Productivity

Tabula OCR - Free Tool to Extract Tables from PDF Files for Windows and macOS

Hazem Abbas

Nov 19, 2024 — 2 min read

Table of Content

Tabula is a free self-hosted lightweight tool that enables you to read and extract table data from PDF files easily.

Because it is written using Java, It works for Windows, Linux and macOS.

How to use Tabula?

Upload a PDF file containing a data table.
Browse to the page you want, then select the table by clicking and dragging to draw a box around the table.
Click "Preview & Export Extracted Data". Tabula will try to extract the data and display a preview. Inspect the data to make sure it looks correct. If data is missing, you can go back to adjust your selection.
Click the "Export" button.
Now you can work with your data as text file or a spreadsheet rather than a PDF! (You can open the downloaded file in Microsoft Excel or the free LibreOffice Calc)

Install using Docker

You can also install it using Docker.

docker run \
	--name tabula \
	-p 5000:5000 \
	-d \
	turicas/tabula:1.2.1

Customize with Docker

docker run \
	--name tabula \
	-p 5001:5001 \
	-e PORT=5001 \
	-e JAVA_XMS="256M" \
	-e JAVA_XMX="1024M" \
	-d \
	turicas/tabula:1.2.1

License

Tabula is an open-source project that is released under the MIT License

Resources & Downloads

Tabula: Extract Tables from PDFs

Tabula is a free tool for extracting data from PDF files into CSV and Excel files.

Productivity pdf pdf ocr ocr Open-source Java Windows macos Linux Self-hosted Web-based Apps Ubuntu Fedora Debian Arch Linux Linux Mint Manjaro office

Tabula OCR - Free Tool to Extract Tables from PDF Files for Windows and macOS

Hazem Abbas

Table of Content

How to use Tabula?

Install using Docker

Customize with Docker

License

Resources & Downloads

Are You Truly Ready to Put Your Mobile or Web App to the Test?

Articles

Systems

Development

Apps

Science - Healthcare

Open-source Apps

Medical Apps

Lists

Dev. Resources

Read more

Beyond the Books: Why Bad Homeschooling Can Cage Curiosity, Creativity and How to Fix It, My Personal Take

Why Searching using AI Chat Interface Does not work, 7 Reasons

Boost Your Email Workflow: 12 Dockerized Mail Servers That Deliver (Literally!)

Oppo Find N3 is The Real Competitor for Samsung Fold

Table of Content

How to use Tabula?

Install using Docker

Customize with Docker

License

Resources & Downloads

Read More Articles in Productivity

Top 10 Piped Youtube Apps To Watch YouTube without Ads

Pomodoro: Not Just for Managing Tasks, But for Managing Your Health Too

WifiScreen Use Mobile or Table as a Second Screen for Your Windows System, Old But Still Usable

Escrcpy: Display and Control Android Devices on Windows, Linux, macOS with this Free App

8 Open-source Apps to Display and Control Android Devices From Your Desktop!

Traditional AI VS Prompt AI: How the New AI Wave is changing the World and Production and Creativity!

Articles

Systems

Development

Apps

Science - Healthcare

Open-source Apps

Medical Apps

Lists

Dev. Resources

Read more

Beyond the Books: Why Bad Homeschooling Can Cage Curiosity, Creativity and How to Fix It, My Personal Take

Why Searching using AI Chat Interface Does not work, 7 Reasons

Boost Your Email Workflow: 12 Dockerized Mail Servers That Deliver (Literally!)

Oppo Find N3 is The Real Competitor for Samsung Fold