OCR (Optical Character Recognition) is a technology that allows computers to recognize text in images or scanned documents and convert it to editable text.

OCR tools are commonly used in various industries, including:

  • Digitization of printed materials: OCR can be used to convert physical books, magazines, and newspapers into digital formats that can be easily searched and shared.
  • Document management: OCR can be used to extract data from documents, making it easier to organize and search through large amounts of information.
  • Accessibility: OCR can be used to convert printed materials into formats that are accessible to people with visual impairments.
  • Translation: OCR can be used to extract text from documents in one language and translate it into another.

There are many OCR tools available, each with its own unique features and use-cases. Some tools are designed for specific tasks, such as TessStudio for reviewing and correcting OCR data, while others like EasyOCR are more general purpose.

In this post, we will find the best free and open-source OCR tools, that you can download, install, and use for Windows, and other platforms.

1- Text-Grab

Text-Grab is a Windows 10/11 OCR utility that takes a screenshot, passes the image to the local Windows API OCR engine, and puts the text into the clipboard for use anywhere. It has different modes to make working with text fast and easy.

GitHub - TheJoeFin/Text-Grab: Use OCR in Windows quickly and easily with Text Grab. With optional background process and popups.
Use OCR in Windows quickly and easily with Text Grab. With optional background process and popups. - GitHub - TheJoeFin/Text-Grab: Use OCR in Windows quickly and easily with Text Grab. With optiona…

2- OCR Translator

OCR Translator enables you to effortlessly convert captured images into text, and confidently translate that text with ease.

Features

  • Desktop application with a user-friendly graphical user interface (GUI) provided by customtkinter.
  • Ability to select preferred OCR and translation services.
  • Option to run the program using either the START button or the keyboard shortcut (Alt+Win+T or bound from options).
  • Capability to choose the area of the screen to scan for text using OCR and save the position (for example, when watching a movie and the subtitles always appear in one spot, so you don't have to select the text area again).
  • Automatic translation of the captured text if a translation service has been selected.
  • Ability to capture subtitles from movies or games by selecting the corresponding area of the screen and displaying the translated text next to them.
  • Chat with chatGPT or edgeGPT.
  • Ability to translate from the clipboard or manually entered text (similar to a typical translation app).
  • Save all selected options and settings to a file and load them when the program is launched.
GitHub - Azornes/ocrTranslator: Convert captured images to text using BaiduOCR, GoogleOCR, WindowsOCR, tesseractOCR, RapidOCR or Capture2Text, and translate the resulting text using Google, Chatgpt, Edgegpt, DeepL or many more. Desktop application with a nice GUI provided by customtkinter.
Convert captured images to text using BaiduOCR, GoogleOCR, WindowsOCR, tesseractOCR, RapidOCR or Capture2Text, and translate the resulting text using Google, Chatgpt, Edgegpt, DeepL or many more. D…

3- TessStudio

TessStudio is a powerful Windows program designed for creating, reviewing, and correcting OCR data in searchable PDF files using the highly reliable Tesseract engine.

Features

  • Supports image and multipage PDF files, with or without prior OCR data.
  • Can run or re-run the Tesseract OCR process the current page, all pages or selected pages.
  • Preserves any visible text on a PDF page while performing OCR on the image elements only.
  • For multipage files, multiple instances of the tesseract engine run in parallel for improved performance. The speed improvement depends on the number of processor cores.
  • Identify and display OCR text at the word level with detected word boundaries visible.
  • The built-in spell checker automatically tags words not found in the dictionary.
  • Display PDF pages in the following modes: Image with OCR text hidden, OCR text visible and image hidden, and OCR text visible on faded image.
  • Use any installed font to display OCR text. Fonts are automatically scaled to fit word boundaries.
  • Click on a visible word to open a text editor to correct OCR mistakes.
  • Split a selected word at the current cursor position into two words, or merge the selected word with the next word.
  • Modify or move word boundaries.
  • Create new OCR words, delete existing words.
  • Supports any number of Undo and Redo operations.
  • Save corrections as searchable PDF files. Optionally save as PDF/A or encrypted PDF files.
  • Experimental support for removing grid lines and handling a mixed-mode page with both light text on dark background and dark text on light background. This is common with table headers.
  • Capture and examine debug intermediary images and OCR output in text.
GitHub - OpaitSoftware/TesseractStudio.Net: A free Windows graphical interface to the Tesseract 4.0 OCR engine.
A free Windows graphical interface to the Tesseract 4.0 OCR engine. - GitHub - OpaitSoftware/TesseractStudio.Net: A free Windows graphical interface to the Tesseract 4.0 OCR engine.

4- ImageScanOCR

Image Scan OCR is the ultimate tool for recognizing text from images and PDFs using Windows OCR. With this powerful OCR app, you can easily convert any text image to editable plain text.

And the best part? You can easily manage and save the OCR result text as a plain text file, making it simple to organize and access your important information.

GitHub - ttop32/ImageScanOCR: Convert image and pdf to text using Window OCR
Convert image and pdf to text using Window OCR. Contribute to ttop32/ImageScanOCR development by creating an account on GitHub.

5- EasyOCR

EasyOCR is an OCR software that is ready to use and supports over 80 languages, including popular writing scripts such as Latin, Chinese, Arabic, Devanagari, Cyrillic, and more.

EasyOCR: A Free Open-source OCR That Supports 80+ Languages
EasyOCR is a free developer-friendly OCR “Optical Character Recognition” that supports 80+ languages including Latin, Chinese, Arabic, and Cyrillic. EasyOCR is written in the Python programming language. It can be installed as a Python package, and integrates well with other Python Frameworks lik…
GitHub - JaidedAI/EasyOCR: Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc. - GitHub - JaidedAI/EasyOCR: Ready-to-use OCR with 80+…

6- (a9t9) Free OCR Software

This is a Free open-source OCR application for the Windows Store - A modern GUI front-end for the Microsoft OCR library. The application also includes support for reading and OCR'ing PDF files.

GitHub - A9T9/Free-OCR-Software: Free open-source OCR application for the Windows Store - A modern GUI front-end for the Microsoft OCR library. The application also includes support for reading and OCR’ing PDF files.
Free open-source OCR application for the Windows Store - A modern GUI front-end for the Microsoft OCR library. The application also includes support for reading and OCR'ing PDF files. - GitHub…

7- pdf2pdfocr

This is an exceptional tool that swiftly OCRs PDFs (and supported images) with the added feature of appending a text "layer" to the original file, transforming it into a highly searchable PDF.

The script exclusively employs open source tools, solidifying its status as a reliable asset.

GitHub - LeoFCardoso/pdf2pdfocr: A free tool to OCR a PDF and add a text “layer” in the original file, making a searchable PDF. Use only open source tools. Please tip!
A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip! - GitHub - LeoFCardoso/pdf2pdfocr: A free tool to OC…

8- Cuneiform-OCR (Python)

This repository contains code for line detection, character detection and recognition on the cuneiform 2d images.

Each folder contains the respective code for:

  • Line Detection: Using Image processing techniques, line indentations are found on Cuneiform rgb images.
  • Object Detection: Using maskrcnn Cuneiform characters and line indentations are found on Cuneiform rgb images.
  • Synthetic: To create synthetic cuneiform rgb images with annotations for maskrcnn training.
GitHub - cdli-gh/Cuneiform-OCR: This repository contains code for line detection, character detection and recognition on the cuneiform 2d images
This repository contains code for line detection, character detection and recognition on the cuneiform 2d images - GitHub - cdli-gh/Cuneiform-OCR: This repository contains code for line detection,…

9- NormCap

Our OCR-powered screen-capture tool captures information more efficiently than traditional image-based methods. Available for Linux, macOS, and Windows operating systems.

GitHub - dynobo/normcap: OCR powered screen-capture tool to capture information instead of images
OCR powered screen-capture tool to capture information instead of images - GitHub - dynobo/normcap: OCR powered screen-capture tool to capture information instead of images

10- LAREX

LAREX is a great open-source tool for analyzing the layout of early printed books. Its rule-based approach to connected components is not only efficient, but also easy to understand for the user. If needed, there is also an intuitive manual correction option.

In addition, the PAGE XML format makes it simple to integrate into existing OCR workflows. Evaluations have conclusively demonstrated that LAREX is an efficient and flexible solution for segmenting pages of early printed books, and is sure to meet all expectations.

LAREX can be installed thru Docker, or from source.

GitHub - OCR4all/LAREX: A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.
A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books. - GitHub - OCR4all/LAREX: A semi-automatic open-source tool for Layout Analysis and Region EXtrac…

11- OCRFeeder

OCRFeeder is a powerful document layout analysis and optical character recognition system that automatically outlines content, distinguishes between graphics and text, and performs OCR over the latter. It generates multiple formats, with ODT as its main output.

With its complete GTK graphical user interface, users can confidently correct any unrecognized characters, define or correct bounding boxes, set paragraph styles, clean input images, import PDFs, save and load projects, and export everything to multiple formats.

OCRFeeder gives users the tools they need to efficiently and accurately process their documents.

Apps/OCRFeeder - GNOME Wiki!

12- QT Box Editor

QT Box Editor is a tool used for adjusting box files in tesseract-ocr. The goal of this project is to provide an easy and efficient way to edit files, regardless of their size. QT box editor is the successor of the tesseract-gui project, which is no longer being developed.

Qt-box-editor by zdenop
GitHub - zdenop/qt-box-editor: QT4 editor of tesseract-ocr box files
QT4 editor of tesseract-ocr box files. Contribute to zdenop/qt-box-editor development by creating an account on GitHub.

13- Rescribe

Rescribe is a desktop tool that allows you to easily perform OCR on image files, PDFs, and Google Books. It uses the Tesseract OCR engine along with efficient preprocessing and analysis pipelines to produce high-quality output. The tool is particularly useful for OCR of historical printed works, but it also includes modern language options and works well on modern printed works.

Rescribe is available for Windows, Linux, and macOS.

Rescribe desktop tool for OCR

14- Cognitive OpenOCR

Cognitive OpenOCR is a free OCR program that combines databases from other openware OCR programs and user feedback. It offers 23 different language options and has avoidable bloatware options during installation.

Cognitive OpenOCR
A free tool for picking words from images


15- OCRmyPDF

OCRmyPDF: OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched (Free software)
OCRmyPDF is a free open-source command-line tool that adds an OCR text layer to scanned PDF files, allowing them to be searched or copy-pasted. It is already being used to scan and search millions of heavy PDF files. Features Its features include: * Generates a searchable PDF/A f…

16- Paperwork

Paperwork is an open-source OCR and Scanner
What is Paperwork? Paperwork is a personal document manager. It manages scanned documents and PDFs. It’s designed to be easy and fast to use. The idea behind Paperwork is “scan & forget”: You can just scan a new document and forget about it until the day you need it again. In

17- Lios OCR

Lios is properly the best open-source Linux OCR
What is Lios? Lios stands for “Linux-Intelligent-Ocr-Solution” which is an open-source, free OCR solution that converts any text in image, PDF into a readable text. Lios is a free and open source software for converting prints to text using either a scanner or a camera. It supports multiple langua…

18- NormCap

NormCap: Extract Any Text from Your Screen
NormCap is a free open-source OCR and screen-capture tool that extract data from any part of your screen. Under the hood, NormCap uses Tesseract; the open-source OCR engine that supports dozens of languages by default and used in many enterprise apps. NormCap is written with Python and works for W…


13 Best Open Source Free PDF OCR Text Extractors
PDF file formats are a compact format widely used to create portable documents, reports, e-books, and more. Originally developed by Adobe in 1992, it has become a world standard. PDF files can contain text, images, and tables, and can be generated by many office suites, document editors, apps, web…
Convert image to Text with this Free App: Capture2Text OCR
Convert image to Text with this multilingual free app: Capture2Text Capture2Text is a free open-source OCR that allows you to convert any image capture to text. It supports about 90+ languages so far and comes with a highly customized shortcuts. Alongside its rich graphical user interface (GUI), C…