Lios or Linux-Intelligent-Ocr-Solution: Easy-OCR solution and Tesseract trainer for GNU/Linux

Lios or Linux-Intelligent-Ocr-Solution: Easy-OCR solution and Tesseract trainer for GNU/Linux

Lios (which stands for Linux-OCR-Software) is like that super-helpful friend who can read anything you throw at them. It's a free, open-source Optical Character Recognition (OCR) tool that turns images into editable text faster than you can say "accessibility rocks!"

The app is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package.

13 Best Open Source Free PDF OCR Text Extractors
PDF file formats are a compact format widely used to create portable documents, reports, e-books, and more. Originally developed by Adobe in 1992, it has become a world standard. PDF files can contain text, images, and tables, and can be generated by many office suites, document editors, apps, web services,


12 Free OCR Libraries and Projects
What is OCR (Optical Character Recognition)? OCR or Optical Character Recognition is a process that converts images that contains text into readable editable text formats which you can edit, copy, paste and save. It is not a new technology, as it was created decades ago to aid enterprise transform their
12 Top Free OCR Screen Capture Tools that Grab Text Directly from Your Screen
Screenshot OCR is a technology that allows users to extract text from screenshots and convert it into editable text. There are various screenshots to image OCR tools available that utilize Optical Character Recognition (OCR) algorithms to recognize and extract text from images of screenshots. These tools are useful in several
18 Open-source Free OCR for Windows
OCR (Optical Character Recognition) is a technology that allows computers to recognize text in images or scanned documents and convert it to editable text. OCR tools are commonly used in various industries, including: * Digitization of printed materials: OCR can be used to convert physical books, magazines, and newspapers into digital

Primary Cool

  1. Scanner Sorcery: Got a physical document? Lios works with your scanner to transform it into digital text in a snap.
  2. Camera Wizardry: No scanner? No problem! Snap a pic with your camera, and Lios will do the rest.
  3. PDF Prowess: Those pesky PDFs that won't let you copy text? Lios breaks through their defenses.
  4. Image Alchemy: Whether it's JPEGs, PNGs, or any other image format, Lios can extract the text.
  5. Bulk Brilliance: Got a folder full of images? Lios can process them all in one go. Talk about efficiency!
  6. Screenshot Savvy: Captured text in a screenshot? Lios has got you covered.

More cool features

  • Import images from Scanner, PDFs, Folder, or Webcam,
  • Take and Recognize Screenshot,
  • Recognize Selected Areas(Rectangle selection),
  • Support two OCR Engines (Cuneiform,Tesseract),
  • Tesseract-Trainer - Train your tesseract ocr engine to improve the accuracy
  • Full Auto Rotation for any Language(If aspell installed for the language, Eg : "sudo apt-get install aspell-hi" for Hindi,
  • Side by side view of image and output
  • Advanced Scanner Brightness optimizer
  • Text Reader for low vision with Highlighting, With user selected Color, Font, and Background Color,
  • Audio converter(espeak),
  • Spell-checker(aspell),
  • Export as pdf (text/images),
  • Dictionary Support for English(Artha)
  • Options for save, load and reset settings,
  • Text-Cleaner - Post process your output with match-replace dialog
  • Other options - Find, Find-and-Replace, Go-To-Page, Go-To-Line, Append file, Punch File, Selection of starting page number, page numbering mode and number of pages to scan, Selection of Scan area, brightness, resolution and time between repeated scanning, Output Insert position, image rotation and zoom options, etc
LaTeX-OCR: Free and Open-Source Python-based OCR for Scientific Document Conversion
In academia, research, and scientific fields, LaTeX has long been the preferred markup language for creating complex mathematical formulas and professional-grade documentation. However, converting printed or scanned documents containing LaTeX code back into an editable format can be challenging. Enter LaTeX-OCR, an open-source project designed to tackle this problem. What

Bonus Round: Teach Tesseract New Tricks

As if all that wasn't enough, Lios comes with a Tesseract Trainer GUI. For the OCR nerds out there (you know who you are), this means you can train the software to recognize new fonts or languages. It's like giving Lios superpowers!

Why Lios is a Game-Changer

  1. It's Free: In a world where good OCR software often comes with a hefty price tag, Lios is a breath of fresh air.
  2. Open Source Goodness: Tech-savvy users can dive into the code, customize it, and even contribute improvements.
  3. Versatility: From scanned documents to screenshots, Lios handles it all.
  4. Accessibility Focus: It's not just about converting text; it's about making information accessible to everyone.

Install

Dependency list : python3, python3-imaging-sane|python3-sane, python3-speechd, tesseract-ocr, imagemagick, cuneiform, espeak,poppler-utils, python3-enchant,aspell-en, gir1.2-gst-plugins-base-1.0, gir1.2-gstreamer-1.0

git clone https://github.com/zendalona/lios.git
cd lios
python3 setup.py install --install-data=/usr

Following the third step for a system-wide installation is not necessary. You can use the following commands instead, as long as you have all the dependecies installed. This will be useful if you are a Lios developer.

export PYTHONPATH=.
bin/lios --datadir 'share/lios'

License

GNU General Public License version 3.0 (GPLv3)

Resources & Downloads

Linux-Intelligent-Ocr-Solution
Download Linux-Intelligent-Ocr-Solution for free. Easy-OCR solution and Tesseract trainer for GNU/Linux. Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired.

OCRmyPDF: OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched (Free software)
OCRmyPDF is a free open-source command-line tool that adds an OCR text layer to scanned PDF files, allowing them to be searched or copy-pasted. It is already being used to scan and search millions of heavy PDF files. Features Its features include: * Generates a searchable PDF/A file from a
Capture2Text: The Ultimate OCR Tool for Effortless Text Extraction and Translation
Capture2Text enables users to effortlessly perform optical character recognition (OCR) on a selected area of the screen by simply using a keyboard shortcut. The extracted text is automatically saved to the clipboard, ensuring seamless accessibility. This remarkable software supports a wide range of languages, including Chinese, English, French, German, Japanese,
Paperwork is an open-source OCR and Scanner
What is Paperwork? Paperwork is a personal document manager. It manages scanned documents and PDFs. It’s designed to be easy and fast to use. The idea behind Paperwork is “scan & forget”: You can just scan a new document and forget about it until the day you need it again. In
LaTeX-OCR: Free and Open-Source Python-based OCR for Scientific Document Conversion
In academia, research, and scientific fields, LaTeX has long been the preferred markup language for creating complex mathematical formulas and professional-grade documentation. However, converting printed or scanned documents containing LaTeX code back into an editable format can be challenging. Enter LaTeX-OCR, an open-source project designed to tackle this problem. What







Open-source Apps

9,500+

Medical Apps

500+

Lists

450+

Dev. Resources

900+