Performs a very fast OCR on a list of images (file path, url, base64, bytes, numpy, PIL ...) using Tesseract and returns the recognized text, its coordinates, and line-based word grouping in a DataFrame.
-
Updated
Nov 14, 2023 - Python
Performs a very fast OCR on a list of images (file path, url, base64, bytes, numpy, PIL ...) using Tesseract and returns the recognized text, its coordinates, and line-based word grouping in a DataFrame.
Text extraction from image through OCR
The app extracts tabular data from PNG, JPG, or PDF files uploaded by the user and converts it into a downloadable CSV file.
This repository contains a document scanner app that could perform Optical Character Recognition.
The core objective of this project is to develop a sophisticated algorithm capable of accurately identifying tables within scanned images, even when confronted with diverse layouts,fonts, and varying image quality levels.This algorithm will not only locate these tables but also perform data extraction from them.
Scripts to convert low-quality scanned PDFs to text files using Google Cloud Vision and GPT-3 for spellchecking
Apache Solr Document Search and Indexing Analysis with OCR
This is a small and simple cli ocr script to automatically ocr an image or split a pdf into images and then ocr the images of the pages.
Medical Bill Information Extractor to automatically extract relevant information from medical bills using optical character recognition (OCR) technology, along with Google Tesseract, OpenCV2, and regular expressions.
Windows application for text decoding using the TesseractOCR library.
LiveScreenTranslator utilizes OCR and translation services to provide instantaneous on-screen text translation, incorporating multi-monitor support, Text-to-Speech functionality, and the ability to save translated text to a file, delivering a solution for diverse translation requirements.
Extracting tabular data from scanned PDFs with OpenCV and PyTesseract.
Flask application for OCR and extraction of text from documents with support for repository applications
Add a description, image, and links to the ocr topic page so that developers can more easily learn about it.
To associate your repository with the ocr topic, visit your repo's landing page and select "manage topics."