ocr

Here are 4,652 public repositories matching this topic...

hansalemaos / multitessiocr

Performs a very fast OCR on a list of images (file path, url, base64, bytes, numpy, PIL ...) using Tesseract and returns the recognized text, its coordinates, and line-based word grouping in a DataFrame.

fast files ocr position tesseract pandas multiple dataframe coordinates easyocr

Updated Nov 14, 2023
Python

Zaka801 / Text_extraction

Star

Text extraction from image through OCR

ocr text-extraction ocr-python

Updated Jun 24, 2023
Jupyter Notebook

KohHaruki / SummerHack2023

Star

The app extracts tabular data from PNG, JPG, or PDF files uploaded by the user and converts it into a downloadable CSV file.

react python typescript ocr image-processing expressjs tesseract image-analysis

Updated Feb 19, 2023
TypeScript

Ravindu-Yasas-Nagasinghe / Document-scanner-app-with-Optical-Character-Recognition.

Star

This repository contains a document scanner app that could perform Optical Character Recognition.

machine-learning ocr app-development

Updated Mar 6, 2023

cosmtrek / imgctl

Star

A command-line interface to control images with ease.

cli image ocr

Updated Feb 22, 2023
Go

47h4rv4-b / bankScribe

Star

All R&D related to bank statement transaction categorization and statement analysis.

python pdf ocr

Updated Mar 21, 2023
Python

oussama95boussaid / Automated_Table_Detection_and_recognition_from_Scanned_Images

Star

The core objective of this project is to develop a sophisticated algorithm capable of accurately identifying tables within scanned images, even when confronted with diverse layouts,fonts, and varying image quality levels.This algorithm will not only locate these tables but also perform data extraction from them.

data-science natural-language-processing ocr computer-vision image-processing transformers artificial-intelligence tesseract-ocr nlp-machine-learning ocr-python yolov8

Updated Sep 16, 2023
Jupyter Notebook

emilyhasson / Text-Recognition

Star

Scripts to convert low-quality scanned PDFs to text files using Google Cloud Vision and GPT-3 for spellchecking

nlp ocr computer-vision

Updated Jun 13, 2023
Python

liviobisogni / solr-ocr-indexing

Star

Apache Solr Document Search and Indexing Analysis with OCR

search search-engine pdf ocr solr tesseract indexing leptonica tesseract-ocr optical-character-recognition indexing-engine apache-solr document-search

Updated Apr 19, 2023
Java

AlDrAkU / Simple_OCR_cli

Star

This is a small and simple cli ocr script to automatically ocr an image or split a pdf into images and then ocr the images of the pages.

machine-learning ocr tesseract-ocr ocr-recognition

Updated Feb 10, 2024
Python

DoganK01 / YOLOV7-License-Plate-Recognition-with-Dashboard---Easy-OCR

Star

python machine-learning ocr recognition ai computer-vision dashboard deep-learning tesseract sort yolo object-detection object-tracking license-plate-recognition deepsort yolov7

Updated Mar 28, 2023
Python

RepZ97 / Medical-Bill-Information-Extraction

Star

Medical Bill Information Extractor to automatically extract relevant information from medical bills using optical character recognition (OCR) technology, along with Google Tesseract, OpenCV2, and regular expressions.

ocr regex tesseract-ocr opencv2

Updated Mar 21, 2023
Jupyter Notebook

Paulraj916 / video-to-slide

Star

A python based Tkinder application for converting video to slide

ocr tesseract

Updated Aug 20, 2023
Python

prbrq / ImageTextOCR

Star

Windows application for text decoding using the TesseractOCR library.

windows productivity ocr csharp dotnet

Updated Feb 15, 2023
C#

casual-lab / PDF-OCRSearch

Star

用于扫描版 pdf 书籍的内容检索

pdf ocr searching pdf-document-processor

Updated Nov 3, 2022
Python

ryanp3343 / LiveScreenTranslator

Star

LiveScreenTranslator utilizes OCR and translation services to provide instantaneous on-screen text translation, incorporating multi-monitor support, Text-to-Speech functionality, and the ability to save translated text to a file, delivering a solution for diverse translation requirements.

multilingual python text-to-speech real-time ocr translation desktop-application language-processing