Skip to content

Hello, this is Brix OCR, an open-source project where we try to build an OCR engine with the help of others as well as make a custom model.

Notifications You must be signed in to change notification settings

Artificial-Brix/BRIX-OCR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BRIX-OCR

OCR engines have been developed into many kinds of domain-specific OCR applications, such as receipt OCR, invoice OCR, check OCR, legal billing document OCR. They can be used for: Data entry for business documents, e.g. Cheque, passport, invoice, bank statement and receipt. Automatic number plate recognition. This project focuses to build a OCR engine with the help of datasets and images.

Available libraries

Some of the avalable OCR engines are:

Datasets:

Some of the available datasets for testing and training a OCR engine:

Drive Link of some datasets for testing and training a OCR engine:

Research papers:

TASKS to resolve:

  • Task 0: As we need to train the custom model as well as the pretrained models so we need datasets,please add datasets links or download them inside a drive and make hyper link in the datasets sections in the readme.md and complete the Task 0.
  • Task 1: There are three folders given Newspaper ,Posters and Sheets,go inside one folder, you can find a image there, as a sample, please find similar images only and push them inside perticuler folders, minimum 50 images inside a folder will be enough to make the dataset.
  • Task 2: In this task you have to make a jupyter notebook and in that try to use some of the given libraries in the readme section and you have to test their output using the images in the test images,and contribute a jupyter notebook as a name like this: Name_of_the_contributer.ipynb.
  • Task 3: This is the last step of the project, as you have tried all the libraries,make a custom model using the datasets and the take the help of the research papers as well as you mentor of the project,make a jupyter notebook and complete the Task 3.

How to contribute

Please don't push any commits in the main branch, in that case the PR will not be accepted,as there are 4 tasks, please join the discord server first to contribute and then comment under the respective issues and then fork the repo and start working. HAPPY CONTRIBUTING!!!