Skip to content

kojimano/ocr_extraction_attack

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

CS 6431 Final Project (Evaluating the Risk of Efficient Model Extraction Attack to Steal the Knowledge from Public OCR APIs)

Data

Data processing and annotation scripts are located under /data/script. We are undecided whether we will release our Google OCR annotation due to the ethical concerns and potential the term of use violation.

cd /data
mkdir TextOCR
mkdir TextOCR/basic
cd TextOCR/basic
wget https://dl.fbaipublicfiles.com/textvqa/data/textocr/TextOCR_0.1_train.json
wget https://dl.fbaipublicfiles.com/textvqa/images/train_val_images.zip
unzip https://dl.fbaipublicfiles.com/textvqa/images/train_val_images.zip

Examples of Google OCR Annotations

Models

Please follow the astep below to clone the model repo. We will release our modification patches once we resolve the conflicts.

mkdir models
cd models
git clone https://github.com/clovaai/deep-text-recognition-benchmark.git

Visualization of Validation Curves Enabled by https://wandb.ai

Trained on Human Annotations

Trained on Google OCR Annotations

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages