CS 6431 Final Project (Evaluating the Risk of Efficient Model Extraction Attack to Steal the Knowledge from Public OCR APIs)

Data

Data processing and annotation scripts are located under /data/script. We are undecided whether we will release our Google OCR annotation due to the ethical concerns and potential the term of use violation.

cd /data
mkdir TextOCR
mkdir TextOCR/basic
cd TextOCR/basic
wget https://dl.fbaipublicfiles.com/textvqa/data/textocr/TextOCR_0.1_train.json
wget https://dl.fbaipublicfiles.com/textvqa/images/train_val_images.zip
unzip https://dl.fbaipublicfiles.com/textvqa/images/train_val_images.zip

Examples of Google OCR Annotations

Models

Please follow the astep below to clone the model repo. We will release our modification patches once we resolve the conflicts.

mkdir models
cd models
git clone https://github.com/clovaai/deep-text-recognition-benchmark.git

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
data/script		data/script
medias		medias
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS 6431 Final Project (Evaluating the Risk of Efficient Model Extraction Attack to Steal the Knowledge from Public OCR APIs)

Data

Examples of Google OCR Annotations

Models

Visualization of Validation Curves Enabled by https://wandb.ai

Trained on Human Annotations

Trained on Google OCR Annotations

About

Releases

Packages

Languages

kojimano/ocr_extraction_attack

Folders and files

Latest commit

History

Repository files navigation

CS 6431 Final Project (Evaluating the Risk of Efficient Model Extraction Attack to Steal the Knowledge from Public OCR APIs)

Data

Examples of Google OCR Annotations

Models

Visualization of Validation Curves Enabled by https://wandb.ai

Trained on Human Annotations

Trained on Google OCR Annotations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages