This project uses scattering wavelet networks for image processing and classification which is so far typically done with convolutional neuronal networks (CNNs) or related models. This technique is used in order to detect tables and extract their content from scaned documents.
For more information see DocScatWaveNet
Table detection in PDF and scaned documents with scattering wavelets:
- OS name/OS Type:
Ubuntu 20.04.3 LTS/64-bit
- Processor: AMD® Ryzen threadripper 1900x 8-core processor × 16
- You need
zstd
for unpack data:sudo apt install zstd
- Install
git lfs
(https://git-lfs.github.com/) and then add information about the compressed files viagit lfs track "*.zst"
- run
pip install -r requirements.txt
- run
python setup.py install
- run
python startMeUp.py
- if you are asked "calculate SWCs (Y/N) ?" then type "Y"
For more information see DocScatWaveNet