Skip to content

parse tabular data with invisible border from pdf to csv easily.

Notifications You must be signed in to change notification settings

snifhex/tabular-pdf-to-csv

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 

Repository files navigation

pdf-tabular-to-csv

Parse tables from pdf with invisible border to csv easily.

Main goal of this script?

To parse table from the pdf having invisble border.

table

but you can parse tables with visible border too.

you will just have to remove flavor parameter form the line 42 of the script.

How to use this script ?

Put your pdf and this script in same folder and add page numbers in pages variable of the script. Now run the script. after a successful execution you will have output folder in the current directory and in that output folder you will have tables according to their page number extracted in csv formats.

What if I just want to parse only few pages/specific pages from the pdf?

Well good news is you can by simply updating pages variable to the string of your specific page numbers.

About

parse tabular data with invisible border from pdf to csv easily.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages