Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

is there any option to convert pdf to mark down with embedded images #3

Open
jayanh opened this issue Aug 17, 2017 · 5 comments
Open

Comments

@jayanh
Copy link

jayanh commented Aug 17, 2017

Hi!
This version can only convert to text, so is there any way/option to convert with media(images...)

Thanks

@jzillmann
Copy link
Owner

Currently not, sorry.
I've thought about it... think pdf.js allows to extract media... but haven't tried it and for my use it was irrelevant...

@marky-mark
Copy link

Table data would be great too

@berserkwarwolf
Copy link

@jzillmann Would you accept a bounty for this feature ?

@jzillmann
Copy link
Owner

@berserkwarwolf What exactly ?

  1. You want media extracted ?
  2. You want media extracted and included in the markdown as links (probably downloaded as a folder) ?
  3. Table data ?

@flywire
Copy link

flywire commented Aug 22, 2021

https://pdfbox.apache.org/

REM for %f in (*.pdf) do extract "%f"
java -jar pdfbox-app-2.0.24.jar ExtractImages %1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants