Dear @linux and @academicchatter folks:

Please suggest libre/open source tools that allow for the extraction of text and images from scientific pdf documents?

P.S: I’m on a linux machine. Would like something terminal friendly, if possible!

  • @CCRhode
    link
    1
    edit-2
    25 days ago

    I’m mystified that poppler-utils is not a viable option. Of course the *.pdf file would have to include the text itself, but many do.