Dear @linux and @academicchatter folks:

Please suggest libre/open source tools that allow for the extraction of text and images from scientific pdf documents?

P.S: I’m on a linux machine. Would like something terminal friendly, if possible!

  • Responsabilidade
    link
    fedilink
    5
    edit-2
    25 days ago

    The first tool I can think of is LibreOffice Draw

    Maybe there are other tools, but I think LibreOffice Draw do the job pretty well

    Edit: If the PDF has written text, you may wanna use an OCR tool, but I don’t have any to suggest