logoalt Hacker News

TightFibretoday at 9:11 AM1 replyview on HN

Long shot, but I wonder if an image of the pdf would do better if it did get unstuck on internal formats.


Replies

lxgrtoday at 10:48 AM

It definitely does. PDF is a vector-based image format historically, and all add-ons that make it behave a bit more sane as a text-oriented document format are optional, so your mileage using tools like pdftotext will vary greatly depending on who created a given PDF.