is there any easy way to find out if a given PDF is a image-only PDF? Like telling if the PDF is coming from a scanner?
Edit: I found this info as a Knowledge Base item:
But how can I answer these questions programatically?Things that indicate a PDF might be image based include:
- if you know it came from a scanner
if you cannot select text using the "Select Tool"
if you get no results searching for a word that you know is in the document
if you zoom the document greatly and it gets pixelated
And another question: when will the OCR functionality be available?
Best regards,
Cantemir