How to find PDF-documents without "live" text?  SOLVED

This Forum is for the use of End Users requiring help and assistance for Tracker Software's PDF-Tools.

Moderators: PDF-XChange Support, Daniel - PDF-XChange, Chris - PDF-XChange, Sean - PDF-XChange, Vasyl - PDF-XChange, Stefan - PDF-XChange

User avatar
Jensen Head
User
Posts: 823
Joined: Mon Sep 13, 2021 8:12 am

How to find PDF-documents without "live" text?

Post by Jensen Head »

Do I understand correctly that neither PDF-XChange Editor's multi-file search tool nor PDF-Tools currently has a tool that allows you to provide the user with a list of local PDF-documents without text objects (which can be found by searching, selected and copied as a sequence of characters)? Yes, it is possible to get a list of documents containing one of the letters of the alphabet of all the languages that can be encountered. After that, using third-party tools, you can get a list of documents left out of the list, assuming that they do not have text objects. Also, there is a tool that allows you to perform some procedures with pages that do not have text objects.

But in situations where documents may not be in dozens of languages, it is not practical to recognize them in all possible languages. In this case, the text recognition process will be long and inaccurate. Therefore, I need to get exactly the list of documents without text inside, each of which I want to recognize later, indicating only those languages that actually occur in them.

What is the best way to do this with the available means, or can we hope for the appearance of the corresponding functionality in the application in the foreseeable future?
User avatar
Dimitar - PDF-XChange
Site Admin
Posts: 2638
Joined: Mon Jan 15, 2018 9:01 am

Re: How to find PDF-documents without "live" text?

Post by Dimitar - PDF-XChange »

Hello Jensen Head,

As you have already noticed, this is currently not possible.

I will however forward your request to our developers for consideration.

I can't promise that this is something that will be implemented, but we'll see what our developers have to say about this feature.

I will keep you posted on any progress.

Regards.
User avatar
Jensen Head
User
Posts: 823
Joined: Mon Sep 13, 2021 8:12 am

Re: How to find PDF-documents without "live" text?

Post by Jensen Head »

Dimitar - PDF-XChange wrote: Fri Jan 27, 2023 3:13 pmwe'll see what our developers have to say about this feature.
Did they say something?
User avatar
Daniel - PDF-XChange
Site Admin
Posts: 12171
Joined: Wed Jan 03, 2018 6:52 pm

Re: How to find PDF-documents without "live" text?  SOLVED

Post by Daniel - PDF-XChange »

Hello, Jensen Head

At the moment there is no way to do this, and it is not currently planned, as the need is considered too niche.

Kind regards,
Dan McIntyre - Support Technician
PDF-XChange Co. LTD

+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com
User avatar
Vladimir G - Tracker Dev
User
Posts: 90
Joined: Thu Nov 30, 2017 1:24 pm

Re: How to find PDF-documents without "live" text?

Post by Vladimir G - Tracker Dev »

Hello Jensen Head,

It's not entirely clear to me whether you want to search for documents without text or filter them out.

If you're looking for a search feature, PDF Tools doesn't provide any functionality for global searching across arbitrary documents.
The application works only with the specific documents or folders that you pass to it for processing.

However, you can analyze a set of input documents by specifying the the negated “Any text” criterion and then either save those documents or export a list of them to a PDTFL file.

Example of a tool that processes only documents with no text:
image.png
Best regards,
You do not have the required permissions to view the files attached to this post.
Vladimir Goshko
Software Developer
PDF-XChange Co. LTD