How to find PDF documents without a specified language?

Forum for the PDF-XChange Editor - Free and Licensed Versions

Moderators: PDF-XChange Support, Daniel - PDF-XChange, Chris - PDF-XChange, Sean - PDF-XChange, Paul - PDF-XChange, Vasyl - PDF-XChange, Ivan - Tracker Software, Stefan - PDF-XChange

User avatar
Jensen Head
User
Posts: 623
Joined: Mon Sep 13, 2021 8:12 am

How to find PDF documents without a specified language?

Post by Jensen Head »

Since the Language column in Windows File Explorer does not correspond to the same-name meta tag in PDF, it is currently difficult to find documents with a specific language, or without a specified language. At least with a large number of documents or with a large size.

Please consider adding this field to the PDF-XChange File Explorer add-on utility (PDF-XChange Shell Extensions), or adding a search parameter to the advanced document search (with the ability to search for a specific language in the list of languages ​​installed for a document or documents without an installed language).
User avatar
Daniel - PDF-XChange
Site Admin
Posts: 11252
Joined: Wed Jan 03, 2018 6:52 pm

Re: How to find PDF documents without a specified language?

Post by Daniel - PDF-XChange »

Hello, Jensen Head

Thank you for the suggestion, I have passed this along to the Dev team for review, but as usual, I cannot make any promises until I have heard back from them.

[Edit]
At the very least, they are committed to investigating if this is possible, and we have a ticket for it now:

RT#7469: FR: shell ext - Add language metadata

Kind regards,
Dan McIntyre - Support Technician
PDF-XChange Co. LTD

+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com
User avatar
Jensen Head
User
Posts: 623
Joined: Mon Sep 13, 2021 8:12 am

Re: How to find PDF documents without a specified language?

Post by Jensen Head »

Just keep in mind that the language can be specified not only in the Info dictionary (which is accessible via reader.metadata in PyPDF2), but also in the PDF (Root) catalog under the /Lang key (access via catalog = reader.trailer["/Root"]). As a separate script, this tool works, but for most users it would be convenient to use it via the GUI.
User avatar
Daniel - PDF-XChange
Site Admin
Posts: 11252
Joined: Wed Jan 03, 2018 6:52 pm

Re: How to find PDF documents without a specified language?

Post by Daniel - PDF-XChange »

Hello, Jensen Head

Thank you for that, I have made a note, just in case.

Kind regards,
Dan McIntyre - Support Technician
PDF-XChange Co. LTD

+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com