Filter documents for additional metadata in xmp

This Forum is for the use of End Users requiring help and assistance for Tracker Software's PDF-Tools.

Moderators: PDF-XChange Support, Daniel - PDF-XChange, Chris - PDF-XChange, Sean - PDF-XChange, Vasyl - PDF-XChange, Stefan - PDF-XChange

biofunc
User
Posts: 13
Joined: Tue Mar 03, 2020 6:35 am

Filter documents for additional metadata in xmp

Post by biofunc »

Hello,

I have many documents that have been scanned and the scanner software has incorrectly recognized the text:

In case of documents fed in landscape format, the text gets recognized, but is displayed in the wrong place in the document and in a vertical orientation (see photo). It is due to the firmware of the scanner and meanwhile several hundred documents have been scanned.

I found a solution by first cleaning the document (metadata is preserved, old text layer is removed see photo) and then running the text recognition of pdf-xchange over it.

All documents have the entry "Macro V3.6.1" in the metadata as producer and author.

Can I filter the documents with the entry "Macro V3.6.1" to only do the cleaning and ocr there?

Best regards
biofunc
Picture-1.png
Picture-2.png
Picture-3.png
You do not have the required permissions to view the files attached to this post.
User avatar
Stefan - PDF-XChange
Site Admin
Posts: 19887
Joined: Mon Jan 12, 2009 8:07 am

Re: Filter documents for additional metadata in xmp

Post by Stefan - PDF-XChange »

Hello biofunc,

I can see the "Producer" info in windows explorer (you might need to turn this column on):
image.png
Would that help you narrowing down the files you need to reprocess?

I can not think of a way to do the "filtering" inside the Editor or Tools themselves.

Kind regards,
Stefan
You do not have the required permissions to view the files attached to this post.
User avatar
Vladimir G - Tracker Dev
User
Posts: 70
Joined: Thu Nov 30, 2017 1:24 pm

Re: Filter documents for additional metadata in xmp

Post by Vladimir G - Tracker Dev »

Hello biofunc,

As Stefan mentioned, the ability to do this kind of filtering with PDF-Tools isn't possible right now. After talking with the Development Team, we think that adding this feature can be done. So, I'm hopeful it will be in the next release.

I will reply to you here if we implement this feature, or you can follow the release notes to check if it has been implemented.

Kind regards,
Vladimir
Vladimir Goshko
Software Developer
PDF-XChange Co. LTD
biofunc
User
Posts: 13
Joined: Tue Mar 03, 2020 6:35 am

Re: Filter documents for additional metadata in xmp

Post by biofunc »

Hello,
the filter function is already very good and I can manage with the solution suggested by Stefan.
But if you could now expand the search/filtering of the metadata, that would be great.

Thank you.

biofunc
User avatar
Daniel - PDF-XChange
Site Admin
Posts: 11116
Joined: Wed Jan 03, 2018 6:52 pm

Re: Filter documents for additional metadata in xmp

Post by Daniel - PDF-XChange »

Hello, biofunc

I haven't heard if this is making it into the upcoming release, but it should be just around the corner, likely in the next week, possible two if there are delays...
Keep an eye out for the release notes!

Kind regards,
Dan McIntyre - Support Technician
PDF-XChange Co. LTD

+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com
User avatar
Jensen Head
User
Posts: 589
Joined: Mon Sep 13, 2021 8:12 am

Re: Filter documents for additional metadata in xmp

Post by Jensen Head »

As far as I can see, there are currently two Actions in PDF-Tools to perform similar actions:

Filter Documents
☐ Select documents with non-empty content
☐ Select documents with bookmarks
☐ Select documents with comments and fields
☐ Select documents with text
͏ ͏ Search in: Document Info
͏ ͏ Proximity:
͏ ͏ ☐ Page Text
͏ ͏ ☐ Bookmarks
͏ ͏ ☐ Comments
͏ ͏ ☐ Form Fields
͏ ͏ ☐ External Links
͏ ͏ ☑ Document Info

and

Filter Files
☐ File name
☐ File size
☐ Creation date
☐ Modified date

Should this be understood as meaning that PDF-Tools does not yet implement filtering by metadata in a specific field?
User avatar
Stefan - PDF-XChange
Site Admin
Posts: 19887
Joined: Mon Jan 12, 2009 8:07 am

Re: Filter documents for additional metadata in xmp

Post by Stefan - PDF-XChange »

Hello Jensen Head,

I just asked our devs for an update on the status of this, and we will report back here as soon as there are any news!

Kind regards,
Stefan