Hello,
I have many documents that have been scanned and the scanner software has incorrectly recognized the text:
In case of documents fed in landscape format, the text gets recognized, but is displayed in the wrong place in the document and in a vertical orientation (see photo). It is due to the firmware of the scanner and meanwhile several hundred documents have been scanned.
I found a solution by first cleaning the document (metadata is preserved, old text layer is removed see photo) and then running the text recognition of pdf-xchange over it.
All documents have the entry "Macro V3.6.1" in the metadata as producer and author.
Can I filter the documents with the entry "Macro V3.6.1" to only do the cleaning and ocr there?
Best regards
biofunc
Filter documents for additional metadata in xmp
Moderators: PDF-XChange Support, Daniel - PDF-XChange, Chris - PDF-XChange, Sean - PDF-XChange, Vasyl - PDF-XChange, Stefan - PDF-XChange
-
- User
- Posts: 13
- Joined: Tue Mar 03, 2020 6:35 am
Filter documents for additional metadata in xmp
You do not have the required permissions to view the files attached to this post.
-
- Site Admin
- Posts: 19887
- Joined: Mon Jan 12, 2009 8:07 am
Re: Filter documents for additional metadata in xmp
Hello biofunc,
I can see the "Producer" info in windows explorer (you might need to turn this column on): Would that help you narrowing down the files you need to reprocess?
I can not think of a way to do the "filtering" inside the Editor or Tools themselves.
Kind regards,
Stefan
I can see the "Producer" info in windows explorer (you might need to turn this column on): Would that help you narrowing down the files you need to reprocess?
I can not think of a way to do the "filtering" inside the Editor or Tools themselves.
Kind regards,
Stefan
You do not have the required permissions to view the files attached to this post.
-
- User
- Posts: 70
- Joined: Thu Nov 30, 2017 1:24 pm
Re: Filter documents for additional metadata in xmp
Hello biofunc,
As Stefan mentioned, the ability to do this kind of filtering with PDF-Tools isn't possible right now. After talking with the Development Team, we think that adding this feature can be done. So, I'm hopeful it will be in the next release.
I will reply to you here if we implement this feature, or you can follow the release notes to check if it has been implemented.
Kind regards,
Vladimir
As Stefan mentioned, the ability to do this kind of filtering with PDF-Tools isn't possible right now. After talking with the Development Team, we think that adding this feature can be done. So, I'm hopeful it will be in the next release.
I will reply to you here if we implement this feature, or you can follow the release notes to check if it has been implemented.
Kind regards,
Vladimir
Vladimir Goshko
Software Developer
PDF-XChange Co. LTD
Software Developer
PDF-XChange Co. LTD
-
- User
- Posts: 13
- Joined: Tue Mar 03, 2020 6:35 am
Re: Filter documents for additional metadata in xmp
Hello,
the filter function is already very good and I can manage with the solution suggested by Stefan.
But if you could now expand the search/filtering of the metadata, that would be great.
Thank you.
biofunc
the filter function is already very good and I can manage with the solution suggested by Stefan.
But if you could now expand the search/filtering of the metadata, that would be great.
Thank you.
biofunc
-
- Site Admin
- Posts: 11111
- Joined: Wed Jan 03, 2018 6:52 pm
Re: Filter documents for additional metadata in xmp
Hello, biofunc
I haven't heard if this is making it into the upcoming release, but it should be just around the corner, likely in the next week, possible two if there are delays...
Keep an eye out for the release notes!
Kind regards,
I haven't heard if this is making it into the upcoming release, but it should be just around the corner, likely in the next week, possible two if there are delays...
Keep an eye out for the release notes!
Kind regards,
Dan McIntyre - Support Technician
PDF-XChange Co. LTD
+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com
PDF-XChange Co. LTD
+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com
-
- User
- Posts: 589
- Joined: Mon Sep 13, 2021 8:12 am
Re: Filter documents for additional metadata in xmp
As far as I can see, there are currently two Actions in PDF-Tools to perform similar actions:
Filter Documents
☐ Select documents with non-empty content
☐ Select documents with bookmarks
☐ Select documents with comments and fields
☐ Select documents with text
͏ ͏ Search in: Document Info
͏ ͏ Proximity:
͏ ͏ ☐ Page Text
͏ ͏ ☐ Bookmarks
͏ ͏ ☐ Comments
͏ ͏ ☐ Form Fields
͏ ͏ ☐ External Links
͏ ͏ ☑ Document Info
and
Filter Files
☐ File name
☐ File size
☐ Creation date
☐ Modified date
Should this be understood as meaning that PDF-Tools does not yet implement filtering by metadata in a specific field?
Filter Documents
☐ Select documents with non-empty content
☐ Select documents with bookmarks
☐ Select documents with comments and fields
☐ Select documents with text
͏ ͏ Search in: Document Info
͏ ͏ Proximity:
͏ ͏ ☐ Page Text
͏ ͏ ☐ Bookmarks
͏ ͏ ☐ Comments
͏ ͏ ☐ Form Fields
͏ ͏ ☐ External Links
͏ ͏ ☑ Document Info
and
Filter Files
☐ File name
☐ File size
☐ Creation date
☐ Modified date
Should this be understood as meaning that PDF-Tools does not yet implement filtering by metadata in a specific field?
-
- Site Admin
- Posts: 19887
- Joined: Mon Jan 12, 2009 8:07 am
Re: Filter documents for additional metadata in xmp
Hello Jensen Head,
I just asked our devs for an update on the status of this, and we will report back here as soon as there are any news!
Kind regards,
Stefan
I just asked our devs for an update on the status of this, and we will report back here as soon as there are any news!
Kind regards,
Stefan