Currently, when exporting to .txt and .docx, the entire text of the PDF document is exported, including that marked as artifacts. In the case of .txt, this data, for example, header and footer values (page numbers, book title, current chapter, author, web link to the source, etc.) are garbage that must be removed manually. In the case of saving to .docx, the presence of these fragments on each page is also undesirable when using Layout Settings := Retain Flowing Text.
I suggest considering the ability to ignore artifacts when exporting as a general application setting, or as a checkbox in the export settings dialogs. Of course, this is a feature that is missing from other PDF applications, but I hope this will not bother you in this particular case.
[suggestion] Add the ability to ignore artifacts when exporting
Moderators: PDF-XChange Support, Daniel - PDF-XChange, Chris - PDF-XChange, Sean - PDF-XChange, Paul - PDF-XChange, Vasyl - PDF-XChange, Ivan - Tracker Software, Stefan - PDF-XChange
-
- User
- Posts: 635
- Joined: Mon Sep 13, 2021 8:12 am
-
- Site Admin
- Posts: 11385
- Joined: Wed Jan 03, 2018 6:52 pm
Re: [suggestion] Add the ability to ignore artifacts when exporting
Hello, Jensen Head
For now, the best option I could offer is to "crop" your pages before the export (you do not need to save those changes). It wouldnt help with any items that are in the middle of the page, but should be good for the outer edges.
Just Enable the option to remove content outside the crop area, and define a size that keeps the main body intact, but quickly strips out the "edge" items. Then you can convert the cropped document to docx/txt/etc, and you should only have the central text to work with.
I will pass this along for further consideration.
Kind regards,
For now, the best option I could offer is to "crop" your pages before the export (you do not need to save those changes). It wouldnt help with any items that are in the middle of the page, but should be good for the outer edges.
Just Enable the option to remove content outside the crop area, and define a size that keeps the main body intact, but quickly strips out the "edge" items. Then you can convert the cropped document to docx/txt/etc, and you should only have the central text to work with.
I will pass this along for further consideration.
Kind regards,
Dan McIntyre - Support Technician
PDF-XChange Co. LTD
+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com
PDF-XChange Co. LTD
+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com