[suggestion] Add the ability to ignore artifacts when exporting

Forum for the PDF-XChange Editor - Free and Licensed Versions

Moderators: PDF-XChange Support, Daniel - PDF-XChange, Chris - PDF-XChange, Sean - PDF-XChange, Paul - PDF-XChange, Vasyl - PDF-XChange, Ivan - Tracker Software, Stefan - PDF-XChange

User avatar
Jensen Head
User
Posts: 635
Joined: Mon Sep 13, 2021 8:12 am

[suggestion] Add the ability to ignore artifacts when exporting

Post by Jensen Head »

Currently, when exporting to .txt and .docx, the entire text of the PDF document is exported, including that marked as artifacts. In the case of .txt, this data, for example, header and footer values (page numbers, book title, current chapter, author, web link to the source, etc.) are garbage that must be removed manually. In the case of saving to .docx, the presence of these fragments on each page is also undesirable when using Layout Settings := Retain Flowing Text.

I suggest considering the ability to ignore artifacts when exporting as a general application setting, or as a checkbox in the export settings dialogs. Of course, this is a feature that is missing from other PDF applications, but I hope this will not bother you in this particular case.
User avatar
Daniel - PDF-XChange
Site Admin
Posts: 11385
Joined: Wed Jan 03, 2018 6:52 pm

Re: [suggestion] Add the ability to ignore artifacts when exporting

Post by Daniel - PDF-XChange »

Hello, Jensen Head

For now, the best option I could offer is to "crop" your pages before the export (you do not need to save those changes). It wouldnt help with any items that are in the middle of the page, but should be good for the outer edges.

Just Enable the option to remove content outside the crop area, and define a size that keeps the main body intact, but quickly strips out the "edge" items. Then you can convert the cropped document to docx/txt/etc, and you should only have the central text to work with.

I will pass this along for further consideration.

Kind regards,
Dan McIntyre - Support Technician
PDF-XChange Co. LTD

+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com