Incorrect recognition of lines of text  SOLVED

Discussion for the End User use of OCR in PDF-XChange Editor and Viewer

Moderators: PDF-XChange Support, Daniel - PDF-XChange, Chris - PDF-XChange, Sean - PDF-XChange, Paul - PDF-XChange, Vasyl - PDF-XChange, Ivan - Tracker Software, Stefan - PDF-XChange

User avatar
Jensen Head
User
Posts: 862
Joined: Mon Sep 13, 2021 8:12 am

Incorrect recognition of lines of text

Post by Jensen Head »

Incorrect recognition of lines of text.gif
͏
1. Vacuklav rasterized page.pdf (7.21 MB) — https://drive.google.com/file/d/1RYRH9LBWWftrI9_QtqpCY8BSfBfU4EpY/view
2. Vacuklav rasterized page. OCRed.pdf (7.24 MB) — https://drive.google.com/file/d/1O5AZM5j2j2AxIEzDcMRy075_IDc-AIh3/view
͏
ABBYY FineReader PDF 15 recognized this document just as badly, but Adobe Acrobat Pro DC 2022.002.20212 x64-bit recognized it perfectly (usually, it recognizes documents worse than PDF-XChange).
You do not have the required permissions to view the files attached to this post.
Last edited by Jensen Head on Wed Dec 10, 2025 10:45 am, edited 2 times in total.
User avatar
Dimitar - PDF-XChange
Site Admin
Posts: 2666
Joined: Mon Jan 15, 2018 9:01 am

Re: Incorrect recognition of lines of text

Post by Dimitar - PDF-XChange »

Hi,

Could you please tell me which OCR tool have you used?

If it is possible please send me a screenshot of the OCR window itself.
User avatar
Jensen Head
User
Posts: 862
Joined: Mon Sep 13, 2021 8:12 am

Re: Incorrect recognition of lines of text

Post by Jensen Head »

2022-10-27_12-36-20.png
͏
The problem is with this document. Other documents both before and after it are recognized normally.
You do not have the required permissions to view the files attached to this post.
Last edited by Jensen Head on Wed Dec 10, 2025 10:45 am, edited 1 time in total.
User avatar
Dimitar - PDF-XChange
Site Admin
Posts: 2666
Joined: Mon Jan 15, 2018 9:01 am

Re: Incorrect recognition of lines of text

Post by Dimitar - PDF-XChange »

Hi,

Thank you for the provided information and files.

I tested your document in my PDF Editor but it seems that it behaves differently on my end:

2022-10-27_15-23-21.gif

May I ask you, what version is your PDF Editor, and is there any particular reason you are using Searchable Image mode for your OCR tool?


Regards.
You do not have the required permissions to view the files attached to this post.
User avatar
Jensen Head
User
Posts: 862
Joined: Mon Sep 13, 2021 8:12 am

Re: Incorrect recognition of lines of text  SOLVED

Post by Jensen Head »

К сожалению, у меня не сохранились образцы документов, и я закрою данную тему, как более неактуальную.
Dimitar - PDF-XChange wrote: Thu Oct 27, 2022 12:33 pmis there any particular reason you are using Searchable Image mode for your OCR tool?
I do this 99.9% of the time when I need to add recognized text to a PDF document. That is, I need text that was previously only found visually to be found by searching through files, and then selected and copied when the file is opened. But the document's appearance remains unchanged. This is exactly what Searchable Image is for, and it almost always does a great job.
User avatar
Sean - PDF-XChange
Site Admin
Posts: 769
Joined: Wed Sep 14, 2016 5:42 pm

Re: Incorrect recognition of lines of text

Post by Sean - PDF-XChange »

Hi Jensen,

Okay - thanks for the clarification, that makes sense. As I'm sure you know, you can achieve similar results with the other two OCR options, but if you want to retain the original page content then the option you're using is correct.

Kind regards,
Sean Godley
Technical Writer
PDF-XChange Co LTD
Sales: +1 (250) 324-1621
Fax: +1 (250) 324-1623