Odd character spacing after OCR

Discussion for the End User use of OCR in PDF-XChange Editor and Viewer

Moderators: Daniel - PDF-XChange, PDF-XChange Support, Vasyl - PDF-XChange, Chris - PDF-XChange, Sean - PDF-XChange, Paul - PDF-XChange, Ivan - Tracker Software, Stefan - PDF-XChange

Post Reply
cakeandcustard74
User
Posts: 6
Joined: Sun Jan 07, 2024 8:09 pm

Odd character spacing after OCR

Post by cakeandcustard74 »

Hi all,

There’s a strange thing that occurs to some words after using OCR, typically 2 to 4 lettered words, where the letters have unequal spacing between them. Is there any way to fix this?
Last edited by cakeandcustard74 on Wed Apr 03, 2024 5:42 pm, edited 1 time in total.
User avatar
Dimitar - PDF-XChange
Site Admin
Posts: 2191
Joined: Mon Jan 15, 2018 9:01 am

Re: Odd character spacing after OCR

Post by Dimitar - PDF-XChange »

Hello cakeandcustard74,

Welcome to our forum.

Could you please send us one of the files you are having this problem with as well as a screenshot of how the entire page looks on your end?
Also - please do let us know if you are using the Standard or Enhanced OCR, and which build of the Editor is currently installed on your end (You can check the version under Help -> About inside the Editor).

Regards.
cakeandcustard74
User
Posts: 6
Joined: Sun Jan 07, 2024 8:09 pm

Re: Odd character spacing after OCR

Post by cakeandcustard74 »

Hi Dimitar,

I’m using Enhanced OCR, and the build is 383, version 10.1.3. Here’s a PDF with just a couple of pages I used OCR on (I haven’t proofread and edited so there’s some typos), and here’s a screenshot of one of the pages. The majority of words are fine, except for short words such as ‘who’ and ‘the’, which have unequal spacing between the letters.
Thus spoke Zarathustra.pdf
(12.74 MiB) Downloaded 105 times
IMG_2628.png
User avatar
Paul - PDF-XChange
Site Admin
Posts: 7356
Joined: Wed Mar 25, 2009 10:37 pm
Contact:

Re: Odd character spacing after OCR

Post by Paul - PDF-XChange »

Hi, cakeandcustard74

the sample you provided already has OCR on it and the spacing already there. Do you have a "Pre-OCR" version we can look at?

Kind regards,
Paul - Tracker Supp
Best regards

Paul O'Rorke
PDF-XChange Support
http://www.pdf-xchange.com
cakeandcustard74
User
Posts: 6
Joined: Sun Jan 07, 2024 8:09 pm

Re: Odd character spacing after OCR

Post by cakeandcustard74 »

Hi Paul,

Here’s the original PDF with no OCR. In the original, the words I’ve underlined in the above screenshot are spaced normally, it’s just after OCR they get a bit weird. I’ve also tried using OCR on other pages, and the issue still occurs with short words, but like I stated in my other reply the majority of words are spaced fine.
Thus spoke Zarathustra original.pdf
(12.79 MiB) Downloaded 101 times
User avatar
Paul - PDF-XChange
Site Admin
Posts: 7356
Joined: Wed Mar 25, 2009 10:37 pm
Contact:

Re: Odd character spacing after OCR

Post by Paul - PDF-XChange »

Hi, cakeandcustard74

I see the same when I EOCR it here. I am reaching out to the OCR specialist to get his thoughts and will post here what we find.

Kind regards,
Paul - Tracker Supp
Best regards

Paul O'Rorke
PDF-XChange Support
http://www.pdf-xchange.com
cakeandcustard74
User
Posts: 6
Joined: Sun Jan 07, 2024 8:09 pm

Re: Odd character spacing after OCR

Post by cakeandcustard74 »

Hi Paul,

Ah, thank you - I look forward to seeing what you find.
User avatar
Paul - PDF-XChange
Site Admin
Posts: 7356
Joined: Wed Mar 25, 2009 10:37 pm
Contact:

Re: Odd character spacing after OCR

Post by Paul - PDF-XChange »

Hi, cakeandcustard74

the devs have to do some in depth investigation on this. It may take some time. I have raised a ticket around this so we can keep track of the progress. While for internal use only, should yo refer to RT#6740: OCR character spacing issue here any support staff member should be able to get you a status report.

I hope that helps.

Kind regards,
Paul - Tracker Supp
Best regards

Paul O'Rorke
PDF-XChange Support
http://www.pdf-xchange.com
cakeandcustard74
User
Posts: 6
Joined: Sun Jan 07, 2024 8:09 pm

Re: Odd character spacing after OCR

Post by cakeandcustard74 »

Hi Paul,

Thank you for raising the issue with the devs; I’ll wait a few days and ask about it then.
User avatar
Daniel - PDF-XChange
Site Admin
Posts: 10862
Joined: Wed Jan 03, 2018 6:52 pm

Odd character spacing after OCR

Post by Daniel - PDF-XChange »

:)
Dan McIntyre - Support Technician
PDF-XChange Co. LTD

+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com
Post Reply