As an example, a one-page searchable/image-on-text PDF is attached. (The verbage might give the impression the pdf software was Paperport, but it actually was Omnipage16 Standard)
With XChange Viewer (v2.0Bld39.2 and v2.0Bld37.2), use of the Select Text tool copies text to Notepad with many unknown characters when all should be simple english characters, and Viewer doesn't have successful searches for the corresponding words.
This behavior doesN'T happen with PDF-XChange Tools nor Foxit Reader. I don't have Adobe installed to check.
- PDF-XChange Tools (v4.0.0.149): The "Convert PDF to .txt" tool, with text encoding set to unicode, results in all simple english characters.
- Foxit Reader (v2.3Bld3201): use of the Select Text tool does copy text to Notepad in all simple english characters and does have successful searches for the corresponding words. Note Foxit lists different font info: Helvetica, Type1, Encoding Ansi, Actual Font: Helvetica, Actual Font Type: Type1.
Hoping there's a fix because in the next few months I'll have many similar documents!
.
Searchable Text Has Many Unknown Chars; All Should Be Engl.
Moderators: PDF-XChange Support, Daniel - PDF-XChange, Chris - PDF-XChange, Sean - PDF-XChange, Paul - PDF-XChange, Vasyl - PDF-XChange, Ivan - Tracker Software, Stefan - PDF-XChange
-
Pro User
- User
- Posts: 4
- Joined: Thu Feb 08, 2007 4:39 pm
Searchable Text Has Many Unknown Chars; All Should Be Engl.
You do not have the required permissions to view the files attached to this post.
-
Ivan - Tracker Software
- Site Admin
- Posts: 3603
- Joined: Thu Jul 08, 2004 10:36 pm
Re: Searchable Text Has Many Unknown Chars; All Should Be Engl.
Problem reproduced and we are working on the fix.
This fix will be available into next build of the viewer.
This fix will be available into next build of the viewer.
PDF-XChange Co Ltd. (Project Director)
When attaching files to any message - please ensure they are archived and posted as a .ZIP, .RAR or .7z format - or they will not be posted - thanks.
When attaching files to any message - please ensure they are archived and posted as a .ZIP, .RAR or .7z format - or they will not be posted - thanks.
-
Pro User
- User
- Posts: 4
- Joined: Thu Feb 08, 2007 4:39 pm
Re: Searchable Text Has Many Unknown Chars; All Should Be Engl.
Thank you for the fast investigation. Looking forward to the next build!
-
Pro User
- User
- Posts: 4
- Joined: Thu Feb 08, 2007 4:39 pm
Re: Searchable Text Has Many Unknown Chars; All Should Be Engl.
When I look down the End-User Downloads webpage at the listing for PDF-XChange Viewer, how do I tell if a new build is available?
I thought the one listed earlier today was a new build, because under "Last Update" it said 2 Sept, instead of 31 Aug, which is what it said on Sunday. But after I installed it and checked the version, is seems to be the same one I installed on Sunday, i.e. v2.0 Bld 39.2 Aug29 08 22:34:16.
Thank You.
P.S. Do you have a rough idea when the next build will be available for download? Thanks.
I thought the one listed earlier today was a new build, because under "Last Update" it said 2 Sept, instead of 31 Aug, which is what it said on Sunday. But after I installed it and checked the version, is seems to be the same one I installed on Sunday, i.e. v2.0 Bld 39.2 Aug29 08 22:34:16.
Thank You.
P.S. Do you have a rough idea when the next build will be available for download? Thanks.
-
John - Tracker Supp
- Site Admin
- Posts: 5225
- Joined: Tue Jun 29, 2004 10:34 am
Re: Searchable Text Has Many Unknown Chars; All Should Be Engl.
Hi,
There was a change in the download provided - the Help file was updated and the PDF manual removed by default - so now you must specifically download the PDF manual if you require this in addition to the Standard format (CHM) help file - this is in an efort to reduce the download size now the Help file itself has doubled in size as we have expanded to include the newest features and functions.
But you point is well taken and we will try anmd make it easier to identify.
There was a change in the download provided - the Help file was updated and the PDF manual removed by default - so now you must specifically download the PDF manual if you require this in addition to the Standard format (CHM) help file - this is in an efort to reduce the download size now the Help file itself has doubled in size as we have expanded to include the newest features and functions.
But you point is well taken and we will try anmd make it easier to identify.
If posting files to this forum - you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded - thank you.
Best regards
Tracker Support
http://www.tracker-software.com
Best regards
Tracker Support
http://www.tracker-software.com
-
Podhorny
- User
- Posts: 88
- Joined: Tue Oct 09, 2007 8:03 am
Re: Searchable Text Has Many Unknown Chars; All Should Be Engl.
Remark to help file - as you told size is now doubled. If you want (this is not request) you can reduce it by using images with reduced size (75%) like in previous version.
Regards, Jiri
Regards, Jiri