PPRuNe Forums - View Single Post - PDF to TXT conversion
View Single Post
Old 27th Apr 2009, 21:41
  #16 (permalink)  
ChristiaanJ
 
Join Date: Jan 2005
Location: France
Posts: 2,315
Likes: 0
Received 0 Likes on 0 Posts
Originally Posted by Jhieminga
ChristiaanJ, it looks as if the PDF is based on graphics
That's exactly the problem.

...in that case you will not be able to do anything but use an OCR program.
I totally agree.
I just vaguely hoped there already was a free program about somewhere that would do OCR directly on a graphics-based PDF, even if less-than-perfect...

I have done this before with Omnipage and that works quite well but it is not a one-click fix (mind you, there is no one-click solution for this!).
My main question about Omnipage is really, whether it will open the PDF and OCR it, or whether I will have to print all, scan all and OCR it. If I can use Omnipage to open the PDF, click and click and click, and end up with at least a .TXT file at the other end, I'll get it.

If you're still stuck with this next week I could try to find the time to run it through my Omnipage version.
If you would, I'd be most grateful.
As I said, a basic text version, even with all the OCR "typos", would be a real help.

CJ
ChristiaanJ is offline