Extracting text from images
Thread Starter
Joined: May 1999
Posts: 529
Likes: 0
From: TMI
I’m looking for any products that can extract text from images. Anyone know of any?
The images contain large amounts of text and I would like to be able to process the text.
The images contain large amounts of text and I would like to be able to process the text.
Fleet Manager

Joined: Apr 2001
Aviation Qualifications: ATPL
Posts: 7,447
Likes: 310
From: various places .....
Probably around 18 months ago I saw a TV documentary on encryption .. one technique reviewed was the use of compressed bitmaps to hide the text data in amongst the otherwise discarded bytes. Sorry I can't remember the contact details but one ought to be able to chase it down via the search engines ....

Joined: Dec 1998
Posts: 3,038
Likes: 52
From: .
Where are the images from? Is it something you are scanning? If so when you scan it rather than save as an image use OCR software for the text. If you have not scanned it, if the quality of the image is ok you can print it out first and then try scanning it to extract the text with OCR. <img src="smile.gif" border="0">
Cunning Artificer

Joined: Jun 2001
Posts: 3,125
Likes: 7
From: The spiritual home of DeHavilland
I use OmniPage Pro 9.0 by Caere. It came bundled with my Canon Scanner but you should be able to buy it seperately. You can put image files straight through, there's no need to print the file and then scan it. I've tried other OCR software before, and you couldn't import image files from elsewhere but OmniPage is the best I've tried yet. Very useful.
<a href="http://www.caere.com/products/omnipage/pro/" target="_blank">http://www.caere.com/products/omnipage/pro/</a>
Just checked the link and found they are on version 11 now...
**********************************. .Through difficulties to the cinema
[ 26 January 2002: Message edited by: Blacksheep ]</p>
<a href="http://www.caere.com/products/omnipage/pro/" target="_blank">http://www.caere.com/products/omnipage/pro/</a>
Just checked the link and found they are on version 11 now...
**********************************. .Through difficulties to the cinema
[ 26 January 2002: Message edited by: Blacksheep ]</p>
Joined: Dec 2001
Posts: 140
Likes: 0
From: STL
If you find stand-alone commercial OCR expensive (as may be the case if you don't have a continuing need for the application) you can try WOCAR first (assuming it is Windows software you want). It is freeware for noncommercial use. It is intended for scanned text and your files must be black and white, not greyscale. The file must be saved in TIFF format. Depending on your images, a basic graphics editor such as IrfanView (also freeware) might/should be able to do the necessary things to convert to files WOCAR can handle.
I just checked and the zip-archive that is available now seems to be the same one I downloaded about five years ago. I don't know what the lack of development means.
<a href="http://ccambien.free.fr/wocar/" target="_blank">http://ccambien.free.fr/wocar/</a>
I just checked and the zip-archive that is available now seems to be the same one I downloaded about five years ago. I don't know what the lack of development means.
<a href="http://ccambien.free.fr/wocar/" target="_blank">http://ccambien.free.fr/wocar/</a>




