Wikiposts
Search

Notices
Computer/Internet Issues & Troubleshooting Anyone with questions about the terribly complex world of computers or the internet should try here. NOT FOR REPORTING ISSUES WITH PPRuNe FORUMS! Please use the subforum "PPRuNe Problems or Queries."

Extracting text from images

Thread Tools
 
Search this Thread
 
Old 26th January 2002 | 07:30
  #1 (permalink)  
Thread Starter
 
Joined: May 1999
Posts: 529
Likes: 0
From: TMI
Question Extracting text from images

I’m looking for any products that can extract text from images. Anyone know of any?

The images contain large amounts of text and I would like to be able to process the text.
LevelFive is offline  
Old 26th January 2002 | 09:20
  #2 (permalink)  
Fleet Manager
25 Anniversary
 
Joined: Apr 2001
Aviation Qualifications: ATPL
Posts: 7,447
Likes: 310
From: various places .....
Post

Probably around 18 months ago I saw a TV documentary on encryption .. one technique reviewed was the use of compressed bitmaps to hide the text data in amongst the otherwise discarded bytes. Sorry I can't remember the contact details but one ought to be able to chase it down via the search engines ....
john_tullamarine is offline  
Old 26th January 2002 | 11:24
  #3 (permalink)  
25 Anniversary
 
Joined: Dec 1998
Posts: 3,038
Likes: 52
From: .
Question

Where are the images from? Is it something you are scanning? If so when you scan it rather than save as an image use OCR software for the text. If you have not scanned it, if the quality of the image is ok you can print it out first and then try scanning it to extract the text with OCR. <img src="smile.gif" border="0">
spannersatcx is offline  
Old 26th January 2002 | 16:49
  #4 (permalink)  
Cunning Artificer
20 Anniversary
 
Joined: Jun 2001
Posts: 3,125
Likes: 7
From: The spiritual home of DeHavilland
Lightbulb

I use OmniPage Pro 9.0 by Caere. It came bundled with my Canon Scanner but you should be able to buy it seperately. You can put image files straight through, there's no need to print the file and then scan it. I've tried other OCR software before, and you couldn't import image files from elsewhere but OmniPage is the best I've tried yet. Very useful.

<a href="http://www.caere.com/products/omnipage/pro/" target="_blank">http://www.caere.com/products/omnipage/pro/</a>

Just checked the link and found they are on version 11 now...

**********************************. .Through difficulties to the cinema

[ 26 January 2002: Message edited by: Blacksheep ]</p>
Blacksheep is offline  
Old 26th January 2002 | 19:51
  #5 (permalink)  
 
Joined: Dec 2001
Posts: 140
Likes: 0
From: STL
Post

If you find stand-alone commercial OCR expensive (as may be the case if you don't have a continuing need for the application) you can try WOCAR first (assuming it is Windows software you want). It is freeware for noncommercial use. It is intended for scanned text and your files must be black and white, not greyscale. The file must be saved in TIFF format. Depending on your images, a basic graphics editor such as IrfanView (also freeware) might/should be able to do the necessary things to convert to files WOCAR can handle.

I just checked and the zip-archive that is available now seems to be the same one I downloaded about five years ago. I don't know what the lack of development means.

<a href="http://ccambien.free.fr/wocar/" target="_blank">http://ccambien.free.fr/wocar/</a>
bblank is offline  
Old 26th January 2002 | 20:20
  #6 (permalink)  
 
Joined: Feb 2000
Posts: 542
Likes: 0
From: asia
Post

There's also something called Textbridge which does a good job of going through images and converting text to word or excel format.. .I think u can use it either with a scanner or with an image file
stickyb is offline  
Old 28th January 2002 | 02:07
  #7 (permalink)  
Thread Starter
 
Joined: May 1999
Posts: 529
Likes: 0
From: TMI
Post

Thanks for the help. It’s very much appreciated.
LevelFive is offline  

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off



Contact Us - Archive - Advertising - Cookie Policy - Privacy Statement - Terms of Service

Copyright © 2026 MH Sub I, LLC dba Internet Brands. All rights reserved. Use of this site indicates your consent to the Terms of Use.