Notices
Computer/Internet Issues & Troubleshooting Anyone with questions about the terribly complex world of computers or the internet should try here. NOT FOR REPORTING ISSUES WITH PPRuNe FORUMS! Please use the subforum "PPRuNe Problems or Queries."

An old memoir

Old 20th October 2008 | 19:46
  #1 (permalink)  
Thread Starter
 
Joined: Nov 2005
Posts: 758
Likes: 2
From: morayshire
An old memoir

Evnin' all.
I've got a book/let produced by an uncle of mine about his time as a private soldier in WW1. It was retyped in the late '70s and seems to be on A4 size pages but was clearly done on a manual typewriter in (I think) Courier 14pt. I volunteered to scan it into my PC and turn it into a Word document, but it has turned into a nightmare task. Although the paper seems to be A4 it never seems to be a complete match on the scanner. There are multiple typos that need correcting but they seem to be generated by the scanner, but not all of them! I now have some individual pages scanned in and edited but there are only 375 left to do! I'm not sure if I'll live long enough to see the end. Has anyone any procedural suggestions as to a way of speeding up the process? Spending large sums on a sexy new scanner isn't an option.

Is making it a Word document the best way? Alll the pages I've done so far are one side only. The book itself is moderately fragile and lots of the pages are loose (which does have the benefit of making them easier to get into the scanner). All suggestions will receive due consideration. TIA
The Ancient Mariner
Rossian is offline  
Reply
Old 20th October 2008 | 20:01
  #2 (permalink)  
 
Joined: Jul 1999
Posts: 272
Likes: 5
From: England
Does it have to be a word document?

If you just want a digital version then you could scan the pages and save them as images and then convert them to a pdf document using something like open office.
Don Coyote is offline  
Reply
Old 20th October 2008 | 20:07
  #3 (permalink)  
More bang for your buck
 
Joined: Nov 2005
Posts: 3,513
Likes: 1
From: land of the clanger
I think I would agree that a PDF format would be the easiest, and fully editable, but still a pain scanning wise. There are commercial document scanning services available but at what cost I've no idea.
green granite is offline  
Reply
Old 20th October 2008 | 22:05
  #4 (permalink)  
Psychophysiological entity
20 Anniversary
 
Joined: Jun 2001
Aviation Qualifications: ATPL
Posts: 3,383
Likes: 169
From: Walton on the Naze Essex.
There are multiple typos that need correcting but they seem to be generated by the scanner, but not all of them!
I'm confused. Are you saying that the software is decoding / encoding the scanned image into a text format? Text recognition in other words.

I guess you haven't got the original pages. It would be a shame if they were scrapped upon transcribing.

I have many of my father's letters to my mother from WWII, I intend to scan them as jpg's and archive on DVD. Rough edges, mud, tea-stains and all.

This is by the way, but I also have a charming little Birthday Book. Many, many ancestors in there. My mother was one of the last when she signed it at age 12 -- born 100 years ago.

The book is hard-back and of course very precious so I didn't want to stress the 'hinge', yet the pages lie flat enough to get a good focus to the spine.
Loose rivets is offline  
Reply
Old 20th October 2008 | 22:35
  #5 (permalink)  
Thread Starter
 
Joined: Nov 2005
Posts: 758
Likes: 2
From: morayshire
So far so good

Thanks for your contributions so far guys. I'll have to read up on how to create a pdf doc.

Loose. I think the typos come from some limitations in my scanner. The type on the pages isn't always very sharp edged and I think it "misreads" what's on the page. Away from that; the original was handwritten in the 1920s after the still very young man was demobbed in late 1919. His parents were gone and he had a sister who in non-PC terms was called "simple", who needed looking after. He trained as a primary school teacher and it wasn't until he was a bit more settled that he had time to put his thoughts on paper. In the '40s he was a headmaster and had access to a typewriter of the sit-up-and-beg type. He then transferred the originals into typed pages. In the '70s his son had them professionally typed again and printed a limited edition (5) for the family and for the Black Watch museum. It's a rivetting read as you follow this 16 year old from Dundee through the trenches in France, through captivity and the 'flu pandemic of 1918, to just walking away from his prison camp as he saw the war ending, back through the retreating Germans to the safety of his own troops. It's a story that I want to make available to MY family and this seems a good way to do it - if only I can crack it. Thanks again.
The Ancient Mariner
Rossian is offline  
Reply
Old 20th October 2008 | 22:44
  #6 (permalink)  
 
Joined: Nov 2002
Posts: 1,630
Likes: 0
From: 39N 77W
Friends gave me a set of "The Complete National Geographic Magazine on CD". It covers from the first issue in 1888 up to the end of the 1990s.

The NGS just scanned all the pages as JPG and let it go at that. The resolution is poor enough that the text is not too easy to read. About three or four CDs cover a decade of the magazine including all the adverts.

A page of text occupies about 56 kilobytes. A double-page color image takes about 150 kilobytes. I would recommend against such severe compression.

seacue
seacue is offline  
Reply
Old 21st October 2008 | 19:10
  #7 (permalink)  
A really irritating PPRuNer
25 Anniversary
Veteran: Air Force
 
Joined: Jun 2000
Aviation Qualifications: PPL
Posts: 905
Likes: 17
From: Just popping my head back up above the parapet
Rossian,
check your PMs

Brian
Brian Dixon is offline  
Reply
Old 22nd October 2008 | 07:29
  #8 (permalink)  
 
Joined: Feb 2000
Posts: 542
Likes: 0
From: asia
A couple of years ago i did quite a big job scanning a document and converting into word format.

I found that the software which did the conversion from scan to word (I think it was called textbridge) was very sensitive to some of the scanning parameters.

Experiment a bit. Don't scan in colour, make sure it is black/white, not greyscale). Adjest the contrast values to give the best distinction. Don't scan at too high a resolution (Sounds crazy, but try it)
stickyb is offline  
Reply
Old 22nd October 2008 | 12:05
  #9 (permalink)  
Gnome de PPRuNe
Community Builder
Community Influencer
20 Anniversary
 
Joined: Jan 2002
Aviation Qualifications: Spotter
Posts: 15,198
Likes: 1,201
From: Too close to Croydon for comfort
A useful general scanning tip: if you are scanning thinnish paper with text or images on both sides, place a sheet of black paper behind to stop the reverse side showing through during the scan.
treadigraph is offline  
Reply

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off


Thread Tools
Search this Thread

Contact Us - Archive - Advertising - Cookie Policy - Privacy Statement - Terms of Service

Copyright © 2026 MH Sub I, LLC dba Internet Brands. All rights reserved. Use of this site indicates your consent to the Terms of Use.