Extracting info for html page
Thread Starter
Per Ardua ad Astraeus
Joined: Mar 2000
Posts: 18,575
Likes: 4
From: UK
Extracting info for html page
I am supplied with a wide variety of inputs for news on a village website.
First problem is extracting text from a MS Pub file. Html formatted text then goes into tables on a prepared page using my style sheets. I cannot get 'Save as html' to allow insertion of the code into tables, and each page generates over 2000 NEW lines of css style in true MS fashion. At the moment I am transcribing longhand into html - any tricks?
Second one is extracting an image from a PDF file. I have tried various progs (I have Acrobat) but the resulting colour is not true. Resorting to jpg screensahots at this time.
First problem is extracting text from a MS Pub file. Html formatted text then goes into tables on a prepared page using my style sheets. I cannot get 'Save as html' to allow insertion of the code into tables, and each page generates over 2000 NEW lines of css style in true MS fashion. At the moment I am transcribing longhand into html - any tricks?
Second one is extracting an image from a PDF file. I have tried various progs (I have Acrobat) but the resulting colour is not true. Resorting to jpg screensahots at this time.
Joined: Jul 2003
Posts: 852
Likes: 3
From: Brum
If you have access to a Mac, FileJuicer will extract all text/jpegs etc. from almost any file...
FileJuicer
Nige
FileJuicer
Nige
Administrator
Joined: Mar 2001
Aviation Qualifications: PPL
Posts: 8,121
Likes: 686
From: Twickenham, home of rugby
I use Notepad a lot to strip out almost everything except the plain ASCII text - works great for copying text from web pages for re-processing, as you lose all the crap.
Dunno if that helps.
SD
Dunno if that helps.
SD




