PPRuNe Forums - View Single Post - Extracting info for html page
View Single Post
Old 8th Dec 2009, 08:35
  #1 (permalink)  
BOAC
Per Ardua ad Astraeus
 
Join Date: Mar 2000
Location: UK
Posts: 18,579
Likes: 0
Received 0 Likes on 0 Posts
Extracting info for html page

I am supplied with a wide variety of inputs for news on a village website.

First problem is extracting text from a MS Pub file. Html formatted text then goes into tables on a prepared page using my style sheets. I cannot get 'Save as html' to allow insertion of the code into tables, and each page generates over 2000 NEW lines of css style in true MS fashion. At the moment I am transcribing longhand into html - any tricks?

Second one is extracting an image from a PDF file. I have tried various progs (I have Acrobat) but the resulting colour is not true. Resorting to jpg screensahots at this time.
BOAC is offline