Quote:> I have obtained a Postscript file (apparently generated with
> Microsoft Word) of which I can only read the first page using gv
> (an error - moveto - is generated on the next page).
First of all be sure to use ghostscript 7.0. I was using the old 5.x
an I find 7.0 much improved.
Quote:> a) How do I extract just the text from the Postscript file? How is
> the raw text in a Postscript file encoded?
There should be a ps2ascii utility included with ghostscript.
Quote:> b) Is it possible to fix a corrupted Postscript file (e.g. by
> extracting the usable portions to a new file)?
The utility fixps (probably from the psutils) might help.
I once had luck with file and dd in extracting a postscript readable
by 5.x from a newer postscript file generated some Adobe Program. The
good old file told me something like 'x bytes of garbage at the
beginning, Postscript file from byte x+1 to y, TIFF image from byte
y+1 to z', and with dd I extracted the x+1-to-y part only.
Stefano - Hodie septimo Kalendas Iunias MMI est