Grabbing text from PDFs

Grabbing text from PDFs

Post by Kai Rohrbache » Sat, 05 Aug 2000 04:00:00



Hi!

I tried Strucurise's free "Kleptomania 2.1" trial (www.structurise.com) to
grab text from a PDF with disabled text copy&paste feature.
(Kleptomania is a special OCR-program which "reverse engineers texts" from
bitmap pictures, i.e.: transforms pixels back into ASCII-chars)

Disabling anti-aliasing in Acrobat and then experimenting a bit showed that
the recognition rate depends mainly on the PDF's zoom factor (~128% gave
best results), but still the results are not good.

Does anyone know of a better tool?

br,
 Kai

 
 
 

Grabbing text from PDFs

Post by robertrutle.. » Sun, 06 Aug 2000 04:00:00




> Hi!

> I tried Strucurise's free "Kleptomania 2.1" trial

(www.structurise.com) to
Quote:> grab text from a PDF with disabled text copy&paste feature.

(edit)

If you want to extract text from a PDF file, try this, GSView utility:

http://www.cs.wisc.edu/~ghost/gsview/get34.html

You will also need to install Ghostscript with it (everything you need
is freeware).

It works very well to extract the text from PDF files.

Bob Rutledge

Sent via Deja.com http://www.deja.com/
Before you buy.