[lnkForumImage]
TotalShareware - Download Free Software

Confronta i prezzi di migliaia di prodotti.
Asp Forum
 Home | Login | Register | Search 


 

Forums >

comp.lang.ruby

Re: Text extraction from PDF files (non-European languages)...?

Hannes Wyss

11/21/2006 5:15:00 PM

Axel

On 11/21/06, Nuralanur@aol.com <Nuralanur@aol.com> wrote:
> is there a way of extracting text from a PDF, if the latter
> is in some non-European language, such as Arabic or
> Chinese?

rpdf2txt (1) _should_ work with Unicode PDF-Documents. If you run into
any problems let me know, I'm happy to tinker with the beast.

http://download.ywesee.com/rpdf2txt/rpdf2txt-1.0...
http://raa.ruby-lang.org/project...

hth

Hannes