Dave Burt
3/3/2005 12:06:00 PM
"Bil Kleb" <Bil.Kleb@NASA.Gov> asked:
> However, they want "deep" search capabilities for this
> internal site, i.e., they want a text search of not only
> titles and page content, but of file attachments such as
> Word documents, PowerPoint, PDF, and so forth.
>
> What's the path toward this end?
You can use WIN32OLE to fire up and automate Word, Powerpoint, etc., grab
the text, and update your indexes (after upload or periodically). Win32ole
scripts are strange animals - I find it easiest to half write them in MS'
VBA IDE, with its context help and Object Browser; they translate quite
easily into Ruby/win32ole.
And I'm sure there's a Ruby Way to read PDFs which I know nothing about.
If you come up with the code to do this in Soks, please share it :)
Cheers,
Dave