[lnkForumImage]
TotalShareware - Download Free Software

Confronta i prezzi di migliaia di prodotti.
Asp Forum
 Home | Login | Register | Search 


 

Forums >

comp.lang.ruby

ANN: English 0.2.0

Trans

2/6/2008 11:11:00 PM

= English

http://english.rub...

English is a spin-off of Facets, collecting together all the English
language libraries together under a single unified project. The
English project intends to provided a general purpose set of English-
oriented text manipulation and natural language processing Ruby
libraries in one convenient package.

The library thus far consists of censor.rb, double_metaphone.rb,
dresner.rb, inflect.rb, jumble.rb, levenshtein.rb, metaphone.rb,
numerals.rb, patterns.rb, porter_stemming.rb, roman.rb, similarity.rb,
soundex.rb and textfilter.rb.

== Release Notes

This version incorporates some new libraries gathered from various
sources and cleans up the library as a whole. It should be a pretty
usable little library now.

== Usage

Please see the API documentation.

== Install

% gem install english

or download the .zip file and install manually:

% unzip english-x.y.z.zip
% cd english-x.y.z
% sudo task/setup

For Windows users the last line needs to be 'ruby task/setup'.

== Authors

* Trans
* Paul Battley
* Tim Fletcher
* Derek Lewis
* George Moschovitis
* Michael Neumann

== Changes

=== 0.2.0 // 2008-02-06

* Removed Language module. All libraries (except extensions) are now
in toplevel English module.
* Incorporated a couple libraries from the Text project and other
sources.

== Copying

English, Copyright (c) 2007 Tiger Ops

Some libraries within English are copyrighted to respective authors,
or are derivatives of such acknowledged work.

English is distributed under the terms of the Ruby License. Which is a
dual GPL license.


2 Answers

Michael Granger

4/13/2008 7:01:00 PM

0

On Feb 6, 2008, at 3:10 PM, Trans wrote:
> English is a spin-off of Facets, collecting together all the English
> language libraries together under a single unified project. The
> English project intends to provided a general purpose set of English-
> oriented text manipulation and natural language processing Ruby
> libraries in one convenient package.


Just out of curiosity, did you know about the Linguistics library
before you released this? "English" has some stuff the 'en' module of
Linguistics doesn't have (and vice-versa), but I'd have happily added
new functions to it if you had suggested it.

If you did know about it beforehand, was there something about it you
found objectionable? Just professional curiosity. I'd like to make
Linguistics usable for people other than myself, and if you felt
compelled to write something else to avoid having to use it, I'd take
that as a sign that it needs changing.

Thanks,
--
Michael Granger <ged@FaerieMUD.org>
Rubymage, Architect, Believer
The FaerieMUD Consortium <http://www.FaerieMU...


Trans

4/14/2008 1:07:00 AM

0



On Apr 13, 3:00 pm, Michael Granger <g...@FaerieMUD.org> wrote:
> On Feb 6, 2008, at 3:10 PM, Trans wrote:
>
> > English is a spin-off of Facets, collecting together all the English
> > language libraries together under a single unified project. The
> > English project intends to provided a general purpose set of English-
> > oriented text manipulation and natural language processing Ruby
> > libraries in one convenient package.
>
> Just out of curiosity, did you know about the Linguistics library
> before you released this? "English" has some stuff the 'en' module of
> Linguistics doesn't have (and vice-versa), but I'd have happily added
> new functions to it if you had suggested it.
>
> If you did know about it beforehand, was there something about it you
> found objectionable? Just professional curiosity. I'd like to make
> Linguistics usable for people other than myself, and if you felt
> compelled to write something else to avoid having to use it, I'd take
> that as a sign that it needs changing.

I had done a little searching and knew about Lingustics. My main usage
was #plural and #singular, and I already had code that used these. So
I didn't want to have to go back and add ".en" in all those cases. Of
course, I now know one can use the :proxy option to get around this,
but at the the time I did not know that. But beyond that, I suppose it
just a matter of design "levels", if you will. Lingustics is a much
bigger idea, not just for English but also ultimately having in mind
support for other languages. I think that's great, but my feeling was
I just wanted something simpler, focused on English and some commonly
useful language processing.

But I wouldn't mind talking to you more off list and see what we might
be able to do mutaully.

T.