[lnkForumImage]
TotalShareware - Download Free Software

Confronta i prezzi di migliaia di prodotti.
Asp Forum
 Home | Login | Register | Search 


 

Forums >

comp.lang.ruby

Transliteration problems

Axel Etzold

11/8/2008 10:10:00 PM

Dear all,

I'd like to transliterate some accented text, just removing the accents.
The encoding should be iso-8859-1 (source and destination).

I am on Ubuntu 8.04, and I get the following:


require "rubygems"
require "iconv"

s="caffè"
p Iconv.iconv("ascii//translit","iso-8859-1",s)[0] => "caff?"

Is there any way to get "caffe" ?

I read in some earlier post to a Ruby list that there are different language locales to Iconv,
but I couldn't find how to specify any in Ruby Iconv :

http://groups.google.de/group/comp.lang.ruby/browse_thread/thread/40d823...

Thank you very much for your help!

Best regards,

Axel

--
GMX Download-Spiele: Preizsturz! Alle Puzzle-Spiele Deluxe über 60% billiger.
http://games.entertainment.gmx.net/de/entertainment/games/download/puzzle/...

2 Answers

Josef 'Jupp' Schugt

11/8/2008 11:05:00 PM

0

On Sat, 08 Nov 2008 23:10:12 +0100, Axel Etzold <AEtzold@gmx.de> wrote:

> p Iconv.iconv("ascii//translit","iso-8859-1",s)[0] => "caff?"

It seems as if either your system is broken or the file is not in ISO
8859-1 because that conversion *should* work and it does so on my Fedora 9:

Linux pen2.homeunix.net 2.6.26.6-79.fc9.x86_64 #1 SMP Fri Oct 17 14:20:33
EDT 2008 x86_64 x86_64 x86_64 GNU/Linux

$ iconv --version|head -1
iconv (GNU libc) 2.8

Are you sure that you do not get "caffA?" in place of "caff?"? That would
suggest that the file is encoded using utf8, not an encoding of the ISO
8859 familiy.

Josef 'Jupp' Schugt
--
Blog: http://penpen.gooda...
PGP key (id 6CC6574F): http://wwwkeys.d...
Jabber - http://www.j... - contact information on request

Henrik Nyh

11/9/2008 10:09:00 AM

0

See if http://groups.google.com/group/ruby-talk-google/browse_thread/thread/96c75d...
helps.

On Sun, Nov 9, 2008 at 12:05 AM, Josef 'Jupp' Schugt <jupp@gmx.de> wrote:
> On Sat, 08 Nov 2008 23:10:12 +0100, Axel Etzold <AEtzold@gmx.de> wrote:
>
>> p Iconv.iconv("ascii//translit","iso-8859-1",s)[0] => "caff?"
>
> It seems as if either your system is broken or the file is not in ISO 8859-1
> because that conversion *should* work and it does so on my Fedora 9:
>
> Linux pen2.homeunix.net 2.6.26.6-79.fc9.x86_64 #1 SMP Fri Oct 17 14:20:33
> EDT 2008 x86_64 x86_64 x86_64 GNU/Linux
>
> $ iconv --version|head -1
> iconv (GNU libc) 2.8
>
> Are you sure that you do not get "caffA?" in place of "caff?"? That would
> suggest that the file is encoded using utf8, not an encoding of the ISO 8859
> familiy.
>
> Josef 'Jupp' Schugt
> --
> Blog: http://penpen.gooda...
> PGP key (id 6CC6574F): http://wwwkeys.d...
> Jabber - http://www.j... - contact information on request
>
>