[lnkForumImage]
TotalShareware - Download Free Software

Confronta i prezzi di migliaia di prodotti.
Asp Forum
 Home | Login | Register | Search 


 

Forums >

comp.lang.ruby

word boundaries for regular expressions

Adam Akhtar

8/26/2008 7:33:00 AM

Hi did a search for word boundaries but didnt quite find what i was
looking for.

If i have strings containing products and model numbers

e.g.
"JP-ATH Headphones JP"

and I want to remove the last JP but not the one in the modle number how
do i go about it,

i tried

string.gsub(/\bJP\b/, '')
but it removes both.
I guess the hypen in the model number doesnt count as a word letter so
it gets knocked off.

am i doing something wrong here?
--
Posted via http://www.ruby-....

4 Answers

Stefano Crocco

8/26/2008 7:38:00 AM

0

On Tuesday 26 August 2008, Adam Akhtar wrote:
> Hi did a search for word boundaries but didnt quite find what i was
> looking for.
>
> If i have strings containing products and model numbers
>
> e.g.
> "JP-ATH Headphones JP"
>
> and I want to remove the last JP but not the one in the modle number how
> do i go about it,
>
> i tried
>
> string.gsub(/\bJP\b/, '')
> but it removes both.
> I guess the hypen in the model number doesnt count as a word letter so
> it gets knocked off.
>
> am i doing something wrong here?

You can replace the first \b with \s, which only matches spaces:

string.gsub(/\sJP\b/, '')

I hope this helps

Stefano




Michael Morin

8/26/2008 9:33:00 AM

0

Adam Akhtar wrote:
> Hi did a search for word boundaries but didnt quite find what i was
> looking for.
>
> If i have strings containing products and model numbers
>
> e.g.
> "JP-ATH Headphones JP"
>
> and I want to remove the last JP but not the one in the modle number how
> do i go about it,
>
> i tried
>
> string.gsub(/\bJP\b/, '')
> but it removes both.
> I guess the hypen in the model number doesnt count as a word letter so
> it gets knocked off.
>
> am i doing something wrong here?

If the spaces are consistent, you can do something like this

"JP-ATH Headphones JP".split(/\s+/)[0..-2].join(" ")

or this if they're not.

"JP-ATH Headphones JP".sub(/\s+\w+$/,'')

The advantage of the top one is you can remove something out of the
middle of the string if necessary. The bottom one is probably faster
and generally makes more sense.

--
Michael Morin
Guide to Ruby
http://ruby....
Become an About.com Guide: beaguide.about.com
About.com is part of the New York Times Company

Robert Klemme

8/26/2008 4:23:00 PM

0

2008/8/26 Adam Akhtar <adamtemporary@gmail.com>:
> Hi did a search for word boundaries but didnt quite find what i was
> looking for.
>
> If i have strings containing products and model numbers
>
> e.g.
> "JP-ATH Headphones JP"
>
> and I want to remove the last JP but not the one in the modle number how
> do i go about it,
>
> i tried
>
> string.gsub(/\bJP\b/, '')
> but it removes both.
> I guess the hypen in the model number doesnt count as a word letter so
> it gets knocked off.

Yep, that sums it up pretty well.

> am i doing something wrong here?

Obviously, since your results do not match your expectations / requirements. :-)

You could use lookahead

irb(main):002:0> "JP-ATH Headphones JP".gsub /\bJP\b(?=\s|$)/, 'XXX'
=> "JP-ATH Headphones XXX"

It all depends on what other occurrences you have and which of them
you want to match.

Kind regards

robert

--
use.inject do |as, often| as.you_can - without end

Adam Akhtar

8/28/2008 5:07:00 AM

0

Thanks everyone. Yes the strings to match vary a lot in terms of
positiong. Some dont have model numbers, some do, some dont have jp some
do. Ive know of look ahead but never used it before. ill give that a
shot.

Thanks!

adam
--
Posted via http://www.ruby-....