[lnkForumImage]
TotalShareware - Download Free Software

Confronta i prezzi di migliaia di prodotti.
Asp Forum
 Home | Login | Register | Search 


 

Forums >

comp.lang.ruby

Multiple matching with ()*

Alessandro [AkiRoss] Re

7/31/2007 1:34:00 PM

Hi there!
I'm Alessandro from Italy and I started using ruby some days ago,
so... Hello, Community! :)

Well, I was trying to match a pattern multiple times. I tried both
with normal match() and scan(), but i can't get the desired result.

The subject string is something like:
"1a2bend" or "beg1a2b3c4dend"
more generally, it should match /^beg(\d\w)*end$/ : always a begin and
ending pattern, and a unspecified number of central pattern.
The problem is that the central pattern must be extracted for every
time it's encountered.
For example, trying with
"x1A2B3C4Dz".scan /^(x)(\d\w)*(z)$/
returns
[["x", "4D", "z"]]
while i need something like
[["x", "1A", "2B", "3C", "4D", "z"]]

Why does ()* match just the last one? How can i get all the ()* that it matches?

Probabily i'm doing something wrong, but can't understand where :
Thanks!
--
~Ale

17 Answers

Jano Svitok

7/31/2007 1:49:00 PM

0

On 7/31/07, Alessandro Re <akirosspower@gmail.com> wrote:
> Hi there!
> I'm Alessandro from Italy and I started using ruby some days ago,
> so... Hello, Community! :)
>
> Well, I was trying to match a pattern multiple times. I tried both
> with normal match() and scan(), but i can't get the desired result.
>
> The subject string is something like:
> "1a2bend" or "beg1a2b3c4dend"
> more generally, it should match /^beg(\d\w)*end$/ : always a begin and
> ending pattern, and a unspecified number of central pattern.
> The problem is that the central pattern must be extracted for every
> time it's encountered.
> For example, trying with
> "x1A2B3C4Dz".scan /^(x)(\d\w)*(z)$/
> returns
> [["x", "4D", "z"]]
> while i need something like
> [["x", "1A", "2B", "3C", "4D", "z"]]
>
> Why does ()* match just the last one? How can i get all the ()* that it matches?
>
> Probabily i'm doing something wrong, but can't understand where :
Try:

if "x1A2B3C4Dz" =~ /^(x)((?:\d\w)*)(z)$/

return [

Jano Svitok

7/31/2007 1:57:00 PM

0

On 7/31/07, Alessandro Re <akirosspower@gmail.com> wrote:
> Hi there!
> I'm Alessandro from Italy and I started using ruby some days ago,
> so... Hello, Community! :)
>
> Well, I was trying to match a pattern multiple times. I tried both
> with normal match() and scan(), but i can't get the desired result.
>
> The subject string is something like:
> "1a2bend" or "beg1a2b3c4dend"
> more generally, it should match /^beg(\d\w)*end$/ : always a begin and
> ending pattern, and a unspecified number of central pattern.
> The problem is that the central pattern must be extracted for every
> time it's encountered.
> For example, trying with
> "x1A2B3C4Dz".scan /^(x)(\d\w)*(z)$/
> returns
> [["x", "4D", "z"]]
> while i need something like
> [["x", "1A", "2B", "3C", "4D", "z"]]
>
> Why does ()* match just the last one? How can i get all the ()* that it matches?
>
> Probabily i'm doing something wrong, but can't understand where :
Try:

if "x1A2B3C4Dz" =~ /^(x)((?:\d\w)*)(z)$/
a, b = $1, $3 #
return [a] + $2.scan(/\d\w/).flatten + [b]
end

I don't know if it's possible to do it in one run though, maybe you
could use split as well...
Take care when doing nested searches as they will overwrite $1..9
(that's why I used a and b)

J.

Harry Kakueki

7/31/2007 2:01:00 PM

0

On 7/31/07, Alessandro Re <akirosspower@gmail.com> wrote:
> For example, trying with
> "x1A2B3C4Dz".scan /^(x)(\d\w)*(z)$/
> returns
> [["x", "4D", "z"]]
> while i need something like
> [["x", "1A", "2B", "3C", "4D", "z"]]
>
Hi,

Try this.

str = "x1A2B3C4Dz"
p str.scan(/\d?\w/) #>["x", "1A", "2B", "3C", "4D", "z"]

Harry

--
A Look into Japanese Ruby List in English
http://www.ka...

Alessandro [AkiRoss] Re

7/31/2007 2:09:00 PM

0

Thanks, but i need to match the pattern OR don't match anything.
"lol1a2vasd".scan(/\d?\w/) => ["l", "o", "l", "1a", "2v", "a", "s", "d"]
while i need to be sure that the pattern begins with a regex "x" and
ends with "z"

(of course, x 1 a 2 b 3 c should be regexes not just chars)

thanks, you help is apreciated :)

On 7/31/07, Harry Kakueki <list.push@gmail.com> wrote:
> On 7/31/07, Alessandro Re <akirosspower@gmail.com> wrote:
> > For example, trying with
> > "x1A2B3C4Dz".scan /^(x)(\d\w)*(z)$/
> > returns
> > [["x", "4D", "z"]]
> > while i need something like
> > [["x", "1A", "2B", "3C", "4D", "z"]]
> >
> Hi,
>
> Try this.
>
> str = "x1A2B3C4Dz"
> p str.scan(/\d?\w/) #>["x", "1A", "2B", "3C", "4D", "z"]
>
> Harry
>
> --
> A Look into Japanese Ruby List in English
> http://www.ka...
>
>


--
~Ale

Alessandro [AkiRoss] Re

7/31/2007 2:12:00 PM

0

Mh well, to me it seems a normal regex processing (i mean, it *should*
require only one instruction, since this pattern can be read with just
one regex, even if ruby doesn't allow it... but it would be really
bad).
Anyway well, splitting it there are different ways to do it - thanks
for your sudjestion.
But if ruby make it possible with one call, i'd prefer to use it.

Thanks!

On 7/31/07, Jano Svitok <jan.svitok@gmail.com> wrote:
> On 7/31/07, Alessandro Re <akirosspower@gmail.com> wrote:
> > Hi there!
> > I'm Alessandro from Italy and I started using ruby some days ago,
> > so... Hello, Community! :)
> >
> > Well, I was trying to match a pattern multiple times. I tried both
> > with normal match() and scan(), but i can't get the desired result.
> >
> > The subject string is something like:
> > "1a2bend" or "beg1a2b3c4dend"
> > more generally, it should match /^beg(\d\w)*end$/ : always a begin and
> > ending pattern, and a unspecified number of central pattern.
> > The problem is that the central pattern must be extracted for every
> > time it's encountered.
> > For example, trying with
> > "x1A2B3C4Dz".scan /^(x)(\d\w)*(z)$/
> > returns
> > [["x", "4D", "z"]]
> > while i need something like
> > [["x", "1A", "2B", "3C", "4D", "z"]]
> >
> > Why does ()* match just the last one? How can i get all the ()* that it matches?
> >
> > Probabily i'm doing something wrong, but can't understand where :>
> Try:
>
> if "x1A2B3C4Dz" =~ /^(x)((?:\d\w)*)(z)$/
> a, b = $1, $3 #
> return [a] + $2.scan(/\d\w/).flatten + [b]
> end
>
> I don't know if it's possible to do it in one run though, maybe you
> could use split as well...
> Take care when doing nested searches as they will overwrite $1..9
> (that's why I used a and b)
>
> J.
>
>


--
~Ale

Harry Kakueki

7/31/2007 2:33:00 PM

0

On 7/31/07, Alessandro Re <akirosspower@gmail.com> wrote:
> Thanks, but i need to match the pattern OR don't match anything.
> "lol1a2vasd".scan(/\d?\w/) => ["l", "o", "l", "1a", "2v", "a", "s", "d"]
> while i need to be sure that the pattern begins with a regex "x" and
> ends with "z"

str = "lol1a2vasd"
p str.scan(/\d\w|\w{3}/)

Harry

--
A Look into Japanese Ruby List in English
http://www.ka...

Robert Klemme

7/31/2007 2:56:00 PM

0

2007/7/31, Alessandro Re <akirosspower@gmail.com>:
> Mh well, to me it seems a normal regex processing (i mean, it *should*
> require only one instruction, since this pattern can be read with just
> one regex, even if ruby doesn't allow it... but it would be really
> bad).
> Anyway well, splitting it there are different ways to do it - thanks
> for your sudjestion.
> But if ruby make it possible with one call, i'd prefer to use it.

irb(main):006:0> s="x1A2B3C4Dz"
=> "x1A2B3C4Dz"
irb(main):007:0> s.scan /x(\d\w)*z/
=> [["4D"]]
irb(main):008:0> s.scan /x((?:\d\w)*?)z/
=> [["1A2B3C4D"]]
irb(main):009:0> s.scan(/x((?:\d\w)*?)z/).map {|a| a[0].scan(/\d\w/)}
=> [["1A", "2B", "3C", "4D"]]

Kind regards

robert

Alessandro [AkiRoss] Re

7/31/2007 3:19:00 PM

0

Thanks, this is an interesting solution!

On 7/31/07, Robert Klemme <shortcutter@googlemail.com> wrote:
> 2007/7/31, Alessandro Re <akirosspower@gmail.com>:
> > Mh well, to me it seems a normal regex processing (i mean, it *should*
> > require only one instruction, since this pattern can be read with just
> > one regex, even if ruby doesn't allow it... but it would be really
> > bad).
> > Anyway well, splitting it there are different ways to do it - thanks
> > for your sudjestion.
> > But if ruby make it possible with one call, i'd prefer to use it.
>
> irb(main):006:0> s="x1A2B3C4Dz"
> => "x1A2B3C4Dz"
> irb(main):007:0> s.scan /x(\d\w)*z/
> => [["4D"]]
> irb(main):008:0> s.scan /x((?:\d\w)*?)z/
> => [["1A2B3C4D"]]
> irb(main):009:0> s.scan(/x((?:\d\w)*?)z/).map {|a| a[0].scan(/\d\w/)}
> => [["1A", "2B", "3C", "4D"]]
>
> Kind regards
>
> robert
>
>


--
~Ale

botp

7/31/2007 3:23:00 PM

0

On 7/31/07, Alessandro Re <akirosspower@gmail.com> wrote:
> Mh well, to me it seems a normal regex processing (i mean, it *should*
> require only one instruction, since this pattern can be read with just
> one regex, even if ruby doesn't allow it... but it would be really bad).

seems like you have a pattern within a pattern.
it may be easy to unwrap outer pattern first, then work on the inner
pattern. something like,

irb(main):096:0> "lol1a2vasd".scan(/lol(.+)asd/).to_s.scan(/\d\w/)
=> ["1a", "2v"]
irb(main):097:0> "beg1a2vend".scan(/beg(.+)end/).to_s.scan(/\d\w/)
=> ["1a", "2v"]
irb(main):098:0> "beg1a2vendxbeg3c4dend".scan(/beg(.+)end/).to_s.scan(/\d\w/)
=> ["1a", "2v", "3c", "4d"]

is that ok?
kind regards -botp

Wolfgang Nádasi-donner

7/31/2007 9:41:00 PM

0

Alessandro Re wrote:
> For example, trying with
> "x1A2B3C4Dz".scan /^(x)(\d\w)*(z)$/
> returns
> [["x", "4D", "z"]]
> while i need something like
> [["x", "1A", "2B", "3C", "4D", "z"]]

Does this goes more into the direction you wanted:

irb(main):001:0> "x1A2B3C4Dz".scan
/(?:^(?:x)|\G)(\d\w)(?=(?:\d\w)*(?:z)$)/
=> [["1A"], ["2B"], ["3C"], ["4D"]]

???

Wolfgang Nádasi-Donner
--
Posted via http://www.ruby-....