Asp Forum - hpricot - parse html

K. R.

1/2/2008 9:13:00 PM

hi @all

I would like to parse html code and remove all tags that starts with


How can I remove this tags with regex? I used the gsub! function to
manipulate the string.

Thanks for helping...
--
Posted via http://www.ruby-....

3 Answers

Jim Clark

1/3/2008 3:38:00 AM

Try this...

C:\temp>irb
irb(main):001:0> mystring = "xxx yy  zz"
=> "xxx yy  zz"
irb(main):002:0> mystring.gsub(//,'')
=> "xxx yy zz"

Regards,
Jim

K. R. wrote:
> hi @all
>
> I would like to parse html code and remove all tags that starts with
> 
>
> How can I remove this tags with regex? I used the gsub! function to
> manipulate the string.
>
> Thanks for helping...
>

Dingding Ye

1/3/2008 10:37:00 AM

[Note: parts of this message were removed to make it a legal post.]

You should also process the \n, \r char.

So I think the regex should be "".

On Jan 3, 2008 11:37 AM, Jim Clark <diegoslice@gmail.com> wrote:

> Try this...
>
> C:\temp>irb
> irb(main):001:0> mystring = "xxx yy  zz"
> => "xxx yy  zz"
> irb(main):002:0> mystring.gsub(//,'')
> => "xxx yy zz"
>
> Regards,
> Jim
>
> K. R. wrote:
> > hi @all
> >
> > I would like to parse html code and remove all tags that starts with
> > 
> >
> > How can I remove this tags with regex? I used the gsub! function to
> > manipulate the string.
> >
> > Thanks for helping...
> >
>
>
>

Daniel Brumbaugh Keeney

1/3/2008 5:52:00 PM

On Jan 3, 2008 4:37 AM, sishen <yedingding@gmail.com> wrote:
> You should also process the \n, \r char.
>
> So I think the regex should be "".

Don't forget about the multiline option, it's easy, just stick an 'm'
after the regexp.

Daniel Brumbaugh Keeney

comp.lang.ruby

hpricot - parse html

K. R.

Jim Clark

Dingding Ye

Daniel Brumbaugh Keeney

x Login to ForumsZone