Brad Wilson
6/27/2005 7:35:00 PM
If you're comfortable "cleaning it up", why not tidy it to XHTML then
use the XML parser? This is the approach I took recently when I needed
it.
On 6/27/05, mathew <meta@pobox.com> wrote:
> For invalid "tag soup" HTML, your best bet is probably to use
> html/htmltokenizer.