Discussion:
SGML parsing
Zubin Mithra
2010-12-04 06:40:10 UTC
Permalink
Hey everyone,

I was trying to port feedparser over to Python3.1 and I realized that
sgmllib had been deprecated from Python2.7 upward and subsequently removed.
Which alternatives would you suggest I use in Python3.1 to perform analogous
tasks?

Thanks,
zm
Éric Araujo
2010-12-04 14:59:13 UTC
Permalink
Hi Zubin,

I don’t know about 3.1, but in 3.2 the html.parser module has very
recently received improvements to make it more lax when needed, with
Beautiful Soup cited as example:
http://mail.python.org/pipermail/python-checkins/2010-December/100097.html

Apart from that, people say that lxml is the new Beautiful Soup.

Cheers

Loading...