<?xml version="1.0" encoding="ISO-8859-15"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Web scraping with Python</title>
	<atom:link href="http://www.akasig.org/2004/09/03/web-scraping-with-python/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.akasig.org/2004/09/03/web-scraping-with-python/</link>
	<description>Innover, servir, entreprendre.</description>
	<lastBuildDate>Thu, 02 Sep 2010 03:48:04 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Laura Ramos</title>
		<link>http://www.akasig.org/2004/09/03/web-scraping-with-python/comment-page-1/#comment-169770</link>
		<dc:creator>Laura Ramos</dc:creator>
		<pubDate>Thu, 02 Sep 2010 03:48:04 +0000</pubDate>
		<guid isPermaLink="false">http://sig.levillage.org/?p=563#comment-169770</guid>
		<description>The post will be interesting if the information is more relevant and precise at the same time. The short content will not grab the attention in the users.</description>
		<content:encoded><![CDATA[<p>The post will be interesting if the information is more relevant and precise at the same time. The short content will not grab the attention in the users.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sig</title>
		<link>http://www.akasig.org/2004/09/03/web-scraping-with-python/comment-page-1/#comment-151761</link>
		<dc:creator>Sig</dc:creator>
		<pubDate>Wed, 07 Oct 2009 11:10:45 +0000</pubDate>
		<guid isPermaLink="false">http://sig.levillage.org/?p=563#comment-151761</guid>
		<description>JohnMc, thanks for pointing to PyQuery and lxml. I was a bit concerned that lxml would not handle malformed HTML files. But it now can interface with BeautifulSoup and use its parsing abilities for malformed HTML files. Here is the lxml module for this :

&lt;a href=&quot;http://codespeak.net/lxml/elementsoup.html&quot; rel=&quot;nofollow&quot;&gt;http://codespeak.net/lxml/elementsoup.html&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>JohnMc, thanks for pointing to PyQuery and lxml. I was a bit concerned that lxml would not handle malformed HTML files. But it now can interface with BeautifulSoup and use its parsing abilities for malformed HTML files. Here is the lxml module for this :</p>
<p><a href="http://codespeak.net/lxml/elementsoup.html">http://codespeak.net/lxml/elementsoup.html</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JohnMc</title>
		<link>http://www.akasig.org/2004/09/03/web-scraping-with-python/comment-page-1/#comment-150663</link>
		<dc:creator>JohnMc</dc:creator>
		<pubDate>Mon, 07 Sep 2009 03:58:29 +0000</pubDate>
		<guid isPermaLink="false">http://sig.levillage.org/?p=563#comment-150663</guid>
		<description>Beautifulsoup is ok, but it does have lexical issues on certain constructs. Nor it is consistent.

Something I find much better is lxml and PyQuery. lxml is extremely fast. PyQuery&#039;s advantage is that if you know jQuery already you bypass the learning curve in using the tool.</description>
		<content:encoded><![CDATA[<p>Beautifulsoup is ok, but it does have lexical issues on certain constructs. Nor it is consistent.</p>
<p>Something I find much better is lxml and PyQuery. lxml is extremely fast. PyQuery&#8217;s advantage is that if you know jQuery already you bypass the learning curve in using the tool.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sig</title>
		<link>http://www.akasig.org/2004/09/03/web-scraping-with-python/comment-page-1/#comment-19467</link>
		<dc:creator>Sig</dc:creator>
		<pubDate>Mon, 14 Mar 2005 08:47:13 +0000</pubDate>
		<guid isPermaLink="false">http://sig.levillage.org/?p=563#comment-19467</guid>
		<description>Check also &lt;a href=&quot;http://sig.levillage.org/?p=599&quot; rel=&quot;nofollow&quot;&gt;Web scraping with Python (part II)&lt;/a&gt; where I present a python app that helps you scraping the web.</description>
		<content:encoded><![CDATA[<p>Check also <a href="http://sig.levillage.org/?p=599">Web scraping with Python (part II)</a> where I present a python app that helps you scraping the web.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: AkaSig &#187; Web scraping with python (part 1 : crawling)</title>
		<link>http://www.akasig.org/2004/09/03/web-scraping-with-python/comment-page-1/#comment-8134</link>
		<dc:creator>AkaSig &#187; Web scraping with python (part 1 : crawling)</dc:creator>
		<pubDate>Wed, 29 Dec 2004 15:41:32 +0000</pubDate>
		<guid isPermaLink="false">http://sig.levillage.org/?p=563#comment-8134</guid>
		<description>[...] wling design is based on the John L. Lee&#8217;s mechanize framework for python. There are &lt;a href=&quot;http://sig.levillage.org/index.php?p=563&quot;&gt;other tools available in Python&lt;/a&gt; (see also . And several other approaches are available when you want to [...]</description>
		<content:encoded><![CDATA[<p>[...] wling design is based on the John L. Lee&#8217;s mechanize framework for python. There are <a href="http://sig.levillage.org/index.php?p=563">other tools available in Python</a> (see also . And several other approaches are available when you want to [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>
