<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: RDF dump vs. dereferencable URIs</title>
	<atom:link href="http://fgiasson.com/blog/index.php/2007/02/01/rdf_dump_vs_dereferencable_uris/feed/" rel="self" type="application/rss+xml" />
	<link>http://fgiasson.com/blog/index.php/2007/02/01/rdf_dump_vs_dereferencable_uris/</link>
	<description></description>
	<lastBuildDate>Thu, 09 Feb 2012 21:05:51 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Fred</title>
		<link>http://fgiasson.com/blog/index.php/2007/02/01/rdf_dump_vs_dereferencable_uris/comment-page-1/#comment-654</link>
		<dc:creator>Fred</dc:creator>
		<pubDate>Sun, 04 Feb 2007 17:20:03 +0000</pubDate>
		<guid isPermaLink="false">#comment-654</guid>
		<description>Hi Marc,&lt;br /&gt;
&lt;br /&gt;
Yeah I read the story. What I will do, as I said in my last blog post, is that I will add a repository of available rdf data dump on PTSW, hoping it could prevent such situations in the future. However, you have done the only thing to do: banning the IP from crawling geonames. This is unfortunately the only thing that will really work (semweb or not ;) ).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Take care,&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Fred</description>
		<content:encoded><![CDATA[<p>Hi Marc,</p>
<p>Yeah I read the story. What I will do, as I said in my last blog post, is that I will add a repository of available rdf data dump on PTSW, hoping it could prevent such situations in the future. However, you have done the only thing to do: banning the IP from crawling geonames. This is unfortunately the only thing that will really work (semweb or not <img src='http://fgiasson.com/blog/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' />  ).</p>
<p>
Take care,</p>
<p>
Fred</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: marc</title>
		<link>http://fgiasson.com/blog/index.php/2007/02/01/rdf_dump_vs_dereferencable_uris/comment-page-1/#comment-655</link>
		<dc:creator>marc</dc:creator>
		<pubDate>Sat, 03 Feb 2007 19:21:03 +0000</pubDate>
		<guid isPermaLink="false">#comment-655</guid>
		<description></description>
		<content:encoded><![CDATA[<p>Frédérick,</p>
<p>Not only is a dump for huge datasets preferable from a crawler&#8217;s point of view it is also easing strain on the data provider&#8217;s resources. Fetching a database with millions of document row by row requires a lot of resources to create and deliver the documents. A semantic web crawler may thus have the effects of a denial-of-service attack.  More about a recent episode of a semantic web crawler DDOS in my blog :</p>
<p><a href="http://geonames.wordpress.com/2007/02/03/friendly-fire-semantic-web-crawler-ddos/" rel="nofollow">http://geonames.wordpress.com/2007/02/03/friendly-fire-semantic-web-crawler-ddos/</a></p>
<p>Marc</p>
]]></content:encoded>
	</item>
</channel>
</rss>

