<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>kantenwerk &#187; linked data</title>
	<atom:link href="http://kantenwerk.org/category/linked-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://kantenwerk.org</link>
	<description>Just another WordPress weblog</description>
	<lastBuildDate>Tue, 04 May 2010 15:15:34 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0-beta1</generator>
		<item>
		<title>Close, but a Cigar Nevertheless</title>
		<link>http://kantenwerk.org/2010/05/04/close-but-a-cigar-nevertheless/</link>
		<comments>http://kantenwerk.org/2010/05/04/close-but-a-cigar-nevertheless/#comments</comments>
		<pubDate>Tue, 04 May 2010 12:53:02 +0000</pubDate>
		<dc:creator>Knud</dc:creator>
				<category><![CDATA[conference]]></category>
		<category><![CDATA[deriblog]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[travel]]></category>
		<category><![CDATA[web science]]></category>

		<guid isPermaLink="false">http://kantenwerk.org/2010/05/04/close-but-a-cigar-nevertheless/</guid>
		<description><![CDATA[I just came back from this year&#8217;s Web Science Confernce in Raleigh, NC. The idea of the conference &#8211; as of Web Science in general &#8211; is to give a holistic, multi-disciplinary view on the Web, and while I&#8217;m still not sure if and exactly how this will work like in the end (there was [...]]]></description>
			<content:encoded><![CDATA[<p>I just came back from this year&#8217;s <a href="http://www.websci10.org/" alt="WebSci 2010">Web Science Confernce</a> in Raleigh, NC. The idea of the conference &#8211; as of Web Science in general &#8211; is to give a holistic, multi-disciplinary view on the Web, and while I&#8217;m still not sure if and exactly how this will work like in the end (there was a heated discussion between social and computer scientists in the closing panel), I found the event very interesting and a lot of fun. Of course, the best surprise came right at the end, when our paper on <a href="http://journal.webscience.org/302/" alt="Learning from Linked Open Data Usage: Patterns &amp; Metrics">Linked Data Usage</a> (I had reported on <a href="http://kantenwerk.org/2009/02/04/linked-data-access-analysis/" title="Linked Data Access Analysis">early stages of this</a> quite a while ago on this blog) was shortlisted as one of three papers for the best paper award! In the end we didn&#8217;t win (the prize went to the paper by Metaxas and Mustafaraj: <a href="http://journal.webscience.org/317/" alt="From Obscurity to Prominence in Minutes: Political Speech and Real-Time Search">From Obscurity to Prominence in Minutes: Political Speech and Real-Time Search</a>), but just to get the nomination was pretty awesome. I really didn&#8217;t expect this, considering that this paper had been in the pipeline for more that a year now, but never quite made it for any submission deadline, and was therefore delayed time and time again. This is great encouragement for continuing our work in this area!</p>
<div style="width:425px" id="__ss_3862202"><strong style="display:block;margin:12px 0 4px"><a href="http://www.slideshare.net/dunken69/linked-data-usagewebsci2010" title="Learning from Linked Open Data Usage">Learning from Linked Open Data Usage</a></strong><object id="__sse3862202" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=linkeddatausagewebsci2010-100426153950-phpapp02&#038;stripped_title=linked-data-usagewebsci2010" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed name="__sse3862202" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=linkeddatausagewebsci2010-100426153950-phpapp02&#038;stripped_title=linked-data-usagewebsci2010" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object>
<div style="padding:5px 0 12px">View more <a href="http://www.slideshare.net/">presentations</a> from <a href="http://www.slideshare.net/dunken69">Knud Möller</a>.</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://kantenwerk.org/2010/05/04/close-but-a-cigar-nevertheless/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Semantic User Agents</title>
		<link>http://kantenwerk.org/2009/10/08/semantic-user-agents/</link>
		<comments>http://kantenwerk.org/2009/10/08/semantic-user-agents/#comments</comments>
		<pubDate>Thu, 08 Oct 2009 17:10:34 +0000</pubDate>
		<dc:creator>Knud</dc:creator>
				<category><![CDATA[deriblog]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[semanticweb]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[linked]]></category>
		<category><![CDATA[semweb]]></category>

		<guid isPermaLink="false">http://kantenwerk.org/2009/10/08/semantic-user-agents/</guid>
		<description><![CDATA[I&#8217;m still very much interested in the topic of analysing usage of linked data sites. To that end, an interesting question to ask is what kinds of agents access a linked data site. And here, apart from the usual categorisation into bots, browsers and such, it makes sense to differentiate between semantic and non-semantic agents. [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m still very much interested in the topic of <a href="http://kantenwerk.org/2009/02/04/linked-data-access-analysis/">analysing usage of linked data sites</a>. To that end, an interesting question to ask is what kinds of agents access a linked data site. And here, apart from the usual categorisation into bots, browsers and such, it makes sense to differentiate between <em>semantic</em> and <em>non-semantic</em> agents. Very loosely, we could say that</p>
<blockquote><p>
  Semantic agents are agents which are aware of RDF data and actively request it.
</p></blockquote>
<p>To know whether or not an agent requests RDF, we could look at the header of an individual HTTP request and check if the agent had specified <code>Accept: application/rdf+xml</code>. However, the Apache server log files unfortunately don&#8217;t tell us anything about the request header. Luckily though, there is an indirect way of finding out about this. If our linked data site uses <a href="http://www.w3.org/TR/cooluris/" title="Cool URIs for the Semantic Web">best practice content negotiation and 303 redirects</a>, we can look at pairs of requests in the log files. E.g., the <a href="http://data.semanticweb.org/" title="Semantic Web Dog Food">Semantic Web Dog Food</a> site uses a particular URI pattern for resources and their HTML and RDF representations. E.g.:</p>
<pre>

http://data.semanticweb.org/organization/deri-nui-galway

http://data.semanticweb.org/organization/deri-nui-galway/html

http://data.semanticweb.org/organization/deri-nui-galway/rdf
</pre>
<p>If the plain URI is requested, the server will either redirect to the HTML or the RDF representation, based on what was specified by the agent. Therefore, if we find a request for a plain URI and a request for the corresponding RDF URI, from the same IP address and the same agent, within a short time frame (e.g. 5 seconds), then we can infer that the agent had requested <code>application/rdf+xml</code> and can therefore be classified as a <em>semantic agent</em>.</p>
<pre>
90.21.243.141 - - [06/Oct/2008:16:07:58 +0100] "GET /organization/vrije-universiteit-amsterdam-the-netherlands HTTP/1.1" 303 7592 "-" "rdflib-2.4.0 (http://rdflib.net/; eikeon@eikeon.com)"
90.21.243.141 - - [06/Oct/2008:16:08:02 +0100] "GET /organization/vrije-universiteit-amsterdam-the-netherlands/rdf HTTP/1.1" 200 45358 "-" "rdflib-2.4.0 (http://rdflib.net/; eikeon@eikeon.com)"
</pre>
<p>The example above shows this: the &#8220;rdflib.net&#8221; agent requested the plain URI <code>.../organization/vrije-universiteit-amsterdam-the-netherlands</code> and was 303 redirected to <code>.../organization/vrije-universiteit-amsterdam-the-netherlands/rdf</code> a few seconds later. From this we can automatically infer that &#8220;rdflib.net&#8221; is a semantic agent.</p>
<p>A list of 423 semantic agents found in this way for the dog food site from 10/2008-10/2009 is <a href="http://kantenwerk.org/downloads/semantic_agents.txt" title="List of Semantic Agents">here</a>. Looking at the list, we can find a lot of agents that are clearly &#8220;semantic&#8221;, such as the &#8220;SindiceFetcher&#8221; or a SIOC browser. However, most of them are actually not what I would normally consider &#8220;semantic&#8221;, such as hordes of &#8220;Mozilla&#8221;-branded agents or dodgy looking bots. More research is awaiting&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://kantenwerk.org/2009/10/08/semantic-user-agents/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Growth of the Web of Linked Data</title>
		<link>http://kantenwerk.org/2009/09/04/growth-of-the-web-of-linked-data/</link>
		<comments>http://kantenwerk.org/2009/09/04/growth-of-the-web-of-linked-data/#comments</comments>
		<pubDate>Thu, 03 Sep 2009 23:20:24 +0000</pubDate>
		<dc:creator>Knud</dc:creator>
				<category><![CDATA[deriblog]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[semanticweb]]></category>
		<category><![CDATA[semweb]]></category>

		<guid isPermaLink="false">http://kantenwerk.org/2009/09/04/growth-of-the-web-of-linked-data/</guid>
		<description><![CDATA[Bob DuCharme points out nicely how much the Web of Linked Data has grown in the past year by comparing to versions of Richard Cyganiak&#8217;s LOD cloud diagram. It looks pretty impressive when you compare the two versions side by side!]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.snee.com/bobdc.blog/2009/09/growth-of-the-linked-data-clou.html" title="Bob DuCharme's weblog">Bob DuCharme points out</a> nicely how much the Web of Linked Data has grown in the past year by comparing to versions of <a href="http://richard.cyganiak.de/2007/10/lod/">Richard Cyganiak&#8217;s LOD cloud diagram</a>. It looks pretty impressive when you compare the two versions side by side!</p>
]]></content:encoded>
			<wfw:commentRss>http://kantenwerk.org/2009/09/04/growth-of-the-web-of-linked-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Linked Data for WWW2009 Online</title>
		<link>http://kantenwerk.org/2009/04/16/linked-data-for-www2009-online/</link>
		<comments>http://kantenwerk.org/2009/04/16/linked-data-for-www2009-online/#comments</comments>
		<pubDate>Thu, 16 Apr 2009 17:03:58 +0000</pubDate>
		<dc:creator>Knud</dc:creator>
				<category><![CDATA[conference]]></category>
		<category><![CDATA[deriblog]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[semanticweb]]></category>
		<category><![CDATA[semweb]]></category>

		<guid isPermaLink="false">http://kantenwerk.org/2009/04/16/linked-data-for-www2009-online/</guid>
		<description><![CDATA[I don&#8217;t announce every new addition to the Semantic Web Dog Food Server, but this is a big one: based on the data available from EPrints, we managed to get information about papers and authors for the upcoming WWW2009 in Madrid up as linked data on the dog food server. You can get all the [...]]]></description>
			<content:encoded><![CDATA[<p>I don&#8217;t announce every new addition to the <a href="http://data.semanticweb.org/" title="Semantic Web Dog Food">Semantic Web Dog Food Server</a>, but this is a big one: based on the data available from <a href="http://www2009.eprints.org/">EPrints</a>, we managed to get information about papers and authors for the upcoming <a href="http://www2009.org/">WWW2009</a> in Madrid up as linked data on the dog food server. You can get all the papers, authors and their affiliations, all nicely integrated with the rest of the dog food data from other conferences. You can start start browsing <a href="http://data.semanticweb.org/conference/www/2009" title="Linked data for WWW2009">here</a> or get a <a href="http://data.semanticweb.org/conference/www/2009/complete" title="RDF dump for WWW2009">dump of the data</a>. Enjoy!</p>
]]></content:encoded>
			<wfw:commentRss>http://kantenwerk.org/2009/04/16/linked-data-for-www2009-online/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>&#8220;There is probably no Semantic Web &#8230;</title>
		<link>http://kantenwerk.org/2009/03/16/there-is-probably-no-semantic-web/</link>
		<comments>http://kantenwerk.org/2009/03/16/there-is-probably-no-semantic-web/#comments</comments>
		<pubDate>Mon, 16 Mar 2009 17:06:05 +0000</pubDate>
		<dc:creator>Knud</dc:creator>
				<category><![CDATA[linked data]]></category>
		<category><![CDATA[semanticweb]]></category>
		<category><![CDATA[semweb]]></category>

		<guid isPermaLink="false">http://kantenwerk.org/2009/03/16/there-is-probably-no-semantic-web/</guid>
		<description><![CDATA[&#8230; now stop infering and get lodding!&#8221; A great little (a great little?!) photoshop tribute to the Atheist Bus Campaign in London and elsewhere (now also in Germany). I don&#8217;t know exactly where this picture appeared originally &#8211; a friend of a friend saw it on Twitter somewhere, and I don&#8217;t use Twitter. Anyway, I [...]]]></description>
			<content:encoded><![CDATA[<p><em>&#8230; now stop infering and get lodding!&#8221;</em> A great little (a great little?!) photoshop tribute to the <a href="http://www.atheistbus.org.uk/">Atheist Bus Campaign</a> in London and elsewhere (now <a href="http://www.buskampagne.de/">also in Germany</a>). I don&#8217;t know exactly where this picture appeared originally &#8211; a friend of a friend saw it on Twitter somewhere, and I don&#8217;t use Twitter. Anyway, I love it! I also love the fact that we now have a new verb. I wonder how it is inflected? It&#8217;s probably regular, so it should look like this:</p>
<blockquote><p>
  <em>to lod (verb): lod, lodded, lodding &#8211; the act of publishing linked open data on the World Wide Web, adhering to the <a href="http://www.w3.org/DesignIssues/LinkedData.html" title="Linked Data Rules">rules of linked data</a>.</em>
</p></blockquote>
<p><a href="http://www.whatreallypissesmeoff.com/hugh/lod-bus2.jpg"><img src="http://www.whatreallypissesmeoff.com/hugh/lod-bus2.jpg" class="displayed"  alt="There is probably no Semantic Web - LOD Bus"/></a></p>
]]></content:encoded>
			<wfw:commentRss>http://kantenwerk.org/2009/03/16/there-is-probably-no-semantic-web/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Tim Berners-Lee on Linked Data at TED</title>
		<link>http://kantenwerk.org/2009/03/16/tim-berners-lee-on-linked-data-at-ted/</link>
		<comments>http://kantenwerk.org/2009/03/16/tim-berners-lee-on-linked-data-at-ted/#comments</comments>
		<pubDate>Sun, 15 Mar 2009 23:20:14 +0000</pubDate>
		<dc:creator>Knud</dc:creator>
				<category><![CDATA[linked data]]></category>
		<category><![CDATA[semanticweb]]></category>
		<category><![CDATA[semweb]]></category>

		<guid isPermaLink="false">http://kantenwerk.org/2009/03/16/tim-berners-lee-on-linked-data-at-ted/</guid>
		<description><![CDATA[Tim Berners-Lee1 gave an enthusiastic talk about linked data at TED, urging everybody to get their data out there or, if they don&#8217;t have any, to demand access to data in a proper format. Interestingly, he didn&#8217;t mention the words &#8220;Semantic Web&#8221; once during the talk, nor did he ever say &#8220;RDF&#8221; or even &#8220;URI&#8221; [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.w3.org/People/Berners-Lee/card#i">Tim Berners-Lee</a><sup>1</sup> gave an enthusiastic talk about <a href="http://linkeddata.org/">linked data</a> at <a href="http://conferences.ted.com/TED2009/">TED</a>, urging everybody to get their data out there or, if they don&#8217;t have any, to demand access to data in a proper format.</p>
<p>Interestingly, he didn&#8217;t mention the words <em>&#8220;Semantic Web&#8221;</em> once during the talk, nor did he ever say <em>&#8220;RDF&#8221;</em> or even <em>&#8220;URI&#8221;</em> &#8211; instead he spoke about <em>&#8220;names starting with &#8216;http&#8217;&#8221;</em>. Cool enough, his slides had the <a href="http://data.semanticweb.org/">dog food data set</a> in them! :)</p>
<p>A video of the talk and a link to the slides can be found on the <a href="http://ebiquity.umbc.edu/blogger/2009/03/14/video-from-tim-berners-lee-2009-ted-talk-on-linked-data/">ebiquity blog</a>.</p>
<p><a href="http://www.flickr.com/photos/dunken69/3357435585/" title="LOD Cloud with dogfood by Dunkoman, on Flickr"><img src="http://farm4.static.flickr.com/3585/3357435585_1b84b2f095_o.png" width="363" height="273" alt="LOD Cloud with dogfood" class="displayed"/></a></p>
<p><sup>1</sup>I wish this link would lead me to something nice when I go to it with a Web browser!</p>
]]></content:encoded>
			<wfw:commentRss>http://kantenwerk.org/2009/03/16/tim-berners-lee-on-linked-data-at-ted/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>RDF for all of O&#8217;Reilly&#8217;s titles (with OPMI)</title>
		<link>http://kantenwerk.org/2009/03/14/rdf-for-all-of-oreillys-titles-with-opmi/</link>
		<comments>http://kantenwerk.org/2009/03/14/rdf-for-all-of-oreillys-titles-with-opmi/#comments</comments>
		<pubDate>Sat, 14 Mar 2009 15:23:39 +0000</pubDate>
		<dc:creator>Knud</dc:creator>
				<category><![CDATA[linked data]]></category>
		<category><![CDATA[semanticweb]]></category>
		<category><![CDATA[semweb]]></category>

		<guid isPermaLink="false">http://kantenwerk.org/2009/03/14/rdf-for-all-of-oreillys-titles-with-opmi/</guid>
		<description><![CDATA[I might be a bit late (one month) to discover this, but IT book publisher O&#8217;Reilly have recently started a service called O&#8217;Reilly Product Metadata Interface (OPMI), which provides RDF metadata for their whole catalogue of books. More details about this can be found on the O&#8217;Reilly Labs page. I think it&#8217;s great news that [...]]]></description>
			<content:encoded><![CDATA[<p>I might be a bit late (one month) to discover this, but IT book publisher O&#8217;Reilly have recently started a service called <a href="http://opmi.labs.oreilly.com/" title="OPMI">O&#8217;Reilly Product Metadata Interface (OPMI)</a>, which provides RDF metadata for their whole catalogue of books. More details about this can be found on the <a href="http://labs.oreilly.com/opmi.html">O&#8217;Reilly Labs</a> page.</p>
<p>I think it&#8217;s great news that a major publisher starts to open up their data to the Semantic Web! Term-wise, they do the right thing and use vocabularies that have turned into de-facto standards (<a href="http://xmlns.com/foaf/spec/" title="FOAF specification">FOAF</a> and <a href="http://dublincore.org/documents/dcmi-terms/" title="DC Terms specification">DC (terms)</a> in particular), as well as some newly coined terms in their own <a href="http://purl.oreilly.com/ns/meta/">O&#8217;Reilly namespace</a>. They also get brownie points for actually making their namespace dereferencable. Good practice!</p>
<p>There are a few things that could be improved to make their data more useful, though:</p>
<ul>
<li>They use non-http URIs like this: <code>urn:x-domain:oreilly.com:agent:pdb:1210</code>. That&#8217;s perfectly fine RDF, but it breaks the <a href="http://www.w3.org/DesignIssues/LinkedData.html" title="Linked Data Rules">linked data rules</a> &#8211; URIs like that are not dereferencable, which means it is impossible for interested agents to find out more about those resources.</li>
<li>Both the book URIs and the ontology namespace URI lead only to RDF. It would be nice if, upon a request for HTML, their servers would provide something human-readable as well. They acknowledge this problem themselves, so hopefully it will be addressed soon. Content negotiation to the rescue? For their vocabulary, these <a href="http://www.w3.org/TR/swbp-vocab-pub/" title="Best-practice recipes for publishing RDFS vocabularies">vocabulary publishing recipes</a> might help (in combination with a tool like <a href="http://kantenwerk.org/vocdoc/" title="VocDoc vocabulary documentation generator">VocDoc</a>).</li>
<li>The ontology source looks a bit messy, with weird namespace declarations like <code>xmlns:p3="http://purl.org/dc/terms/#"</code>. These might be artifacts from the ontology editor they used, though. Not really harmful, just ugly.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://kantenwerk.org/2009/03/14/rdf-for-all-of-oreillys-titles-with-opmi/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Linked Data Access Analysis</title>
		<link>http://kantenwerk.org/2009/02/04/linked-data-access-analysis/</link>
		<comments>http://kantenwerk.org/2009/02/04/linked-data-access-analysis/#comments</comments>
		<pubDate>Wed, 04 Feb 2009 17:30:33 +0000</pubDate>
		<dc:creator>Knud</dc:creator>
				<category><![CDATA[linked data]]></category>
		<category><![CDATA[semanticweb]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[linked]]></category>

		<guid isPermaLink="false">http://kantenwerk.org/2009/02/04/linked-data-access-analysis/</guid>
		<description><![CDATA[I&#8217;m currently working on an analysis of the log files of the Semantic Web Dog Food server. Apart from the obvious queries such as &#8220;How much traffic was there?&#8221;, &#8220;When were the peaks in traffic?&#8221; or &#8220;Where did the traffic come from?&#8221;, Semantic Web-type linked data inspires some other questions as well. Examples of such [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m currently working on an analysis of the log files of the <a href="http://data.semanticweb.org" title="Semantic Web Dog Food">Semantic Web Dog Food</a> server. Apart from the obvious queries such as &#8220;How much traffic was there?&#8221;, &#8220;When were the peaks in traffic?&#8221; or &#8220;Where did the traffic come from?&#8221;, Semantic Web-type linked data inspires some other questions as well. Examples of such questions are to figure out how intensively the Semantic Web portion of the data was used (i.e., how often was RDF requested compared to HTML), how the distribution of &#8220;semantic&#8221; vs. &#8220;conventional&#8221; user agents was or what kind of data was requested.</p>
<p>Using the techniques described earlier in <a href="http://confuseddevelopment.blogspot.com/2009/01/creating-bar-charts-with-gnuplot.html">a post on my Confused Development blog</a> I sifted through about 7 months worth of log files and generated some pretty pictures. Here is what I came up with so far:</p>
<p><a href="http://www.flickr.com/photos/dunken69/3252798997/" title="Linked data hit analysis (Data tail) by Dunkoman, on Flickr"><img src="http://farm4.static.flickr.com/3459/3252798997_5b3234a157.jpg" width="500" height="350" alt="Linked data hit analysis (Data tail)" /></a></p>
<p>The serving of linked data on the dog food server works through content negotiation &#8211; basically, the first request by an agent would be to the URI of the resource (&#8220;plain&#8221; in the graph), specifying in the header whether an RDF or HTML representation is desired. The server then redirects to either the HTML or RDF document with the desired representation. In theory, this means that <code>requests(rdf) + requests(html) = requests(plain)</code>. However, since it is perfectly feasible to request the HTML or RDF documents directly, the total of RDF+HTML is slightly higher. The total numbers are:</p>
<table>
<tbody>
<tr>
<td><strong>HTML:</strong></td>
<td align="right">238486</td>
</tr>
<tr>
<td><strong>RDF:</strong></td>
<td align="right">35491</td>
</tr>
<tr>
<td><strong>HTML+RDF:</strong></td>
<td align="right">273977</td>
</tr>
<tr>
<td><strong>Plain:</strong></td>
<td align="right">247576</td>
</tr>
</tbody>
</table>
<p>As the graph and the numbers show, the usage in terms of RDF requests is relatively low at the moment, indicating that there is still a long way to go for the Semantic Web to really take off (and that we need to work on making the site more popular).</p>
<p><a href="http://www.flickr.com/photos/dunken69/3252799039/" title="Linked data hit analysis (Resource type) by Dunkoman, on Flickr"><img src="http://farm4.static.flickr.com/3323/3252799039_beb0b97261.jpg" width="500" height="350" alt="Linked data hit analysis (Resource type)" /></a></p>
<p>This second graph shows the distribution of hits over time for the different kinds of resources which the server offers, as indicated by the requested namespace (<code>dogfood:person</code>, <code>dogfood:conference</code>, &#8230;). Interest in people resources is highest almost all of the time. Partially, this may be due to ego surfing of Semantic Web researchers. However, as the graphs below will show, bot traffic far exceeds traffic by human visitors, so my hunch is that the preference of people pages can be explained through the search strategies of the big search engine players out there &#8211; people information is probably considered more valuable. Of course, another factor is the fact that there are about three times as many people resources on the dog food server than e.g. conference resources.</p>
<p>Regarding the conference and workshop resources, those need to be examined in a more fine-grained fashion, since the respective namespaces cover everything connected to an event: papers, talks, chairs, the event itself, etc.</p>
<p><a href="http://www.flickr.com/photos/dunken69/3252798951/" title="Linked data hit analysis (Agent tail) by Dunkoman, on Flickr"><img src="http://farm4.static.flickr.com/3400/3252798951_842236821e.jpg" width="500" height="350" alt="Linked data hit analysis (Agent tail)" /></a></p>
<p>No self-respecting analysis can live without a nice longtail graph these days. Looking at visiting agents, we get such a distribution (y-scale is logarithmic). The agents in the head are the big search engine crawlers &#8211; GoogleBot, Yahoo! Slurp and MSNBot -, as well as the big name browsers. In the middle and long tail we find lots and lots of different other bots, crawlers and browsers, as well as various tools, data services and agents who didn&#8217;t give themselves a proper identifier and instead just show up as &#8220;Java&#8221; or &#8220;perl-libwww&#8221; (very naughty behaviour indeed&#8230;).</p>
<p><a href="http://www.flickr.com/photos/dunken69/3253625570/" title="Linked data hit analysis (Agent types) by Dunkoman, on Flickr"><img src="http://farm4.static.flickr.com/3258/3253625570_935a841e5f.jpg" width="500" height="350" alt="Linked data hit analysis (Agent types)" /></a></p>
<p>More interesting is probably this graph, which shows the agent distribution after I had sliced and diced it manually according to some criteria:</p>
<ul>
<li><strong>What type of agent is it:</strong> <em>bot/crawler</em>, <em>browser</em> (=human visitor), <em>unspecified programming library</em>, <em>debugging or scripting tool</em> (curl, wget, &#8230;) or <em>data-service</em>. The latter is <a href="http://richard.cyganiak.de/" title="Richard Cyganiak's homepage">Richard&#8217;s</a> term for agents which provide a service for other agents by processing some data on the Web. In contrast to crawlers, the purpose here is not archiving or indexing. Examples are format converters, snapshot generators, etc.</li>
<li><strong>What is the &#8220;semanticity&#8221; of the agent:</strong> is it a conventional agent, or one that operates in a Semantic Web-aware fashion?</li>
<li><strong>Mobile or not:</strong> I noticed a (small) amount of visits by mobile browsers, which I thought could be interesting to record separately.</li>
</ul>
<p>All this and more will become part of my thesis and also (hopefully) make into some sort of more polished publication soon.</p>
]]></content:encoded>
			<wfw:commentRss>http://kantenwerk.org/2009/02/04/linked-data-access-analysis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
