<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>underused.org</title>
	<atom:link href="http://underused.org/feed/" rel="self" type="application/rss+xml" />
	<link>http://underused.org</link>
	<description>The notoriously underused weblog by Michael Scharkow</description>
	<lastBuildDate>Tue, 12 Jan 2010 20:06:11 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Changing Places</title>
		<link>http://underused.org/2010/01/12/changing-places/</link>
		<comments>http://underused.org/2010/01/12/changing-places/#comments</comments>
		<pubDate>Tue, 12 Jan 2010 20:06:11 +0000</pubDate>
		<dc:creator>Michael Scharkow</dc:creator>
				<category><![CDATA[Other Stuff]]></category>
		<category><![CDATA[Science]]></category>

		<guid isPermaLink="false">http://underused.org/?p=86</guid>
		<description><![CDATA[As of this year, I am in Stuttgart at the University of Hohenheim. Thorsten Quandt has offered me a full-time senior researcher position in the ERC-funded project on the Social Foundations of Online Gaming. I&#8217;m joining a great team and get to spend lots of time and money on gaming&#8230;Yay!
]]></description>
			<content:encoded><![CDATA[<p>As of this year, I am in Stuttgart at the <a href="http://www.uni-hohenheim.de">University of Hohenheim</a>. <a href="http://online.uni-hohenheim.de">Thorsten Quandt</a> has offered me a full-time senior researcher position in the ERC-funded project on the <a href="http://sofoga.uni-hohenheim.de">Social Foundations of Online Gaming</a>. I&#8217;m joining a great team and get to spend lots of time and money on gaming&#8230;Yay!</p>
]]></content:encoded>
			<wfw:commentRss>http://underused.org/2010/01/12/changing-places/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Collaborative Map-Reduce in the Browser</title>
		<link>http://underused.org/2009/03/03/collaborative-map-reduce-in-the-browser/</link>
		<comments>http://underused.org/2009/03/03/collaborative-map-reduce-in-the-browser/#comments</comments>
		<pubDate>Tue, 03 Mar 2009 18:42:40 +0000</pubDate>
		<dc:creator>Michael Scharkow</dc:creator>
				<category><![CDATA[tumblelog]]></category>
		<category><![CDATA[clustering software]]></category>

		<guid isPermaLink="false">http://underused.org/2009/03/03/collaborative-map-reduce-in-the-browser/</guid>
		<description><![CDATA[This is such a cool idea.&#160;
]]></description>
			<content:encoded><![CDATA[<p><br /><a href="http://www.igvita.com/2009/03/03/collaborative-map-reduce-in-the-browser/">This is such a cool idea.</a>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://underused.org/2009/03/03/collaborative-map-reduce-in-the-browser/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Research Note: Using Google Search Data for Agenda Setting Research</title>
		<link>http://underused.org/2009/02/07/research-note-using-google-search-data-for-agenda-setting-research/</link>
		<comments>http://underused.org/2009/02/07/research-note-using-google-search-data-for-agenda-setting-research/#comments</comments>
		<pubDate>Fri, 06 Feb 2009 22:43:11 +0000</pubDate>
		<dc:creator>Michael Scharkow</dc:creator>
				<category><![CDATA[Science]]></category>
		<category><![CDATA[agenda setting]]></category>
		<category><![CDATA[Google]]></category>

		<guid isPermaLink="false">http://underused.org/?p=75</guid>
		<description><![CDATA[Jens Vogelgesang and I presented a small paper on using Google&#8217;s search query data for the measurement of the public agenda at a joint Journalism/Methods conference today. The working paper is available here and the abstract below. Comments are, of course, very welcome.

The reliable and valid measurement of issue salience is a key problem of [...]]]></description>
			<content:encoded><![CDATA[<p>Jens Vogelgesang and I presented a small paper on using Google&#8217;s search query data for the measurement of the public agenda at a joint Journalism/Methods conference today. <a href="http://underused.org/docs/google_scharkow_vogelgesang2009.pdf">The working paper is available here</a> and the abstract below. Comments are, of course, very welcome.</p>

<p><blockquote>The reliable and valid measurement of issue salience is a key problem of agenda setting research. While exposure, awareness and salience are usually investigated using surveys, the behavioral consequences of those processes, like follow-up communication or information seeking, lend themselves very well to observation. Many theoretical models of agenda setting incorporate these concepts, but only few empirical studies exist that actually measure salience-driven behavior. We propose a new method for measuring aggregate issue salience by analyzing data from search queries typed into Google. We illustrate this approach with a case study from the 2005 Bundestag elections in Germany, focussing on the fiscal expert Paul Kirchhof of Angela Merkel&#8217;s campaign team. Using both survey and online observation data, we find substantial correlations between the two longitudinal measures of issue salience.</blockquote>We both should be working on other things, like our PhD theses, but those small empirical studies are too much fun.</p>
]]></content:encoded>
			<wfw:commentRss>http://underused.org/2009/02/07/research-note-using-google-search-data-for-agenda-setting-research/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>2009 AAAS/Science Dance Contest</title>
		<link>http://underused.org/2008/11/28/2009-aaasscience-dance-contest/</link>
		<comments>http://underused.org/2008/11/28/2009-aaasscience-dance-contest/#comments</comments>
		<pubDate>Fri, 28 Nov 2008 19:03:38 +0000</pubDate>
		<dc:creator>Michael Scharkow</dc:creator>
				<category><![CDATA[tumblelog]]></category>
		<category><![CDATA[arts]]></category>
		<category><![CDATA[dance]]></category>
		<category><![CDATA[Science]]></category>

		<guid isPermaLink="false">http://underused.org/2008/11/28/2009-aaasscience-dance-contest/</guid>
		<description><![CDATA[Taking science to the dance, and back again2009 AAAS/Science Dance Contest
]]></description>
			<content:encoded><![CDATA[<p>Taking science to the dance, and back again<br /><a href="http://gonzolabs.org/dance/contestants/">2009 AAAS/Science Dance Contest</a></p>
]]></content:encoded>
			<wfw:commentRss>http://underused.org/2008/11/28/2009-aaasscience-dance-contest/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Journal of Articles in Support of the Null Hypothesis</title>
		<link>http://underused.org/2008/07/11/journal-of-articles-in-support-of-the-null-hypothesis/</link>
		<comments>http://underused.org/2008/07/11/journal-of-articles-in-support-of-the-null-hypothesis/#comments</comments>
		<pubDate>Thu, 10 Jul 2008 22:28:06 +0000</pubDate>
		<dc:creator>Michael Scharkow</dc:creator>
				<category><![CDATA[Science]]></category>
		<category><![CDATA[statistical significance]]></category>

		<guid isPermaLink="false">http://underused.org/2008/07/11/journal-of-articles-in-support-of-the-null-hypothesis/</guid>
		<description><![CDATA[After my PhD supervisor recently advised me to better not end up with non-results in my thesis, JASNH&#160;looks like a very cool plan B for publication.
]]></description>
			<content:encoded><![CDATA[<p>After my PhD supervisor recently advised me to better not end up with non-results in my thesis, <a href="http://www.jasnh.com/">JASNH</a>&nbsp;looks like a very cool plan B for publication.</p>
]]></content:encoded>
			<wfw:commentRss>http://underused.org/2008/07/11/journal-of-articles-in-support-of-the-null-hypothesis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>posterous &#8211; Email based blogging redux</title>
		<link>http://underused.org/2008/06/29/posterous-email-based-blogging-redux/</link>
		<comments>http://underused.org/2008/06/29/posterous-email-based-blogging-redux/#comments</comments>
		<pubDate>Sun, 29 Jun 2008 15:57:25 +0000</pubDate>
		<dc:creator>Michael Scharkow</dc:creator>
				<category><![CDATA[tumblelog]]></category>
		<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Web Apps]]></category>

		<guid isPermaLink="false">http://underused.org/2008/06/29/posterous-email-based-blogging-redux/</guid>
		<description><![CDATA[posterous&#160;lets you create a blog by sending one simple email to post@posterous.com. Bidirectional email commenting works, too. Does it get any simpler?
]]></description>
			<content:encoded><![CDATA[<p><a href="http://posterous.com/">posterous</a>&nbsp;lets you create a blog by sending one simple email to post@posterous.com. Bidirectional email commenting works, too. Does it get any simpler?</p>
]]></content:encoded>
			<wfw:commentRss>http://underused.org/2008/06/29/posterous-email-based-blogging-redux/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Harvard enforces Open Access for all research</title>
		<link>http://underused.org/2008/02/15/harvard-enforces-open-access-for-all-research/</link>
		<comments>http://underused.org/2008/02/15/harvard-enforces-open-access-for-all-research/#comments</comments>
		<pubDate>Fri, 15 Feb 2008 17:49:47 +0000</pubDate>
		<dc:creator>Michael Scharkow</dc:creator>
				<category><![CDATA[Science]]></category>
		<category><![CDATA[open access]]></category>

		<guid isPermaLink="false">http://underused.org/2008/02/15/harvard-enforces-open-access-for-all-research/</guid>
		<description><![CDATA[Gary King just announced that the Faculty of Arts and Sciences at Harvard unanimously decided to enforce Open Access for all faculty members. This bold move should seriously advance OA world-wide.

I&#8217;d like to see similar steps forward from the German DFG and the likes, or my university. I guess most of the faculty at UdK [...]]]></description>
			<content:encoded><![CDATA[<p>Gary King <a href="http://www.iq.harvard.edu/blog/sss/archives/2008/02/open_access_to.shtml">just announced</a> that the Faculty of Arts and Sciences at Harvard unanimously decided to enforce Open Access for all faculty members. This bold move should seriously advance OA world-wide.</p>

<p>I&#8217;d like to see similar steps forward from the German DFG and the likes, or my university. I guess most of the faculty at UdK does not even know what OA means, and who needs to if there&#8217;s no research anyway ;-)</p>
]]></content:encoded>
			<wfw:commentRss>http://underused.org/2008/02/15/harvard-enforces-open-access-for-all-research/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SimpleBits realigned</title>
		<link>http://underused.org/2008/02/01/simplebits-realigned/</link>
		<comments>http://underused.org/2008/02/01/simplebits-realigned/#comments</comments>
		<pubDate>Fri, 01 Feb 2008 09:05:20 +0000</pubDate>
		<dc:creator>Michael Scharkow</dc:creator>
				<category><![CDATA[tumblelog]]></category>
		<category><![CDATA[Web Design]]></category>

		<guid isPermaLink="false">http://underused.org/2008/02/01/simplebits-realigned/</guid>
		<description><![CDATA[Simplebits gets yet another great mini-redesign.
]]></description>
			<content:encoded><![CDATA[<p><a href="http://simplebits.com/">Simplebits</a> gets yet another great mini-redesign.</p>
]]></content:encoded>
			<wfw:commentRss>http://underused.org/2008/02/01/simplebits-realigned/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Typoscript&#8217;s RECORDS, how I love thee</title>
		<link>http://underused.org/2008/01/06/typoscripts-records-how-i-love-thee/</link>
		<comments>http://underused.org/2008/01/06/typoscripts-records-how-i-love-thee/#comments</comments>
		<pubDate>Sun, 06 Jan 2008 20:21:09 +0000</pubDate>
		<dc:creator>Michael Scharkow</dc:creator>
				<category><![CDATA[Hacking]]></category>
		<category><![CDATA[TYPO3]]></category>

		<guid isPermaLink="false">http://underused.org/2008/01/06/typoscripts-records-how-i-love-thee/</guid>
		<description><![CDATA[After hacking it for so many years, there are still some surprising nuggets hidden in TYPO3. Our TYPO3-newbie webmaster Johannes recently pointed me to the RECORDS type in our template which is extremely useful for including portlet-style content elements in your template. In order to include a plugin somewhere in your page template, simply add [...]]]></description>
			<content:encoded><![CDATA[<p>After hacking it for so many years, there are still some surprising nuggets hidden in TYPO3. Our TYPO3-newbie webmaster <a href="http://sitegraph.de">Johannes</a> recently pointed me to the RECORDS type in our template which is extremely useful for including portlet-style content elements in your template. In order to include a plugin somewhere in your page template, simply add it to a sysfolder or hidden page somewhere and refer to the content record like this:</p>

<pre><code>subparts.TAGCLOUD = RECORDS
subparts.TAGCLOUD {
tables = tt_content
source = 444
dontCheckPid = 1
}
</code></pre>

<p>That&#8217;s it. The plugin content is rendered without USER_INT fiddling or COA tricks, configuration is dummy proof with Flexforms which seem to be more popular than TS configuration anyway. You can also include link lists, search forms, user login or normal content etc. like this, all nicely editable by your average users.</p>
]]></content:encoded>
			<wfw:commentRss>http://underused.org/2008/01/06/typoscripts-records-how-i-love-thee/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Scraping Youtube with Beautiful Soup</title>
		<link>http://underused.org/2007/12/11/scraping-youtube-with-beautiful-soup/</link>
		<comments>http://underused.org/2007/12/11/scraping-youtube-with-beautiful-soup/#comments</comments>
		<pubDate>Tue, 11 Dec 2007 09:58:52 +0000</pubDate>
		<dc:creator>Michael Scharkow</dc:creator>
				<category><![CDATA[Hacking]]></category>
		<category><![CDATA[BeautifulSoup]]></category>
		<category><![CDATA[Python]]></category>
		<category><![CDATA[Youtube]]></category>

		<guid isPermaLink="false">http://underused.org/2007/12/11/scraping-youtube-with-beautiful-soup/</guid>
		<description><![CDATA[For an upcoming project I need to track some usage statistics of  Youtube videos which are not provided via the GData API. The common solution to this problem is screen scraping the HTML pages and extracting the information.

Here&#8217;s a quick howto using Python and the  BeautifulSoup HTML/XML parser. 

First off, we choose a [...]]]></description>
			<content:encoded><![CDATA[<p>For an upcoming project I need to track some usage statistics of  Youtube videos which are not provided via the GData API. The common solution to this problem is screen scraping the HTML pages and extracting the information.</p>

<p>Here&#8217;s a quick howto using Python and the  <a href="http://www.crummy.com/software/BeautifulSoup/">BeautifulSoup</a> HTML/XML parser. </p>

<p>First off, we choose a Youtube video page, like <a href="http://youtube.com/watch?v=Xe1a1wHxTyo">this one</a> and stuff it into the BeautifulSoup parser:</p>

<pre><code>#!/usr/bin/env python
from urllib2 import urlopen
from BeautifulSoup import BeautifulSoup
import re # we need regular expressions later

monty_vid = urlopen('http://youtube.com/watch?v=Xe1a1wHxTyo')
page = BeautifulSoup(monty_vid)

print page.prettify()
</code></pre>

<p>The last line pretty-prints the HTML you just retrieved, so you can check if it&#8217;s an existing page or a 404. Next, we&#8217;d like to extract some meta data like title, description and tags for the video. Luckily, those are provided in the HTML head as meta tags, in order, so we can extract the content attribute from those. The result object is a list with elements that act like dictionaries:</p>

<pre><code>meta = {}
meta['title'] = page.head('meta')[0]['content']
meta['description'] = page.head('meta')[1]['content']
meta['tags'] = page.head('meta')[2]['content'].split(', ')
</code></pre>

<p>Notice that all the extracted strings are Unicode, and we made a list of tags by splitting the string. Next up, we want the number of views and the number of ratings. Luckily, the former is available in a span tag with a dedicated class which we can retrieve with the following search on the document body:</p>

<pre><code>views = page.body('span',"viewCount")[0].string
</code></pre>

<p>The number of ratings is not readily marked up, but available as a string like &#8220;55 ratings&#8221;, so we need another technique &#8212; pattern matching within a certain div:</p>

<pre><code>numratings = page.body('div', id='defaultRatingMessage',    
                        text=re.compile('ratings'))[0].string.split()[0]
</code></pre>

<p>The body() method with the text parameter gives us all tags in the named div that match our simple pattern, from which we extract the first part with split()[0]. Finally, we do not only want the number of ratings, but the rating itself. The rating is not available as a number in the document, but indicated by 0 to 5 star images which we can count:</p>

<pre><code>rating = len(page.body('img','rating icn_star_full_19x20png'))
</code></pre>

<p>Of course, there are dozens of different ways to query the HTML for some values, but these work with the current layout. Since you have to update your scraping script with every change, why bother with optimal queries?</p>
]]></content:encoded>
			<wfw:commentRss>http://underused.org/2007/12/11/scraping-youtube-with-beautiful-soup/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.363 seconds -->
<!-- Cached page served by WP-Cache -->
<!-- Compression = gzip -->