<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/" xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>OUseful.Info, the blog...</title>
	
	<link>http://blog.ouseful.info</link>
	<description>Trying to find useful things to do with emerging technologies in open education</description>
	<lastBuildDate>Thu, 09 Feb 2012 20:40:59 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain="blog.ouseful.info" port="80" path="/?rsscloud=notify" registerProcedure="" protocol="http-post" />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>OUseful.Info, the blog...</title>
		<link>http://blog.ouseful.info</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://blog.ouseful.info/osd.xml" title="OUseful.Info, the blog..." />
	
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/ouseful" /><feedburner:info uri="ouseful" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://blog.ouseful.info/?pushpress=hub" /><feedburner:emailServiceId>ouseful</feedburner:emailServiceId><feedburner:feedburnerHostname>http://feedburner.google.com</feedburner:feedburnerHostname><item>
		<title>Do Retweeters Lack Commitment to a Hashtag?</title>
		<link>http://feedproxy.google.com/~r/ouseful/~3/d1TEqhodE_4/</link>
		<comments>http://blog.ouseful.info/2012/02/09/do-retweeters-lack-commitment-to-a-hashtag/#comments</comments>
		<pubDate>Thu, 09 Feb 2012 18:30:46 +0000</pubDate>
		<dc:creator>Tony Hirst</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Anything you want]]></category>
		<category><![CDATA[Visualisation]]></category>
		<category><![CDATA[hashtag community]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://blog.ouseful.info/?p=7022</guid>
		<description>I seem to be going down more ratholes than usual at the moment, in this case relating to activity round Twitter hashtags. Here&amp;#8217;s a quick bit of reflection around a chart from Visualising Activity Around a Twitter Hashtag or Search Term Using R that shows activity around a hashtag that was minted for an event [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;amp;blog=325417&amp;amp;post=7022&amp;amp;subd=ouseful&amp;amp;ref=&amp;amp;feed=1" width="1" height="1" /&gt;</description>
			<content:encoded><![CDATA[<p>I seem to be going down more ratholes than usual at the moment, in this case relating to activity round Twitter hashtags. Here&#8217;s a quick bit of reflection around a chart from <a href="http://blog.ouseful.info/2012/02/06/visualising-activity-round-a-twitter-hashtag-or-search-term-using-r/">Visualising Activity Around a Twitter Hashtag or Search Term Using R</a> that shows  activity around a hashtag that was minted for an event that took place before the sample period.</p>
<p>The y-axis is organised according to the time of first use (within the sample period) of the tag by a particular user. The x axis is time. The dots represent tweets containing the hashtag, coloured blue by default, red if they are an old-style RT (i.e. they begin <tt>RT @username:</tt>).</p>
<p><a href="http://ouseful.files.wordpress.com/2012/02/twcoversationperuserwithrts.png"><img src="http://ouseful.files.wordpress.com/2012/02/twcoversationperuserwithrts.png?w=700" alt="" title="twConversationPerUserWithRTs" class="alignnone size-medium wp-image-6986" /></a></p>
<p>So what sorts of thing might we look for in this chart, and what are the problems with it? Several things jump out at me:</p>
<ul>
<li>For many of the users, their first tweet (in this sample period at least) is an RT; that is, they are brought into the hashtag community through issuing an RT;</li>
<li>Many of the users whose first use is via an RT don&#8217;t use the hashtag again within the sample period. Is this typical? Does this signal represent amplification of the tag without any real sense of  engagement with it?</li>
<li>A noticeable proportion of folk whose first use is not an RT go on to post further non-RT tweets. Does this represent an ongoing commitment to the tag? Note that this chart does not show whether tweets are replies, or &#8220;open&#8221; tweets. Replies (that is, tweets beginning <tt>@username</tt> are likely to represent conversational threads within a tag context rather than &#8220;general&#8221; tag usage, so it would be worth using an additional colour to identify reply based conversational tweets as such.</li>
<li>&#8220;New style&#8221; retweets are diaplayed as retweets by colouring&#8230; I need to check whether or nor newstyle RT information is available that I could use to colour such tweets appropriately. (or alternatively, I&#8217;d have to do some sort of string matching to see whether or not a tweet was the same as a previously seen tweet, which is a bit of a pain:-(</li>
</ul>
<p>(Note that when I started mapping hashtag communities, I used to generate tag user names based on a filtered list of tweets that excluded RTs. this meant that folk who only used the tag as part of an RT and did not originate tweets that contained the tag, either in general or as part of a conversation,  would not be counted as a member of the hashtag community. More recently, I have added filters that include RTs but exclude users who used the tag only once, for example, thus retaining serial RTers, but not single use users.)</p>
<p>So what else might this chart tell us? Looking at vertical slices, it seems that news entrants to the tag community appear to come in waves, maybe as part of rapid fire RT bursts. This chart doesn&#8217;t tell us for sure that this is happening, but it does highlight areas of the timelime that might be worth investigating more closely if we are interested in what happened at those times when there does appear to be a spike in activity. (Are there any modifications we could make to this chart to make them more informative in this respect? The time resolution is very poor, for example, so being able to zoom in on a particular time might be handy. Or are there other charts that might provide a different lens that can help us see what was happening at those times?)</p>
<p>And as a final point &#8211; this stuff may be all very interesting, <em>but is it useful?</em>, And if so, <strong>how?</strong> I also wonder how generalisable it is to other sorts of communication analysis. For example, I think we could use similar graphical techniques to explore engagement with an active comment thread on a blog, or Google+, or additions to an online forum thread. (For forums with mutliple threads, we maybe need to rethink how this sort of chart would work, or how it might be coloured/what symbols we might use, to distinguish between starting a new thread, or adding to a pre-existing one, for example. I&#8217;m sure the literature is filled with dozens of examples for how we might visualise forum activity, so if you know of any good references/links&#8230;?! ;-) #lazyacademic)</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ouseful.wordpress.com/7022/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ouseful.wordpress.com/7022/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ouseful.wordpress.com/7022/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ouseful.wordpress.com/7022/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ouseful.wordpress.com/7022/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ouseful.wordpress.com/7022/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ouseful.wordpress.com/7022/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ouseful.wordpress.com/7022/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ouseful.wordpress.com/7022/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ouseful.wordpress.com/7022/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ouseful.wordpress.com/7022/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ouseful.wordpress.com/7022/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ouseful.wordpress.com/7022/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ouseful.wordpress.com/7022/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;blog=325417&amp;post=7022&amp;subd=ouseful&amp;ref=&amp;feed=1" width="1" height="1" /><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/ouseful?a=d1TEqhodE_4:n57pwxTgTcQ:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/ouseful?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=d1TEqhodE_4:n57pwxTgTcQ:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=d1TEqhodE_4:n57pwxTgTcQ:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/ouseful?i=d1TEqhodE_4:n57pwxTgTcQ:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=d1TEqhodE_4:n57pwxTgTcQ:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/ouseful?i=d1TEqhodE_4:n57pwxTgTcQ:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=d1TEqhodE_4:n57pwxTgTcQ:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/ouseful?i=d1TEqhodE_4:n57pwxTgTcQ:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=d1TEqhodE_4:n57pwxTgTcQ:cGdyc7Q-1BI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=cGdyc7Q-1BI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/ouseful/~4/d1TEqhodE_4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.ouseful.info/2012/02/09/do-retweeters-lack-commitment-to-a-hashtag/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Tony Hirst</media:title>
		</media:content>

		<media:content url="http://ouseful.files.wordpress.com/2012/02/twcoversationperuserwithrts.png" medium="image">
			<media:title type="html">twConversationPerUserWithRTs</media:title>
		</media:content>
	<creativeCommons:license>http://creativecommons.org/licenses/by/2.0/</creativeCommons:license><feedburner:origLink>http://blog.ouseful.info/2012/02/09/do-retweeters-lack-commitment-to-a-hashtag/</feedburner:origLink></item>
		<item>
		<title>What is the Potential Audience Size for a Hashtag Community?</title>
		<link>http://feedproxy.google.com/~r/ouseful/~3/WQve9WB_TsA/</link>
		<comments>http://blog.ouseful.info/2012/02/09/what-is-the-potential-audience-size-for-a-hashtag-community/#comments</comments>
		<pubDate>Thu, 09 Feb 2012 00:30:53 +0000</pubDate>
		<dc:creator>Tony Hirst</dc:creator>
				<category><![CDATA[Anything you want]]></category>
		<category><![CDATA[Rstats]]></category>
		<category><![CDATA[Visualisation]]></category>
		<category><![CDATA[backchannel]]></category>
		<category><![CDATA[event amplification]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://ouseful.wordpress.com/?p=7008</guid>
		<description>What&amp;#8217;s the potential audience size around a Twitter hashtag? Way back when, in the early days of webs stats, reported figures tended to centre around the notion of hits, the number of calls made to a server via website activity. I forget the details, but the metric was presumably generated from server logs. This measure [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;amp;blog=325417&amp;amp;post=7008&amp;amp;subd=ouseful&amp;amp;ref=&amp;amp;feed=1" width="1" height="1" /&gt;</description>
			<content:encoded><![CDATA[<p>What&#8217;s the potential audience size around a Twitter hashtag?</p>
<p>Way back when, in the early days of webs stats, reported figures tended to centre around the notion of <em>hits</em>, the number of calls made to a server via website activity. I forget the details, but the metric was presumably generated from server logs. This measure was always totally unreliable, because in the course of serving a web page, a server might be hit multiple times, once for each separately delivered asset, such as images, javascript files, css files and so on. Hits soon gave way to the notion of <em>Page Views</em>, which more accurately measured the number of pages (rather than assets) served via a website. This was complemented with the notion of <em>Visits</em> and <em>Unique Visits</em>: Visits, as tracked by a cookies, represent a set of pages viewed around about the same time by the same person. Unique Visits (or &#8220;Uniques&#8221;), represent the number of <em>different</em> people who appear to have visited the site in any given period.</p>
<p>What we see here, then, is a steady evolution in the complexity of website metrics that reflects on the one hand dissatisfaction with one way of measuring or reporting activity, and on the other practical considerations with respect to instrumentation and the ability to capture certain metrics once they are conceived of.</p>
<p>Widespread social media monitoring/tracking is largely still in the realm of &#8220;hits&#8221; measurement. Personal dashboards for services such as Twitter typically display direct measures provided by the Twitter API, or measures trivially/directly identified from Twitter API or archived data &#8211; number of followers, numbers of friends, distribution of updates over time, number of mentions, and so on.</p>
<p>Something both myself and Martin Hawksey have been thinking about on and off for some time are ways of reporting activity around Twitter hashtags. A commonly(?!) asked question in this respect relates to how much <em>engagement</em> (whatever that means) there has been with a particular tag. So here&#8217;s a quick mark in the sand about some of my current thinking about this. (Note that these ideas may well have been more formally developed in the academic literature &#8211; I&#8217;m a bit behind in my reading! If you know something that covers this in more detail, or that I should cite, please feel free to add a link in the comments&#8230; #lazyAcademic.)</p>
<p>One of the first metrics that comes to my mind is the number of people who have used a particular hashtag, and the number of their followers. Easily stated, it doesn&#8217;t take a lot of thought to realise even these &#8220;simple&#8221; measures are fraught with difficulty:</p>
<ul>
<li>what counts as a <em>use</em> of the hashtag? If I retweet a measure of yours that contains a hashtag, have I <em>used</em> it in any meaningful sense? Does a &#8220;use&#8221; mean the creation of a new tweet containing the tag? What about if I reply to a tweet from you than contains the tag and I include the tag in my reply to you, even if I&#8217;m not sure what that tag relates to?</li>
<li>the potential audience size for the tag (potential uniques?), based on the number of followers of the tag users. At first glance, we might think this can be easily calculated by adding together the follower counts of the tag users, but this is more strictly an approximation of the potential audience: the set of followers of A may include some of the followers of B, or C; do we count the tag users themselves amongst the audience? If so, the upper bound also needs to take into account the fact that none of the users may be followers of any of the other tag users.<br />
Note there is also a lower bound &#8211; the largest follower count amongst the tag users (whatever that means&#8230;) of the hashtag. Furthermore, if we want to count the number of folk not using the tag but who may have seen the tag, this lower bound can be revised downwards by subtracting the number of tag users minus one (for the tag user with the largest follower count). The value is still only an approximation, though, becuase it assumes that all the tag users are actually included as followers of at least one, each, of the tag users. (If you think these points are &#8220;just academic&#8221;, they are and they aren&#8217;t &#8211; observations like these can often be used to help formulate gaming strategies around metrics based on these measures.)</li>
<li>the potential number of <em>views</em> of a tag, for example based on the product of the number of times a user tweets and their follower count?</li>
<li>the <em>reach</em> of (or active engagement with?) the tag, as measured by the number of people who actually see the tag, or the number of people who take and action around it (such as replying to a tagged tweet, RTing it, or clicking on a link a tagged tweet contains); note that we may be able ot construct probabilistic models (albeit quite involved ones) of the potential reach based on factors like the number of people someone follows, when they are online, the rate at which the people they follow tweet, and so on..</li>
</ul>
<p>To try to make this a little more concrete, here are a couple of scripts for exploring the potential audience size of a tag based on the followers of the tag users (where a user is someone who publishes or retweets a tweet containing the tag over a specified period). The first, Python script runs a Twitter search and generates a list of unique users of the tag, along with the timestamp of their first use of the tag within the sample period. This script also grabs all the followers of the tag users, along with their counts, and generates running cumulative (upper bound approximation) count of the tag user follower numbers as well as calculating the rolling set of unique followers to date as each new tag user is observed. The second, R script plots the values.</p>
<p>The first thing we can do is look at the incidence of new users of the hashtag over time:</p>
<p><a href="http://ouseful.files.wordpress.com/2012/02/new-twitter-entrants-to-hashtag-conversation.png"><img src="http://ouseful.files.wordpress.com/2012/02/new-twitter-entrants-to-hashtag-conversation.png?w=700" alt="" title="New twitter entrants to hashtag conversation"   class="alignnone size-full wp-image-7009" /></a></p>
<p>(For a little more discussion of this sort of chart, see <a href="http://blog.ouseful.info/2012/02/06/visualising-activity-round-a-twitter-hashtag-or-search-term-using-r/">Visualising Activity Around a Twitter Hashtag or Search Term Using R</a> and its inspiration, @mediaczar&#8217;s <a href="http://blog.magicbeanlab.com/networkanalysis/how-should-page-admins-deal-with-flame-wars/">How should Page Admins deal with Flame Wars?</a>.)</p>
<p>More relevant to this post, however, is a plot showing some counts relating to followers of users of the hashtag:</p>
<p><a href="http://ouseful.files.wordpress.com/2012/02/potentialreachoftag.png"><img src="http://ouseful.files.wordpress.com/2012/02/potentialreachoftag.png?w=700" alt="" title="potentialReachofTag"   class="alignnone size-full wp-image-7010" /></a></p>
<p>In this case, the top, green line represents the summed total number of followers for tag users as they enter the conversation. If every user had completely different followers, this might be meaningful, but where conversation takes place around a tag between folk who know each other, it&#8217;s highly likely that they have followers in common.</p>
<p>The middle, red line shows a count of the number of unique followers to date, based on the the followers of users of the tag to date.</p>
<p>The lower, blue line shows the difference between the red and green lines. This represents the error between the summed follower counts and the actual number of unique followers.</p>
<p>Here&#8217;s a view over the number of new unique potential audience members at each time step (I think the use of the line chart here may be a mistake&#8230; I think bars/lineranges would probably be more appropriate&#8230;):</p>
<p><a href="http://ouseful.files.wordpress.com/2012/02/newuniquefollowersateachtimestep.png"><img src="http://ouseful.files.wordpress.com/2012/02/newuniquefollowersateachtimestep.png?w=700" alt="" title="newUniqueFollowersAtEachTImeStep"   class="alignnone size-full wp-image-7017" /></a></p>
<p>In the following chart, I overplot oneline with another. The lower layer (a red line) is the total follower account for each new tag user. The blue is the increase in the potential audience count (that is, the number of the new users&#8217; followers that haven&#8217;t potentially seen the tag so far). The range of the visible part of the red line thus shows the number of a new tag user&#8217;s followers who have potentially already seen the tag. Err&#8230; maybe (that is, if my code is correct and all the scripts are doing what I think they&#8217;re doing! If they aren&#8217;t, then just treat this post as an exploration of the sorts of charts we might be able to produce to explore audience reach;-)</p>
<p><a href="http://ouseful.files.wordpress.com/2012/02/potentialaudiencegrowth.png"><img src="http://ouseful.files.wordpress.com/2012/02/potentialaudiencegrowth.png?w=700" alt="" title="potentialaudiencegrowth"   class="alignnone size-full wp-image-7012" /></a></p>
<p>Here are the scripts (such as they are!)</p>
<p><pre class="brush: python;">import newt,csv,tweepy
import networkx as nx

#the term we're going to search for
tag='ddj'
#how many tweets to search for (max 1500)
num=500

##Something along lines of:
auth = tweepy.OAuthHandler(CONSUMER_KEY, CONSUMER_SECRET)
auth.set_access_token(SKEY, SSECRET)
api = tweepy.API(auth, cache=tweepy.FileCache('cache',cachetime), retry_errors=[500], retry_delay=5, retry_count=2)

#You need to do some work here to search the Twitter API
tweeters, tweets=yourSearchTwitterFunction(api,tag,num)
#tweeters is a list of folk who tweeted the term of interest
#tweets is a list of the Twitter tweet objects returned from the search
#My code for this is tightly bound up in a large and rambling library atm...

#Put tweets into chronological order
tweets.reverse()

#I was being lazy and wasn't sure what vars I needed or what I was trying to do when I started this!
#The whole thing really needs rewriting...
tweepFo={}
seenToDate=set([])
uniqSourceFo=[]
#runtot is crude and doesn't measure overlap
runtot=0
oldseentodate=0

#Construct a digraph from folk using the tag to their followers
DG=nx.DiGraph()

for tweet in tweets:
	user=tweet['from_user']
	if user not in tweepFo:
		tweepFo[user]=[]
		print &quot;Getting follower data for&quot;, str(user), str(len(tweepFo)), 'of', str(len(tweeters))
		mi=tweepy.Cursor(api.followers_ids,id=user).items()
		userID=tweet['from_user_id'] #check
		DG.add_node(userID,label=user)
		for m in mi:
			tweepFo[user].append(m)
			#construct graph
			DG.add_edge(userID,m,weight=1)
			DG.node[m]['label']=''
		ufc=len(tweepFo[user])
		runtot=runtot+ufc
		#seen to date is all people who have seen so far, plus new ones, so it's the union
		oldseentodate=len(seenToDate)
		seenToDate=seenToDate.union(set(tweepFo[user]))
		uniqSourceFo.append((tweet['created_at'],len(seenToDate),user,runtot,ufc,oldseentodate))
	else:
		#I'm weighting the edges so we can count how many times folk see the hashtag
		if len(DG.edges(userID))&gt;0:
			tmp1,tmp2=DG.edges(userID)[0]
			weight=DG[userID][tmp2]['weight']+1
			for fromN,toN in DG.edges(userID):
				DG[fromN][toN]['weight']=weight


fo='reports/tmp/'+tag+'_ncount.csv'
f=open(fo,'wb+')
writer=csv.writer(f)
writer.writerow(['datetime','count','newuser','crudetot','userFoCount','previousCount'])
for ts,l,u,ct,ufc,ols in uniqSourceFo:
	print ts,l
	writer.writerow([ts,l,u,ct,ufc,ols])

f.close()

print &quot;Writing graph..&quot;
filter=[]
for n in DG:
	if DG.degree(n)&gt;1: filter.append(n)
filter=set(filter)
H=DG.subgraph(filter)
nx.write_graphml(H, 'reports/tmp/'+tag+'_ncount_2up.graphml')
print &quot;Writing other graph..&quot;
nx.write_graphml(DG, 'reports/tmp/'+tag+'_ncount.graphml')</pre></p>
<p>Here&#8217;s the R script&#8230;</p>
<p><pre class="brush: r;">ddj_ncount &lt;- read.csv(&quot;~/code/twapps/newt/reports/tmp/ddj_ncount.csv&quot;)
#Convert the datetime string to a time object
ddj_ncount$ttime=as.POSIXct(strptime(ddj_ncount$datetime, &quot;%a, %d %b %Y %H:%M:%S&quot;),tz='UTC')

#Order the newuser factor levels into the order in which they first use the tag
dda=subset(ddj_ncount,select=c('ttime','newuser'))
dda=arrange(dda,-desc(ttime))
ddj_ncount$newuser=factor(ddj_ncount$newuser, levels = dda$newuser)

#Plot when each user first used the tag against time
ggplot(ddj_ncount) + geom_point(aes(x=ttime,y=newuser)) + opts(axis.text.x=theme_text(size=6),axis.text.y=theme_text(size=4))

#Plot the cumulative and union flavours of increasing possible audience size, as well as the difference between them
ggplot(ddj_ncount) + geom_line(aes(x=ttime,y=count,col='Unique followers')) + geom_line(aes(x=ttime,y=crudetot,col='Cumulative followers')) + geom_line(aes(x=ttime,y=crudetot-count,col='Repeated followers')) + labs(colour='Type') + xlab(NULL)

#Number of new unique followers introduced at each time step
ggplot(ddj_ncount)+geom_line(aes(x=ttime,y=count-previousCount,col='Actual delta'))

#Try to get some idea of how many of the followers of a new user are actually new potential audience members
ggplot(ddj_ncount) + opts(axis.text.x=theme_text(angle=-90,size=4)) + geom_linerange(aes(x=newuser,ymin=0,ymax=userFoCount,col='Follower count')) + geom_linerange(aes(x=newuser,ymin=0,ymax=(count-previousCount),col='Actual new audience'))

#This is still a bit experimental
#I'm playing around trying to see what proportion or number of a users followers are new to, or subsumed by, the potential audience of the tag to date...
ggplot(ddj_ncount) + geom_linerange(aes(x=newuser,ymin=0,ymax=1-(count-previousCount)/userFoCount)) + opts(axis.text.x=theme_text(angle=-90,size=6)) + xlab(NULL)</pre></p>
<p>In the next couple of posts in this series, I&#8217;ll start to describe how we can chart the potential increase in audience count as a delta for each new tagger, along with a couple of ways of trying to get some initial sort of sense out of the graph file, such as the distribution of the potential number of &#8220;views&#8221; of a tag across the unique potential audience members&#8230;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ouseful.wordpress.com/7008/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ouseful.wordpress.com/7008/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ouseful.wordpress.com/7008/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ouseful.wordpress.com/7008/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ouseful.wordpress.com/7008/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ouseful.wordpress.com/7008/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ouseful.wordpress.com/7008/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ouseful.wordpress.com/7008/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ouseful.wordpress.com/7008/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ouseful.wordpress.com/7008/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ouseful.wordpress.com/7008/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ouseful.wordpress.com/7008/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ouseful.wordpress.com/7008/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ouseful.wordpress.com/7008/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;blog=325417&amp;post=7008&amp;subd=ouseful&amp;ref=&amp;feed=1" width="1" height="1" /><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/ouseful?a=WQve9WB_TsA:QzWYpGtrkKo:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/ouseful?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=WQve9WB_TsA:QzWYpGtrkKo:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=WQve9WB_TsA:QzWYpGtrkKo:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/ouseful?i=WQve9WB_TsA:QzWYpGtrkKo:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=WQve9WB_TsA:QzWYpGtrkKo:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/ouseful?i=WQve9WB_TsA:QzWYpGtrkKo:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=WQve9WB_TsA:QzWYpGtrkKo:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/ouseful?i=WQve9WB_TsA:QzWYpGtrkKo:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=WQve9WB_TsA:QzWYpGtrkKo:cGdyc7Q-1BI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=cGdyc7Q-1BI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/ouseful/~4/WQve9WB_TsA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.ouseful.info/2012/02/09/what-is-the-potential-audience-size-for-a-hashtag-community/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Tony Hirst</media:title>
		</media:content>

		<media:content url="http://ouseful.files.wordpress.com/2012/02/new-twitter-entrants-to-hashtag-conversation.png" medium="image">
			<media:title type="html">New twitter entrants to hashtag conversation</media:title>
		</media:content>

		<media:content url="http://ouseful.files.wordpress.com/2012/02/potentialreachoftag.png" medium="image">
			<media:title type="html">potentialReachofTag</media:title>
		</media:content>

		<media:content url="http://ouseful.files.wordpress.com/2012/02/newuniquefollowersateachtimestep.png" medium="image">
			<media:title type="html">newUniqueFollowersAtEachTImeStep</media:title>
		</media:content>

		<media:content url="http://ouseful.files.wordpress.com/2012/02/potentialaudiencegrowth.png" medium="image">
			<media:title type="html">potentialaudiencegrowth</media:title>
		</media:content>
	<creativeCommons:license>http://creativecommons.org/licenses/by/2.0/</creativeCommons:license><feedburner:origLink>http://blog.ouseful.info/2012/02/09/what-is-the-potential-audience-size-for-a-hashtag-community/</feedburner:origLink></item>
		<item>
		<title>Dangers of a Walled Garden…</title>
		<link>http://feedproxy.google.com/~r/ouseful/~3/z8fh2RT6zgg/</link>
		<comments>http://blog.ouseful.info/2012/02/08/dangers-of-a-walled-garden/#comments</comments>
		<pubDate>Wed, 08 Feb 2012 11:46:56 +0000</pubDate>
		<dc:creator>Tony Hirst</dc:creator>
				<category><![CDATA[Anything you want]]></category>
		<category><![CDATA[ads]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://blog.ouseful.info/?p=6975</guid>
		<description>Reading a recent Economist article (The value of friendship) about the announcement last week that Facebook is to float as a public company, and being amazed as ever about how these valuations, err, work, I recalled a couple of observations from a @currybet post about the Guardian Facebook app (&amp;#8220;The Guardian&amp;#8217;s Facebook app&amp;#8221; &amp;#8211; Martin [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;amp;blog=325417&amp;amp;post=6975&amp;amp;subd=ouseful&amp;amp;ref=&amp;amp;feed=1" width="1" height="1" /&gt;</description>
			<content:encoded><![CDATA[<p>Reading a recent Economist article (<a href="http://www.economist.com/node/21546020">The value of friendship</a>) about the announcement last week that Facebook is to float as a public company, and being amazed as ever about how these valuations, err, work, I recalled a couple of observations from a @currybet post about the Guardian Facebook app (<a href="http://www.currybet.net/cbet_blog/2012/02/newsrewired-guardian-facebook.php">&#8220;The Guardian&#8217;s Facebook app&#8221; &#8211; Martin Belam at news:rewired</a>). The first related to using Facebook apps to (only partially successfully) capture attention of folk on Facebook and get them to refocus it on the Guardian website:</p>
<blockquote><p> We knew that 77% of visits to the Guardian from facebook.com only lasted for one page. A good hypothesis for this was that leaving the confines of Facebook to visit another site was an interruption to a Facebook session, rather than a decision to go off and browse another site. We began to wonder what it would be like if you could visit the Guardian whilst still within Facebook, signed in, chatting and sharing with your friends. Within that environment could we show users a selection of other content that would appeal to them, and tempt them to stay with our content a little bit longer, even if they weren’t on our domain.</p></blockquote>
<p>The second thing that came to mind related to the economic/business models around the app Facebook app itself:</p>
<blockquote><p>The Guardian Facebook app is a canvas app. That means the bulk of the page is served by us within an iFrame on the Facebook domain. All the revenue from advertising served in that area of the page is ours, and for launch we engaged a sponsor to take the full inventory across the app. Facebook earn the revenue from advertising placed around the edges of the page.</p></blockquote>
<p>I&#8217;m not sure if Facebook runs CPM (cost per thousand) display based ads, where advertisers pay per impression, or follow the Google AdWords model, where advertisers pay per click (PPC), but it got me wondering&#8230; A large number of folk on Facebook (and Twitter) share links to third party websites external to Facebook. As Martin Belam points out, the user return rate back to Facebook for folk visiting third party sites <em>from</em> Facebook seems very high &#8211; folk seem to follow a link from Facebook, consume that item, <em>return to Facebook</em>. Facebook makes an increasing chunk of its revenue from ads it sells on Facebook.com (though with the amount of furniture and Facebook open graph code it&#8217;s getting folk to include on their own websites, it presumably wouldn&#8217;t be so hard for them to roll out their own ad network to place ads on third party sites?) so keeping eyeballs on Facebook is presumably in their commercial interest.</p>
<p>In Twitter land, where the VC folk are presumably starting to wonder when the money tap will start to flow, I notice &#8220;sponsored tweets&#8221; are starting to appear in search results:</p>
<p><a href="http://www.flickr.com/photos/psychemedia/6806085033/" title="ANother twitter search irrelevance by psychemedia, on Flickr"><img src="http://farm8.staticflickr.com/7150/6806085033_41d5a2ab0c_z.jpg" width="544" height="253" alt="ANother twitter search irrelevance"></a></p>
<p>Relevance still appears to be quite low, possibly because they haven&#8217;t yet got enough ads to cover a wide range of keywords or prompts:</p>
<p><a href="http://www.flickr.com/photos/psychemedia/6806079193/" title="Dodgy twitter promoted tweet by psychemedia, on Flickr"><img src="http://farm8.staticflickr.com/7168/6806079193_715b438aa7_z.jpg" width="538" height="360" alt="Dodgy twitter promoted tweet"></a></p>
<p>(Personally, if the relevance score was low, I wouldn&#8217;t place the ad, or I&#8217;d serve an ad tuned to the user, rather than the content, <em>per se</em>&#8230;)</p>
<p>Again, with Twitter, a lot of sharing results in users being taken to external sites, from which they quickly return to the Twitter context. Keeping folk in the Twitter context for images and videos through pop-up viewers or embedded content in the client is also a strategy pursued in may Twitter clients.</p>
<p>So here&#8217;s the thought, though it&#8217;s probably a commercially suicidal one: at the moment, Facebook and Twitter and Google+ all <em>automatically</em> &#8220;linkify&#8221; URLs (though Google+ also takes the strategy of previewing the first few lines of a single linked to page within a Google+ post). That is, given a URL in a post, they turn it into a link. But what if they turned that linkifier off for a domain, unless a fee was paid to turn it back on. Or what if the linkifier was turned off if the number of clickthrus on links to a particular domain, or page within a domain, exceeded a particular threshold, and could only be turned on again at a metered, CPM rate. (Memories here of different models for getting folk to pay for bandwidth, because what we have here is access to bandwidth out of the immediate Facebook, Twitter or Google+ context).</p>
<p>As a revenue model, the losses associated with irritating users would probably outweigh any revenue benefits, but as a thought experiment, it maybe suggests that we need to start paying more attention to how these large attention-consuming services are increasingly trying to cocoon us in their context (anyone remember AOL, or to a lesser extent Yahoo, or Microsoft?), rather than playing nicely with the rest of the web.</p>
<p>PS Hmmm&#8230;&#8221;app&#8221;. One default interpretation of this is &#8220;app on phone&#8221;, but &#8220;Facebook app&#8221; means an app that runs on the Facebook platform&#8230; So for any give app, that it is an &#8220;app&#8221; implies that that particular variant means &#8220;software application that runs on a proprietary platform&#8221;, which might actually be a combination of hardware and software platforms (e.g. Facebook API and Android phone)???</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ouseful.wordpress.com/6975/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ouseful.wordpress.com/6975/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ouseful.wordpress.com/6975/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ouseful.wordpress.com/6975/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ouseful.wordpress.com/6975/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ouseful.wordpress.com/6975/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ouseful.wordpress.com/6975/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ouseful.wordpress.com/6975/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ouseful.wordpress.com/6975/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ouseful.wordpress.com/6975/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ouseful.wordpress.com/6975/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ouseful.wordpress.com/6975/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ouseful.wordpress.com/6975/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ouseful.wordpress.com/6975/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;blog=325417&amp;post=6975&amp;subd=ouseful&amp;ref=&amp;feed=1" width="1" height="1" /><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/ouseful?a=z8fh2RT6zgg:1MJQMXZ10OY:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/ouseful?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=z8fh2RT6zgg:1MJQMXZ10OY:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=z8fh2RT6zgg:1MJQMXZ10OY:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/ouseful?i=z8fh2RT6zgg:1MJQMXZ10OY:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=z8fh2RT6zgg:1MJQMXZ10OY:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/ouseful?i=z8fh2RT6zgg:1MJQMXZ10OY:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=z8fh2RT6zgg:1MJQMXZ10OY:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/ouseful?i=z8fh2RT6zgg:1MJQMXZ10OY:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=z8fh2RT6zgg:1MJQMXZ10OY:cGdyc7Q-1BI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=cGdyc7Q-1BI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/ouseful/~4/z8fh2RT6zgg" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.ouseful.info/2012/02/08/dangers-of-a-walled-garden/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Tony Hirst</media:title>
		</media:content>

		<media:content url="http://farm8.staticflickr.com/7150/6806085033_41d5a2ab0c_z.jpg" medium="image">
			<media:title type="html">ANother twitter search irrelevance</media:title>
		</media:content>

		<media:content url="http://farm8.staticflickr.com/7168/6806079193_715b438aa7_z.jpg" medium="image">
			<media:title type="html">Dodgy twitter promoted tweet</media:title>
		</media:content>
	<creativeCommons:license>http://creativecommons.org/licenses/by/2.0/</creativeCommons:license><feedburner:origLink>http://blog.ouseful.info/2012/02/08/dangers-of-a-walled-garden/</feedburner:origLink></item>
		<item><title>Links for 2012-02-06 [del.icio.us]</title><link>http://feedproxy.google.com/~r/ouseful/~3/wlgnyiNiyg8/feedthru</link><pubDate>Tue, 07 Feb 2012 00:00:00 PST</pubDate><guid isPermaLink="false">http://del.icio.us/psychemedia/feedthru#2012-02-06</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="http://cameronneylon.net/blog/network-enabled-research/"&gt;Science in the Open &amp;raquo; Blog Archive &amp;raquo; Network Enabled Research: Maximise scale and connectivity, minimise friction&lt;/a&gt;&lt;br/&gt;
A good overview of why the web makes a difference... That there is something qualitatively different about working as part of a large, loose, distributed network is something that is not, I think, widely appreciated...&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.cabinetoffice.gov.uk/news/behavioural-insights-team-publish-paper-fraud-error-and-debt"&gt;Behavioural Insights Team publish paper on fraud, error and debt | Cabinet Office&lt;/a&gt;&lt;br/&gt;
I always feel as if the phrase "Behavioural Insights Team" has something of the Orwellian nightmare about it...&lt;/li&gt;
&lt;/ul&gt;&lt;img src="http://feeds.feedburner.com/~r/ouseful/~4/wlgnyiNiyg8" height="1" width="1"/&gt;</description><feedburner:origLink>http://del.icio.us/psychemedia/feedthru#2012-02-06</feedburner:origLink></item><item>
		<title>Visualising Activity Around a Twitter Hashtag or Search Term Using R</title>
		<link>http://feedproxy.google.com/~r/ouseful/~3/VAyVFcp3ZCs/</link>
		<comments>http://blog.ouseful.info/2012/02/06/visualising-activity-round-a-twitter-hashtag-or-search-term-using-r/#comments</comments>
		<pubDate>Mon, 06 Feb 2012 13:14:19 +0000</pubDate>
		<dc:creator>Tony Hirst</dc:creator>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Visualisation]]></category>
		<category><![CDATA[Anything you want]]></category>
		<category><![CDATA[Rstats]]></category>

		<guid isPermaLink="false">http://blog.ouseful.info/?p=6982</guid>
		<description>I think one of valid criticisms around a lot of the visualisations I post here and on my various #f1datajunkie blogs is that I often don&amp;#8217;t post any explanatory context around the visualisations. This is partly a result of the way I use my blog posts in a selfish way to document the evolution of [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;amp;blog=325417&amp;amp;post=6982&amp;amp;subd=ouseful&amp;amp;ref=&amp;amp;feed=1" width="1" height="1" /&gt;</description>
			<content:encoded><![CDATA[<p>I think one of valid criticisms around a lot of the visualisations I post here and on my various #f1datajunkie blogs is that I often don&#8217;t post any explanatory context around the visualisations. This is partly a result of the way I use my blog posts in a selfish way to document the evolution of my own practice, but not necessarily the &#8220;so what&#8221; elements that represent any meaning or sense I take from the visualisations. In many cases, this is because the understanding I come to of a dataset is typically the result of an (inter)active exploration of the data set; what I blog are the pieces of the puzzle that show how I personally set about developing a conversation with a dataset, pieces that you can try out if you want to&#8230;;-)</p>
<p>An approach that might get me more readers would be to post commentary around what I&#8217;ve learned about a dataset from having a conversation with it. A good example of this can be seen in @mediaczar&#8217;s post on <a href="http://blog.magicbeanlab.com/networkanalysis/how-should-page-admins-deal-with-flame-wars/">How should Page Admins deal with Flame Wars?</a>, where this visualisation of activity around a Facebook post is analysed in terms of effective (or not!) strategies for moderating a flame war.</p>
<p><img src="http://blog.magicbeanlab.com/wp-content/uploads/Kingshthorpe-Comments.png" alt="@mediaczar visualisation of engagement around facebook flamewars" /></p>
<p>The chart shows a sequential ordering of posts in the order they were made along the x-axis, and the unique individual responsible for each post, ordered by accession to the debate along the y-axis. For interpretation and commentary, see the original post: <a href="http://blog.magicbeanlab.com/networkanalysis/how-should-page-admins-deal-with-flame-wars/">How should Page Admins deal with Flame Wars?</a> ;-)</p>
<p>One take away of the chart for me is that it provides a great snapshot of new people entering into a conversation (vertical lines) as well as engagement by an individual (horizontal lines). If we use a time proportional axis on x, we can also see engagement over time.</p>
<p>In a Twitter context, it&#8217;s likely that a rapid increase in numbers of folk engaging with a hashtag, for example, might be the result of an RT related burst of activity. For folk who have already engaged in hashtag usage, for example as part of a live event backhannel, a large number of near co-occurring tweets that are not RTs might signal some notable happenstance within the event.</p>
<p>To explore this idea, here&#8217;s a quick bit of R tooling inspired by Mat&#8217;s post&#8230; It uses the twitteR library and sources tweets via a Twitter search.</p>
<p><pre class="brush: r;">require(twitteR)
#Pull in a search around a hashtag.
searchTerm='#ukgc12'
rdmTweets &lt;- searchTwitter(searchTerm, n=500)
# Note that the Twitter search API only goes back 1500 tweets

#Plot of tweet behaviour by user over time
#Based on @mediaczar's http://blog.magicbeanlab.com/networkanalysis/how-should-page-admins-deal-with-flame-wars/
#Make use of a handy dataframe creating twitteR helper function
tw.df=twListToDF(rdmTweets)
#@mediaczar's plot uses a list of users ordered by accession to user list
## 1) find earliest tweet in searchlist for each user [ http://stackoverflow.com/a/4189904/454773 ]
require(plyr)
tw.dfx=ddply(tw.df, .var = &quot;screenName&quot;, .fun = function(x) {return(subset(x, created %in% min(created),select=c(screenName,created)))})
## 2) arrange the users in accession order
tw.dfxa=arrange(tw.dfx,-desc(created))
## 3) Use the username accession order to order the screenName factors in the searchlist
tw.df$screenName=factor(tw.df$screenName, levels = tw.dfxa$screenName)
#ggplot seems to be able to cope with time typed values...
require(ggplot2)
ggplot(tw.df)+geom_point(aes(x=created,y=screenName))
</pre></p>
<p><a href="http://ouseful.files.wordpress.com/2012/02/twitteractivityperuserbytime.png"><img src="http://ouseful.files.wordpress.com/2012/02/twitteractivityperuserbytime.png?w=700" alt="" title="twitterActivityPerUserByTime" class="alignnone size-medium wp-image-6985" /></a></p>
<p>We can get a feeling for which occurrences were old-style RTs by identifying tweets that start with a classic RT, and then colouring each tweet appropriately (note there may be some overplotting/masking of points&#8230;I&#8217;m not sure how big the x-axis time bins are&#8230;)</p>
<p><pre class="brush: r;">#Identify and colour the RTs...
library(stringr)
#A helper function to remove @ symbols from user names...
trim &lt;- function (x) sub('@','',x)
#Identify classic style RTs
tw.df$rt=sapply(tw.df$text,function(tweet) trim(str_match(tweet,&quot;^RT (@[[:alnum:]_]*)&quot;)[2]))
tw.df$rtt=sapply(tw.df$rt,function(rt) if (is.na(rt)) 'T' else 'RT')
ggplot(tw.df)+geom_point(aes(x=created,y=screenName,col=rtt))</pre></p>
<p><a href="http://ouseful.files.wordpress.com/2012/02/twcoversationperuserwithrts.png"><img src="http://ouseful.files.wordpress.com/2012/02/twcoversationperuserwithrts.png?w=700" alt="" title="twConversationPerUserWithRTs" class="alignnone size-medium wp-image-6986" /></a></p>
<p>So now we can see when folk entered into the hashtag community via a classic RT.</p>
<p>We can also start to explore who was classically retweeted when:</p>
<p><pre class="brush: r;">#Generate a plot showing how a person is RTd
tw.df$rtof=sapply(tw.df$text,function(tweet) trim(str_match(tweet,&quot;^RT (@[[:alnum:]_]*)&quot;)[2]))
#Note that this doesn't show how many RTs each person got in a given time period if they got more than one...
ggplot(subset(tw.df,subset=(!is.na(rtof))))+geom_point(aes(x=created,y=rtof))</pre></p>
<p><a href="http://ouseful.files.wordpress.com/2012/02/twwhenfolkrtd.png"><img src="http://ouseful.files.wordpress.com/2012/02/twwhenfolkrtd.png?w=700" alt="" title="twWhenFolkRTd" class="alignnone size-medium wp-image-6987" /></a></p>
<p>Another view might show who was classically RTd by whom (activity along a row indicating someone was retweeted a lot through one or more tweets, activity within a column identifying an individual who RTs a lot&#8230;):</p>
<p><pre class="brush: r;">#We can start to get a feel for who RTs whom...
require(gdata)
#We don't want to display screenNames of folk who tweeted but didn't RT
tw.df.rt=drop.levels(subset(tw.df,subset=(!is.na(rtof))))
#Order the screennames of folk who did RT by accession order (ie order in which they RTd)
tw.df.rta=arrange(ddply(tw.df.rt, .var = &quot;screenName&quot;, .fun = function(x) {return(subset(x, created %in% min(created),select=c(screenName,created)))}),-desc(created))
tw.df.rt$screenName=factor(tw.df.rt$screenName, levels = tw.df.rta$screenName)
# Plot who RTd whom
ggplot(subset(tw.df.rt,subset=(!is.na(rtof))))+geom_point(aes(x=screenName,y=rtof))+opts(axis.text.x=theme_text(angle=-90,size=6)) + xlab(NULL)</pre></p>
<p><a href="http://ouseful.files.wordpress.com/2012/02/twwhortbywhoh.png"><img src="http://ouseful.files.wordpress.com/2012/02/twwhortbywhoh.png?w=700" alt="" title="twWhoRTbyWhom"   class="alignnone size-full wp-image-6990" /></a></p>
<p>What sense you might make of all this, or where to take it next, is down to you of course&#8230; Err, erm&#8230;?! ;-)</p>
<p>PS see also: <a href="http://blog.ouseful.info/2012/02/06/visualising-activity-round-a-twitter-hashtag-or-search-term-using-r/">http://blog.ouseful.info/2012/01/21/a-quick-view-over-a-mashe-google-spreadsheet-twitter-archive-of-ukgc2012-tweets/</a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ouseful.wordpress.com/6982/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ouseful.wordpress.com/6982/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ouseful.wordpress.com/6982/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ouseful.wordpress.com/6982/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ouseful.wordpress.com/6982/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ouseful.wordpress.com/6982/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ouseful.wordpress.com/6982/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ouseful.wordpress.com/6982/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ouseful.wordpress.com/6982/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ouseful.wordpress.com/6982/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ouseful.wordpress.com/6982/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ouseful.wordpress.com/6982/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ouseful.wordpress.com/6982/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ouseful.wordpress.com/6982/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;blog=325417&amp;post=6982&amp;subd=ouseful&amp;ref=&amp;feed=1" width="1" height="1" /><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/ouseful?a=VAyVFcp3ZCs:tUJ1WnaCGyc:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/ouseful?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=VAyVFcp3ZCs:tUJ1WnaCGyc:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=VAyVFcp3ZCs:tUJ1WnaCGyc:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/ouseful?i=VAyVFcp3ZCs:tUJ1WnaCGyc:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=VAyVFcp3ZCs:tUJ1WnaCGyc:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/ouseful?i=VAyVFcp3ZCs:tUJ1WnaCGyc:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=VAyVFcp3ZCs:tUJ1WnaCGyc:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/ouseful?i=VAyVFcp3ZCs:tUJ1WnaCGyc:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=VAyVFcp3ZCs:tUJ1WnaCGyc:cGdyc7Q-1BI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=cGdyc7Q-1BI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/ouseful/~4/VAyVFcp3ZCs" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.ouseful.info/2012/02/06/visualising-activity-round-a-twitter-hashtag-or-search-term-using-r/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Tony Hirst</media:title>
		</media:content>

		<media:content url="http://blog.magicbeanlab.com/wp-content/uploads/Kingshthorpe-Comments.png" medium="image">
			<media:title type="html">@mediaczar visualisation of engagement around facebook flamewars</media:title>
		</media:content>

		<media:content url="http://ouseful.files.wordpress.com/2012/02/twitteractivityperuserbytime.png" medium="image">
			<media:title type="html">twitterActivityPerUserByTime</media:title>
		</media:content>

		<media:content url="http://ouseful.files.wordpress.com/2012/02/twcoversationperuserwithrts.png" medium="image">
			<media:title type="html">twConversationPerUserWithRTs</media:title>
		</media:content>

		<media:content url="http://ouseful.files.wordpress.com/2012/02/twwhenfolkrtd.png" medium="image">
			<media:title type="html">twWhenFolkRTd</media:title>
		</media:content>

		<media:content url="http://ouseful.files.wordpress.com/2012/02/twwhortbywhoh.png" medium="image">
			<media:title type="html">twWhoRTbyWhom</media:title>
		</media:content>
	<creativeCommons:license>http://creativecommons.org/licenses/by/2.0/</creativeCommons:license><feedburner:origLink>http://blog.ouseful.info/2012/02/06/visualising-activity-round-a-twitter-hashtag-or-search-term-using-r/</feedburner:origLink></item>
		<item>
		<title>OU Marketers Go After Competition Supported Editorial…?</title>
		<link>http://feedproxy.google.com/~r/ouseful/~3/PT8o_ySX5Hk/</link>
		<comments>http://blog.ouseful.info/2012/02/06/ou-marketers-go-after-competition-supported-editorial/#comments</comments>
		<pubDate>Mon, 06 Feb 2012 10:45:05 +0000</pubDate>
		<dc:creator>Tony Hirst</dc:creator>
				<category><![CDATA[OU2.0]]></category>

		<guid isPermaLink="false">http://blog.ouseful.info/?p=6978</guid>
		<description>Over the weekend, I noticed that the Guardian was offering readers a chance to win the chance to study for an OU degree for free. Today, via a tweet, I see a link to a piece of editorial coverage from Friday &amp;#8211; Live and learn with distance learning &amp;#8211; on some of the motivations for [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;amp;blog=325417&amp;amp;post=6978&amp;amp;subd=ouseful&amp;amp;ref=&amp;amp;feed=1" width="1" height="1" /&gt;</description>
			<content:encoded><![CDATA[<p>Over the weekend, I noticed that the Guardian was offering readers a chance to <a href="http://www.guardian.co.uk/lifeandstyle/competition/2012/feb/04/open-university-competition">win the chance to study for an OU degree for free</a>. Today, via a tweet, I see a link to a piece of editorial coverage from Friday &#8211; <a href="http://www.guardian.co.uk/money/2012/feb/03/distance-learning">Live and learn with distance learning</a> &#8211; on some of the motivations for studying for an OU degree &#8211; as well as a look at the commitment that&#8217;s involved in taking a distance learning degree.</p>
<p>The competition is prominently linked to:</p>
<p><a href="http://www.guardian.co.uk/money/2012/feb/03/distance-learning" title="Photo Sharing"><img src="http://farm8.staticflickr.com/7017/6828903939_eaf0f0b28d.jpg" width="444" height="500" alt="OU advertorial and linked competition" /></a></p>
<p>I suspect we are going to see more of this&#8230;</p>
<p>I was also interested to see this tweet from @barnstormed on Sunday: <em>Nice to see the @openuniversity on one of the electronic pitch-side advertising boards at Murrayfield :) #rugby #6nations</em> [Anyone got a screenshot?]</p>
<p>See also a previous campaign: <a href="http://ouseful.open.ac.uk/blogarchive/009219.html">OU Course Discounts with the Tesco Clubcard</a>, although I note this is about to come to an end?</p>
<p><a href="http://www.tesco.com/clubcard/deals/product.aspx?R=456&amp;bci=4294966476%7CEducation" title="Photo Sharing"><img src="http://farm8.staticflickr.com/7161/6828910077_c42666c641.jpg" width="500" height="307" alt="End of OU/Tesco Clubcard deal" /></a></p>
<p>Hmmm&#8230;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ouseful.wordpress.com/6978/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ouseful.wordpress.com/6978/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ouseful.wordpress.com/6978/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ouseful.wordpress.com/6978/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ouseful.wordpress.com/6978/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ouseful.wordpress.com/6978/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ouseful.wordpress.com/6978/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ouseful.wordpress.com/6978/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ouseful.wordpress.com/6978/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ouseful.wordpress.com/6978/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ouseful.wordpress.com/6978/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ouseful.wordpress.com/6978/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ouseful.wordpress.com/6978/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ouseful.wordpress.com/6978/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;blog=325417&amp;post=6978&amp;subd=ouseful&amp;ref=&amp;feed=1" width="1" height="1" /><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/ouseful?a=PT8o_ySX5Hk:O1hrb-GsPaw:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/ouseful?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=PT8o_ySX5Hk:O1hrb-GsPaw:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=PT8o_ySX5Hk:O1hrb-GsPaw:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/ouseful?i=PT8o_ySX5Hk:O1hrb-GsPaw:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=PT8o_ySX5Hk:O1hrb-GsPaw:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/ouseful?i=PT8o_ySX5Hk:O1hrb-GsPaw:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=PT8o_ySX5Hk:O1hrb-GsPaw:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/ouseful?i=PT8o_ySX5Hk:O1hrb-GsPaw:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=PT8o_ySX5Hk:O1hrb-GsPaw:cGdyc7Q-1BI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=cGdyc7Q-1BI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/ouseful/~4/PT8o_ySX5Hk" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.ouseful.info/2012/02/06/ou-marketers-go-after-competition-supported-editorial/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Tony Hirst</media:title>
		</media:content>

		<media:content url="http://farm8.staticflickr.com/7017/6828903939_eaf0f0b28d.jpg" medium="image">
			<media:title type="html">OU advertorial and linked competition</media:title>
		</media:content>

		<media:content url="http://farm8.staticflickr.com/7161/6828910077_c42666c641.jpg" medium="image">
			<media:title type="html">End of OU/Tesco Clubcard deal</media:title>
		</media:content>
	<creativeCommons:license>http://creativecommons.org/licenses/by/2.0/</creativeCommons:license><feedburner:origLink>http://blog.ouseful.info/2012/02/06/ou-marketers-go-after-competition-supported-editorial/</feedburner:origLink></item>
		<item>
		<title>When A Comment Spammer’s Script Goes Wrong 2</title>
		<link>http://feedproxy.google.com/~r/ouseful/~3/dmVHqHxvei4/</link>
		<comments>http://blog.ouseful.info/2012/02/06/when-a-comment-spammers-script-goes-wrong-2/#comments</comments>
		<pubDate>Mon, 06 Feb 2012 09:44:17 +0000</pubDate>
		<dc:creator>Tony Hirst</dc:creator>
				<category><![CDATA[Anything you want]]></category>
		<category><![CDATA[comment spam]]></category>

		<guid isPermaLink="false">http://blog.ouseful.info/?p=6971</guid>
		<description>Another insight into the world of comment spam engines, a set of spam comments that were dumped wholesale as a single comment on one of my posts over the weekend&amp;#8230; Here&amp;#8217;s the file&amp;#8230; Hello there, just became alert to your weblog thru Google, and located that it is truly informative. I’m going to be careful [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;amp;blog=325417&amp;amp;post=6971&amp;amp;subd=ouseful&amp;amp;ref=&amp;amp;feed=1" width="1" height="1" /&gt;</description>
			<content:encoded><![CDATA[<p>Another insight into the world of comment spam engines, a set of spam comments that were dumped wholesale as a single comment on one of my posts over the weekend&#8230;</p>
<p><a href="http://www.flickr.com/photos/psychemedia/6828732115/" title="Photo Sharing"><img src="http://farm8.staticflickr.com/7166/6828732115_40d760f227.jpg" width="500" height="318" alt="Commment spam script - as worldcloud" /></a></p>
<p>Here&#8217;s the file&#8230;</p>
<blockquote><p>Hello there, just became alert to your weblog thru Google, and located that it is truly informative. I’m going to be careful for brussels. I will be grateful if you happen to continue this in future. Lots of people will likely be benefited from your writing. Cheers!<br />
It is perfect time to make some plans for the future and it’s time to be happy. I have read this put up and if I may I desire to counsel you some attention-grabbing things or suggestions. Maybe you could write subsequent articles regarding this article. I want to read more things approximately it!<br />
Excellent post. I used to be checking constantly this weblog and I am inspired! Extremely helpful information specifically the closing part :) I deal with such information much. I used to be seeking this certain info for a long time. Thanks and good luck.<br />
hello there and thank you in your information ? I have definitely picked up something new from proper here. I did alternatively expertise a few technical points using this site, as I skilled to reload the web site many times prior to I may just get it to load properly. I were thinking about in case your hosting is OK? Not that I’m complaining, but sluggish loading circumstances instances will often affect your placement in google and could injury your high-quality rating if advertising and marketing with Adwords. Well I am including this RSS to my email and can look out for a lot extra of your respective fascinating content. Ensure that you update this once more soon..<br />
Excellent goods from you, man. I have take into account your stuff prior to and you’re just too wonderful. I actually like what you’ve acquired here, certainly like what you’re saying and the way in which wherein you say it. You make it enjoyable and you continue to care for to stay it sensible. I cant wait to read much more from you. That is actually a terrific web site.<br />
Very nice post. I simply stumbled upon your blog and wished to mention that I have really enjoyed surfing around your blog posts. In any case I will be subscribing on your rss feed and I am hoping you write again very soon!<br />
I just like the helpful information you supply for your articles. I will bookmark your blog and test once more here frequently. I am moderately certain I’ll be informed lots of new stuff proper right here! Best of luck for the following!<br />
I feel this is one of the most vital information for me. And i’m happy reading your article. However should statement on some basic things, The site taste is great, the articles is in reality nice : D. Good process, cheers<br />
We are a group of volunteers and starting a new scheme in our community. Your site offered us with useful information to paintings on. You’ve performed an impressive task and our whole neighborhood might be grateful to you.<br />
Undeniably consider that which you said. Your favorite justification seemed to be on the net the simplest factor to be mindful of. I say to you, I certainly get annoyed even as other people consider concerns that they plainly do not know about. You controlled to hit the nail upon the highest and defined out the whole thing with no need side-effects , folks can take a signal. Will probably be again to get more. Thank you<br />
That is very attention-grabbing, You’re an excessively skilled blogger. I’ve joined your feed and look forward to in quest of extra of your magnificent post. Also, I’ve shared your web site in my social networks<br />
Hello There. I discovered your blog using msn. This is a very neatly written article. I will be sure to bookmark it and come back to learn more of your useful information. Thank you for the post. I’ll certainly return.<br />
I beloved up to you will obtain carried out proper here. The caricature is tasteful, your authored material stylish. nonetheless, you command get bought an nervousness over that you would like be handing over the following. in poor health unquestionably come more before again since exactly the similar nearly very ceaselessly inside case you defend this increase.<br />
Hello, i believe that i saw you visited my web site so i came to return the want?.I am trying to find things to enhance my site!I assume its good enough to use some of your ideas!!<br />
Just want to say your article is as amazing. The clarity in your put up is simply great and that i can suppose you are a professional in this subject. Well together with your permission let me to grasp your feed to stay up to date with approaching post. Thank you one million and please carry on the gratifying work.<br />
Its such as you learn my mind! You seem to know so much about this, such as you wrote the ebook in it or something. I think that you simply could do with a few percent to drive the message house a little bit, however other than that, that is wonderful blog. A great read. I will definitely be back.<br />
Thanks for the good writeup. It actually was once a amusement account it. Look advanced to more delivered agreeable from you! However, how can we be in contact?<br />
Hey there, You have done a fantastic job. I’ll definitely digg it and personally recommend to my friends. I am confident they’ll be benefited from this website.<br />
Magnificent beat ! I wish to apprentice whilst you amend your site, how could i subscribe for a weblog website? The account aided me a appropriate deal. I have been a little bit familiar of this your broadcast offered vibrant transparent concept<br />
I’m really impressed with your writing abilities as neatly as with the structure on your weblog. Is this a paid theme or did you modify it yourself? Anyway stay up the excellent high quality writing, it’s rare to peer a nice blog like this one today..<br />
Attractive component of content. I just stumbled upon your blog and in accession capital to claim that I get actually loved account your blog posts. Any way I’ll be subscribing on your feeds and even I fulfillment you get admission to persistently fast.<br />
My brother recommended I may like this website. He was once totally right. This publish actually made my day. You can not consider just how much time I had spent for this information! Thanks!<br />
I don’t even understand how I finished up here, but I thought this publish was once great. I do not recognize who you might be but definitely you’re going to a famous blogger for those who are not already. Cheers!<br />
Heya i’m for the first time here. I found this board and I find It truly useful &amp; it helped me out much. I hope to give one thing again and help others like you aided me.<br />
I was recommended this blog by means of my cousin. I am not sure whether or not this post is written by means of him as no one else realize such certain about my difficulty. You’re incredible! Thank you!<br />
Excellent weblog right here! Also your web site quite a bit up fast! What host are you the use of? Can I am getting your affiliate hyperlink for your host? I desire my site loaded up as quickly as yours lol<br />
Wow, amazing blog structure! How lengthy have you been blogging for? you make running a blog look easy. The total glance of your site is wonderful, as smartly as the content material!<br />
I am now not certain where you’re getting your info, however great topic. I needs to spend some time learning much more or figuring out more. Thank you for magnificent information I used to be in search of this info for my mission.<br />
You really make it appear really easy together with your presentation however I find this topic to be actually one thing which I think I would never understand. It seems too complicated and extremely huge for me. I am taking a look forward in your subsequent post, I’ll try to get the cling of it!<br />
I’ve been browsing online more than three hours as of late, but I never found any attention-grabbing article like yours. It’s beautiful value sufficient for me. In my opinion, if all site owners and bloggers made good content material as you did, the web shall be much more helpful than ever before.<br />
I do agree with all of the ideas you have introduced on your post. They are really convincing and will definitely work. Nonetheless, the posts are too short for novices. May you please lengthen them a bit from next time? Thanks for the post.<br />
You can definitely see your skills in the work you write. The world hopes for even more passionate writers like you who aren’t afraid to mention how they believe. All the time go after your heart.<br />
I’ll immediately snatch your rss feed as I can’t in finding your e-mail subscription link or e-newsletter service. Do you have any? Please allow me recognize in order that I could subscribe. Thanks.<br />
Someone essentially lend a hand to make severely posts I might state. This is the very first time I frequented your web page and thus far? I amazed with the research you made to make this particular post incredible. Wonderful activity!<br />
Great site. Lots of useful info here. I am sending it to a few buddies ans also sharing in delicious. And of course, thank you on your sweat!<br />
hello!,I really like your writing very much! proportion we keep in touch extra about your article on AOL? I need a specialist on this space to unravel my problem. Maybe that is you! Looking ahead to look you.<br />
Amazing issues here. I’m very happy to peer your article. Thanks a lot and I’m looking forward to touch you. Will you kindly drop me a e-mail?<br />
I simply couldn’t go away your website prior to suggesting that I actually enjoyed the usual information a person provide in your visitors? Is gonna be back ceaselessly in order to inspect new posts<br />
You are actually a good webmaster. The site loading velocity is amazing. It sort of feels that you’re doing any unique trick. In addition, The contents are masterpiece. you’ve performed a excellent task on this subject!<br />
Thanks a bunch for sharing this with all folks you really know what you’re talking approximately! Bookmarked. Please also seek advice from my website =). We will have a hyperlink change contract between us<br />
Wonderful paintings! This is the type of info that are meant to be shared across the internet. Disgrace on the search engines for now not positioning this put up higher! Come on over and visit my web site . Thanks =)<br />
Helpful info. Lucky me I found your web site by chance, and I am stunned why this coincidence did not took place in advance! I bookmarked it.<br />
I’ve been exploring for a bit for any high quality articles or weblog posts on this sort of house . Exploring in Yahoo I eventually stumbled upon this website. Studying this information So i am happy to show that I have an incredibly just right uncanny feeling I found out exactly what I needed. I most surely will make certain to don?t forget this web site and give it a look on a relentless basis.<br />
Woah this weblog is fantastic i like studying your articles. Stay up the good work! You realize, many persons are looking around for this information, you can help them greatly.<br />
I get pleasure from, result in I found exactly what I used to be taking a look for. You have ended my 4 day lengthy hunt! God Bless you man. Have a great day. Bye<br />
Thanks for another fantastic article. Where else may anybody get that type of information in such a perfect method of writing? I’ve a presentation next week, and I am on the look for such info.<br />
It’s actually a cool and helpful piece of information. I’m happy that you just shared this helpful information with us. Please stay us up to date like this. Thank you for sharing.<br />
Wonderful post, very informative. I ponder why the opposite specialists of this sector do not notice this. You should continue your writing. I am sure, you have a huge readers’ base already!|What’s Going down i’m new to this, I stumbled upon this I have found It absolutely useful and it has helped me out loads. I’m hoping to give a contribution &amp; aid other customers like its aided me. Great job.<br />
Thank you, I’ve recently been looking for information approximately this subject for a long time and yours is the best I’ve found out till now. However, what about the conclusion? Are you positive in regards to the source?|What i don’t understood is in reality how you are not actually much more neatly-liked than you may be now. You’re so intelligent.<br />
You already know therefore significantly in relation to this matter, produced me personally believe it from a lot of numerous angles. Its like women and men are not fascinated unless it’s something to accomplish with Girl gaga! Your own stuffs great. Always handle it up!<br />
Generally I don’t learn article on blogs, however I would like to say that this write-up very pressured me to take a look at and do it! Your writing taste has been amazed me. Thank you, quite great post.<br />
Hello my loved one! I want to say that this post is awesome, nice written and come with almost all vital infos. I would like to see extra posts like this .<br />
obviously like your web site however you have to test the spelling on quite a few of your posts. Many of them are rife with spelling problems and I find it very troublesome to tell the reality then again I’ll definitely come again again.<br />
Hello, Neat post. There is a problem along with your web site in web explorer, would check this? IE nonetheless is the marketplace chief and a large component of people will miss your great writing because of this problem.<br />
I have read several good stuff here. Definitely worth bookmarking for revisiting. I surprise how so much attempt you put to make this sort of wonderful informative site.<br />
Howdy very nice web site!! Man .. Excellent .. Superb .. I’ll bookmark your website and take the feeds additionally?I’m satisfied to search out so many useful info here within the post, we need develop extra techniques on this regard, thank you for sharing. . . . . .<br />
It’s in point of fact a nice and useful piece of information. I’m happy that you just shared this helpful info with us. Please keep us up to date like this. Thanks for sharing.<br />
Wonderful issues altogether, you simply won a emblem new reader. What might you suggest in regards to your post that you simply made a few days in the past? Any positive?<br />
Thanks for any other informative blog. Where else may I get that kind of info written in such a perfect method? I’ve a venture that I’m simply now operating on, and I have been at the look out for such information.<br />
Hi there, I discovered your blog by the use of Google at the same time as searching for a related topic, your site came up, it looks great. I’ve bookmarked to my favourites|added to bookmarks.</p></blockquote>
<p>See also: <a href="http://blog.ouseful.info/2011/11/30/when-a-comment-spammers-script-goes-wrong/">When a Comment Spammer’s Script Goes Wrong…</a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ouseful.wordpress.com/6971/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ouseful.wordpress.com/6971/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ouseful.wordpress.com/6971/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ouseful.wordpress.com/6971/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ouseful.wordpress.com/6971/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ouseful.wordpress.com/6971/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ouseful.wordpress.com/6971/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ouseful.wordpress.com/6971/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ouseful.wordpress.com/6971/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ouseful.wordpress.com/6971/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ouseful.wordpress.com/6971/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ouseful.wordpress.com/6971/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ouseful.wordpress.com/6971/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ouseful.wordpress.com/6971/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;blog=325417&amp;post=6971&amp;subd=ouseful&amp;ref=&amp;feed=1" width="1" height="1" /><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/ouseful?a=dmVHqHxvei4:F2hPfQkNb_s:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/ouseful?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=dmVHqHxvei4:F2hPfQkNb_s:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=dmVHqHxvei4:F2hPfQkNb_s:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/ouseful?i=dmVHqHxvei4:F2hPfQkNb_s:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=dmVHqHxvei4:F2hPfQkNb_s:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/ouseful?i=dmVHqHxvei4:F2hPfQkNb_s:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=dmVHqHxvei4:F2hPfQkNb_s:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/ouseful?i=dmVHqHxvei4:F2hPfQkNb_s:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=dmVHqHxvei4:F2hPfQkNb_s:cGdyc7Q-1BI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=cGdyc7Q-1BI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/ouseful/~4/dmVHqHxvei4" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.ouseful.info/2012/02/06/when-a-comment-spammers-script-goes-wrong-2/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Tony Hirst</media:title>
		</media:content>

		<media:content url="http://farm8.staticflickr.com/7166/6828732115_40d760f227.jpg" medium="image">
			<media:title type="html">Commment spam script - as worldcloud</media:title>
		</media:content>
	<creativeCommons:license>http://creativecommons.org/licenses/by/2.0/</creativeCommons:license><feedburner:origLink>http://blog.ouseful.info/2012/02/06/when-a-comment-spammers-script-goes-wrong-2/</feedburner:origLink></item>
		<item><title>Links for 2012-02-03 [del.icio.us]</title><link>http://feedproxy.google.com/~r/ouseful/~3/QSSiA9HpSFA/feedthru</link><pubDate>Sat, 04 Feb 2012 00:00:00 PST</pubDate><guid isPermaLink="false">http://del.icio.us/psychemedia/feedthru#2012-02-03</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="http://digital.cabinetoffice.gov.uk/2012/02/03/government-policy-a-spotters-guide/"&gt;Government policy &amp;ndash; a spotter&amp;rsquo;s guide | Government Digital Service&lt;/a&gt;&lt;br/&gt;
I've been trying to get my head around what (government) policy actually is in the sense of "policy development", and it seems the gov.uk folk have come up with a pragmatic definition - "statements of the government’s position, intent or action" - that will shape the area of the gov.uk website describing govt policy and actions around it under the following headings:

The issue, problem or opportunity, and gov’s aims
Actions, what govt is doing/will do/has done to address problem or seize opportunity
Background, how policy has developed to date, why govt has chosen this course and rejected other options, including evidence
Engagement, who govt has asked/is asking/will ask, when and how
Impact, who benefits or is otherwise affected
Bills and legislation, the legal framework in which this policy is operating/how policy might change that legislation
Partner organisations, what govt and NGOs are involved, and in what capacity

Related news, speeches, publications and consultation&lt;/li&gt;
&lt;/ul&gt;&lt;img src="http://feeds.feedburner.com/~r/ouseful/~4/QSSiA9HpSFA" height="1" width="1"/&gt;</description><feedburner:origLink>http://del.icio.us/psychemedia/feedthru#2012-02-03</feedburner:origLink></item><item>
		<title>Journalist Filters on Twitter – The Reuters View</title>
		<link>http://feedproxy.google.com/~r/ouseful/~3/3jIObOe0yhw/</link>
		<comments>http://blog.ouseful.info/2012/02/03/journalist-filters-on-twitter-the-reuters-view/#comments</comments>
		<pubDate>Fri, 03 Feb 2012 12:57:58 +0000</pubDate>
		<dc:creator>Tony Hirst</dc:creator>
				<category><![CDATA[Infoskills]]></category>
		<category><![CDATA[onlinejournalismblog]]></category>

		<guid isPermaLink="false">http://blog.ouseful.info/?p=6962</guid>
		<description>It seems that Reuters has a new product out &amp;#8211; Reuters Social Pulse. As well as highlighting &amp;#8220;the stories being talked about by the newsmakers we follow&amp;#8221;, there is an area highlighting &amp;#8220;the Reuters &amp;#38; Klout 50 where we rank America’s most social CEOs.&amp;#8221; Of note here is that this list is ordered by Klout [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;amp;blog=325417&amp;amp;post=6962&amp;amp;subd=ouseful&amp;amp;ref=&amp;amp;feed=1" width="1" height="1" /&gt;</description>
			<content:encoded><![CDATA[<p>It seems that Reuters has a new product out &#8211; <a href="http://blogs.reuters.com/mediafile/2012/02/02/reuters-social-pulse/">Reuters Social Pulse</a>. As well as highlighting &#8220;the stories being talked about by the newsmakers we follow&#8221;, there is an area highlighting &#8220;the <a href="http://www.reuters.com/reuters-klout-50">Reuters &amp; Klout 50</a> where we rank America’s most social CEOs.&#8221; Of note here is that this list is ordered by Klout score. Reuters don&#8217;t own Klout (yet?!) do they?! </p>
<p>The offering also includes a view of the world through the tweets of Reuters own staff. Apparently, &#8220;Reuters has over 3,000 journalists around the world, many of whom are doing amazing work on Twitter. That is too many to keep up with on a Twitter list, so we created a directory <a href="http://www.reuters.com/journalist-twitter-directory">Reuters Twitter Directory</a>] that shows you our best tweeters by topic. It let’s you find our reporters, bloggers and editors by category and location so you can drill down to business journalists in India, if you so choose, or tech writers in the UK.&#8221;</p>
<p>If you view the source of Reuters Twitter directory page, you can find a Javascript object that lists all(?) the folk in the Reuters Twitter directory and the tags they are associated with&#8230; Hmm, I thought&#8230; Hmmm&#8230;</p>
<p>If we grab that object, and pop it into Python, it&#8217;s easy enough to create a  bipartite network that links journalists to the categories they are associated with:</p>
<p><pre class="brush: python;">import simplejson
import networkx as nx
#http://mlg.ucd.ie/files/summer/tutorial.pdf
from networkx.algorithms import bipartite

g = nx.Graph()

#need to bring in reutersJournalistList
users=simplejson.loads(reutersJournalistList)

#I had some 'issues' with the parsing for some reason? Required this hack in the end...
for user in users:
	for x in user:
		if x=='users':
			u=user[x][0]['twitter_screen_name']
			print 'user:',user[x][0]['twitter_screen_name']
			for topic in user[x][0]['topics']:
				print '- topic:',topic
				#Add edges from journalist name to each tag they are associated with
				g.add_edge(u,topic)
#print bipartite.is_bipartite(g)
#print bipartite.sets(g)

#Save a graph file we can visualise in Gephi corresponding to bipartite graph
nx.write_graphml(g, &quot;usertags.graphml&quot;)

#We can find the sets of names/tags associated with the disjoint sets in the graph
users,tags=bipartite.sets(g)

#Collapse the bipartite graph to a graph of journalists connected via a common tag
ugraph= bipartite.projected_graph(g, users)
nx.write_graphml(ugraph, &quot;users.graphml&quot;)

#Collapse the bipartite graph to a set of tags connected via a common journalist
tgraph= bipartite.projected_graph(g, tags)
nx.write_graphml(tgraph, &quot;tags.graphml&quot;)

#Dump a list of the journalists Twitter IDs
f=open(&quot;users.txt&quot;,&quot;w+&quot;)
for uo in users: f.write(uo+'\n')
f.close()</pre></p>
<p>Having generated graph files, we can then look to see how the tags cluster as a result of how they were applied to journalists associated with several tags:</p>
<p><a href="http://www.flickr.com/photos/psychemedia/6811403673/" title="Photo Sharing"><img src="http://farm8.staticflickr.com/7030/6811403673_65a245e938.jpg" width="500" height="428" alt="Reuters journalists twitter directory cotags" /></a></p>
<p>Alternatively, we can look to see which journalists are connected by virtue of being associated with similar tags (hmm, I wonder if edge weight carries information about how many tags each connected pair may be associated through?). In this case, I size the nodes by betweenness centrality to try to highlight journalists that bridge topic areas:</p>
<p><a href="http://www.flickr.com/photos/psychemedia/6811417129/" title="Photo Sharing"><img src="http://farm8.staticflickr.com/7166/6811417129_1da8b8329e.jpg" width="500" height="447" alt="Reuters twitter journalists list via cotags, sized by betweenness centrality" /></a></p>
<p>Association through shared tags (as applied by Reuters) is one thing, but there is also structure arising from friendship networks&#8230;So to what extent do the Reuters Twitter List journalists follow each other (again, sizing by betweenness centrality):</p>
<p><a href="http://www.flickr.com/photos/psychemedia/6811428169/" title="Photo Sharing"><img src="http://farm8.staticflickr.com/7156/6811428169_4c0727687e.jpg" width="500" height="479" alt="Reuters twitter journalists friend connections sized by betweenness centrality" /></a></p>
<p>Finally, here&#8217;s a quick look at folk followed by 15 or more of the folk in the Reuters Twitter journalists list: this is the common source area on Twitter for the journalists on the list. This time, I size nodes by eigenvector centrality.</p>
<p><a href="http://www.flickr.com/photos/psychemedia/6811440445/" title="Photo Sharing"><img src="http://farm8.staticflickr.com/7173/6811440445_429fe3cf98.jpg" width="500" height="423" alt="FOlk followed by 15 or more of folk on reuters twitter journliasts list, size by eigenvector centrality" /></a></p>
<p>So why bother with this? Because journalists provide a filter onto the way the world is reported to us through the media, and as a result the perspective we have of the world as portrayed through the media.  If we see journalists as providing independent fairwitness services, then having some sort of idea about the extent to which they are sourcing their information severally, or from a common pool, can be handy. In the above diagram, for example, I try to highlight common sources (folk followed by at least 15 of the journalists on the Twitter list). But I could equally have got a feeling for the range of sources by producing a much larger and sparser graph, such as all the folk followed by journalists on the list, or folk followed by only 1 person on the list (40,000 people or so in all &#8211; see below), or by 2 to 5 people on the list&#8230;</p>
<p><a href="http://www.flickr.com/photos/psychemedia/6811635317/" title="Photo Sharing"><img src="http://farm8.staticflickr.com/7165/6811635317_4c81a789a2.jpg" width="500" height="441" alt="The twitterverse as directly and publicly followed by folk on the Reuters Journalists twitter list" /></a></p>
<p>Friends lists are one sort of filter every Twitter user has onto the content been shared on Twitter, and something that&#8217;s easy to map. There are other views of course &#8211; the list of people mentioning a user is readily available to every Twitter user, and it&#8217;s easy enough to set up views around particular hashtags or search terms. Grabbing the journalists associated with one or more particular tags, and then mapping their friends (or, indeed, followers) is also possible, as is grabbing the follower lists for one or more journalists and then looking to see who the friends of the followers are, thus positioning the the journalist in the social media environment as perceived by their followers.</p>
<p>I&#8217;m not sure that value Reuters sees in the stream of tweets from the folk on its Twitter journalists lists, or the Twitter networks they have built up, but the friend lenses at least we can try to map out. And via the bipartite user/tag graph, it also becomes trivial for us to find journalists with interests in Facebook and advertising, for example&#8230;</p>
<p>PS for associated techniques related to the <em>emergent social positioning</em> of hashtags and shared links on Twitter, see <a href="http://blog.ouseful.info/2012/01/08/socially-positioning-sherlock/">Socially Positioning #Sherlock and Dr John Watson’s Blog&#8230;</a> and <a href="http://blog.ouseful.info/2012/01/26/social-media-interest-maps-of-newsnight-and-bbcqt-twitterers/">Social Media Interest Maps of Newsnight and BBCQT Twitterers</a>. For a view over @skynews Twitter friends, and how they connect, see <a href="http://emergentsocialpositioning.posterous.com/visualising-how-skynews-twitter-friends-conne">Visualising How @skynews&#8217; Twitter Friends Connect</a>.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ouseful.wordpress.com/6962/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ouseful.wordpress.com/6962/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ouseful.wordpress.com/6962/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ouseful.wordpress.com/6962/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ouseful.wordpress.com/6962/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ouseful.wordpress.com/6962/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ouseful.wordpress.com/6962/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ouseful.wordpress.com/6962/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ouseful.wordpress.com/6962/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ouseful.wordpress.com/6962/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ouseful.wordpress.com/6962/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ouseful.wordpress.com/6962/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ouseful.wordpress.com/6962/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ouseful.wordpress.com/6962/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;blog=325417&amp;post=6962&amp;subd=ouseful&amp;ref=&amp;feed=1" width="1" height="1" /><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/ouseful?a=3jIObOe0yhw:luZ6HKn3lYU:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/ouseful?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=3jIObOe0yhw:luZ6HKn3lYU:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=3jIObOe0yhw:luZ6HKn3lYU:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/ouseful?i=3jIObOe0yhw:luZ6HKn3lYU:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=3jIObOe0yhw:luZ6HKn3lYU:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/ouseful?i=3jIObOe0yhw:luZ6HKn3lYU:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=3jIObOe0yhw:luZ6HKn3lYU:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/ouseful?i=3jIObOe0yhw:luZ6HKn3lYU:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=3jIObOe0yhw:luZ6HKn3lYU:cGdyc7Q-1BI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=cGdyc7Q-1BI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/ouseful/~4/3jIObOe0yhw" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.ouseful.info/2012/02/03/journalist-filters-on-twitter-the-reuters-view/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Tony Hirst</media:title>
		</media:content>

		<media:content url="http://farm8.staticflickr.com/7030/6811403673_65a245e938.jpg" medium="image">
			<media:title type="html">Reuters journalists twitter directory cotags</media:title>
		</media:content>

		<media:content url="http://farm8.staticflickr.com/7166/6811417129_1da8b8329e.jpg" medium="image">
			<media:title type="html">Reuters twitter journalists list via cotags, sized by betweenness centrality</media:title>
		</media:content>

		<media:content url="http://farm8.staticflickr.com/7156/6811428169_4c0727687e.jpg" medium="image">
			<media:title type="html">Reuters twitter journalists friend connections sized by betweenness centrality</media:title>
		</media:content>

		<media:content url="http://farm8.staticflickr.com/7173/6811440445_429fe3cf98.jpg" medium="image">
			<media:title type="html">FOlk followed by 15 or more of folk on reuters twitter journliasts list, size by eigenvector centrality</media:title>
		</media:content>

		<media:content url="http://farm8.staticflickr.com/7165/6811635317_4c81a789a2.jpg" medium="image">
			<media:title type="html">The twitterverse as directly and publicly followed by folk on the Reuters Journalists twitter list</media:title>
		</media:content>
	<creativeCommons:license>http://creativecommons.org/licenses/by/2.0/</creativeCommons:license><feedburner:origLink>http://blog.ouseful.info/2012/02/03/journalist-filters-on-twitter-the-reuters-view/</feedburner:origLink></item>
		<item><title>Links for 2012-02-02 [del.icio.us]</title><link>http://feedproxy.google.com/~r/ouseful/~3/syq6zzonObQ/feedthru</link><pubDate>Fri, 03 Feb 2012 00:00:00 PST</pubDate><guid isPermaLink="false">http://del.icio.us/psychemedia/feedthru#2012-02-02</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="http://www.deloitte.com/view/en_US/us/Services/consulting/technology-consulting/technology-2012/index.htm"&gt;Deloitte | Technology Trends 2012 | Deloitte Consulting LLP&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;img src="http://feeds.feedburner.com/~r/ouseful/~4/syq6zzonObQ" height="1" width="1"/&gt;</description><feedburner:origLink>http://del.icio.us/psychemedia/feedthru#2012-02-02</feedburner:origLink></item><item>
		<title>Several Takes on the Notion of “Data Laundering”</title>
		<link>http://feedproxy.google.com/~r/ouseful/~3/ioWYtaNQd8c/</link>
		<comments>http://blog.ouseful.info/2012/02/03/several-takes-on-the-definition-of-data-laundering/#comments</comments>
		<pubDate>Fri, 03 Feb 2012 00:03:56 +0000</pubDate>
		<dc:creator>Tony Hirst</dc:creator>
				<category><![CDATA[Anything you want]]></category>
		<category><![CDATA[Policy]]></category>
		<category><![CDATA[data laundering]]></category>

		<guid isPermaLink="false">http://blog.ouseful.info/?p=6957</guid>
		<description>Picking up on Sleight of Hand and Data Laundering in Evidence Based Policy Making and Paul Bradshaw&amp;#8217;s response that we should maybe follow the data, here&amp;#8217;s a quick summary of several competing conceptualisations of &amp;#8220;data laundering&amp;#8221;. The first relates to the usage in the sense of &amp;#8220;[o]bscuring, removing, or fabricating the provenance of illegally obtained [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;amp;blog=325417&amp;amp;post=6957&amp;amp;subd=ouseful&amp;amp;ref=&amp;amp;feed=1" width="1" height="1" /&gt;</description>
			<content:encoded><![CDATA[<p>Picking up on <a href="http://blog.ouseful.info/2012/02/01/sleight-of-hand-and-data-laundering-in-evidence-based-policy-making/">Sleight of Hand and Data Laundering in Evidence Based Policy Making</a> and Paul Bradshaw&#8217;s response that we should maybe <a href="http://onlinejournalismblog.com/2012/02/01/data-laundering/"><em>follow the data</em></a>, here&#8217;s a quick summary of several competing conceptualisations of &#8220;data laundering&#8221;.</p>
<p>The first relates to the usage in the sense of &#8220;[o]bscuring, removing, or fabricating the provenance of illegally obtained data such that it may be used for lawful purposes&#8221; (<a href="http://sectorprivate.wordpress.com/2009/03/16/my-definition-of-data-laundering-as-inspired-by-william-gibson-from-mona-lisa-overdrive/">[SectorPrivate's] definition of Data Laundering – as inspired by William Gibson from Mona Lisa Overdrive</a>).</p>
<p>SectorPrivate cites this example from a 2005 Privacy conference paper by Thilo Weichert (<a href="http://www.privacyconference2005.org/fileadmin/PDF/weichert_e.pdf">Privacy and Data Protection in federal police cooperation</a>):</p>
<blockquote><p>As working in joint committees naturally is a more informal cooperation, supervision as regards data protection is practically impossible. It is not ensured that personal data transmissions are put down in a protocol being checked. Therefore, it is usually impossible to find out the point of origin of specific information, whether it was obtained lawfully and how its utilisation is limited. In this context one can even talk about <em>data laundering facility</em>: data obtained unlawfully can be passed <em>across the table</em> and be processed without complaints by the receiver in a now cleaned form and can thereupon be passed back.</p></blockquote>
<p>A more recent reference in <a href="http://www.thinkmind.org/index.php?view=article&amp;articleid=sec_v2_n23_2009_13">ThinkMind // International Journal On Advances in Security, volume 2, numbers 2 and 3, 2009</a> on <em>Design Patterns for a Systemic Privacy Protection</em> identifies the following:</p>
<blockquote><p>Problem Situation 4 – Data laundering. Companies are paying a lot of money for personal and group proﬁles and there are market actors in position to sell them.<br />
This is clearly against data protection principles. This phenomenon is known as &#8216;data laundering&#8217;. Similar to money laundering, data laundering aims to make illegally obtained personal data look as if they were obtained legally, so that they can be used to target customers.</p></blockquote>
<p>This example is also referred to from an EU Sixth Framework Information Scoiety Technologies deliverable &#8211; <a href="http://is.jrc.es/pages/TFS/documents/SWAMI_D3_3July-final.pdf">Safeguards in a World of Ambient Intelligence (SWAMI) Threats, Vulnerabilities and Safeguards in Ambient Intelligence Deliverable D3 3 July 2006</a> which cites the source as the <a href="http://is.jrc.ec.europa.eu/pages/TFS/documents/SWAMI_D2_scenarios_Final_ESvf_003.pdf">second SWAMI deliverable</a>. SWAMI-D2 describes the process of data laundering as follows: &#8220;Via a large number of transactions and operations, the illegal origin (illegal collection) of personal data can be camouflaged&#8221;.</p>
<p>The third deliverable goes on to make the following recommendation:</p>
<blockquote><p>A means to prevent data laundering could be an obligation imposed on those who buy or<br />
otherwise acquire databases, profiles and vast amounts of personal data, to check diligently<br />
the legal origin of the data. If the buyer does not check the origin and/or the legality of the<br />
databases and profiles, he could be considered equal to a receiver of stolen goods and thus<br />
held liable for illegal data processing. An obligation could also be created which would<br />
require buyers to notify the national data protection officers when personal data(bases) are<br />
acquired. Persons or companies involved or assisting in data laundering could be made<br />
subject to criminal sanctions.</p></blockquote>
<p>The SWAMI reports thus situate <em>data laundering</em> in the context of invasions privacy and/or contraventions to data protection legislation. State sponsored, rather than evil criminal mafia initiated, usage of illegally acquired data (eg <a href="http://boingboing.net/2010/01/25/us-govt-data-launder.html">US gov&#8217;t data-laundering: using corporate databases to get around privacy law</a>) also falls into a broadly similar area of data protection/privacy law abuse.</p>
<p>The term &#8220;data laundering&#8221; also appears to have varied usage in the sense of data cleaning (aka data cleansing), (eg <a href="http://www2.sas.com/proceedings/sugi26/p099-26.pdf">Quick and Dirty Data Laundering: A Scalable Solution for Range Checking Data</a>, <a href="http://onlinelibrary.wiley.com/doi/10.1002/(SICI)1099-128X(199609)10:5/6%3C453::AID-CEM444%3E3.0.CO;2-P/abstract">Data laundering by target rotation in chemistry-based oil exploration</a>).</p>
<p>The sense in which I first came across the term was whilst discussing a <em>data laundry</em> process that could replace metadata records or fields with metadata records in library catalogues that are tainted with commercial license restrictions with data of equivalent of higher quality, known provenance and open license terms (<a href="http://blog.ouseful.info/2011/08/09/open-data-processes-the-open-metadata-laundry/">Open Data Processes: the Open Metadata Laundry</a>).</p>
<p>The notion I was going for in <a href="http://blog.ouseful.info/2012/02/01/sleight-of-hand-and-data-laundering-in-evidence-based-policy-making/">Sleight of Hand and Data Laundering in Evidence Based Policy Making</a> is different again. Whilst it shares the SWAMI characterisation insofar as it relates to the practice of removing provenance traces from a data set, it does not assume that the data was acquired illegally and it also differs in the purpose to which the laundered data is applied. In the sense I intended, the data is legal but of low or unverified quality, contains a significant bias, or whose provenance may lead to a conflict of interest arising from the use to which the data is to be put. The laundering is there not to remove traces of the illegal provenance of the data, but to mask the original provenance with a provenance, authority or veneer of quality associated with another agent, such that the data becomes accepted &#8220;at face value&#8221; with the imprimateur of an independent trusted party. The second part of my take on data laundering was the use to which the laundered data might be put. Specifically, having been laundered of its dubious provenance, and remarqued with a stamp of independent and/or trusted authority, the data would continue to make it&#8217;s way through a policy development process with the intention that it would influence the policy decision in favour of the outcome preferred by the agent who insinuated the original data into the data laundering chain.</p>
<p>Compare this with the WIkipedia description of <em>money laundering</em>: &#8220;Money laundering often occurs in three steps: first, cash is introduced into the financial system by some means (&#8216;placement&#8217;), the second involves carrying out complex financial transactions in order to camouflage the illegal source (&#8216;layering&#8217;), and the final step entails acquiring wealth generated from the transactions of the illicit funds (&#8216;integration&#8217;).&#8221;</p>
<p>I would contend that there are thus several different sorts of <em>data malpractice</em> that we might term as <em>data laundering</em> and that one of the tasks facing a Fourth Estate might be to clarify and chase down these various abuses of process whether they occur in the corporate world, academia, the public sector or in government itself.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ouseful.wordpress.com/6957/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ouseful.wordpress.com/6957/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ouseful.wordpress.com/6957/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ouseful.wordpress.com/6957/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ouseful.wordpress.com/6957/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ouseful.wordpress.com/6957/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ouseful.wordpress.com/6957/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ouseful.wordpress.com/6957/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ouseful.wordpress.com/6957/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ouseful.wordpress.com/6957/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ouseful.wordpress.com/6957/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ouseful.wordpress.com/6957/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ouseful.wordpress.com/6957/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ouseful.wordpress.com/6957/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;blog=325417&amp;post=6957&amp;subd=ouseful&amp;ref=&amp;feed=1" width="1" height="1" /><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/ouseful?a=ioWYtaNQd8c:GOc6llzv3TM:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/ouseful?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=ioWYtaNQd8c:GOc6llzv3TM:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=ioWYtaNQd8c:GOc6llzv3TM:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/ouseful?i=ioWYtaNQd8c:GOc6llzv3TM:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=ioWYtaNQd8c:GOc6llzv3TM:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/ouseful?i=ioWYtaNQd8c:GOc6llzv3TM:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=ioWYtaNQd8c:GOc6llzv3TM:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/ouseful?i=ioWYtaNQd8c:GOc6llzv3TM:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=ioWYtaNQd8c:GOc6llzv3TM:cGdyc7Q-1BI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=cGdyc7Q-1BI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/ouseful/~4/ioWYtaNQd8c" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.ouseful.info/2012/02/03/several-takes-on-the-definition-of-data-laundering/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Tony Hirst</media:title>
		</media:content>
	<creativeCommons:license>http://creativecommons.org/licenses/by/2.0/</creativeCommons:license><feedburner:origLink>http://blog.ouseful.info/2012/02/03/several-takes-on-the-definition-of-data-laundering/</feedburner:origLink></item>
		<item>
		<title>A Tinkerer’s Toolbox…</title>
		<link>http://feedproxy.google.com/~r/ouseful/~3/admSuF5k8zM/</link>
		<comments>http://blog.ouseful.info/2012/02/02/a-tinkerers-toolbox/#comments</comments>
		<pubDate>Thu, 02 Feb 2012 12:23:58 +0000</pubDate>
		<dc:creator>Tony Hirst</dc:creator>
				<category><![CDATA[Anything you want]]></category>
		<category><![CDATA[OBU]]></category>
		<category><![CDATA[OpenPlatform]]></category>
		<category><![CDATA[OU2.0]]></category>

		<guid isPermaLink="false">http://blog.ouseful.info/?p=6950</guid>
		<description>A couple of days ago, I ran a sort of repeated, 3 hour, Digital Sandbox workshop session to students on the Goldsmiths&amp;#8217; MA/MSc in Creating Social Media (thanks to @danmcquillan for the invite and the #castlondon students for being so tolerant and engaged ;-) I guess the main theme was how messy tinkering can be, [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;amp;blog=325417&amp;amp;post=6950&amp;amp;subd=ouseful&amp;amp;ref=&amp;amp;feed=1" width="1" height="1" /&gt;</description>
			<content:encoded><![CDATA[<p>A couple of days ago, I ran a sort of repeated, 3 hour, <em>Digital Sandbox</em> workshop session to students on the Goldsmiths&#8217; <a href="http://www.gold.ac.uk/pg/ma-creating-social-media/">MA/MSc in Creating Social Media</a> (thanks to @danmcquillan for the invite and the #castlondon students for being so tolerant and engaged ;-)</p>
<p>I guess the main theme was how messy tinkering can be, and how simple ideas often don&#8217;t work as you expect them to, often requiring hacks, workarounds and alternative approaches to get things working at all, even if not reliably (which is to say: some of the demos borked;-)</p>
<p>Anyway&#8230; the topics covered were broadly:</p>
<p>1) getting data into a form where we can make it flow, as demonstrated by &#8220;my hit&#8221;, which shows how to screenscrape tabular data from a Wikipedia page using Google spreadsheets, republish it as CSV (eventually!), pull it into a Yahoo pipe and geocode it, then publish it as a KML feed that can be rendered in a Google map and embedded in an arbitrary web page.</p>
<p>2) getting started with Gephi as a tool for visualising and interactively having a conversation with a network represented data set.</p>
<p>To support <em>post hoc</em> activities, I had a play with <a href="http://delicious.com/stacks/view/CROBXt">a Delicious stack</a> as a way of aggregating a set of tutorial like blog posts I had laying around that were related to each of the activities:</p>
<p><a href="http://delicious.com/stacks/view/CROBXt" title="Photo Sharing"><img src="http://farm8.staticflickr.com/7009/6806149911_2355c7a0e4.jpg" width="500" height="496" alt="Delicious stack" /></a></p>
<p>I&#8217;d been quite dismissive of Delicious stacks when they first launched (see, for example, <a href="http://blog.ouseful.info/2011/09/27/rediscovering-playlists/">Rediscovering playlists</a>), but I&#8217;m starting to see how they might actually be quite handy as a way of bootstrapping my way into a set of uncourses and/or ebooks around particular apps and technologies. There&#8217;s nothing particularly new about being able to build ordered sets of resources, of course, but the interesting thing for me is that even if I don&#8217;t get as far as editing a set of posts into a coherent mini-guide, a well ordered stack may itself provide a useful guide to a particular application, tool, set of techniques or topic.</p>
<p>As to why a literal repackaging of blog posts around a particular tool or technology as an ebook may not be such a good idea in and of itself, see Martin Belam&#8217;s post describing his experiences editing a couple of <em>Guardian Shorts</em>*: <a href="http://www.currybet.net/cbet_blog/2011/11/doctor-who-resurrection-of-the-doctor-guardian-ebook.php">&#8220;Who’s Who: The Resurrection of the Doctor&#8221;: Doctor Who ebook confidential</a> and <a href="http://www.currybet.net/cbet_blog/2012/01/guardian-facebook-ebook.php">Editing the Guardian’s Facebook ebook</a>&#8230;</p>
<p>* One of the things I&#8217;ve been tracking lately is engagement by the news media in alternative ways of trying to sell their content. A good example of this is the Guardian, who have been repackaging edited collections of (medium and long form) articles on a particular theme as &#8220;<a href="http://www.amazon.co.uk/s/ref=ntt_athr_dp_sr_1?_encoding=UTF8&amp;search-alias=digital-text&amp;field-author=The%20Guardian&amp;tag=ouseful-21">Guardian Shorts</a>&#8220;. So for example, there are e-book article collection wrappers around the breaking of the <a href="http://www.amazon.co.uk/Phone-Hacking-Guardian-Shorts-ebook/dp/B005G0JGZY/ref=sr_1_6?s=digital-text&amp;ie=UTF8&amp;qid=1328181720&amp;sr=1-6&amp;tag=ouseful-21">phone hacking story</a>, or <a href="http://www.amazon.co.uk/Reading-Riots-Investigating-Englands-ebook/dp/B006LLOCII/ref=sr_1_5?s=digital-text&amp;ie=UTF8&amp;qid=1328181720&amp;sr=1-5&amp;tag=ouseful-21">investigating last year&#8217;s UK riots</a>. If you want a <a href="http://www.amazon.co.uk/Jazz-Orleans-generation-Guardian-ebook/dp/B006NXTXW4/ref=sr_1_7?s=digital-text&amp;ie=UTF8&amp;qid=1328181720&amp;sr=1-7&amp;tag=ouseful-21">quick guide to jazz</a> or an overview of the <a href="http://www.amazon.co.uk/Facts-are-Sacred-Guardian-ebook/dp/B006PI9PQG/ref=pd_sim_kinc_2&amp;tag=ouseful-21">Guardian datastore approach to data journalism</a>, they have those too. (Did I get enough affiliate links in there, do you think?!;-)</p>
<p>This rethinking of how to aggregate, reorder and repackage content into saleable items is something that may benefit content producing universities. This is particularly true in the case of the OU, of course, where we have been producing content for years, and recently making it publicly available through a variety of channels, such as <a href="http://openlearn.open.ac.uk">OpenLearn</a>, or, err, <a href="http://www.open.edu/openlearn/">the other OpenLearn</a>, via <a href="http://www.open.edu/itunes/">iTunesU</a>, or <a href="http://www.youtube.com/user/TheOpenUniversity/">YouTube</a>, <a href="http://pipes.yahoo.com/ouseful/oubbcclips">OU/BBC co-productions</a> and so on. It&#8217;s also interesting to note how the OU is also providing content (<a href="http://www.whatdotheyknow.com/">under some sort of commercial agreement&#8230;?</a>) to other publishers/publications, such as the <a href="http://www.newscientist.com/blogs/nstv/2012/01/physics-in-a-minute-the-twin-paradox.html">New Scientist</a>:</p>
<p><a href="http://www.newscientist.com/blogs/nstv/2012/01/physics-in-a-minute-the-twin-paradox.html" title="OU youtube ads being in New Scientist context by psychemedia, on Flickr"><img src="http://farm8.staticflickr.com/7168/6640555687_7660fe3702_z.jpg" width="496" height="640" alt="OU youtube ads being in New Scientist context"></a></p>
<p>There are other opportunities too, of course, such as Martin Weller&#8217;s suggestion that  <a href="http://nogoodreason.typepad.co.uk/no_good_reason/2012/02/why-its-time-for-the-university-press-again.html">it&#8217;s time for the rebirth of the university press</a>, or, from another of Martin&#8217;s posts, the creation of &#8220;special issue open access journal collections&#8221; (<a href="http://nogoodreason.typepad.co.uk/no_good_reason/2011/12/launching-meta-edtech-journal.html">Launching Meta EdTech Journal</a>), as well as things like <a href="http://blog.ouseful.info/2010/03/05/the-university-expert-press-room-cop15/">The University Expert Press Room</a> which provides a channel for thematic content around a news area and which complements very well, in legacy terms, the sort of model being pursued via Guardian Shorts?</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ouseful.wordpress.com/6950/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ouseful.wordpress.com/6950/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ouseful.wordpress.com/6950/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ouseful.wordpress.com/6950/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ouseful.wordpress.com/6950/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ouseful.wordpress.com/6950/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ouseful.wordpress.com/6950/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ouseful.wordpress.com/6950/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ouseful.wordpress.com/6950/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ouseful.wordpress.com/6950/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ouseful.wordpress.com/6950/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ouseful.wordpress.com/6950/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ouseful.wordpress.com/6950/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ouseful.wordpress.com/6950/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;blog=325417&amp;post=6950&amp;subd=ouseful&amp;ref=&amp;feed=1" width="1" height="1" /><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/ouseful?a=admSuF5k8zM:zJRbvnpcV-U:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/ouseful?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=admSuF5k8zM:zJRbvnpcV-U:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=admSuF5k8zM:zJRbvnpcV-U:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/ouseful?i=admSuF5k8zM:zJRbvnpcV-U:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=admSuF5k8zM:zJRbvnpcV-U:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/ouseful?i=admSuF5k8zM:zJRbvnpcV-U:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=admSuF5k8zM:zJRbvnpcV-U:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/ouseful?i=admSuF5k8zM:zJRbvnpcV-U:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=admSuF5k8zM:zJRbvnpcV-U:cGdyc7Q-1BI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=cGdyc7Q-1BI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/ouseful/~4/admSuF5k8zM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.ouseful.info/2012/02/02/a-tinkerers-toolbox/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Tony Hirst</media:title>
		</media:content>

		<media:content url="http://farm8.staticflickr.com/7009/6806149911_2355c7a0e4.jpg" medium="image">
			<media:title type="html">Delicious stack</media:title>
		</media:content>

		<media:content url="http://farm8.staticflickr.com/7168/6640555687_7660fe3702_z.jpg" medium="image">
			<media:title type="html">OU youtube ads being in New Scientist context</media:title>
		</media:content>
	<creativeCommons:license>http://creativecommons.org/licenses/by/2.0/</creativeCommons:license><feedburner:origLink>http://blog.ouseful.info/2012/02/02/a-tinkerers-toolbox/</feedburner:origLink></item>
		<item>
		<title>Sleight of Hand and Data Laundering in Evidence Based Policy Making</title>
		<link>http://feedproxy.google.com/~r/ouseful/~3/Wrw7NOH4kqg/</link>
		<comments>http://blog.ouseful.info/2012/02/01/sleight-of-hand-and-data-laundering-in-evidence-based-policy-making/#comments</comments>
		<pubDate>Wed, 01 Feb 2012 11:26:00 +0000</pubDate>
		<dc:creator>Tony Hirst</dc:creator>
				<category><![CDATA[Anything you want]]></category>
		<category><![CDATA[Policy]]></category>
		<category><![CDATA[ddj]]></category>

		<guid isPermaLink="false">http://blog.ouseful.info/?p=6940</guid>
		<description>I&amp;#8217;ve still to make this year&amp;#8217;s New Year&amp;#8217;s Resolution, but one of the things that I thing I&amp;#8217;d like to spend more time getting my head round is the notion of &amp;#8220;evidence based policy making&amp;#8221; (e.g. Is Evidence-Based Government Possible?. As far as I can tell, this is often caricatured as either involving Googling around [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;amp;blog=325417&amp;amp;post=6940&amp;amp;subd=ouseful&amp;amp;ref=&amp;amp;feed=1" width="1" height="1" /&gt;</description>
			<content:encoded><![CDATA[<p>I&#8217;ve still to make this year&#8217;s New Year&#8217;s Resolution, but one of the things that I thing I&#8217;d like to spend more time getting my head round is the notion of &#8220;evidence based policy making&#8221; (e.g. <a href="http://www.nationalschool.gov.uk/policyhub/downloads/JerryLeeLecture1202041.pdf">Is Evidence-Based Government Possible?</a>.</p>
<p>As far as I can tell, this is often caricatured as either involving Googling around a policy area using ministerially obvious Google terms and referencing whatever&#8217;s in the top 5 hits, or taking a policy decision then looking for selective evidence to support that decision, along with contrary evidence against competing alternatives; (in a related area of evidence based practice, see for example: <a href="http://www.leeds.ac.uk/educol/documents/00001819.htm">Some Questions about Evidence-based Practice in Education</a>. If you have other examples in a similar vein, please let me know&#8230; #lookingForAnEvidenceBase Also e.g. the idea of <a href="http://en.wikipedia.org/wiki/Policy-based_evidence_making"><em>policy based evidence making</em></a> [h/t Jon Warbrick];-)</p>
<p>One of the suspicions I have is that &#8220;evidence&#8221; inherits the authority associated with the most reputable source associated with it when we wish to call on it to justify it, (and possibly as a complement to that, the least reputable source if we wish to discount it?)</p>
<p>So for example, in his <em>Networker</em> Observer column last weekend, John Naughton describes a presentation given to a technology conference by Facebook&#8217;s chief operating officer, Sheryl Sandberg, that pre-empted a European commission announcement on privacy:</p>
<blockquote><p>Sandberg made claims about the economic benefits of privacy abuse that defy parody. For example, she unveiled a report that Facebook had commissioned from Deloitte, a consultancy firm, which estimated that Facebook – an outfit with a global workforce of about 3,000 – indirectly helped create 232,000 jobs in Europe in 2011 and enabled more than $32bn in revenues.</p>
<p>Inspection of the &#8220;report&#8221; confirms one&#8217;s suspicion that you couldn&#8217;t make this stuff up. Or, rather, only an international consulting firm could make it up. Interestingly, Deloitte itself appears to be ambivalent about it. &#8220;The information contained in the report&#8221;, it cautions, &#8220;has been obtained from Facebook Inc and third party sources that are clearly referenced in the appropriate sections of the report. Deloitte has <em>neither sought to corroborate this information nor to review its overall reasonableness</em>. Further, any results from the analysis contained in the report are reliant on the information available at the time of writing the report and should not be relied upon in subsequent periods.&#8221; (Emphasis added by JN.)</p>
<p>Accordingly, continues Deloitte, &#8220;no representation or warranty, express or implied, is given and no responsibility or liability is or will be accepted by or on behalf of Deloitte or by any of its partners, employees or agents or any other person as to the accuracy, completeness or correctness of the information contained in this document or any oral information made available and any such liability is expressly disclaimed&#8221;.</p></blockquote>
<p>In this case, the Deloitte report was used <em>as evidence</em> by Facebook to <em>demonstrate</em> a particular economic benefit made possible by Facebook&#8217;s activities. The consultancy firms caveats were ignored, (including the fact that the data may in part at least have come from Facebook itself), in reporting this claim. So: this is <em>data laundering</em>, right? We have some dodgy evidence, about which we&#8217;re biased, so we give it to an &#8220;independent&#8221; consultant who re-reports it, albeit with caveats, that we can then report, minus the caveats. Lovely, clean evidence. Our lobbyists can then go to a lazy policy researcher and take this scrubbed evidence, referencing it as finding in the Deloitte report, so that it can make it&#8217;s way into a policy briefing. Or that&#8217;s how I imagine it, any way..</p>
<p>John&#8217;s take was in a similar vein:</p>
<blockquote><p>The sole purpose of &#8220;reports&#8221; such as this is to impress or intimidate politicians and regulators, many of whom still seem unaware of the extent to which international consulting firms are used by corporations to lend an aura of empirical respectability to hogwash.</p></blockquote>
<p><em>Quite so.</em> ;-) I think my concerns go further though &#8211; not only is the Deloitte cachet used to bludgeon evidence-poor audiences into submission, it may also perniciously make it&#8217;s way into documents further up the policy development ladder where only the findings, and none of the caveats (including the dodgy provenance of the data) are disclosed.</p>
<p>So here are a couple of things for the data journalists to take away, maybe?</p>
<p>1) there may be stories to be told about the way other people have sourced and used their data. Where one report quotes data from another, treat it with as much suspicion as you would hearsay&#8230; <em>Check with the source.</em></p>
<p>2) when developing your own data stories, keep really good tabs on where the data&#8217;s come from <em>and be suspicious about it</em>. If you can be, be open with republishing the data, or links to it.</p>
<p>PS if you have other examples of data provenance laundering, please add a link as a comment to this post:-)</p>
<p>PPS see also <a href="http://sunlightfoundation.com/blog/2012/01/30/sopa-lobbyin/">How SOPA and PIPA did and didn’t change how Washington lobbying works</a>: &#8220;The political scientist E.E. Schattschneider once called politics “the mobilization of bias.” By this, he meant something both simple and profound. All political battles are fights between competing interests, he noted, but political outcomes are almost always determined by the bias of those paying attention to the conflict. The trick is to make sure you mobilize the crowd that will cheer for you.&#8221;</p>
<p>PPPS A bit of history relating to the &#8220;data laundry&#8221; idea, originally in the context of scrubbing rights tainted records from library catalogue metadata: <a href="http://blog.ouseful.info/2011/08/09/open-data-processes-the-open-metadata-laundry/">http://blog.ouseful.info/2011/08/09/open-data-processes-the-open-metadata-laundry/</a></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ouseful.wordpress.com/6940/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ouseful.wordpress.com/6940/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/ouseful.wordpress.com/6940/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ouseful.wordpress.com/6940/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/ouseful.wordpress.com/6940/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/ouseful.wordpress.com/6940/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/ouseful.wordpress.com/6940/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/ouseful.wordpress.com/6940/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/ouseful.wordpress.com/6940/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ouseful.wordpress.com/6940/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/ouseful.wordpress.com/6940/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ouseful.wordpress.com/6940/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/ouseful.wordpress.com/6940/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ouseful.wordpress.com/6940/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&amp;blog=325417&amp;post=6940&amp;subd=ouseful&amp;ref=&amp;feed=1" width="1" height="1" /><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/ouseful?a=Wrw7NOH4kqg:kHfTbTVov6U:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/ouseful?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=Wrw7NOH4kqg:kHfTbTVov6U:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=2mJPEYqXBVI" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=Wrw7NOH4kqg:kHfTbTVov6U:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/ouseful?i=Wrw7NOH4kqg:kHfTbTVov6U:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=Wrw7NOH4kqg:kHfTbTVov6U:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/ouseful?i=Wrw7NOH4kqg:kHfTbTVov6U:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=Wrw7NOH4kqg:kHfTbTVov6U:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/ouseful?i=Wrw7NOH4kqg:kHfTbTVov6U:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/ouseful?a=Wrw7NOH4kqg:kHfTbTVov6U:cGdyc7Q-1BI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=cGdyc7Q-1BI" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/ouseful/~4/Wrw7NOH4kqg" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.ouseful.info/2012/02/01/sleight-of-hand-and-data-laundering-in-evidence-based-policy-making/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="" medium="image">
			<media:title type="html">Tony Hirst</media:title>
		</media:content>
	<creativeCommons:license>http://creativecommons.org/licenses/by/2.0/</creativeCommons:license><feedburner:origLink>http://blog.ouseful.info/2012/02/01/sleight-of-hand-and-data-laundering-in-evidence-based-policy-making/</feedburner:origLink></item>
	<item><title>Links for 2012-01-29 [del.icio.us]</title><link>http://feedproxy.google.com/~r/ouseful/~3/_4l0vbgBYQs/feedthru</link><pubDate>Mon, 30 Jan 2012 00:00:00 PST</pubDate><guid isPermaLink="false">http://del.icio.us/psychemedia/feedthru#2012-01-29</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="http://matthieu-totet.fr/Koumin/tools/naoyun/"&gt;Naoyun | Twitter2gephi streaming api&lt;/a&gt;&lt;br/&gt;
"Naoyun is a software that create a bridge between Twitter and Gephi with their own Stream API. You can "follow" hashtags, words or user on twitter and display them as a network in Gephi in real time."&lt;/li&gt;
&lt;li&gt;&lt;a href="http://stakeholders.ofcom.org.uk/consultations/bbc-coverage-2012-olympics/"&gt;Ofcom | BBC Proposals for coverage of the London 2012 Olympic Games&lt;/a&gt;&lt;br/&gt;
So what do we think would make for a sensible *media* strategy around the games? Something inclusive, socially driven, that embraces our increasing desire to document our lives and activities, perhaps? Err, erm...&lt;/li&gt;
&lt;/ul&gt;&lt;img src="http://feeds.feedburner.com/~r/ouseful/~4/_4l0vbgBYQs" height="1" width="1"/&gt;</description><feedburner:origLink>http://del.icio.us/psychemedia/feedthru#2012-01-29</feedburner:origLink></item><item><title>Links for 2012-01-27 [del.icio.us]</title><link>http://feedproxy.google.com/~r/ouseful/~3/m3WKczFIpcI/feedthru</link><pubDate>Sat, 28 Jan 2012 00:00:00 PST</pubDate><guid isPermaLink="false">http://del.icio.us/psychemedia/feedthru#2012-01-27</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="http://digital.cabinetoffice.gov.uk/2012/01/27/what-does-google-is-the-homepage-mean-irl/"&gt;What does &amp;ldquo;Google is the homepage&amp;rdquo; mean IRL? | Government Digital Service&lt;/a&gt;&lt;br/&gt;
So what does "Google is your homepage" really mean? Here's an example...&lt;/li&gt;
&lt;/ul&gt;&lt;img src="http://feeds.feedburner.com/~r/ouseful/~4/m3WKczFIpcI" height="1" width="1"/&gt;</description><feedburner:origLink>http://del.icio.us/psychemedia/feedthru#2012-01-27</feedburner:origLink></item><item><title>Links for 2012-01-25 [del.icio.us]</title><link>http://feedproxy.google.com/~r/ouseful/~3/Y6kJE9qy54I/feedthru</link><pubDate>Thu, 26 Jan 2012 00:00:00 PST</pubDate><guid isPermaLink="false">http://del.icio.us/psychemedia/feedthru#2012-01-25</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="http://complexdiagrams.com/properties"&gt;Properties and Best Uses of Visual Encodings &amp;laquo; Complex Diagrams&lt;/a&gt;&lt;br/&gt;
Building blocks for creating visualisations - what's good for what...&lt;/li&gt;
&lt;li&gt;&lt;a href="http://stakeholders.ofcom.org.uk/consultations/bbc-coverage-2012-olympics/"&gt;Ofcom | BBC Proposals for coverage of the London 2012 Olympic Games&lt;/a&gt;&lt;br/&gt;
So what do we think would make for a sensible *media* strategy around the games? Something inclusive, socially driven, that embraces our increasing desire to document our lives and activities, perhaps? Err, erm...&lt;/li&gt;
&lt;/ul&gt;&lt;img src="http://feeds.feedburner.com/~r/ouseful/~4/Y6kJE9qy54I" height="1" width="1"/&gt;</description><feedburner:origLink>http://del.icio.us/psychemedia/feedthru#2012-01-25</feedburner:origLink></item><item><title>Links for 2012-01-20 [del.icio.us]</title><link>http://feedproxy.google.com/~r/ouseful/~3/f-wqMJMWhyE/feedthru</link><pubDate>Sat, 21 Jan 2012 00:00:00 PST</pubDate><guid isPermaLink="false">http://del.icio.us/psychemedia/feedthru#2012-01-20</guid><description>&lt;ul&gt;
&lt;li&gt;&lt;a href="http://matthieu-totet.fr/Koumin/tools/naoyun/"&gt;Naoyun | Twitter2gephi streaming api&lt;/a&gt;&lt;br/&gt;
"Naoyun is a software that create a bridge between Twitter and Gephi with their own Stream API. You can "follow" hashtags, words or user on twitter and display them as a network in Gephi in real time."&lt;/li&gt;
&lt;/ul&gt;&lt;img src="http://feeds.feedburner.com/~r/ouseful/~4/f-wqMJMWhyE" height="1" width="1"/&gt;</description><feedburner:origLink>http://del.icio.us/psychemedia/feedthru#2012-01-20</feedburner:origLink></item></channel>
</rss>

