<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0"> <channel><title>JISC CETIS MASHe</title> <link>http://mashe.hawksey.info</link> <description>The musing of Martin Hawksey (EdTech Explorer)</description> <lastBuildDate>Fri, 18 May 2012 10:43:30 +0000</lastBuildDate> <language /> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3.2</generator>  <atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/MASHe" /><feedburner:info uri="mashe" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://mashe.hawksey.info/?pushpress=hub" /><creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license><feedburner:emailServiceId>MASHe</feedburner:emailServiceId><feedburner:feedburnerHostname>http://feedburner.google.com</feedburner:feedburnerHostname><item><title>Using Google Spreadsheets to dashboard project/course blog feeds #oerri</title><link>http://feedproxy.google.com/~r/MASHe/~3/A0aUABcHcCk/</link> <comments>http://mashe.hawksey.info/2012/05/using-google-spreadsheets-to-dashboard-projectcourse-blog-feeds-oerri/#comments</comments> <pubDate>Thu, 17 May 2012 20:12:06 +0000</pubDate> <dc:creator>Martin Hawksey</dc:creator> <category><![CDATA[Blogs]]></category> <category><![CDATA[Google Apps]]></category> <category><![CDATA[Google Apps Script]]></category> <category><![CDATA[JISC CETIS]]></category> <category><![CDATA[oerri]]></category> <guid isPermaLink="false">http://mashe.hawksey.info/?p=13331</guid> <description><![CDATA[In this post I’ll show I’ve create a quick dashboard in Google Spreadsheets which takes a list of blog RSS feeds and filters for specific tags/categories. Whilst demonstrated this with the JISC OERRI projects the same technique could be used in other scenarios, such as, as a way to track student blogs.]]></description> <content:encoded><![CDATA[<p>In the original <a
href="http://www.jisc.ac.uk/fundingopportunities/funding_calls/2011/11/oerrapidinnovation.aspx">JISC OER Rapid Innovation call</a> one of the stipulations due to the size and durations of grants is that the main reporting process is blog-based. Amber Thomas, who is the JISC Programme Manager for this strand and <a
href="http://fragmentsofamber.wordpress.com/2012/02/03/whyiblog/">a keen blogger herself</a>, has been a long supporter of projects adopting open practices, blogging progress as they go. Brian Kelly (UKOLN) has also an interest in this area with a some posts including <a
href="http://ukwebfocus.wordpress.com/2011/12/14/beyond-blogging-as-an-open-practice-what-about-associated-open-usage-data/">Beyond Blogging as an Open Practice, What About Associated Open Usage Data?</a></p><p>For the OERRI projects the proposal discussed at the start-up meeting was that projects adopt a taxonomy of tags to indicate keys posts (e.g. project plan, aims, outputs, nutshell etc.). For the final report projects would then compile all posts with specific tags and submit as a ms-word or pdf.</p><p>There are a number of advantages of this approach one of them, for people like me anyway, is it exposes machine readable data that can be used in a number of ways. In this post I’ll show I’ve create a quick dashboard in Google Spreadsheets which takes a list of blog RSS feeds and filters for specific tags/categories. Whilst demonstrated this with the OERRI projects the same technique could be used in other scenarios, such as, as a way to track student blogs. As part of this solution I’ll highlight some of the issues/affordances of different blogging platforms and introduce some future work to combine post content using a template structure.</p><p
align="center"><a
href="https://docs.google.com/spreadsheet/ccc?key=0AqGkLMU9sHmLdHl4dEJGYlZ3cEFnRDR1bUZ4T0Y2dVE#gid=0"><img
style="background-image: none; border-right-width: 0px; padding-left: 0px; padding-right: 0px; display: inline; border-top-width: 0px; border-bottom-width: 0px; border-left-width: 0px; padding-top: 0px" title="OERRI Project Post Directory" border="0" alt="OERRI Project Post Directory" src="http://mcdn.hawksey.info/wp-content/uploads/2012/05/image5.png" width="705" height="293" /></a> <br
/>Screenshot of <a
href="https://docs.google.com/spreadsheet/ccc?key=0AqGkLMU9sHmLdHl4dEJGYlZ3cEFnRDR1bUZ4T0Y2dVE#gid=0">OERRI post dashboard</a></p><h3></h3><h3>The OERRI Project Post Directory</h3><p>If you are not interested in how this spreadsheet was made and just&#160; want to grab a copy to use with your own set of projects/class blogs then just:</p><p
align="center"><strong>*** </strong><a
href="https://docs.google.com/spreadsheet/ccc?key=0AqGkLMU9sHmLdHl4dEJGYlZ3cEFnRDR1bUZ4T0Y2dVE#gid=0"><strong>open the OERRI Project Post Directory</strong></a><strong> ***</strong> <br
/>File &gt; Make a copy if you want your own editable version</p><p>The link to the document above is the one I’ll be developing throughout the programme so feel free to bookmark the link to keep track of what the projects are doing.</p><p>The way the spreadsheet is structured is the tags/categories the script uses to filter posts is in cells D2:L2 and urls are constructed from the values in columns O-Q. The basic technique being used here is building urls that look for specific posts and returning links (made pretty with some conditional formatting).</p><h3>Blogging platforms used in OERRI</h3><p>So how do we build a url to look for specific posts? With this technique it comes down to whether the blogging platform supports tag/category filtering so lets first look at the platforms being used in OERRI projects.</p><p><img
style="background-image: none; border-right-width: 0px; margin: 0px 0px 4px 8px; padding-left: 0px; padding-right: 0px; display: inline; float: right; border-top-width: 0px; border-bottom-width: 0px; border-left-width: 0px; padding-top: 0px" title="chart1" border="0" alt="chart1" align="right" src="http://mcdn.hawksey.info/wp-content/uploads/2012/05/chart1.png" width="412" height="299" />This chart (right) breaks down the blogging platforms. You’ll see the most (12 of 15) are using WordPress in two flavours, ‘shared’, indicating that the blog is also a personal or team blog containing other posts not related to OERRI and ‘dedicated’, setup entirely for the project.</p><p>The 3 other platforms are <a
href="http://www.medev.ac.uk/blog/oer-rapid-innovation-supoerglue/2012/mar/8/oer-rapid-innovation-supoerglue-and-ridlr/">2 MEDEV blogs</a> and the <a
href="http://cloudworks.ac.uk/cloud/view/6196">OUs project on Cloudworks</a>. I’m not familiar with the MEDEV platform and only know a bit about cloudworks so for now I’m going to ignore these and concentrate on the WordPress blogs.</p><h3>WordPress and Tag/Category Filtering</h3><p>One of the benefits of WordPress is you can can an RSS feed for almost everything by adding /feed/ or ?feed=rss2 to urls (other platforms also support this, I a vague recollection of doing something similar in blogger(?)). For example, if you want a feed of all my Google Apps posts you can use <a
href="http://mashe.hawksey.info/category/google-apps/feed/">http://mashe.hawksey.info/category/google-apps/feed/</a><strong></strong>.</p><p>Even better is you can combine tags/categories with a ‘+’ operator so if you want a feed of all my Google Apps posts that are also categorised with Twitter you can use <a
href="http://mashe.hawksey.info/category/google-apps+twitter/feed/">http://mashe.hawksey.info/category/google-apps+twitter/feed/</a>.</p><p>So to get the Bebop ‘nutshell’ categorised post as a RSS item we can use: <a
title="http://bebop.blogs.lincoln.ac.uk/category/nutshell/feed/" href="http://bebop.blogs.lincoln.ac.uk/category/nutshell/feed/">http://bebop.blogs.lincoln.ac.uk/category/nutshell/feed/</a></p><p>Looking at one of the shared wordpress blogs to get the ‘nutshell’ from RedFeather you can use: <a
title="http://blogs.ecs.soton.ac.uk/oneshare/tag/redfeather+nutshell/feed/" href="http://blogs.ecs.soton.ac.uk/oneshare/tag/redfeather+nutshell/feed/">http://blogs.ecs.soton.ac.uk/oneshare/tag/redfeather+nutshell/feed/</a></p><h3>Using Google Spreadsheet importFeed formula to get a post url</h3><p>The <a
href="https://support.google.com/docs/bin/static.py?hl=en&amp;topic=25273&amp;page=table.cs&amp;tab=1240289">‘import’ functions in Google Spreadsheet</a> must be my favourites and I know lots of social media professionals who use them to pull data into a spreadsheet and produce reports for clients from the data. With <a
href="http://support.google.com/docs/bin/answer.py?hl=en&amp;answer=155181">importFeed</a> we can go and see if a blog post under a certain category exists and then return something back, in this case the post link. For my first iteration of this spreadsheet I used the formula below:</p><p><img
style="background-image: none; border-right-width: 0px; padding-left: 0px; padding-right: 0px; display: block; float: none; border-top-width: 0px; border-bottom-width: 0px; margin-left: auto; border-left-width: 0px; margin-right: auto; padding-top: 0px" title="importFeed formula" border="0" alt="importFeed formula" src="http://mcdn.hawksey.info/wp-content/uploads/2012/05/importFeed-flat.png" width="680" height="200" /></p><p>This works well but one of the drawback of importFeed is we can only have a maximum of 50 of them in one spreadsheet. With 15 projects and 9 tag/categories the maths doesn’t add up.</p><p>To get around this I switched to Google Apps Script (macros for Google Spreadsheets <a
href="http://mashe.hawksey.info/category/google-apps-script/">I write a lot about</a>). This doesn’t have an importFeed function built-in but I can do a <a
href="https://developers.google.com/apps-script/service_urlfetch">UrlFetch</a> and <a
href="https://developers.google.com/apps-script/service_xml">Xml parse</a>. <a
href="https://gist.github.com/2720726">Here’s the code</a> which does this (included in the template):</p><p> <script src="https://gist.github.com/2720726.js?file=gistfile1.js"></script> <p>Note this code also uses the <a
href="https://developers.google.com/apps-script/service_cache">Cache Service</a> to improve performance and make sure I don’t go over my <a
href="https://docs.google.com/macros/dashboard">UrlFetch quota</a>.</p><p>We can call this function like other spreadsheet formula using ‘=fetchUrlfromRSS(aUrl)’.</p><h3>Trouble at the tagging mill</h3><p>So we have a problem getting data from none WordPress blogs, which I’m quietly ignoring for now, the next problem is people not tagging/categorising posts correctly. For example, I can see <a
href="http://researchblogs.cs.bham.ac.uk/math-access/2012/04/30/access-to-math-in-a-nutshell/">Access to Math have 10 post including a ‘nutshell’</a> but none of these are tagged. From a machine side there’s not much I can do about this but at least from the dashboard I can spot something isn’t right.</p><h4>Tags for a template</h4><p>I’m sure once projects are politely reminded to tag posts they’ll oblige. One incentive might be to say if posts are tagged correctly then the code above could be easily added to to not just pull post links but the full post text which could then be used to generate the projects final submission.</p><h3></h3><h3>Summary</h3><p>So stay tuned to the <a
href="https://docs.google.com/spreadsheet/ccc?key=0AqGkLMU9sHmLdHl4dEJGYlZ3cEFnRDR1bUZ4T0Y2dVE#gid=0"><strong>OERRI Project Post Directory</strong></a><strong> </strong>spreadsheet to see if I can incorporate MEDEV and Cloudworks feeds, and also if I can create a template for final posts. Given Brian’s <a
href="http://ukwebfocus.wordpress.com/2011/12/14/beyond-blogging-as-an-open-practice-what-about-associated-open-usage-data/">post on usage data</a> mentioned at the beginning should I also be tracking post activity data on social networks or is that a false metric?</p><p>I’m sure there was something else but it has entirely slipped my mind …</p><p>BTW here’s the <a
href="http://www.google.com/reader/public/subscriptions/user%2F11609741331127149470%2Fbundle%2FOERRI">OPML file for the RSS feeds of the blogs that are live</a> (also <a
href="http://www.google.com/reader/bundle/user%2F11609741331127149470%2Fbundle%2FOERRI">visible here as a Google Reader bundle</a>)</p><div
id="crp_related"><h3>You also might like :</h3><ul><li><a
href="http://mashe.hawksey.info/2012/04/notes-on-generating-live-wordclouds-from-yahoo-pipes-using-d3-js/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13331');" class="crp_title">Notes on generating live wordclouds from Yahoo Pipes using D3.js</a></li><li><a
href="http://mashe.hawksey.info/2011/12/oer-visualisation-project-processing-a-resource-feed-to-find-frequency-using-google-spreadsheets-day-3-ukoer-ooher/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13331');" class="crp_title">OER Visualisation Project: Processing a resource feed to find frequency using Google Spreadsheets [day 3] #ukoer #ooher</a></li><li><a
href="http://mashe.hawksey.info/2012/03/googleplus-search-to-rss/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13331');" class="crp_title">Turning Google+ Search results into a RSS feed (for Google Reader)</a></li><li><a
href="http://mashe.hawksey.info/2012/01/google-refine-apps-script-integration/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13331');" class="crp_title">Integrating Google Spreadsheet/Apps Script with Google Refine to update existing spreadsheets</a></li><li><a
href="http://mashe.hawksey.info/2012/01/oer-visualisation-project-timelines-timelines-timelines-day-30-ukoer/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13331');" class="crp_title">OER Visualisation Project: Timelines, timelines, timelines [day 30] #ukoer</a></li></ul></div><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/MASHe?a=A0aUABcHcCk:imiZ2xLknQw:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/MASHe?i=A0aUABcHcCk:imiZ2xLknQw:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=A0aUABcHcCk:imiZ2xLknQw:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/MASHe?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=A0aUABcHcCk:imiZ2xLknQw:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/MASHe?i=A0aUABcHcCk:imiZ2xLknQw:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=A0aUABcHcCk:imiZ2xLknQw:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/MASHe?i=A0aUABcHcCk:imiZ2xLknQw:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=A0aUABcHcCk:imiZ2xLknQw:JUhcmGiK9AQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=JUhcmGiK9AQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=A0aUABcHcCk:imiZ2xLknQw:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/MASHe?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=A0aUABcHcCk:imiZ2xLknQw:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/MASHe?i=A0aUABcHcCk:imiZ2xLknQw:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=A0aUABcHcCk:imiZ2xLknQw:jQG8q3pvyYY"><img src="http://feeds.feedburner.com/~ff/MASHe?d=jQG8q3pvyYY" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=A0aUABcHcCk:imiZ2xLknQw:IqXOmDWthuQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=IqXOmDWthuQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/MASHe/~4/A0aUABcHcCk" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://mashe.hawksey.info/2012/05/using-google-spreadsheets-to-dashboard-projectcourse-blog-feeds-oerri/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://mashe.hawksey.info/2012/05/using-google-spreadsheets-to-dashboard-projectcourse-blog-feeds-oerri/</feedburner:origLink></item> <item><title>Analytics Reconnoitre: Notes on Open Solutions in Big Data from #esym12</title><link>http://feedproxy.google.com/~r/MASHe/~3/FGRFhdFaFv0/</link> <comments>http://mashe.hawksey.info/2012/05/analytics-reconnoitre-notes-on-open-solutions-in-big-data-from-esym12/#comments</comments> <pubDate>Tue, 15 May 2012 14:53:05 +0000</pubDate> <dc:creator>Martin Hawksey</dc:creator> <category><![CDATA[Analytics]]></category> <category><![CDATA[Data]]></category> <category><![CDATA[JISC CETIS]]></category> <category><![CDATA[#esym12]]></category> <category><![CDATA[#jiscar]]></category> <category><![CDATA[analytics]]></category> <guid isPermaLink="false">http://mashe.hawksey.info/?p=13312</guid> <description><![CDATA[Selected notes from the  Eduserv Symposium 2012: Big Data, Big Deal on the abundance of open source solutions and services in this area]]></description> <content:encoded><![CDATA[<p>A couple of weeks ago it was Big Data Week, <em>“a series of interconnected activities and conversations around the world across not only technology but also the commercial use case for Big Data”</em>.</p><blockquote><p><strong>big data</strong><sup><a
href="http://en.wikipedia.org/wiki/Big_data#cite_note-0">[1]</a></sup><sup><a
href="http://en.wikipedia.org/wiki/Big_data#cite_note-1">[2]</a></sup> consists of data sets that grow so large and complex that they become awkward to work with using on-hand database management tools. Difficulties include capture, storage,<sup><a
href="http://en.wikipedia.org/wiki/Big_data#cite_note-2">[3]</a></sup> search, sharing, analytics,<sup><a
href="http://en.wikipedia.org/wiki/Big_data#cite_note-3">[4]</a></sup> and visualizing – <a
href="http://en.wikipedia.org/wiki/Big_data">BY Wikipedia</a></p></blockquote><p>In O’Reilly Radar there was a piece on <a
href="http://radar.oreilly.com/2012/04/big-data-in-europe.html">Big data in Europe</a> which had Q&amp;A from Big Data Week founder/organizer Stewart Townsend, and Carlos Somohano both of whom are big in Big Data.</p><p>Maybe I’m being naïve but I was surprised that there was no reference to what universities/research sector is doing with handling and analysing large data sets. For example at the Sanger Institute alone each of their DNA sequencers are generating 1 terabyte (1024 gigabytes) of data a day, storing over 17 petabytes (17 million gigabytes) which is doubling every year.</p><p>Those figures trip off my tongue because last week I was at the <a
href="http://www.eduserv.org.uk/newsandevents/events/2012/symposium">Eduserv Symposium 2012: Big Data, Big Deal?</a> which had many examples of how institutions are dealing with ‘big data’. There were a couple of things I took away from this event like the prevalence of open source software as well as the number of vendors wrapping open source tools with their own systems to sell as service. Another clear message was a lack of data scientists who can turn raw data into information and knowledge.</p><p>As part of the Analytics Reconnoitre we are undertaking at JISC CETIS in this post I want to summarise some of the open source tools and ‘as a service’ offering in the Big Data scene.</p><p>[<strong>Disclaimer:</strong> I should say first I coming to this area cold. I’m not an information systems expert so what you’ll see here is a very top-level view more often than not me joining the dots from things I’ve learned 5 minutes ago. So if you’ve spot anything I’ve got wrong or bits I’m missing let me know]</p><h3>Open source as a Service</h3><div
style="float: right;"><blockquote><p><strong>some of the aaS’s</strong><br
/> CaaS – Cluster as a Service<br
/> IaaS – Infrastructure as a Service<br
/> SaaS – Software as a Service<br
/> PaaS – Platform as a Service</p></blockquote></div><p>I’ve already highlighted how the <a
href="http://mashe.hawksey.info/2012/05/analytics-reconnoitre-notes-on-r-in-education-and-industry/">open source R statistical computing environment is being used as an analytics layer</a>. Open source is alive and well in other parts of the infrastructure.  First up at the was Rob Anderson from Isilon Systems (division of EMC) talking about <a
href="http://www.eduserv.org.uk/newsandevents/events/2012/symposium/opening-keynote">Big Data and implications for storage</a>. Rob did a great job introducing Big Data and a couple of things I took away were the message that there is a real demand for talented ‘data scientists’ and getting organisations to think differently about data.</p><blockquote
class="twitter-tweet tw-align-center"><p>RT @<a
href="https://twitter.com/mariekeguy">mariekeguy</a>: <a
href="https://twitter.com/search/%2523esym12">#esym12</a> Rob Anderson &#8211; to sum up, big data issues &amp; problems are mainly related to organisational issues/structures &lt;+talent</p><p>— Martin Hawksey (@mhawksey) <a
href="https://twitter.com/mhawksey/status/200534137210290176" data-datetime="2012-05-10T10:33:58+00:00">May 10, 2012</a></p></blockquote><p>If you look some of the products/services EMC offer you’ll find <a
href="http://www.greenplum.com/community/downloads/">EMC Greenplum Database and HD Community Editions</a> (Greenplum are a set of products to handle ‘Big Data’). You’ll see that these include the open source <a
href="http://en.wikipedia.org/wiki/Hadoop">Apache Hadoop</a> ecosystem. If like me you’ve heard of Hadoop but don’t really understand what it is, here is a useful post on <a
href="http://bigdata.wordpress.com/2010/03/01/knowledge-is-precious/">Open source solutions for processing big data and getting Knowledge</a>. This highlights components of the Hadoop most of which appear in the Greenplum Community Edition (I was very surprised to see the NoSQL database <a
href="http://en.wikipedia.org/wiki/Cassandra_(database)">Cassandra</a> which is now part of Hadoop was originally developed by Facebook and released as open source code – more about NoSQL later).</p><h3>Open algorithms, machines and people</h3><p><a
class="thickbox" href="http://mcdn.hawksey.info/wp-content/uploads/2012/05/image4.png"><img
style="background-image: none; margin: 0px 0px 4px 8px; padding-left: 0px; padding-right: 0px; display: inline; float: right; padding-top: 0px; border-width: 0px;" title="amplab - state of the art" src="http://mcdn.hawksey.info/wp-content/uploads/2012/05/image_thumb.png" alt="amplab - state of the art" width="240" height="181" align="right" border="0" /></a>The use of open source in big data was also highlighted by <a
href="http://www.eduserv.org.uk/newsandevents/events/2012/symposium/closing-keynote">Anthony D Joseph Professor at the University of California, Berkeley in his talk</a>. Anthony was highlighting <a
href="http://amplab.cs.berkeley.edu/">UC Berkeley’s AMPLab</a> which is exploring “Making Sense at Scale” by tightly integrating <strong>a</strong>lgorithms, <strong>m</strong>achines and <strong>p</strong>eople (AMP). The slide (right) from Anthony’s presentation summaries what they are doing, combining 3 strands to solve big data problems.</p><p>They are achieving this by combining existing tools with new components. In the slide below you have the following pieces developed by AMPLab:</p><ul><li><a
href="http://incubator.apache.org/mesos/">Apache Mesos</a> – an open source cluster manager</li><li><a
href="http://spark-project.org/">Spark</a> – an open source interactive and interactive data analysis system</li><li><a
href="https://github.com/radlab/SCADS">SCADS</a> – consistency adjustable data store (license unknown)</li><li>PIQL – Performance (predictive) Insightful Query Language (<a
href="https://github.com/radlab/SCADS/tree/v2.1.3/piql">part of SCADS</a>. There’s also <a
href="https://github.com/radlab/PIQL-on-Rails">PIQL-on-RAILS plugin</a> MIT license)</li></ul><p><a
class="thickbox" href="http://mcdn.hawksey.info/wp-content/uploads/2012/05/ampslide-edit.png"><img
style="background-image: none; padding-left: 0px; padding-right: 0px; display: block; float: none; margin-left: auto; margin-right: auto; padding-top: 0px; border-width: 0px;" title="amplab - machines" src="http://mcdn.hawksey.info/wp-content/uploads/2012/05/ampslide-edit_thumb.png" alt="amplab - machines" width="500" height="375" border="0" /></a></p><p>In the Applications/tools box is: Advanced ML algorithms; Interactive data mining; Collaborative visualisation. I’m not entirely sure what these are but in Anthony’s presentation he mentioned more open source tools are required particularly in ‘new analysis environments’.</p><blockquote
class="twitter-tweet tw-align-center"><p>AMPlab gamifying and crowdsourcing analysis to judge accuracy of machine learning algorithms <a
href="https://twitter.com/search/%2523esym12">#esym12</a></p><p>— Matt Johnson (@mhj_work) <a
href="https://twitter.com/mhj_work/status/200615674295488512" data-datetime="2012-05-10T15:57:58+00:00">May 10, 2012</a></p></blockquote><p>Here are the real applications of AMPLab Anthony mentioned:</p><ul><li><a
href="http://traffic.berkeley.edu/">Mobile Millennium Project</a> (traffic monitoring)</li><li><a
href="http://www.urbansim.org">Microsimulation of urban development</a></li><li>Crowd based opinion formation &#8211; <a
href="http://www.state.gov/opinionspace/">Opinion Space</a> (broken site)</li></ul><p>[Another site mentioned by Anthony worth bookmarking/visiting is <a
href="http://datakind.org">DataKind</a> – ‘helping non-profits through pro bono data collections, analysis and visualisation’]</p><h4>OpenStack</h4><p>Another cloud/big data/open source tool I know of but not mentioned at the event is <a
href="http://en.wikipedia.org/wiki/OpenStack">OpenStack</a>. This was initially developed by commercial hosting service Rackspace and NASA (who it has been said are ‘the largest collector of data in human history’). Like Hadoop OpenStack is a collection of tools/projects rather than one product. OpenStack contains <a
href="http://www.openstack.org/projects/compute/">OpenStack Compute</a>, <a
href="http://www.openstack.org/projects/storage/">OpenStack Object Storage</a> and <a
href="http://www.openstack.org/projects/image-service/">OpenStack Image Service</a>.</p><h3>NoSQL</h3><blockquote><p>In <a
href="http://en.wikipedia.org/wiki/Computing">computing</a>, <strong>NoSQL</strong> is a class of <a
href="http://en.wikipedia.org/wiki/Database_management_system">database management system</a> identified by its non-adherence to the widely-used <a
href="http://en.wikipedia.org/wiki/Relational_database_management_system">relational database management system</a> (RDBMS) model … It does not use <a
href="http://en.wikipedia.org/wiki/SQL">SQL</a> as its query language … NoSQL database systems are developed to manage large volumes of data that do not necessarily follow a fixed schema – <a
href="http://en.wikipedia.org/wiki/NoSQL">BY wikipedia</a></p></blockquote><p>NoSQL came up in <a
href="http://www.eduserv.org.uk/newsandevents/events/2012/symposium/big-science-big-data">Simon Metson’s (University of Bristol), Big science, Big Data session</a>. This class of database is common in big data applications but Simon underlined that it’s not always the right tool for the job:</p><blockquote
class="twitter-tweet tw-align-center"><p>&#8220;When all you have is a hammer, everything looks like a nail&#8221; &#8211; great quote from Simon Metson <a
href="https://twitter.com/search/%2523esym12">#esym12</a></p><p>— Matt Johnson (@mhj_work) <a
href="https://twitter.com/mhj_work/status/200586914229796864" data-datetime="2012-05-10T14:03:41+00:00">May 10, 2012</a></p></blockquote><blockquote
class="twitter-tweet tw-align-center"><p>Ah, NoSQL isn&#8217;t a silver bullet. <a
href="https://twitter.com/search/%2523esym12">#esym12</a></p><p>— Brian Kelly (@briankelly) <a
href="https://twitter.com/briankelly/status/200586554324955136" data-datetime="2012-05-10T14:02:15+00:00">May 10, 2012</a></p></blockquote><blockquote
class="twitter-tweet tw-align-center"><p>&#8220;@<a
href="https://twitter.com/mhj_work">mhj_work</a>: Simon M: NoSQL may offer a more cost-effective solution for solving big-data solutions than SQL&#8221; stressing &#8220;may&#8221; <a
href="https://twitter.com/search/%2523esym12">#esym12</a>&#8221;</p><p>— Fiona Murphy (@DrFionaLM) <a
href="https://twitter.com/DrFionaLM/status/200587137962344448" data-datetime="2012-05-10T14:04:35+00:00">May 10, 2012</a></p></blockquote><p>This view is echoed by Nick Jackson (University of Lincoln) who did an ‘awesome’ introduction to MongoDB (one of the many open source NoSQL solutions) as part of the Managing Research Data Hack Data organised by DevCSI/JISC MRD. A strongly recommend you <a
href="http://devcsi.ukoln.ac.uk/2012/05/11/event-report-managing-research-data-hack-day/">look at the resources that came out of this event including other presentations from University of Bristol on data.bris</a>.</p><div
style="text-align: center;" align="center"><iframe
src="http://player.vimeo.com/video/41832762?color=ff9933" frameborder="0" width="600" height="337"></iframe><br
/> <a
href="http://vimeo.com/41832762">Introduction to MongoDB at DevCSI Managing Research Data Hack</a> from <a
href="http://vimeo.com/ukoln">UKOLN</a> on <a
href="http://vimeo.com">Vimeo</a>.</div><p>[BTW the <a
href="http://www.mongodb.org/display/DOCS/Comparing+Mongo+DB+and+Couch+DB">MongoDB site has a very useful page highlighting how it differs from another open source NoSQL solution CouchDB</a>. So even NoSQL solutions come in many flavours. Also <a
href="http://www.eduserv.org.uk/newsandevents/events/2012/symposium/jisc-and-big-data">Simon Hodson Programme Manager, JISC MRD gave a lightening talk on JISC and Big Data at the Eduserv event</a>]</p><h3>Summary</h3><p>The amount of open source solutions in this area is perhaps not surprising as the majority of the web (<a
href="http://news.netcraft.com/archives/2012/03/05/march-2012-web-server-survey.html">65% according to the last netcraft survey</a>) is run on the open source Apache server. It’s interesting to see that code is not only being contributed by the academic/research community but also companies like Facebook who deal with big data on a daily basis. Assuming the challenge isn’t technical it then becomes about organisations understanding what they can do with data and having the talent in place (data scientists) to turn data into ‘actionable insights’.</p><p><a
href="http://live.eduserv.org.uk/VOD/Andy_Powell_Introduction_Video_Archive.php">Here are videos of all the presentations (including links to slides where available)</a></p><p>BTW <a
href="https://docs.google.com/spreadsheet/ccc?key=0AqGkLMU9sHmLdDg4VHY3RERIUXExR2FJZGp5cGRuX1E#gid=82">Here is an archive of tweets from #esym12</a></p><p>For those of you who have made it this far through my dearth on links please feel free to now leave this site and watch some of the <a
href="http://www.greenplum.com/community/data-scientist-summit-2011">videos from the Data Scientist Summit 2011</a> (I’m still working my way through but there are some inspirational presentations).</p><p><strong>Update</strong> Sander van der Waal at OSS Watch who was also at #esym12 as also posted <a
href="http://osswatch.jiscinvolve.org/wp/2012/05/15/the-dominance-of-open-source-tools-in-big-data/">The dominance of open source tools in Big Data Published</a></p><div
id="crp_related"><h3>You also might like :</h3><ul><li><a
href="http://mashe.hawksey.info/2012/03/do-you-git-it-open-educational-resourcespractices-meets-software-version-control/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13312');" class="crp_title">Do you git it?: Open educational resources/practices meets software version control #ukoer</a></li><li><a
href="http://mashe.hawksey.info/2012/05/analytics-reconnoitre-notes-on-r-in-education-and-industry/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13312');" class="crp_title">Analytics Reconnoitre: Notes on R in education and industry</a></li><li><a
href="http://mashe.hawksey.info/2012/03/googleplus-search-to-rss/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13312');" class="crp_title">Turning Google+ Search results into a RSS feed (for Google Reader)</a></li><li><a
href="http://mashe.hawksey.info/2012/03/automatic-translation-of-tags-twitter-archives-using-google-apps-script-language-services/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13312');" class="crp_title">Automatic translation of TAGS Twitter archives using Google Apps Script &#8216;Language&#8217; services</a></li><li><a
href="http://mashe.hawksey.info/2012/04/google-docs-insert-cc-images/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13312');" class="crp_title">Google Docs defaults to searching for Creative Commons licensed images. Great, but could they do better?</a></li></ul></div><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/MASHe?a=FGRFhdFaFv0:hH2Z6JX1xB8:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/MASHe?i=FGRFhdFaFv0:hH2Z6JX1xB8:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=FGRFhdFaFv0:hH2Z6JX1xB8:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/MASHe?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=FGRFhdFaFv0:hH2Z6JX1xB8:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/MASHe?i=FGRFhdFaFv0:hH2Z6JX1xB8:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=FGRFhdFaFv0:hH2Z6JX1xB8:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/MASHe?i=FGRFhdFaFv0:hH2Z6JX1xB8:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=FGRFhdFaFv0:hH2Z6JX1xB8:JUhcmGiK9AQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=JUhcmGiK9AQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=FGRFhdFaFv0:hH2Z6JX1xB8:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/MASHe?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=FGRFhdFaFv0:hH2Z6JX1xB8:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/MASHe?i=FGRFhdFaFv0:hH2Z6JX1xB8:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=FGRFhdFaFv0:hH2Z6JX1xB8:jQG8q3pvyYY"><img src="http://feeds.feedburner.com/~ff/MASHe?d=jQG8q3pvyYY" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=FGRFhdFaFv0:hH2Z6JX1xB8:IqXOmDWthuQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=IqXOmDWthuQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/MASHe/~4/FGRFhdFaFv0" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://mashe.hawksey.info/2012/05/analytics-reconnoitre-notes-on-open-solutions-in-big-data-from-esym12/feed/</wfw:commentRss> <slash:comments>1</slash:comments> <feedburner:origLink>http://mashe.hawksey.info/2012/05/analytics-reconnoitre-notes-on-open-solutions-in-big-data-from-esym12/</feedburner:origLink></item> <item><title>Visual Analytics: Comparison of @SCOREProject and @UKOER (and template for making your own)</title><link>http://feedproxy.google.com/~r/MASHe/~3/Q_0qmk4GT7o/</link> <comments>http://mashe.hawksey.info/2012/05/visual-compare-twitter-accounts/#comments</comments> <pubDate>Thu, 10 May 2012 08:32:33 +0000</pubDate> <dc:creator>Martin Hawksey</dc:creator> <category><![CDATA[Analytics]]></category> <category><![CDATA[Google Apps]]></category> <category><![CDATA[Google Apps Script]]></category> <category><![CDATA[R]]></category> <category><![CDATA[Visualisation]]></category> <category><![CDATA[#ukoer]]></category> <guid isPermaLink="false">http://mashe.hawksey.info/?p=13306</guid> <description><![CDATA[Lou McGill from the JISC/HEA OER Programme Synthesis and Evaluation team recently contacted me as part of the OER Review asking if there was a way to analyse and visualise the Twitter followers of @SCOREProject and @ukoer. Having recently extracted data for the @jisccetis network of accounts I knew it was easy to get the [...]]]></description> <content:encoded><![CDATA[<p>Lou McGill from the JISC/HEA OER Programme Synthesis and Evaluation team recently contacted me as part of the <a
href="https://oersynth.pbworks.com/w/page/53072550/HEFCE-OER-Review">OER Review</a> asking if there was a way to analyse and visualise the Twitter followers of @SCOREProject and @ukoer. Having recently extracted data for the @jisccetis network of accounts I knew it was easy to get the information but make meaningful was another question.</p><p>There are a growing number of sites like <a
href="http://twiangulate.com">twiangulate.com</a> and <a
href="http://create.visual.ly">visual.ly</a> that make it easy to generate numbers and graphics. One of the limitations I find with these tools is they produce flat images and all opportunities for ‘visual analytics’ is lost.</p><table
border="0" cellspacing="0" cellpadding="2" width="680"><tbody><tr><td
valign="top" width="340" align="center"><a
href="http://twiangulate.com/search/ukoer-scoreproject/common_followers/table/my_friends-1/"><img
style="background-image: none; border-right-width: 0px; padding-left: 0px; padding-right: 0px; display: inline; border-top-width: 0px; border-bottom-width: 0px; border-left-width: 0px; padding-top: 0px" title="Click to see twiangulate comparison of SCOREProject and UKOER" border="0" alt="Click to see twiangulate comparison of SCOREProject and UKOER" src="http://mcdn.hawksey.info/wp-content/uploads/2012/05/image.png" width="184" height="260" /></a> <br
/>Twiangulate data</td><td
valign="top" width="340" align="center"><a
href="http://bit.ly/IXGTek"><img
style="background-image: none; border-right-width: 0px; padding-left: 0px; padding-right: 0px; display: inline; border-top-width: 0px; border-bottom-width: 0px; border-left-width: 0px; padding-top: 0px" title="Click to see visual.ly comparison of SCOREProject and UKOER" border="0" alt="Click to see visual.ly comparison of SCOREProject and UKOER" src="http://mcdn.hawksey.info/wp-content/uploads/2012/05/infographic20120508222708.jpg" width="158" height="262" /></a> <br
/>create infographics with visual.ly</td></tr></tbody></table><p>So here’s my take on the problem. A template constructed with free and open source tools that lets you visually explorer the @SCOREProject and @ukoer Twitter following.</p><p><a
href="http://hawksey.info/labs/compare/score-ukoer.html"><img
style="background-image: none; border-right-width: 0px; margin: 0px 0px 4px 8px; padding-left: 0px; padding-right: 0px; display: inline; float: right; border-top-width: 0px; border-bottom-width: 0px; border-left-width: 0px; padding-top: 0px" title="Comparison of @SCOREProject and @ukoer" border="0" alt="Comparison of @SCOREProject and @ukoer" align="right" src="http://mcdn.hawksey.info/wp-content/uploads/2012/05/made-in-aviary.png" width="185" height="260" /></a>In this post I’ll give my narrative on the SCOREProject/UKOER Twitter followership and give you the basic recipe for creating your own comparisons (I should say that the solution isn’t production quality, but I need to move onto other things so someone else can tidy up).</p><p>Let start with the output. Here’s <a
href="http://hawksey.info/labs/compare/score-ukoer.html">a page comparing the Twitter Following of SCOREProject and UKOER</a>. At the top each bubble represents someone who follows SCOREProject or UKOER (hovering over a bubble we can see who they are and clicking filters the summary table at the bottom).</p><h3>Bubble size matters</h3><p>There are three options to change how the bubbles are sized:</p><ul><li>Betweenness Centrality (a measure of the community bridging capacity); (see <a
href="http://blogs.cetis.ac.uk/sheilamacneill/2011/08/30/betweenness-centrality-helping-us-understand-our-networks/">Sheila’s post on this</a>)</li><li>In-Degree (how many other people who follower SCOREProject or ukoer also follow the person represented by the bubble); and</li><li>Followers count (how many people follower the person represented by the node</li></ul><p>Clicking on ‘Grouped’ button lets you see how bubble/people follow either the SCOREProject, UKOER or both. By switching between betweeness, degree and followers we can visually spot a couple of things:</p><ul><li><strong>Betweenness Centrality:</strong> SCOREProject has 3 well connected intercommunity bubbles @GdnHigherEd, @gconole and&#160; @A_L_T. UKOER has the SCOREProject following them which unsurprisingly makes them a great bridge to the SCOREProject community (if you are wondering where UKOER is as they don’t follow SCOREProject they don’t appear.</li><li><strong>In-Degree:</strong> Switching to In-Degree we can visually see that the overall volume of the UKOER group grows more despite the SCOREProject bubble in this group decreasing substantially. This suggests to me that the UKOER following is more interconnected</li><li><strong>Followers count: </strong>Here we see SCOREProject is the biggest winner thanks to being followed by @douglasi who has over 300,000 followers. So whilst SCOREProject is followed by less people than UKOER it has a potential greater reach if @douglasi ever retweeted a message.</li></ul><h3>Colourful combination</h3><p>Sticking with the grouped bubble view we can see different colour grouping within the clusters for SCOREProject, UKOER and both. The most noticeable being light green used to identify Group 4 which has 115 people people following SCOREProject compared to 59 following UKOER. The groupings are created using <a
href="http://arxiv.org/abs/cond-mat/0603718">community structure detection algorithm proposed Joerg Reichardt and Stefan Bornholdt</a>. To give a sense of who these sub-groups might represent individual wordclouds have been generated based on the individual Twitter profile descriptions. Clicking on a word within these clouds filters the table. So for example you can explore who has used the term manager in their twitter profile (I have to say the update isn’t instant but it’ll get there.&#160;</p><p><img
style="background-image: none; border-right-width: 0px; padding-left: 0px; padding-right: 0px; display: block; float: none; border-top-width: 0px; border-bottom-width: 0px; margin-left: auto; border-left-width: 0px; margin-right: auto; padding-top: 0px" title="wordclouds" border="0" alt="wordclouds" src="http://mcdn.hawksey.info/wp-content/uploads/2012/05/image1.png" width="700" height="196" /></p><h3>Behind the scenes</h3><p>The bubble chart is coded in d3.js and based on <a
href="http://vallandingham.me/bubble_charts_in_d3.html">Animated Bubble Chart by Jim Vallandingham</a>. The modifications I made were to allow bubble resizing (<a
href="https://github.com/mhawksey/d3-twitter-compare/blob/master/coffee/vis.coffee#L37-44">lines 37-44</a>). This also required handling the bubble charge slightly differently (<a
href="https://github.com/mhawksey/d3-twitter-compare/blob/master/coffee/vis.coffee#L118">line 118</a>). I got the idea of using the bubble chart for comparison from a Twitter Abused post <a
href="http://twitterabused.com/2012/04/24/rape-culture-twitter-abuse/">Rape Culture and Twitter Abuse</a>. It also made sense to reuse Jim’s template which uses the <a
href="http://twitter.github.com/bootstrap/">Twitter Bootstrap</a>. The wordclouds are also rendered using d3.js by using the <a
href="http://www.jasondavies.com/wordcloud/">d3.wordcloud extension by Jason Davies</a>. Finally the table at the bottom is rendered using the <a
href="https://developers.google.com/chart/">Google Visualisation API/Google Chart Tools</a>.</p><p>All the components play nicely together although the performance isn’t great. If I have more time I might play with the load sequencing, but it could be I’m just asking too much of things like the Google Table chart rendering 600 rows.&#160;</p><h3>How to make your own</h3><p>I should say that this recipe probably won’t work for accounts with over 5,000 followers. It also involves using R (in my case <a
href="http://www.rstudio.org/">RStudio</a>). R is used to do the network analysis/community detection side. You can <a
href="https://github.com/mhawksey/d3-twitter-compare/blob/master/r-processing-script.R">download a copy of the script here</a>. There’s probably an easier recipe that skips this part worth revisiting.</p><ol><li>We start with taking a copy of <a
href="https://docs.google.com/spreadsheet/ccc?key=0AqGkLMU9sHmLdE9HSDNOMmxneFpUZHdZd2pnS1BUb1E&amp;newcopy">Export Twitter Friends and Followers v2.1.2 [Network Mod]</a> (as featured in <a
href="http://mashe.hawksey.info/2012/03/notes-on-extracting-the-jisc-cetis-twitter-follower-network/">Notes on extracting the JISC CETIS twitter follower network</a>).</li><li>Authenticate the spreadsheet with Twitter (instructions in the spreadsheet) and then get the followers if the accounts you are interested in using the Twitter &gt; Get followers menu option&#160;</li><li>Once you’ve got the followers run Twitter &gt; Combine follower sheets Method II</li><li>Move to the Vertices sheet and sort the data on the friends_count column</li><li>In batches of around 250 rows select values from the id_str column and run TAGS Advanced &gt; Get friend IDs – this will start populating the friends_ids column with data. For users with over 5,000 friends reselect their id_str and rerun the menu option until the ‘next_cursor’ equals 0&#160; <br
/><img
style="background-image: none; border-bottom: 0px; border-left: 0px; padding-left: 0px; padding-right: 0px; display: inline; border-top: 0px; border-right: 0px; padding-top: 0px" title="next cursor position" border="0" alt="next cursor position" src="http://mcdn.hawksey.info/wp-content/uploads/2012/05/image2.png" width="295" height="175" /></li><li>Next open the Script editor and open the TAGS4 file and then Run &gt; setup.</li><li>Next select Publish &gt; Publish as a service… and allow anyone to invoke the service anonymously. Copy the service URL and paste it into the R script downloaded earlier (also add the spreadhsheet key to the R script and within your spreadsheet File &gt; Publish to the web&#160; <br
/><img
style="background-image: none; border-bottom: 0px; border-left: 0px; padding-left: 0px; padding-right: 0px; display: inline; border-top: 0px; border-right: 0px; padding-top: 0px" title="publish as service window" border="0" alt="publish as service window" src="http://mcdn.hawksey.info/wp-content/uploads/2012/05/image3.png" width="617" height="379" /></li><li>Run <a
href="https://github.com/mhawksey/d3-twitter-compare/blob/master/r-processing-script.R">the R script</a>! &#8230;&#160; and fingers crossed everything works.</li></ol><p>The <a
href="https://github.com/mhawksey/d3-twitter-compare">files used in the SCOREProject/UKOER can be downloaded from here</a>. Changes you’ll need to make are adding the output csv files to the data folder, changing references in js/gtable.js and js/wordcloud.js and the labels used in coffee/coffee.vis</p><p>So there you go. I’ve spent way too much of my own time on this and haven’t really explained what is going on. Hopefully the various commenting in the source code removes some of the magic (I might revisit the R code as in some ways I think it deserves a post on its own. If you have any questions or feedback leave them in the comments ;)</p><div
id="crp_related"><h3>You also might like :</h3><ul><li><a
href="http://mashe.hawksey.info/2011/12/oer-visualisation-project-beginnings-of-linking-data-from-prod-to-google-spreadsheet-and-early-fruit-day-8-ukoer/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13306');" class="crp_title">OER Visualisation Project: Beginnings of linking data from PROD to Google Spreadsheet and early fruit [day 8] #ukoer</a></li><li><a
href="http://mashe.hawksey.info/2012/02/oer-visualisation-project-the-heart-and-pulse-of-ukoer-day-40/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13306');" class="crp_title">OER Visualisation Project: The heart and pulse of #ukoer [day 40]</a></li><li><a
href="http://mashe.hawksey.info/2012/03/notes-on-extracting-the-jisc-cetis-twitter-follower-network/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13306');" class="crp_title">Notes on extracting the JISC CETIS twitter follower network</a></li><li><a
href="http://mashe.hawksey.info/2011/09/twitter-network-analysis-and-visualisation-ii-nodexl/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13306');" class="crp_title">Twitter network analysis and visualisation II: NodeXL &#8211; Getting started with the @WiredUK friends network</a></li><li><a
href="http://mashe.hawksey.info/2011/12/oer-visualisation-project-processing-a-resource-feed-to-find-frequency-using-google-spreadsheets-day-3-ukoer-ooher/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13306');" class="crp_title">OER Visualisation Project: Processing a resource feed to find frequency using Google Spreadsheets [day 3] #ukoer #ooher</a></li></ul></div><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/MASHe?a=Q_0qmk4GT7o:J7kTsMhZjdc:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/MASHe?i=Q_0qmk4GT7o:J7kTsMhZjdc:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Q_0qmk4GT7o:J7kTsMhZjdc:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/MASHe?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Q_0qmk4GT7o:J7kTsMhZjdc:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/MASHe?i=Q_0qmk4GT7o:J7kTsMhZjdc:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Q_0qmk4GT7o:J7kTsMhZjdc:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/MASHe?i=Q_0qmk4GT7o:J7kTsMhZjdc:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Q_0qmk4GT7o:J7kTsMhZjdc:JUhcmGiK9AQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=JUhcmGiK9AQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Q_0qmk4GT7o:J7kTsMhZjdc:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/MASHe?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Q_0qmk4GT7o:J7kTsMhZjdc:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/MASHe?i=Q_0qmk4GT7o:J7kTsMhZjdc:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Q_0qmk4GT7o:J7kTsMhZjdc:jQG8q3pvyYY"><img src="http://feeds.feedburner.com/~ff/MASHe?d=jQG8q3pvyYY" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Q_0qmk4GT7o:J7kTsMhZjdc:IqXOmDWthuQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=IqXOmDWthuQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/MASHe/~4/Q_0qmk4GT7o" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://mashe.hawksey.info/2012/05/visual-compare-twitter-accounts/feed/</wfw:commentRss> <slash:comments>4</slash:comments> <feedburner:origLink>http://mashe.hawksey.info/2012/05/visual-compare-twitter-accounts/</feedburner:origLink></item> <item><title>Analytics Reconnoitre: Notes on R in education and industry</title><link>http://feedproxy.google.com/~r/MASHe/~3/uIeUL9Xog0k/</link> <comments>http://mashe.hawksey.info/2012/05/analytics-reconnoitre-notes-on-r-in-education-and-industry/#comments</comments> <pubDate>Wed, 09 May 2012 10:46:52 +0000</pubDate> <dc:creator>Martin Hawksey</dc:creator> <category><![CDATA[Analytics]]></category> <category><![CDATA[JISC CETIS]]></category> <category><![CDATA[#jiscar]]></category> <category><![CDATA[analytics]]></category> <guid isPermaLink="false">http://mashe.hawksey.info/?p=13290</guid> <description><![CDATA[Whilst putting together a report on the broad landscape/trends in analytics service and data providers it’s been interesting to note how one particular analytics tools, R, keeps pinging on my radar. I thought it would be useful to loosely join these together and share.]]></description> <content:encoded><![CDATA[<p>As part of my role at JISC CETIS I’ve been asked to contribute to our ‘Analytics Reconnoitre’ which is a JISC commissioned project looking at the data and analytics landscape. One of my first tasks is to report on the broad landscape and trends in analytics service and data providers. Whilst I’m still putting this report together it’s been interesting to note how one particular analytics tools, R, keeps pinging on my radar. I thought it would be useful to loosely join these together and share.</p><h3>Before R, the bigger ‘data science’ picture&#160;</h3><p>Before I go into R there is some more scene setting required. As part of the Analytics Reconnoitre Adam Cooper (JISC CETIS) has already published <a
href="http://blogs.cetis.ac.uk/adam/2012/04/27/analytics-and-big-data-reflections-from-the-teradata-universe-conference-2012/">Analytics and Big Data &#8211; Reflections from the Teradata Universe Conference 2012</a> and <a
href="http://blogs.cetis.ac.uk/adam/2012/05/02/making-sense-of-analytics/">Making Sense of “Analytics”</a>.</p><p>The Analytics and Big Data post is an excellent summary of the Teradata Universe event and Adam is also able to note some very useful thoughts on ‘What this Means for Post-compulsory Education’. This includes identifying pathways for education to move forward with business intelligence and analytics. One of these I particularly liked was:</p><blockquote><p><strong>Experiment with being more analytical at craft-scale <br
/></strong>Rather than thinking in terms of infrastructure or major initiatives, get some practical value with the infrastructure you have. Invest in someone with &quot;data scientist&quot; skills as master crafts-person and give them access to all data but don&#8217;t neglect the value of developing apprentices and of developing wider appreciation of the capabilities and limitations of analytics.</p></blockquote><p>[I’m biased towards this path because it encapsulates a lot of what I aspire to be. The craft model was one <a
href="http://joss.blogs.lincoln.ac.uk/2012/02/25/learning-a-craft/">introduced to me by Joss Winn at this year’s Dev8D</a> and coming for a family of craftsmen it makes me more comfortable to think I’m continuing the tradition in some way.]</p><p>Here are Adams observations and reflections on ‘data science’ from the same bog post:</p><blockquote><p>&quot;Data Scientist&quot; is a term which seems to be capturing the imagination in the corporate big data and analytics community but which has not been much used in our community.</p><p>A facetious definition of data scientist is &quot;a business analyst who lives in California&quot;. Stephen Brobst gave his distinctions between data scientist and business analyst in his talk. His characterisation of a business analyst is someone who: is interested in understanding the answers to a business question; uses BI tools with filters to generate reports. A data scientist, on the other hand, is someone who: wants to know what the question should be; embodies a combination of curiosity, data gathering skills, statistical and modelling expertise and strong communication skills. Brobst argues that the working environment for a data scientist should allow them to self-provision data, rather than having to rely on what is formally supported in the organisation, to enable them to be inquisitive and creative.</p><p>Michael Rappa from the Institute for Advanced Analytics doesn&#8217;t mention curiosity but offers a similar conception of the skill-set for a data scientist in an <a
href="http://www.forbes.com/sites/danwoods/2012/03/05/what-is-a-data-scientist-michael-rappa-north-carolina-state-university/3/">interview in Forbes magazine</a>. The Guardian Data Blog has also reported on various <a
href="http://www.guardian.co.uk/news/datablog/2012/mar/02/data-scientist">views of what comprises a data scientist</a> in March 2012, following the Strata Conference.</p><p>While it can be a sign of hype for new terminology to be spawned, the distinctions being drawn by Brobst and others are appealing to me because they are putting space between mainstream practice of business analysis and some arguably more effective practices. As universities and colleges move forward, we should be cautious of adopt the prevailing view from industry &#8211; the established business analyst role with a focus on reporting and descriptive statistics &#8211; and miss out on a set of more effective practices. Our lack of baked-in BI culture might actually be a benefit if it allows us to more quickly adopt the data scientist perspective alongside necessary management reporting. Furthermore, our IT environment is such that self-provisioning is more tractable.</p></blockquote><h3>R in data science and in business</h3><p>For those that don’t know <a
href="http://www.r-project.org/">R is an open source statistical programming language</a>. If you want more background about the development of R the Information Age cover this in their piece <a
href="http://www.information-age.com/channels/information-management/features/2099883/putting-the-r-in-analytics.thtml">Putting the R in analytics</a>. An important thing to note, which is covered in the story, is R was developed by two academics at University of Auckland and continues to have a very strong and active academic community supporting it. Whilst initially used as an academic tool the article highlights how it is being adopted by the business sector.</p><p>I originally picked up the Information Age post via the Revolutions blog (hosted by <a
href="http://www.revolutionanalytics.com/">Revolution Analytics</a>) in the post <a
href="http://blog.revolutionanalytics.com/2012/04/information-age-feature-article-on-r.html">Information Age: graduates driving industry adoption of R</a>, which includes one of the following quotes from Information Age:</p><blockquote><p>This popularity in academia means that R is being taught to statistics students, says Matthew Aldridge, co-founder of UK- based data analysis consultancy Mango Solutions. “We&#8217;re seeing a lot of academic departments using R, versus SPSS which was what they always used to teach at university,” he says. “That means a lot of students are coming out with R skills.”</p><p>Finance and accounting advisory Deloitte, which uses R for various statistical analyses and to visualise data for presentations, has found this to be the case. “Many of the analytical hires coming out of school now have more experience with R than with SAS and SPSS, which was not the case years ago,” says Michael Petrillo, a senior project lead at Deloitte&#8217;s New York branch.</p></blockquote><p>Revolutions have picked up other stories related to R in big data and analytics. Two I have bookmarked are <a
href="http://blog.revolutionanalytics.com/2012/05/making-sense-of-big-data-requires-more-than-software.html">Yes, you need more than just R for Big Data Analytics</a> in which Revolutions editor David Smith underlines that having tools like R aren’t enough and a wider data science approach is needed because <em>“it combines the tool expertise with statistical expertise and the domain expertise required to understand the problem and the data applicable to it” . </em></p><p>Smith also reminds use that:</p><blockquote><p>The R software is just one piece of software ecosystem — an analytics stack, if you will — of tools used to analyze Big Data. For one thing R isn&#8217;t a data store in its own right: you also need a data layer where R can access structured and unstructured data for analysis. (For example, see how you can <a
href="http://www.revolutionanalytics.com/news-events/free-webinars/2012/r-and-hadoop-equals-big-data-analytics/">use R to extract data from Hadoop</a> in the slides from today&#8217;s webinar by Antonio Piccolboni.) At the analytics layer, you need statistical algorithms that work with Big Data, like those in Revolution R Enterprise. And at the presentation layer, you need the ability to embed the results of the analysis in reports, BI tools, or data apps.</p></blockquote><p>[Revolutions also has a comprehensive list of <a
href="http://blog.revolutionanalytics.com/2012/02/r-in-the-enterprise.html">R integrated throughout the enterprise analytics stack</a> which includes vendor integrations from IBM, Oracle, SAP and more]</p><p>The second post from Revolutions is <a
href="http://blog.revolutionanalytics.com/2012/05/r-and-foursquares-recommendation-engine.html">R and Foursquare&#8217;s recommendation engine</a> which is another graphic illustration of how R is being used in the business sector separately from vendor tools.</p><h3>Closing thoughts</h3><p>At this point it’s worth highlighting another of Adam’s thoughts on directions for academia in Analytics and Big Data:</p><blockquote><p><strong>Don&#8217;t focus on IT infrastructure (or tools) <br
/></strong>Avoid the temptation (and sales pitches) to focus on IT infrastructure as a means to get going with analytics. While good tools are necessary, they are not the right place to start.</p></blockquote><p>I agree about not being blinkered by specific tools and as pointed out earlier R can only ever be just one piece of software in the ecosystem and any good data scientist will use the right tool for the job. It’s interesting to see an academic tool being adopted, and arguable driving, part of the commercial sector. Will academia follow where they have led – if you see what I mean?</p><div
id="crp_related"><h3>You also might like :</h3><ul><li><a
href="http://mashe.hawksey.info/2012/03/google-analytics-rolling-out-social-network-activity-streams-paradata-heaven/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13290');" class="crp_title">Google Analytics rolling out social network activity streams: Paradata heaven?</a></li><li><a
href="http://mashe.hawksey.info/2012/05/analytics-reconnoitre-notes-on-open-solutions-in-big-data-from-esym12/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13290');" class="crp_title">Analytics Reconnoitre: Notes on Open Solutions in Big Data from #esym12</a></li><li><a
href="http://mashe.hawksey.info/2012/02/cetis12-social-network-analysis-data-visualisation-past-present-and-future-ukoer-infovis-presentation/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13290');" class="crp_title">#CETIS12 Social Network Analysis &amp; Data Visualisation: Past, Present and Future #ukoer #infovis Presentation</a></li><li><a
href="http://mashe.hawksey.info/2012/03/tracking-activity-diigo-site-bookmarks-in-a-google-spreadsheet/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13290');" class="crp_title">Tracking activity: Diigo site bookmark activity data in a Google Spreadsheet (who is saving your stuff)</a></li><li><a
href="http://mashe.hawksey.info/2012/03/combine-twitter-and-google-analytics-data-to-find-your-top-content-distributors/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13290');" class="crp_title">Using Google Spreadsheets to combine Twitter and Google Analytics data to find your top content distributors</a></li></ul></div><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/MASHe?a=uIeUL9Xog0k:kn0bmy5D5uQ:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/MASHe?i=uIeUL9Xog0k:kn0bmy5D5uQ:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=uIeUL9Xog0k:kn0bmy5D5uQ:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/MASHe?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=uIeUL9Xog0k:kn0bmy5D5uQ:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/MASHe?i=uIeUL9Xog0k:kn0bmy5D5uQ:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=uIeUL9Xog0k:kn0bmy5D5uQ:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/MASHe?i=uIeUL9Xog0k:kn0bmy5D5uQ:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=uIeUL9Xog0k:kn0bmy5D5uQ:JUhcmGiK9AQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=JUhcmGiK9AQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=uIeUL9Xog0k:kn0bmy5D5uQ:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/MASHe?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=uIeUL9Xog0k:kn0bmy5D5uQ:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/MASHe?i=uIeUL9Xog0k:kn0bmy5D5uQ:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=uIeUL9Xog0k:kn0bmy5D5uQ:jQG8q3pvyYY"><img src="http://feeds.feedburner.com/~ff/MASHe?d=jQG8q3pvyYY" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=uIeUL9Xog0k:kn0bmy5D5uQ:IqXOmDWthuQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=IqXOmDWthuQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/MASHe/~4/uIeUL9Xog0k" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://mashe.hawksey.info/2012/05/analytics-reconnoitre-notes-on-r-in-education-and-industry/feed/</wfw:commentRss> <slash:comments>4</slash:comments> <feedburner:origLink>http://mashe.hawksey.info/2012/05/analytics-reconnoitre-notes-on-r-in-education-and-industry/</feedburner:origLink></item> <item><title>What I’ve starred this month: April 28, 2012</title><link>http://feedproxy.google.com/~r/MASHe/~3/gB8zGWcILfw/</link> <comments>http://mashe.hawksey.info/2012/04/what-ive-starred-this-month-april-28-2012/#comments</comments> <pubDate>Sat, 28 Apr 2012 17:43:54 +0000</pubDate> <dc:creator>Martin Hawksey</dc:creator> <category><![CDATA[Starred]]></category> <guid isPermaLink="false">http://mashe.hawksey.info/?p=13260</guid> <description><![CDATA[Here&#039;s some posts which have caught my attention this month: Study on the barriers that prevent researchers from publishing their code &#124; Software Sustainability Institute &#8211; April 27, 2012 FOI Signals on Useful Open Data? « OUseful.Info, the blog… &#8211; April 24, 2012 MSDN Blogs &#8211; April 24, 2012 Big data in Europe &#8211; O&#8217;Reilly [...]]]></description> <content:encoded><![CDATA[<p>Here&#039;s some posts which have caught my attention this month:<ul><li><a
href="http://software.ac.uk/news/2012-04-27-interesting-study-barriers-prevent-researchers-publishing-their-code">Study on the barriers that prevent researchers from publishing their code | Software Sustainability Institute</a> &#8211; <em>April 27, 2012 </em></li><li><a
href="http://blog.ouseful.info/2012/04/24/foi-signals-on-useful-open-data/?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed:+ouseful+(OUseful+Info)">FOI Signals on Useful Open Data? « OUseful.Info, the blog…</a> &#8211; <em>April 24, 2012 </em></li><li><a
href="http://blogs.msdn.com/b/ukhe/archive/2012/04/24/understanding-the-costs-of-google-apps.aspx">MSDN Blogs</a> &#8211; <em>April 24, 2012 </em></li><li><a
href="http://radar.oreilly.com/2012/04/big-data-in-europe.html?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed:+oreilly/radar/atom+(O'Reilly+Radar)">Big data in Europe &#8211; O&#8217;Reilly Radar</a> &#8211; <em>April 23, 2012 </em></li><li><a
href="http://www.emergingedtech.com/2012/04/exploring-the-khan-academys-use-of-learning-data-and-learning-analytics">Exploring the Khan Academy’s use of Learning Data and Learning Analytics | Emerging Education Technology</a> &#8211; <em>April 23, 2012 </em></li><li><a
href="http://nogoodreason.typepad.co.uk/no_good_reason/2012/04/the-role-of-respectable-idiots.html?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed:+TheEdTechie+(The+Ed+Techie)">The Ed Techie: The role of respectable idiots</a> &#8211; <em>April 20, 2012 </em></li><li><a
href="http://blog.ouseful.info/2012/04/19/from-paywalls-and-attention-walls-to-data-disclosure-walls-and-survey-walls/?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed:+ouseful+(OUseful+Info)">From Paywalls and Attention Walls to Data Disclosure Walls and Survey Walls « OUseful.Info, the blog…</a> &#8211; <em>April 19, 2012 </em></li><li><a
href="http://bavatuesdays.com/john-cleese-on-creativity">John Cleese on Creativity | bavatuesdays</a> &#8211; <em>April 13, 2012 </em></li><li><a
href="http://evidenceframework.org/?p=1886">Evidence Framework for Innovation and Excellence in Education » Blog Archive » Big Data – Avalanche? Flood? Tsunami? What does big data mean for educators? » Evidence Framework for Innovation and Excellence in Education</a> &#8211; <em>April 11, 2012 </em></li><li><a
href="http://bavatuesdays.com/give-the-chronicle-some-love">Give the Chronicle Some Love | bavatuesdays</a> &#8211; <em>April 5, 2012 </em></li><li><a
href="http://edudemic.com/2012/04/grades-2-0-how-learning-analytics-are-changing-the-teachers-role">Grades 2.0: How Learning Analytics Are Changing The Teacher’s Role | Edudemic</a> &#8211; <em>April 3, 2012 </em></li><li><a
href="http://ukwebfocus.wordpress.com/2012/04/02/guest-post-being-openly-selfish-and-making-oer-work-for-you">Guest Post: Being Openly Selfish and Making “OER” Work for You « UK Web Focus</a> &#8211; <em>April 2, 2012 </em></li><li><a
href="http://www.surf.nl/en/publicaties/Pages/Users_narcissism_control.aspx">Users, narcissism and control – tracking the impact of scholarly publications in the 21st century</a> &#8211; <em>April 2, 2012 </em></li><li><a
href="http://blog.ouseful.info/2012/03/30/university-funding-a-wider-view/?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed:+ouseful+(OUseful+Info)">University Funding – A Wider View « OUseful.Info, the blog…</a> &#8211; <em>March 31, 2012 </em></li><li><a
href="http://scienceoftheinvisible.blogspot.com/2012/03/putting-w-in-dik.html?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed:+SOTI+(Science+of+the+Invisible)">Science of the Invisible: Putting the W in the DIK</a> &#8211; <em>March 31, 2012 </em></li><li><a
href="http://drbadgr.wordpress.com/2012/03/29/backchannel-and-notes-for-gtauk-how-to-get-the-best-from-g">Backchannel and notes for #GTAUK : how to get the best from G+? « DrBadgr</a> &#8211; <em>March 31, 2012 </em></li><li><a
href="http://cogdogblog.com/2012/03/28/no-mooc-respect">No MOOC Respect &#8211; CogDogBlog</a> &#8211; <em>March 29, 2012 </em></li></ul><div
style="text-align: right"><em>Automatically generated from <a
href="http://www.diigo.com/user/mhawksey/starred" target="_blank">my Diigo Starred Items</a>.</em></div><div
id="crp_related"><h3>You also might like :</h3><ul><li><a
href="http://mashe.hawksey.info/2012/03/jisc-cetis-your-public-servant-once-more/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13260');" class="crp_title">Good news: Got a full-time job at JISC CETIS</a></li><li><a
href="http://mashe.hawksey.info/2012/03/googleplus-search-to-rss/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13260');" class="crp_title">Turning Google+ Search results into a RSS feed (for Google Reader)</a></li><li><a
href="http://mashe.hawksey.info/2012/05/analytics-reconnoitre-notes-on-r-in-education-and-industry/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13260');" class="crp_title">Analytics Reconnoitre: Notes on R in education and industry</a></li><li><a
href="http://mashe.hawksey.info/2012/04/generating-an-opml-rss-bundle-from-a-page-of-links-using-google-spreadsheets/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13260');" class="crp_title">Generating an OPML RSS bundle from a page of links using Google Spreadsheets</a></li><li><a
href="http://mashe.hawksey.info/2012/04/google-docs-insert-cc-images/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13260');" class="crp_title">Google Docs defaults to searching for Creative Commons licensed images. Great, but could they do better?</a></li></ul></div><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/MASHe?a=gB8zGWcILfw:74TpDLiTloY:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/MASHe?i=gB8zGWcILfw:74TpDLiTloY:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=gB8zGWcILfw:74TpDLiTloY:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/MASHe?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=gB8zGWcILfw:74TpDLiTloY:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/MASHe?i=gB8zGWcILfw:74TpDLiTloY:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=gB8zGWcILfw:74TpDLiTloY:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/MASHe?i=gB8zGWcILfw:74TpDLiTloY:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=gB8zGWcILfw:74TpDLiTloY:JUhcmGiK9AQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=JUhcmGiK9AQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=gB8zGWcILfw:74TpDLiTloY:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/MASHe?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=gB8zGWcILfw:74TpDLiTloY:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/MASHe?i=gB8zGWcILfw:74TpDLiTloY:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=gB8zGWcILfw:74TpDLiTloY:jQG8q3pvyYY"><img src="http://feeds.feedburner.com/~ff/MASHe?d=jQG8q3pvyYY" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=gB8zGWcILfw:74TpDLiTloY:IqXOmDWthuQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=IqXOmDWthuQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/MASHe/~4/gB8zGWcILfw" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://mashe.hawksey.info/2012/04/what-ive-starred-this-month-april-28-2012/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://mashe.hawksey.info/2012/04/what-ive-starred-this-month-april-28-2012/</feedburner:origLink></item> <item><title>Generating an OPML RSS bundle from a page of links using Google Spreadsheets</title><link>http://feedproxy.google.com/~r/MASHe/~3/fYhurcTs_FI/</link> <comments>http://mashe.hawksey.info/2012/04/generating-an-opml-rss-bundle-from-a-page-of-links-using-google-spreadsheets/#comments</comments> <pubDate>Fri, 27 Apr 2012 12:36:44 +0000</pubDate> <dc:creator>Martin Hawksey</dc:creator> <category><![CDATA[Google Apps]]></category> <category><![CDATA[How-to]]></category> <category><![CDATA[Visualisation]]></category> <guid isPermaLink="false">http://mashe.hawksey.info/?p=13255</guid> <description><![CDATA[A post demonstrating how you can generate a OPML RSS bundle from a webpage containing a list of blog urls]]></description> <content:encoded><![CDATA[<p>This morning I finished listening to <a
href="http://datastori.es/episode-5-how-to-learn-data-visualization-with-andy-kirk/">Episode 5 of Data Stories: How To Learn Data Visualization</a>. Data Stories is a bi-weekly podcast on data visualisation produced by <a
href="http://enrico.bertini.me/">Enrico Bertini</a> and <a
href="http://moritz.stefaner.eu/">Moritz Stefaner</a>, episode 5 also featuring <a
href="http://visualisingdata.com/">Andy Kirk</a>. For anyone interested in Data Visualisation I’d highly recommend you give it a listen.</p><p>Like many others I’m at the beginning of my data visualisation journey, one of the things this episode highlighted was there is a whole world of data visualisation experts out there that I’ve yet to start <strike>stealing</strike> learning from. Fortunately today another Visualisation expert, Nathan Yan (FlowingData), posted his list of <a
href="http://flowingdata.com/2012/04/27/data-and-visualization-blogs-worth-following/">Data and visualization blogs worth following</a>. Perfect!</p><p>I could’ve gone through the list and individually subscribed to each of the blogs feeds but I’m lazy (so lazy that a 15 minute hack has turned into a 3 hour write-up &lt;sigh&gt;) and just wanted to dump them into my Google Reader. This is a problem Tony Hirst has encountered&#160; in <a
href="http://blog.ouseful.info/2010/10/23/feed-detection-from-blog-url-lists-with-opml-output/">Feed-detection From Blog URL Lists, with OPML Output</a>. One thing that is not clear is how Tony got his two column CSV of source urls. There are various tools Tony could have used to do this. Here’s my take on converting a page of blog urls into an OPML bundle.</p><h3>Step 1 Extracting blogs urls: Method 1 using Scraper Chrome Extension</h3><p
align="center"><em>“Scraper is a Google Chrome extension for getting data out of web pages and into spreadsheets.”</em>&#160;</p><p>Chrome users can <a
href="https://chrome.google.com/webstore/detail/mbigbapnjcgaffohmbkdlecaccepngjd">grab a copy of Scraper here</a>. Once installed if you go to <a
href="http://flowingdata.com/2012/04/27/data-and-visualization-blogs-worth-following/">Nathan Yan&#8217;s Data and visualization blogs worth following</a> and right-click on&#160; the first link in the list and select ‘Scrape similar’</p><p><img
style="background-image: none; border-right-width: 0px; padding-left: 0px; padding-right: 0px; display: block; float: none; border-top-width: 0px; border-bottom-width: 0px; margin-left: auto; border-left-width: 0px; margin-right: auto; padding-top: 0px" title="scraper menu" border="0" alt="scraper menu" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image19.png" width="379" height="285" />In the window that opens you should get something similar to the one below. Scraper has two options for identifying the parts of the page you want to extract, XPath or JQuery Selectors. Both of these have similar coding structures but for this example I’m going to stick with XPath. XPath basically provides a way to identify parts of the XML/HTML structure and extract content (if you are not familiar with <a
href="http://www.w3schools.com/xpath/xpath_syntax.asp">XPath the w3schools is a great starting point</a>).</p><p><img
style="background-image: none; border-right-width: 0px; padding-left: 0px; padding-right: 0px; display: block; float: none; border-top-width: 0px; border-bottom-width: 0px; margin-left: auto; border-left-width: 0px; margin-right: auto; padding-top: 0px" title="Scraper dialog" border="0" alt="Scraper dialog" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image20.png" width="684" height="338" /></p><p>In this example Scraper should default to ‘//div[1]/div[2]/ul[1]/li/a’. Here’s a quick explanation of how I read this query.&#160; Because it starts with // it will select <em>“nodes in the document from the current node that match the selection no matter where they are”</em> for me this is the trigger to read the query from right to left as we are matching an endpoint pattern. So:</p><blockquote><p>match all &lt;a&gt; in all &lt;li&gt; in first &lt;ul&gt; of second &lt;div&gt; (&lt;div class=”entry-content”&gt; of first &lt;div&gt; (&lt;div class=&quot;entry&quot;&gt;)</p></blockquote><p>this give use the links from the first block of bullet point. We want the links from all of the bullet points lists so the pattern we want is</p><blockquote><p>match first &lt;a&gt; in all &lt;li&gt; in all &lt;ul&gt; of second &lt;div&gt; of first &lt;div&gt;</p></blockquote><p>So basically we need to switch a to a[1] and ul[1] to ul e.g. ‘//div[1]/div[2]/ul/li/a[1]’. Edit the XPath query and in the columns section beneath change the order by clicking and dragging so that @href/URL comes first. Clicking on the ‘Scrape’ button to get a new preview which should now contain a list of 37 urls. Click on Export to Google Docs … You are now ready to move to Step 2 Auto-discovering feed urls below.</p><h3>Step 1 Extracting blogs urls: Method 2 using Google Spreadsheet importXML function</h3><p>Another way to get this data is to directly scrape it using Google Spreadsheets using the <a
href="https://support.google.com/docs/bin/answer.py?hl=en&amp;answer=155184">importXML function</a>. This function also uses XPath to extract parts of a webpage so we can reuse the query used in Method 1 but get the data straight into a spreadsheet (it’s also a live link so if Nathan adds a new link the spreadsheet will automatically update to include this). Let give it a go.</p><p>Create a new spreadsheet and in cells A1 to B3 enter the column heading Link, Title and Url. Next in cell A2 enter:</p><blockquote><p>=ImportXML(&quot;http://flowingdata.com/2012/04/27/data-and-visualization-blogs-worth-following/&quot;,&quot;//div[1]/div[2]/ul/li/a[1]/@href&quot;)</p></blockquote><p>Note the addition of @href. This is included to extract the href attribute in the &lt;a&gt;. You should now have similar list of 37 urls from Nathan’s post.&#160; To get titles we could enter another importXML function in cell B2 using the XPath ‘//div[1]/div[2]/ul/li/a[1]’ which will extract the text between &lt;a&gt;&lt;/a&gt;. Another way is to actual scrape the data from the target url. So in cell B2 enter:</p><blockquote><p>=ImportXML(A2,&quot;//title&quot;)</p></blockquote><p>So this will go to the url in A2 (<a
title="http://infosthetics.com/" href="http://infosthetics.com/">http://infosthetics.com/</a>) and extract anything wrapped in &lt;title&gt;</p><p>Now select cell B2 and fill the column down to get titles for all the urls. Finally we need to select the entire B column and Copy/Paste <strong>values</strong> only. The reason we do this is Google Spreadsheets only allows 50 importXML function per spreadsheet and we’ll need 37 more to get the RSS feeds for these sites.</p><p><img
style="background-image: none; border-right-width: 0px; padding-left: 0px; padding-right: 0px; display: block; float: none; border-top-width: 0px; border-bottom-width: 0px; margin-left: auto; border-left-width: 0px; margin-right: auto; padding-top: 0px" title="Copy Paste as Values" border="0" alt="Copy Paste as Values" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image21.png" width="427" height="404" /></p><h3>Step 2 Auto-discovering feed urls</h3><p>Initially i tried using <a
href="http://blog.ouseful.info/2010/10/22/feed-autodetection-with-yql/">Feed Autodiscovery With YQL</a> with importXML using an XPath of &quot;//link/@href&quot; but I was not getting any results. So instead decided to auto-detect the feed directly using importXML. In cell C2 enter:</p><blockquote><p>=ImportXML(A2,&quot;/html/head/link[@rel='alternate'][1]/@href&quot;)</p></blockquote><p>This time the XPath starts at the XML tree root (&lt;html&gt;) looks in the &lt;head&gt; for the first link with the attribute rel=’alternative’. From <a
href="http://blog.ouseful.info/2010/10/22/feed-autodetection-with-yql/">Tony’s post</a>:</p><blockquote><p>Remember, feed autodiscovery relies on web page containing the following construction in the HTML <em>&lt;head&gt;</em>element: <br
/><em>&lt;link rel=”alternate” type=”application/rss+xml” href=”FEED_URL” title=”FEED_NAME” /&gt;</em></p></blockquote><p>[I tried using //link[@rel=&quot;alternate&quot; and @type=&quot;application/rss+xml&quot;] but Google Spreadsheet didn’t like it, instead grabbing the first rel=alternate link]</p><p>Fill cell C2 down the rest of the column to get RSS feeds for the other urls. You’ll notice that there’s a #N/A for <a
title="http://neoformix.com/" href="http://neoformix.com/">http://neoformix.com/</a> this is because their feed isn’t auto-discoverable. Visiting their site there is a XML link (<a
title="http://neoformix.com/index.xml" href="http://neoformix.com/index.xml">http://neoformix.com/index.xml</a>) that we can just paste into our spreadsheet (tiding data is a usual processes in data visualisation).</p><h3>Step 3 Generating an OPML bundle</h3><p><a
class="thickbox" href="http://opml-generator.appspot.com/"><img
style="background-image: none; border-right-width: 0px; margin: 0px 0px 4px 8px; padding-left: 0px; padding-right: 0px; display: inline; float: right; border-top-width: 0px; border-bottom-width: 0px; border-left-width: 0px; padding-top: 0px" title="Spreadsheet -&gt; OPML Generator" border="0" alt="Spreadsheet -&gt; OPML Generator" align="right" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image22.png" width="369" height="279" /></a>You should now have <a
href="https://docs.google.com/spreadsheet/ccc?key=0AqGkLMU9sHmLdHp3WHZibmp3OVNMdGU3M0RPUXVpa3c">a spreadsheet like this one</a> with 3 columns of data (if you used the Scraper extension in step 1/method 1 you’ll need to make sure your columns are headed Link, Title and Url for the next step). Next to turn our spreadsheet of feeds into an OPML bundle. Fortunately this step has been made super easy by using the <a
href="http://opml-generator.appspot.com/">Spreadsheet -&gt; OPML Generator</a>. Just follow the instructions on this site an seconds later you’ve got:</p><p><a
href="http://www.google.com/reader/public/subscriptions/user%2F11609741331127149470%2Fbundle%2FFlowingData's%20DataVis%20Fav%20Blogs">OPML File of Nathan Yau’s recommended Data and Visualisation Blogs</a>&#160;</p><p>And because I’ve imported these into Google Reader here’s <a
href="http://www.google.com/reader/bundle/user%2F11609741331127149470%2Fbundle%2FFlowingData's%20DataVis%20Fav%20Blogs">an aggregated page of their posts</a>.</p><p><strong>Update:</strong></p><p>Tony Hirst said:</p><blockquote
class="twitter-tweet tw-align-center"><p>@<a
href="https://twitter.com/mhawksey">mhawksey</a> there&#8217;s also the new Yahoo pipes XPath Fetch block&#8230; <a
title="http://bit.ly/JEJtLz" href="http://t.co/TfgF2UOZ">bit.ly/JEJtLz</a></p><p> — Tony Hirst (@psychemedia) <a
href="https://twitter.com/psychemedia/status/195856441691750401" data-datetime="2012-04-27T12:46:29+00:00">April 27, 2012</a></p></blockquote><p> <script src="//platform.twitter.com/widgets.js" charset="utf-8"></script><br
/><blockquote
class="twitter-tweet tw-align-center"><p>@<a
href="https://twitter.com/mhawksey">mhawksey</a> so for example: list of feed URLs <a
title="http://bit.ly/Jr51dj" href="http://t.co/aQvBs3Eg">bit.ly/Jr51dj</a></p><p> — Tony Hirst (@psychemedia) <a
href="https://twitter.com/psychemedia/status/195857708354768896" data-datetime="2012-04-27T12:51:31+00:00">April 27, 2012</a></p></blockquote><p> <script src="//platform.twitter.com/widgets.js" charset="utf-8"></script>I said:<br
/><blockquote
class="twitter-tweet tw-align-center" data-in-reply-to="195857708354768896"><p>@<a
href="https://twitter.com/psychemedia">psychemedia</a> :`s</p><p> — Martin Hawksey (@mhawksey) <a
href="https://twitter.com/mhawksey/status/195858089042391041" data-datetime="2012-04-27T12:53:01+00:00">April 27, 2012</a></p></blockquote><p> <script src="//platform.twitter.com/widgets.js" charset="utf-8"></script> <p>and the how to</p><p
align="center"><a
title="Feed autodiscovery scraper - based on http://bit.ly/IDuJLu CC-BY psychemedia, on Flickr" href="http://www.flickr.com/photos/psychemedia/6972388556/"><img
alt="Feed autodiscovery scraper - based on http://bit.ly/IDuJLu CC-BY psychemedia" src="http://farm9.staticflickr.com/8146/6972388556_a1cd67e388_z.jpg" width="640" height="515" /></a></p><div
id="crp_related"><h3>You also might like :</h3><ul><li><a
href="http://mashe.hawksey.info/2012/03/googleplus-search-to-rss/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13255');" class="crp_title">Turning Google+ Search results into a RSS feed (for Google Reader)</a></li><li><a
href="http://mashe.hawksey.info/2012/03/tracking-activity-diigo-site-bookmarks-in-a-google-spreadsheet/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13255');" class="crp_title">Tracking activity: Diigo site bookmark activity data in a Google Spreadsheet (who is saving your stuff)</a></li><li><a
href="http://mashe.hawksey.info/2012/01/google-refine-apps-script-integration/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13255');" class="crp_title">Integrating Google Spreadsheet/Apps Script with Google Refine to update existing spreadsheets</a></li><li><a
href="http://mashe.hawksey.info/2012/03/automatic-translation-of-tags-twitter-archives-using-google-apps-script-language-services/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13255');" class="crp_title">Automatic translation of TAGS Twitter archives using Google Apps Script &#8216;Language&#8217; services</a></li><li><a
href="http://mashe.hawksey.info/2012/01/free-and-rebuild-the-tweets-export-twapperkeeper-archives-using-google-refine/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13255');" class="crp_title">Free (and rebuild) the tweets! Export TwapperKeeper archives using Google Refine</a></li></ul></div><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/MASHe?a=fYhurcTs_FI:lsGJLKgFDUc:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/MASHe?i=fYhurcTs_FI:lsGJLKgFDUc:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=fYhurcTs_FI:lsGJLKgFDUc:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/MASHe?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=fYhurcTs_FI:lsGJLKgFDUc:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/MASHe?i=fYhurcTs_FI:lsGJLKgFDUc:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=fYhurcTs_FI:lsGJLKgFDUc:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/MASHe?i=fYhurcTs_FI:lsGJLKgFDUc:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=fYhurcTs_FI:lsGJLKgFDUc:JUhcmGiK9AQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=JUhcmGiK9AQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=fYhurcTs_FI:lsGJLKgFDUc:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/MASHe?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=fYhurcTs_FI:lsGJLKgFDUc:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/MASHe?i=fYhurcTs_FI:lsGJLKgFDUc:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=fYhurcTs_FI:lsGJLKgFDUc:jQG8q3pvyYY"><img src="http://feeds.feedburner.com/~ff/MASHe?d=jQG8q3pvyYY" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=fYhurcTs_FI:lsGJLKgFDUc:IqXOmDWthuQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=IqXOmDWthuQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/MASHe/~4/fYhurcTs_FI" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://mashe.hawksey.info/2012/04/generating-an-opml-rss-bundle-from-a-page-of-links-using-google-spreadsheets/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://mashe.hawksey.info/2012/04/generating-an-opml-rss-bundle-from-a-page-of-links-using-google-spreadsheets/</feedburner:origLink></item> <item><title>Google Docs defaults to searching for Creative Commons licensed images. Great, but could they do better?</title><link>http://feedproxy.google.com/~r/MASHe/~3/Z958P9pcxDk/</link> <comments>http://mashe.hawksey.info/2012/04/google-docs-insert-cc-images/#comments</comments> <pubDate>Thu, 26 Apr 2012 15:04:35 +0000</pubDate> <dc:creator>Martin Hawksey</dc:creator> <category><![CDATA[Creative Commons]]></category> <category><![CDATA[Google Apps]]></category> <category><![CDATA[JISC CETIS]]></category> <guid isPermaLink="false">http://mashe.hawksey.info/?p=13245</guid> <description><![CDATA[A feature of Google Docs I wasn’t aware of, that the Insert Image Search automatically filters for Creative Commons released pictures]]></description> <content:encoded><![CDATA[<p>A retweet yesterday by Amber Thomas (<a
href="http://twitter.com/ambrouk">@ambrouk</a>) of Anna Armstrong (<a
href="https://twitter.com/frenchdisko">@frenchdisko</a><strong></strong>) alerted me to a feature of Google Docs I wasn’t aware of, that the Insert Image Search automatically filters for Creative Commons released pictures:</p><blockquote
class="twitter-tweet tw-align-center"><p>Google Docs and Creative Commons images: Did you know&#8230; In a Google Doc, if you insert a picture using a Google&#8230; <a
title="http://bit.ly/JKnSB9" href="http://t.co/ktPwKLmD">bit.ly/JKnSB9</a></p><p>— Anna Armstrong (@frenchdisko) <a
href="https://twitter.com/frenchdisko/status/195053889958064128" data-datetime="2012-04-25T07:37:25+00:00">April 25, 2012</a></p></blockquote><p><img
style="background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border-width: 0px;" title="Insert image in Google Docs" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image18.png" alt="Insert image in Google Docs" width="700" height="451" border="0" /></p><p>Fantastic I thought. A way for staff to create open resources with millions of pictures to choose from and reuse with no more effort than if they were inserting any other image. Such a feature obviously doesn’t come without it’s health warnings. Clicking on ‘<a
href="http://support.google.com/docs/bin/answer.py?hl=en-GB&amp;answer=179622&amp;p=docs_image_search">Learn more</a>’ we can see:</p><p
style="padding-bottom: 8px; padding-left: 36px; padding-right: 10px; background: url('//www.google.com/help/hc/images/hc_icon_pretty_warning_global.gif') #f9edbe no-repeat 10px 7px; padding-top: 8px; border-radius: 1px; -moz-border-radius: 1px; border: #ddd 1px solid;"><strong><br
/> Before reusing content that you&#8217;ve found, you should verify that its licence is legitimate and check the exact terms of reuse stated in the licence.</strong> For example, most licences require that you give credit to the image creator when reusing an image. Google has no way of knowing whether the licence is legitimate, so we aren&#8217;t making any representation that the content is actually lawfully licensed.</p><p>I can appreciate that Google’s search technology isn’t going to be 100% reliable in detecting which license is being used in an existing work, but wouldn’t it be great if when you inserted the image Google also gave their ‘best guess’ of the license for you to check and edit if necessary.</p><p
align="center"><a
href="http://mcdn.hawksey.info/wp-content/uploads/2012/04/GoogleDocsEmbed.jpg"><img
style="background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border: 0px;" title="A better way for Google Docs to embed?" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/GoogleDocsEmbed_thumb.jpg" alt="A better way for Google Docs to embed?" width="684" height="531" border="0" /></a><br
/> [This graphic includes <a
href="http://www.flickr.com/photos/emburke/906975297/">Japanese light bulb</a> - <a
href="http://creativecommons.org/licenses/by/2.0/">CC-BY</a> Elliot Burke]</p><p>Or am I just being naïve about this whole thing?</p><p>PS I don’t know if something has gone horrible wrong with Google image indexing, but when in Insert Image Search I enter ‘lightbulb site:.flickr.com’ the thumbnails don’t always match the actual image.</p><p><a
class="thickbox" href="http://mcdn.hawksey.info/wp-content/uploads/2012/04/thumbwhat.jpg"><img
style="background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border-width: 0px;" title="thumbwhat" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/thumbwhat_thumb.jpg" alt="thumbwhat" width="684" height="296" border="0" /></a></p><div
id="crp_related"><h3>You also might like :</h3><ul><li><a
href="http://mashe.hawksey.info/2011/09/embedding-mp3-audio-files-into-google-sites/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13245');" class="crp_title">Embedding .mp3 audio files into Google Sites</a></li><li><a
href="http://mashe.hawksey.info/2012/04/pitch-for-out-of-hours-project-develop-a-chrome-extension-to-include-creative-commons-license-in-embed-code/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13245');" class="crp_title">Pitch for out-of-hours project: Develop a Chrome extension to include Creative Commons license in embed code</a></li><li><a
href="http://mashe.hawksey.info/2012/04/guest-post-on-creative-commons-uk-adding-creative-commons-to-your-rss-feeds-in-feedburner/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13245');" class="crp_title">Guest Post on Creative Commons UK: Adding Creative Commons to your RSS feeds in Feedburner</a></li><li><a
href="http://mashe.hawksey.info/2012/04/topsy-media-timeline-v2/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13245');" class="crp_title">Revisiting: Experiment to dynamically timeline media posted on Twitter using Topsy and Timeline</a></li><li><a
href="http://mashe.hawksey.info/2012/01/sending-free-sms-messages-to-students-by-mashing-up-google-calendar-notifications/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13245');" class="crp_title">Sending free SMS messages to students by mashing up Google Calendar notifications</a></li></ul></div><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/MASHe?a=Z958P9pcxDk:Zx5wMZVE3IM:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/MASHe?i=Z958P9pcxDk:Zx5wMZVE3IM:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Z958P9pcxDk:Zx5wMZVE3IM:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/MASHe?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Z958P9pcxDk:Zx5wMZVE3IM:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/MASHe?i=Z958P9pcxDk:Zx5wMZVE3IM:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Z958P9pcxDk:Zx5wMZVE3IM:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/MASHe?i=Z958P9pcxDk:Zx5wMZVE3IM:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Z958P9pcxDk:Zx5wMZVE3IM:JUhcmGiK9AQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=JUhcmGiK9AQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Z958P9pcxDk:Zx5wMZVE3IM:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/MASHe?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Z958P9pcxDk:Zx5wMZVE3IM:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/MASHe?i=Z958P9pcxDk:Zx5wMZVE3IM:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Z958P9pcxDk:Zx5wMZVE3IM:jQG8q3pvyYY"><img src="http://feeds.feedburner.com/~ff/MASHe?d=jQG8q3pvyYY" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=Z958P9pcxDk:Zx5wMZVE3IM:IqXOmDWthuQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=IqXOmDWthuQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/MASHe/~4/Z958P9pcxDk" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://mashe.hawksey.info/2012/04/google-docs-insert-cc-images/feed/</wfw:commentRss> <slash:comments>1</slash:comments> <feedburner:origLink>http://mashe.hawksey.info/2012/04/google-docs-insert-cc-images/</feedburner:origLink></item> <item><title>JISC OER Rapid Innovation: Technical roundup and possible directions #oerri</title><link>http://feedproxy.google.com/~r/MASHe/~3/G5HLUG7i0Kc/</link> <comments>http://mashe.hawksey.info/2012/04/jisc-oer-rapid-innovation-technical-roundup-start/#comments</comments> <pubDate>Wed, 25 Apr 2012 15:22:39 +0000</pubDate> <dc:creator>Martin Hawksey</dc:creator> <category><![CDATA[JISC CETIS]]></category> <category><![CDATA[OER]]></category> <category><![CDATA[Rapid Innovation]]></category> <category><![CDATA[oerri]]></category> <guid isPermaLink="false">http://mashe.hawksey.info/?p=13228</guid> <description><![CDATA[As the JISC OER Rapid Innovation projects have either started or will start very soon, mainly for my own benefit, I thought it would be useful to quickly summarise the the technical choices and challenges.]]></description> <content:encoded><![CDATA[<p>As the <a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation.aspx">JISC OER Rapid Innovation</a> projects have either started or will start very soon, mainly for my own benefit, I thought it would be useful to quickly summarise the the technical choices and challenges.</p><h3>Attribute Images &#8211; University of Nottingham</h3><p>Building on the Xpert search engine which has a searchable index of over 250,000 open educational resources, Nottingham are planning a tool to embed CC license information into images.</p><blockquote><p>The Attribute Images project will extend the Xpert Attribution service by creating a new tool that allows users to upload images, either from their computer or from the web and have a Creative Commons attribution statement embedded in the images. … It will provide an option for the user to upload the newly attributed images to Flickr through the Flickr API … In addition it will have an API allowing developers to make use of the service in other sites.</p></blockquote><p>From <a
href="http://comms.nottingham.ac.uk/learningtechnology/2012/03/26/attribute-images-new-jisc-funded-rapid-innovation-project/">the projects first post</a> when they talk about ‘embedding’ CC statements it appears to be visible watermarking. It’ll be interesting if the project explore <a
href="http://wiki.creativecommons.org/XMP">the Creative Commons recommended Adobe Extensible Metadata Platform (XMP)</a> to embed license information into the image data. Something they might want to test is if the Flickr upload preserves this data when resizing. <a
href="http://wiki.creativecommons.org/Developers">Creative Commons also have a range of tools to integrate license selection</a> so it’ll be interesting to see if these are used or if there are compatibility issues.</p><p><a
href="http://comms.nottingham.ac.uk/learningtechnology/category/attribute-images/">Attribute Images Blog</a><br
/> <a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/attributeimages.aspx">Read more about Attribute Images on the JISC site</a></p><h3>Bebop – University of Lincoln</h3><p>Bebop is looking to help staff at Lincoln centralise personal resource creation activity from across platforms into a single stream.</p><blockquote><p>This project will undertake research and development into the use of <a
href="http://buddypress.org/">BuddyPress</a> as an institutional academic profile management tool which aggregates teaching and learning resources as well as research outputs held on third-party websites into the individual’s BuddyPress profile. … This project will investigate and develop BuddyPress so as to integrate (‘consume’) third-party feeds and APIs into BuddyPress profiles and, furthermore, investigate the possibility of BuddyPress being used as a ‘producer application’ of data for re-publishing on other institutional websites and to third-party web services.</p></blockquote><p>In a recent project post asking <a
href="http://bebop.blogs.lincoln.ac.uk/2012/03/27/where-are-the-oers/">Where are the OERs?</a> you can get an idea of the 3rd party APIs they will be looking at which includes Jorum/DSpace, YouTube, Slideshare etc. Talking to APIs isn’t a problem, after all that is what they are designed to do, and having developed plugins on WordPress/BuddyPress myself is a great platform to work on. The main technical challenge is more likely to be doing this on scale and the variability in the type of data returned. It’ll also be interesting if Bebop can be built with flexibility in mind (creating it’s own APIs so that it can be used on other platforms) – looks like the project is <a
href="http://bebop.blogs.lincoln.ac.uk/2012/04/20/rssatom-endpoints-for-user-profiles-on-oer-hosting-sites/">going down aggregating the RSS endpoint point route</a>.</p><p><a
href="http://bebop.blogs.lincoln.ac.uk/">Bebop Blog</a><br
/> <a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/Bebop.aspx">Ream more about Bebop on the JISC site</a></p><h3>Breaking Down Barriers: Building a GeoKnowledge Community with OER</h3><blockquote><p>The proposed project aims to Build a GeoKnowledge Community at Mimas by utilising existing technologies (DSpace) and services (Landmap/Jorum). The aim of the use case is to open-up 50% (8 courses) of the Learning Zone through Creative Commons (CC) Attribution Non-Commercial Share Alike (BY-NC-SA) license as agreed already with authors. A further aim is to transfer the hosting of the ELOGeo repository to Jorum from Nottingham (letter of support provided by University of Nottingham) and create a GeoKnowledge Community site embedded in Jorum using the DSpace API and linking the repository to the Landmap Learning Zone. … The technical solution in developing a specific community site within Jorum will be transferable to other communities that may have a similar requirement in the future.</p></blockquote><p>Still don’t feel I have an entire handle on the technical side of this project, but its early days and already the project is producing a <a
href="http://mimasgeoknowledge.wordpress.com/">steady stream of posts on their blog</a>. One for me to revisit.</p><p><a
href="http://mimasgeoknowledge.wordpress.com/">Break Down Barriers Blog</a><br
/> <a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/breakingdownbarriers.aspx">Read more about Breaking Down Barriers on the JISC site</a></p><h4>CAMILOE (Collation and Moderation of Intriguing Learning Objects in Education)</h4><blockquote><p>This project reclaims and updates 1800 quality assured evidence informed reviews of education research, guidance and practice that were produced and updated between 2003 and 2010 and which are now archived and difficult to access. … These resources were classified using a wide range of schemas including Dublin core, age range, teaching subject, resource type, English Teaching standard and topic area but are no longer searchable or browsable by these categories. … Advances in <strong>Open Educational Resources </strong>(OER) technologies provide an opportunity to make this resource useful again for the academics who created it. These tools include enhanced meta tagging schemas for journal documents, academic proofing tools, repositories for dissemination of OER resources, and open source software for journal moderation and para data concerning resource use.</p></blockquote><p>So a lot of existing records to get into shape and put in something that makes them accessible again. Not only that, if you <a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/camiloe.aspx">look at the project overview</a> you can see usage statistics play an important part. CAMILOE is also <a
href="http://jlernexperiment.wordpress.com/2012/04/03/jlern-and-the-jisc-oer-rapid-innovation-projects/">one of the projects interested in depositing information into the UK Learning Registry node setup as part of the JLeRN Experiment</a>.</p><p>Having dabbled with <a
href="http://mashe.hawksey.info/2011/12/google-refining-jorum-ukoer/">using Google Refine to get Jorum UKOER records into a different shape</a> I wonder if the project will go down this route, or given the number and existing shape manually re-index them. I’d be very surprised if RSS or OAI-PMH didn’t make an appearance.</p><p><a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/camiloe.aspx">Read more about CAMILOE on the JISC site</a></p><h3>Improving Accessibility to Mathematical Teaching Resources</h3><blockquote><p>Making digital mathematical documents fully accessible for visually impaired students is a major challenge to offer equal educational opportunities. … In this project we now want to turn our current program, that is the result of our research, into an assistive technology tool. … According to the identified requirements we will adapt and embed our tool into an existing open source solution for editing markup to allow post-processing of recognised and translated documents for correction and further editing. We will also add facilities to our tool to allow for suitable subject specific customisation by expert users. … In addition to working with accessibility support officers we also want to enable individual learners to employ the tool by making it available firstly via a web interface and finally for download under a Creative Commons License.</p></blockquote><p>The project is building on their existing tool <a
href="http://www.cs.bham.ac.uk/research/groupings/reasoning/sdag/maxtract.php">Maxtract</a> which turns mathematical formula in pdf documents into other formats including full text descriptions, which are more screen reader friendly (<a
href="http://researchblogs.cs.bham.ac.uk/math-access/2012/04/18/how-does-maxtract-work/">a post with more info on how it works</a>). So turning</p><p><a
class="thickbox" href="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image16.png"><img
style="background-image: none; padding-left: 0px; padding-right: 0px; display: block; float: none; margin-left: auto; margin-right: auto; padding-top: 0px; border-width: 0px;" title="example equation" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image_thumb6.png" alt="example equation" width="212" height="65" border="0" /></a></p><p>into:</p><p><em>1 divided by square root of 2 pi integral sub R e to the power of minus x to the power of 2 slash 2 dx = 1 .</em></p><p>The other formats the tool already supports are PDF annotated with LaTeX and XHTML. The project is partnering with <a
href="http://www.jisctechdis.ac.uk/">JISC TechDis</a> to gather specific user requirements.</p><p><a
href="http://researchblogs.cs.bham.ac.uk/math-access/">Improving Accessibility to Mathematics Blog</a><br
/> <a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/accessibilitytomathematics.aspx">Read more about Improving Accessibility to Mathematics on the JISC site</a></p><h3>Linked Data Approaches to OERs</h3><blockquote><p>This project extends MIT’s Exhibit tool to allow users to construct bundles of OERs and other online content around playback of online video. … This project takes a linked data approach to aggregation of OERS and other online content in order  to improve the ‘usefulness’ of online resources for education. The outcome will be an open-source application which uses linked data approaches to present a collection of pedagogically related resources, framed within a narrative created by either the teacher or the students. The ‘collections’ or ‘narratives’ created using the tool will be organised around playback of rich media, such as audio or video, and will be both flexible and scaleable.</p></blockquote><p><a
href="http://simile-widgets.org/exhibit/">MIT’s Exhibit tool</a>, particularly the timeline aspect, was something I used in the <a
href="http://mashe.hawksey.info/2012/01/oer-visualisation-project-timelines-timelines-timelines-day-30-ukoer/">OER Visualisation Project</a>. The project has already produced some videos demonstrating a prototype that uses a timecode to control what is displayed (<a
href="http://ensembleljmu.wordpress.com/2012/04/13/first-prototype/">First prototype!</a>, <a
href="http://ensembleljmu.wordpress.com/2012/04/16/prototype-2/">Prototype #2</a> and <a
href="http://ensembleljmu.wordpress.com/2012/04/16/prototype-2-part-two/">Prototype #2 (part two)</a>). I’m still not entirely sure what ‘linked data approaches’ will be so it’ll be interesting to see how that shapes ups.</p><p><a
href="http://ensembleljmu.wordpress.com/">Linked Data Approaches to OERs Blog</a><br
/> Read more about Linked Data Approaches to OERs on the JISC site &lt;- not on the site yet</p><h3>Portfolio Commons</h3><blockquote><p>… seeks to provide free and open source software tools that can easily integrate open educational practices (the creation, use and sharing of OERs) into the daily routines of learners and teachers … This project proposes to create a free open source plugin for Mahara that will enable a user to select content from their Mahara Portfolio, licence it with a Creative Commons licence of their choosing, create metadata and make a deposit directly into their chosen repositories using the SWORD protocol</p></blockquote><p>The <a
href="http://en.wikipedia.org/wiki/SWORD_(Protocol)">SWORD Protocol</a>, which was developed with funding by JISC, has a healthy eco system of compliant repositories, clients and code libraries, so the technical challenge on that part is getting it wired up as a plugin for Mahara. <a
href="http://wiki.creativecommons.org/Web_Integration">Creative Commons also have a range of tools to integrate license selection for web applications</a>. It’ll be interesting to see if these are used.</p><p>When I met the project manager, John Casey, in London recently I also mentioned, given the arts background, of this project that scoping whether integrating with the Flickr API would be useful. Given that the Attribute Images project mentioned above is looking at this part the ideal scenario might be to link the Mahara plugin to a Attribute Images API, but timings might prevent that.</p><p><a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/portfoliocommons.aspx">Read more about Portfolio Commons on the JISC site</a></p><h3>Rapid Innovation Dynamic Learning Maps-Learning Registry (RIDLR)</h3><blockquote><p>Newcastle University’s Dynamic Learning Maps system (developed with JISC funding) is now embedded in the MBBS curriculum, and now being taken up in Geography and other subject areas … In RIDLR we will test the release of contextually rich paradata via the JLeRN Experiment to the Learning Registry and harvest back paradata about prescribed and additional personally collected resources used within and to augment the MBBS curriculum, to enhance the experience of teachers and learners. We will develop open APIs to harvest and release paradata on OER from end-users (bookmarks, tags, comments, ratings and reviews etc) from the Learning Registry and other sources for specific topics, within the context of curriculum and personal maps.</p></blockquote><p>The technical challenge here is getting data into and out of the Learning Registry, it’ll be interesting to see what APIs they come up with. It’ll also be interesting to see what data they can get and if it’s usable within Dynamic Learning Maps. <a
href="http://www.medev.ac.uk/blog/oer-phase-3-blog/2012/mar/8/oer-rapid-innovation-supoerglue-and-ridlr/">More information including a use case for this project has been posted here</a>.</p><p><a
href="http://www.medev.ac.uk/blog/oer-rapid-innovation-ridlr-and-supoerglue/">RIDLR and SupOERGlue Blog</a><br
/> <a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/ridlr.aspx">Read more about RIDLR on the JISC site</a></p><h3>RedFeather (Resource Exhibition and Discovery)</h3><blockquote><p>RedFeather (Resource Exhibition and Discovery) is a proposed lightweight repository server-side script that fosters best practice for OER, it can be dropped into any website with PHP, and which enables appropriate metadata to be assigned to resources, creates views in multiple formats (including HTML with in-browser previews, RSS and JSON), and provides instant tools to submit to Xpert and Jorum, or migrate to full repository platforms via SWORD.</p></blockquote><p>The above quote nicely summarises the technical headlines. In <a
href="http://blogs.ecs.soton.ac.uk/oneshare/2012/03/14/who-is-redfeather-for/">a recent blog post the team illustrate how RedFeather might be used in a couple of use cases</a>. The core component appears to be creating a single file (coded in PHP which is a server side scripting language) and transferring files/resources to a web server. It’ll be interesting to see if the project explore different deployments, for example, packaging FedFeather on a portable web server (server on a usb stick), or maybe deploy on <a
href="http://scraperwiki.com">Scraperwiki</a> (a place in the cloud where you can execute PHP), or looking at how other cloud/3rd party services could be used. <strong>Update: </strong>I forgot to mention the <a
href="http://code.google.com/p/oer-roadmap/wiki/OERPubAPI">OERPubAPI </a>which is built on the SWORD v2. The interesting part that I&#8217;m watching closely is whether this API will provide a means to publish to none SWORDed repositories like YouTube, Flickr and Slideshare.</p><p><a
href="http://blogs.ecs.soton.ac.uk/oneshare/tag/redfeather/">RedFeather Blog</a><br
/> <a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/RedFeather.aspx">Read more about RedFeather on the JISC site</a></p><h3>Sharing Paradata Across Widget Stores (SPAWS)</h3><blockquote><p>We will use the Learning Registry infrastructure to share paradata about Widgets across multiple Widget Stores, improving the information available to users for selecting widgets and improving discovery by pooling usage information across stores.</p></blockquote><p>For more detail on what paradata will be included <a
href="http://scottbw.wordpress.com/2012/04/16/spaws-in-a-nutshell/">the SPAWS nutshell post</a> says:</p><blockquote><p>each time a user visits a store and writes a review about a particular widget/gadget, or rates it, or embeds it, that information can potentially be syndicated to other stores in the network</p></blockquote><p>There’s not much for me to add about the technical side of this project as Scott has already posted <a
href="http://scottbw.wordpress.com/2012/04/02/sharing-reviews-and-ratings-between-educational-web-app-stores/">a technical overview</a> and gone into <a
href="http://scottbw.wordpress.com/2012/04/05/spaws-initial-code-hands-on-with-the-learning-registry/">more detail about the infrastructure and some initial code</a>.</p><p><a
href="http://scottbw.wordpress.com/tag/oerri">SPAWS Blog</a><br
/> <a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/sharingparadata.aspx">Read more about SPAWS on the JISC site</a></p><h3>SPINDLE: Increasing OER discoverability by improved keyword metadata via automatic speech to text transcription</h3><blockquote><p>SPINDLE will create linguistic analysis tools to filter uncommon spoken words from the automatically generated word-level transcriptions that will be obtained using Large Vocabulary Continuous Speech Recognition (LVCSR) software. SPINDLE will use this analysis to generate a keyword corpus for enriching metadata, and to provide scope for indexing inside rich media content using HTML5.</p></blockquote><p>Enhancing the discoverability of audio/media is something I’m very familiar with having <a
href="http://mashe.hawksey.info/2010/12/making-ripples-in-a-big-pond-optimising-videos-with-an-ititle-twitter-track/">used tweets to index videos</a>. My enthusiasm for this area took a knock with I discovered Mike Wald’s <a
href="http://synote.org">Synote system</a> which uses IBM’s ViaScribe to extract annotations from video/audio. There’s a lot of overlap between Synote and SPINDLE which is why it was good to see them talking to each other at the programme start-up meeting. As far as I’m aware JISC funding for Synote ended in 2009 (but has just been refunded for a mobile version) so now is a good time to look at how open source LVCSR software can be used in a scenario where accuracy for accessibility as an assistive technology is being replaced by best guess to improve accessibility in terms of discoverability.</p><p>In terms of the technical side it will be interesting to see if SPINDLE looks at the <a
href="http://html5doctor.com/video-subtitling-and-webvtt/">WebVTT</a> which seems to be finding its way at the W3C and does include an option for metadata (the issue might be that ‘V’ in WebVTT stands for video). Something that I hope doesn’t put SPINDLE off looking at WebVTT is the lack of native browser support (although it is on the way) There are <a
href="http://html5doctor.com/video-subtitling-and-webvtt/#browsers">some JavaScript libraries you can use to handle WebVTT</a>.  It’ll also be interesting if there is a chance to compare (or highlight existing research) comparing an open source offering like <a
href="http://cmusphinx.sourceforge.net/html/cmusphinx.php">Sphinx</a> with commercial (e.g. ViaScribe)</p><p><a
href="http://blogs.oucs.ox.ac.uk/openspires/category/spindle/">SPINDLE Blog</a><br
/> <a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/spindle.aspx">Read more about SPINDLE on the JISC site</a></p><h3>SupOERGlue</h3><blockquote><p>SuperOERGlue will pilot the integration of OER Glue with Newcastle University’s Dynamic Learning Maps, enabling easy content creation and aggregation from within the learning and teaching support environments, related to specific topics. … Partnering with Tatemae to use OER Glue, which harvests OER from around the world and has developed innovative ways for academics and learners to aggregate customised learning packages constructed of different OER, will enable staff and students to create their own personalised resource mashups which are directly related to specific topics in the curriculum.</p></blockquote><p><a
href="http://www.tatemae.com/projects.html">Tatemae have a track record</a> of working with open educational resources and courseware including developing <a
href="http://www.oerglue.com/">OER Glue</a>. There’s not a huge amount for me to say on the technical side. I did notice that OER Glue currently only works on Google Chrome web browser. Having worked in a number of institutions where installing extra software in a chore it’ll be interesting to see if this causes a problem. <a
href="http://www.medev.ac.uk/blog/oer-phase-3-blog/2012/mar/8/oer-rapid-innovation-supoerglue-and-ridlr/">More information including a use case for this project has been posted here</a>. <strong>Update: </strong>Related to RedFeather update I wondering if SupOERGlue will be looking at <a
href="http://oerpub.org/">OERPub</a> (<em>“An architecture for remixable Open Educational Resources (OER)”</em>)as a framework to republish OER.</p><p><a
href="http://www.medev.ac.uk/blog/oer-rapid-innovation-ridlr-and-supoerglue/">RIDLR and SupOERGlue Blog</a><br
/> <a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/supoerglue.aspx">Read more about SupOERGlue on the JISC site</a></p><h3>Synote Mobile</h3><blockquote><p>Synote Mobile will meet the important user need to make web-based OER recordings easier to access, search, manage, and exploit for learners, teachers and others. …This project will create a new mobile HTML5 version of Synote able to replay Synote recordings on any student’s mobile device capable of connecting to the Internet. The use of HTML5 will overcome the need to develop multiple device-specific applications. The original version of Synote displays the recording, transcript, notes and slide images in four different panels which uses too much screen area for a small mobile device. Synote Mobile will therefore be designed to display captions and notes and images simultaneously ‘over’ the video. Where necessary existing Synote recordings will be converted into an appropriate format to be played by the HTML5 player. Success will be demonstrated by tests and student evaluations using Synote recordings on their mobile devices.</p></blockquote><p>I’ve already mentioned Synote in relation to SPINDLE. Even though it’s early <a
href="http://access.ecs.soton.ac.uk/blog/synotemobile/2012/04/04/research-streaming-videos-onto-mobile-phones/">the project is already documenting a number of their technical challenges</a>. This includes reference to <a
href="http://www.longtailvideo.com/html5/">LongTail’s State of HTML5 Video report</a> and a related post on <a
href="http://www.saltwebsites.com/2012/html5-video-subtitle-wrap">Salt Websites</a>. The later references WebVTT and highlights some libraries that can be used. Use of javascript libraries gets around the lack of &lt;track&gt; support in browsers, but as the LongTail State of the HTML5 video report states:</p><blockquote><p>The element [&lt;track&gt;] is brand new, but every browser vendor is working hard to support it. This is especially important for mobile, since developers cannot use JavaScript to manually draw captions over a video element there.</p></blockquote><p>The report goes on to say:</p><blockquote><p>Note the HTML5 specification defines an alternative approaches to loading captions. It leverages video files with embedded text tracks. iOS supports this today (without API support), but no other browser has yet committed to implement this mechanism. Embedded text tracks are easier to deploy, but harder to edit and make available for search.</p></blockquote><p>Interesting times for Synote Mobile and potentially an opportunity for the sector to learn a lot of lessons about creating accessible mobile video.</p><p><a
href="http://access.ecs.soton.ac.uk/blog/synotemobile/">Synote Mobile Blog</a><br
/> <a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/synotemobile.aspx">Read more about Synote Mobile on the JISC site</a></p><h3>Track OER</h3><blockquote><p>The project aims to look at two ways to reduce tensions between keeping OER in one place and OER spreading and transferring. If we can find out more about where OER is being used then we can continue to gather the information that is needed and help exploit the openness of OER. … The action of the project will be to develop software that can help track open educational resources. The software will be generic in nature and build from existing work developed by BCCampus and MIT, however a key step in this project is to provide an instantiation of the tracking on the Open University’s OpenLearn platform. … The solution will build on earlier work, notably by OLnet fellow Scott Leslie (BCCampus) and JISC project <em>CaPRéT </em>led by Brandon Muramatsu (MIT project partner in <em>B2S</em>).</p></blockquote><p>At the programme start-up meeting talking to Patrick McAndrew, who is leading this project, part one of the solution is to include a unique Creative Commons License icon which is hosted on OU servers which when called by a resource reuse some content leaves a trace (<a
href="http://www.olnet.org/node/149">option 3 in the suggested solutions here</a>). This technique is well established and one I first came across when using the <a
href="http://clustrmaps.com/">ClustrMaps service</a> which uses a map of your website visitors as a hit counter (ClustrMaps was developed by <a
href="http://www.linkedin.com/in/marceisenstadt">Marc Eisenstadt</a> Emeritus Professor at the Open University – small world ;). It looks like <a
href="http://piwik.org/">Piwiki</a> is going to be used to handle/dashboard the web analytics, which is an open source alternative to Google Analytics. The second solution is extending the CETIS funded <a
href="http://capret.mitoeit.org/">CaPRéT</a> developed by Brandon Muramatsu &amp; Co. at MIT which uses JavaScript to track when a user copies and pastes some text. It’ll be interesting if Track OER can port the CaPReT backend to Piwiki (BTW Pat Lockley has posted how to do <a
href="http://blogs.oucs.ox.ac.uk/openspires/2012/04/04/oer-copy-tracking-using-google-analytics/">OER Copy tracking using Google Analytics</a>, which uses similar techniques).</p><p><a
href="http://cloudworks.ac.uk/tag/view/trackoer">Track OER Blog</a><br
/> <a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/trackoer.aspx">Read more about Track OER on the JISC site</a></p><h3>Xerte Experience Now Improved: Targeting HTML5 (XENITH)</h3><blockquote><p>Xerte Online Toolkits is a suite of tools in widespread use by teaching staff to create interactive learning materials. This project will develop the functionality for Xerte Online Toolkits to deliver content as HTML5. Xerte Online Toolkits creates and stores content as XML, and uses the Flash Player to present content. There is an increasing need for Xerte Online Toolkits to accommodate a wider range of delivery devices and platforms.</p></blockquote><p>Here’s a page with more information about <a
href="http://www.nottingham.ac.uk/xerte/toolkits.htm">Xerte Online Toolkits</a>, here’s <a
href="http://www.nottingham.ac.uk/toolkits/play_38">an example toolkit</a> and the <a
href="http://hosting.gmodules.com/ig/gadgets/file/108150762089462716664/xerte-example.xml">source xml used to render it</a> (view source). The issue with tis I haven’t seen the detail for the XENITH project, but something I initially thought about  was whether they would use <a
href="http://en.wikipedia.org/wiki/XSL_Transformations">XSLT (Extensible Stylesheet Language Transformations)</a>, but wondered if this would be a huge headache when converting their Flash player. Another possible solution I recently came across is <a
href="http://www.jangaroo.net/">jangaroo</a>:</p><blockquote><p>Jangaroo is an Open Source project building developer tools that adopt the power of <a
href="http://livedocs.adobe.com/specs/actionscript/3/as3_specification.html">ActionScript 3</a> to create high-quality JavaScript frameworks and applications. Jangaroo is <a
href="http://www.jangaroo.net/home/1240/license.html">released under the Apache License, Version 2.0</a>.</p></blockquote><p>This includes<em>“let your existing ActionScript 3 application run in the browser without a Flash plugin”</em> . It’ll be interesting to see the solution the project implements.</p><p><a
href="http://comms.nottingham.ac.uk/learningtechnology/category/xenith/">XENITH Blog</a><br
/> <a
href="http://www.jisc.ac.uk/whatwedo/programmes/ukoer3/rapidinnovation/xenith.aspx">Read more about XENITH on the JISC site</a></p><p>BTW here’s the <a
href="http://www.google.com/reader/public/subscriptions/user%2F11609741331127149470%2Fbundle%2FOERRI">OPML file for the RSS feeds of the blogs that are live</a> (also <a
href="http://www.google.com/reader/bundle/user%2F11609741331127149470%2Fbundle%2FOERRI">visible here as a Google Reader bundle</a>)</p><p>So which of these projects interests you the post? If you are on one of the projects do my technical highlights look right or have I missed something important?</p><div
id="crp_related"><h3>You also might like :</h3><ul><li><a
href="http://mashe.hawksey.info/2012/03/do-you-git-it-open-educational-resourcespractices-meets-software-version-control/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13228');" class="crp_title">Do you git it?: Open educational resources/practices meets software version control #ukoer</a></li><li><a
href="http://mashe.hawksey.info/2012/05/using-google-spreadsheets-to-dashboard-projectcourse-blog-feeds-oerri/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13228');" class="crp_title">Using Google Spreadsheets to dashboard project/course blog feeds #oerri</a></li><li><a
href="http://mashe.hawksey.info/2010/10/learning-and-teaching-innovation-grants-for-fe-swani/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13228');" class="crp_title">Learning and Teaching Innovation Grants for FE (SWaNI)</a></li><li><a
href="http://mashe.hawksey.info/2012/04/pitch-for-out-of-hours-project-develop-a-chrome-extension-to-include-creative-commons-license-in-embed-code/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13228');" class="crp_title">Pitch for out-of-hours project: Develop a Chrome extension to include Creative Commons license in embed code</a></li><li><a
href="http://mashe.hawksey.info/2011/12/40-days-to-let-you-see-the-impact-of-the-oer-programme-ukoer-day-1/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13228');" class="crp_title">40 days to let you see the impact of the OER Programme #ukoer [day 1]</a></li></ul></div><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/MASHe?a=G5HLUG7i0Kc:L5cf5pmqURU:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/MASHe?i=G5HLUG7i0Kc:L5cf5pmqURU:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=G5HLUG7i0Kc:L5cf5pmqURU:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/MASHe?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=G5HLUG7i0Kc:L5cf5pmqURU:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/MASHe?i=G5HLUG7i0Kc:L5cf5pmqURU:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=G5HLUG7i0Kc:L5cf5pmqURU:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/MASHe?i=G5HLUG7i0Kc:L5cf5pmqURU:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=G5HLUG7i0Kc:L5cf5pmqURU:JUhcmGiK9AQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=JUhcmGiK9AQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=G5HLUG7i0Kc:L5cf5pmqURU:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/MASHe?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=G5HLUG7i0Kc:L5cf5pmqURU:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/MASHe?i=G5HLUG7i0Kc:L5cf5pmqURU:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=G5HLUG7i0Kc:L5cf5pmqURU:jQG8q3pvyYY"><img src="http://feeds.feedburner.com/~ff/MASHe?d=jQG8q3pvyYY" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=G5HLUG7i0Kc:L5cf5pmqURU:IqXOmDWthuQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=IqXOmDWthuQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/MASHe/~4/G5HLUG7i0Kc" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://mashe.hawksey.info/2012/04/jisc-oer-rapid-innovation-technical-roundup-start/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://mashe.hawksey.info/2012/04/jisc-oer-rapid-innovation-technical-roundup-start/</feedburner:origLink></item> <item><title>Guest Post on Creative Commons UK: Adding Creative Commons to your RSS feeds in Feedburner</title><link>http://feedproxy.google.com/~r/MASHe/~3/TkxjQEZ78tY/</link> <comments>http://mashe.hawksey.info/2012/04/guest-post-on-creative-commons-uk-adding-creative-commons-to-your-rss-feeds-in-feedburner/#comments</comments> <pubDate>Tue, 24 Apr 2012 09:06:47 +0000</pubDate> <dc:creator>Martin Hawksey</dc:creator> <category><![CDATA[Creative Commons]]></category> <category><![CDATA[JISC CETIS]]></category> <guid isPermaLink="false">http://mashe.hawksey.info/?p=13211</guid> <description><![CDATA[Today I have a guest post on the Creative Commons UK blog on Adding Creative Commons to your RSS feeds. The post outlines how you can add a link and text to a regionalised Creative Commons license to your RSS feed. The solution isn’t ideal and makes the best of a bad world, but hopefully [...]]]></description> <content:encoded><![CDATA[<p><img
style="background-image: none; margin: 0px 0px 4px; padding-left: 0px; padding-right: 0px; display: inline; float: right; padding-top: 0px; border: 0px;" title="image" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image15.png" alt="Adding some custom Flare" width="360" height="220" align="right" border="0" /></p><p>Today I have a <a
href="http://www.creativecommons.org.uk/blog/tabid/84/EntryId/21/Adding-Creative-Commons-to-your-RSS-feeds.aspx">guest post on the Creative Commons UK blog on Adding Creative Commons to your RSS feeds</a>. The post outlines how you can add a link and text to a regionalised Creative Commons license to your RSS feed.</p><p>The solution isn’t ideal and makes the best of a bad world, but hopefully it is useful if only to show you how you can add things to the end of your Feedburner feed.</p><p>The process of documenting this solution surfaced a number of issues which I’m keen to explore further. Here are a couple listed here more as a note to myself to follow-up:</p><ul><li>variation in the availability of CC licenses types (CC-BY, CC-BY-SA etc) on 3rd party service like YouTube, Flickr and Slideshare</li><li>generic CC licensing over regionalised versions – 3rd party services offering option to CC license usually default to generic (here’s <a
href="http://mcdn.hawksey.info/wp-content/uploads/2012/04/CC-BY-2.5-Comparison-Generic-and-Scotland.docx" target="_blank">a Word Doc comparing the CC-BY 2.5 with the CC-BY 2.5 Scotland</a>&nbsp;- afraid you&#8217;ll need to open in Word in Review &#8211; Final: Show markup mode).</li><li>human versus machine readable licenses – using the <a
href="http://openattribute.com/">OpenAttribute Extension</a> I wasn’t able to detect machine readable licences on YouTube and Flickr (Slideshare worked), but Flickr includes CC licenses information for items in a users RSS version of their photostream (I notice in the <a
href="http://www.creativecommons.org.uk/blog/tabid/84/EntryId/20/OpenAttribute-Making-Creative-Commons-Attribution-Easier.aspx">OpenAttribute post on the Creative Commons blog</a> that in the video CC is detected in Flickr).</li><li>discoverability of user activity RSS feeds from 3rd party services -&nbsp; Joss Winn has started <a
href="http://bebop.blogs.lincoln.ac.uk/2012/04/20/rssatom-endpoints-for-user-profiles-on-oer-hosting-sites/">documenting some endpoints as part of the Bebob project</a></li><li>who is consuming CC licensed RSS feeds outside of education? – this question was prompted by James Burke at Creative Commons UK (@<a
href="http://www.twitter.com/deburca">deburca</a>).</li></ul><p>So a number of things to think about.</p><p>[The guest post also led me to posting a <a
href="http://mashe.hawksey.info/2012/04/pitch-for-out-of-hours-project-develop-a-chrome-extension-to-include-creative-commons-license-in-embed-code/">Pitch for out-of-hours project: Develop a Chrome extension to include Creative Commons license in embed code</a> – no interest yet so it’s gone to the bottom of my list ;)]</p><p>[Mainly so I've got a record I've now included the original guest post below]</p><hr
style="border: 1px solid #ccc; display:block"/><p><img
style="background-image: none; margin: 0px 0px 4px; padding-left: 0px; padding-right: 0px; display: inline; float: right; padding-top: 0px; border-width: 0px;" title="image" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image17.png" alt="Creative Commons License Tool" width="353" height="209" align="right" border="0" />So you’ve decided to ‘share your knowledge and creativity with the world’ licensing your blog content using Creative Commons. Having chosen the license for your needs, next you need to mark your work so that other know how they can remix or reuse your content. The <a
href="http://creativecommons.org/choose/">Creative Commons licensing tool</a> helps you select the license you need but also generates the code for your website.</p><p>Visitors to your website will see how your work is licensed, but what about people who read your content using different ways? What about people who subscribe to your content using RSS feeds (unsure about RSS? <a
href="http://www.commoncraft.com/rss_plain_english">Here it is explained in plain English</a>)? There are a couple of ways to include your CC licenses in your RSS feed.&nbsp; You can, for example, manually copy and paste your license code to the end of every blog post, or your blogging platform may include options to insert a custom footnote which could include the license link.</p><h3>Burning a CC license</h3><p>As the RSS feed for <a
href="http://mashe.hawksey.info/">my blog</a> is already distributed using Google’s Feedburner service I use the built in options to insert a human and machine readable Creative Commons license into my feed. As these options aren’t immediately apparent here’s <a
href="#turningOnCC">a quick guide for turning them on</a>.</p><p>One of the drawbacks of this solution is that while you have control over the licensing level (CC-BY, CC-BY-SA etc.) there is no way to select the regionalised license to match your legal jurisdiction. If like me this is important to you here’s how you can do it.</p><h3>Creating a custom Creative Commons ‘flare’</h3><p>The way I’m going to show you to do this is using Feedburner’s FeedFlare option, which if you’ve just setup Feedburner’s built-in CC licensing you’ll know is an option to add links to the end of your RSS feed items. As part of FeedFlare you can <a
href="https://developers.google.com/feedburner/feedflare_dev_guide">create your own custom ‘flares’</a>.</p><p>Custom FeedFlares are written in an XML format. Don’t worry if you don’t know what this is, it’s not important for this guide, what is important is having somewhere Feedburner can read this custom XML file. Fortunately Google can help with this problem too. By adding <a
href="http://www.google.com/ig/adde?moduleurl=gge.xml">this Google Gadget Editor</a> to your iGoogle page you can write and publish some XML for Feedburner to read (Google account is required).</p><p>Once the editor is installed paste the text below replacing the ‘hello world’ example.</p><p>&lt;FeedFlareUnit&gt;<br
/> &lt;SCRIPT/&gt;<br
/> &lt;Catalog&gt;<br
/> &lt;Title&gt;Attribution CC-BY Martin Hawksey&lt;/Title&gt;<br
/> &lt;Description&gt; Displays my Creative Commons CC-BY license. &lt;/Description&gt;<br
/> &lt;/Catalog&gt;<br
/> &lt;FeedFlare&gt;<br
/> &lt;Text&gt;CC-BY Martin Hawksey&lt;/Text&gt;<br
/> &lt;Link href=&#8221;http://creativecommons.org/licenses/by/2.5/scotland/&#8221; rel=&#8221;license&#8221;/&gt;<br
/> &lt;/FeedFlare&gt;<br
/> &lt;/FeedFlareUnit&gt;</p><p>Before saving there are a couple of changes we need to make.</p><ol><li>Edit &lt;text&gt; tag to match your chosen CC license and attribution (this is the text that will appear at the end of your post)</li><li>Change theurlto match the address of the license you are using</li></ol><p>You can now save the xml using any filename you like. Copy the link address of your file (right click on the file name hyperlink in the top right of the gadget editor)</p><p><img
style="background-image: none; padding-left: 0px; padding-right: 0px; display: block; float: none; margin-left: auto; margin-right: auto; padding-top: 0px; border-width: 0px;" title="image" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image9.png" alt="Copy link for xml" width="491" height="330" border="0" /></p><p>Open the FeedFlare options for your feed which are in the ‘Optimize’ tab for your feed in Feedburner. In the ‘(Enter or paste a Flare Unit URL)’ box under Personal FeedFlare paste the url copied from the Gadget Editor.</p><p><img
style="background-image: none; padding-left: 0px; padding-right: 0px; display: block; float: none; margin-left: auto; margin-right: auto; padding-top: 0px; border-width: 0px;" title="image" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image10.png" alt="Add custom feedflare" width="422" height="206" border="0" /></p><p>Once it’s added tick the box next to it in the Feed column and scroll down to save your settings. Your feed should now show how the post is licensed with a clickable link to the full license text.</p><p><img
style="background-image: none; padding-left: 0px; padding-right: 0px; display: block; float: none; margin-left: auto; margin-right: auto; padding-top: 0px; border-width: 0px;" title="image" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image11.png" alt="Custom CC-BY link in feed " width="271" height="217" border="0" /></p><p>One final thing you might want to do is decide if you want to keep the ‘Creative Commons’ option on in the Publicize tab of Feedburner. This option adds generic license details to the machine readable part of your feed, not the regionalised version you are using. On my feed I have deactivated it because I’m not using my feed to submit content to other repositories and a machine readable CC license is on my website pages.</p><p><img
style="background-image: none; padding-left: 0px; padding-right: 0px; display: block; float: none; margin-left: auto; margin-right: auto; padding-top: 0px; border-width: 0px;" title="image" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image12.png" alt="CC License in feed metadata" width="544" height="121" border="0" /></p><p>Of course all of this hassle would go away if Feedburner allowed users to choose their jurisdiction. In fact looking across other services I regularly use like YouTube, Flickr and Slideshare where Creative Commons licenses can be applied it is always generic. Should these services not be looking to take it to the next logical step and providing regionalised licensing options?</p><h3></h3><h3>Footnote: Turning on Creative Commons Licensing in Feedburner<a
name="turningOnCC"></a></h3><p>From your <a
href="http://feedburner.google.com/fb/a/myfeeds">My Feeds in Feedburner</a> click on the feed you want to edit.</p><h4>Adding a human readable license</h4><ol><li>Click on he ‘Optimize’ tab and then on FeedFlare from the Services list</li><li>Tick the box in the feed column next to ‘View Creative Commons License’</li><li>Click on ‘Activate/Save’</li></ol><h4>Adding a machine readable license</h4><ol><li>Click on the ‘Publicize’ tab and the ‘Creative Commons’ from the Services list</li><li>Chose the licensing level</li><li>Click ‘Activate/Save’</li></ol><div
id="crp_related"><h3>You also might like :</h3><ul><li><a
href="http://mashe.hawksey.info/2012/04/pitch-for-out-of-hours-project-develop-a-chrome-extension-to-include-creative-commons-license-in-embed-code/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13211');" class="crp_title">Pitch for out-of-hours project: Develop a Chrome extension to include Creative Commons license in embed code</a></li><li><a
href="http://mashe.hawksey.info/2012/04/google-docs-insert-cc-images/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13211');" class="crp_title">Google Docs defaults to searching for Creative Commons licensed images. Great, but could they do better?</a></li><li><a
href="http://mashe.hawksey.info/2011/09/embedding-mp3-audio-files-into-google-sites/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13211');" class="crp_title">Embedding .mp3 audio files into Google Sites</a></li><li><a
href="http://mashe.hawksey.info/2012/01/sending-free-sms-messages-to-students-by-mashing-up-google-calendar-notifications/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13211');" class="crp_title">Sending free SMS messages to students by mashing up Google Calendar notifications</a></li><li><a
href="http://mashe.hawksey.info/2010/06/jorumopen-a-quick-overview/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13211');" class="crp_title">JorumOpen: A Quick Overview and Competition Details</a></li></ul></div><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/MASHe?a=TkxjQEZ78tY:p24u4LK8o6A:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/MASHe?i=TkxjQEZ78tY:p24u4LK8o6A:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=TkxjQEZ78tY:p24u4LK8o6A:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/MASHe?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=TkxjQEZ78tY:p24u4LK8o6A:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/MASHe?i=TkxjQEZ78tY:p24u4LK8o6A:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=TkxjQEZ78tY:p24u4LK8o6A:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/MASHe?i=TkxjQEZ78tY:p24u4LK8o6A:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=TkxjQEZ78tY:p24u4LK8o6A:JUhcmGiK9AQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=JUhcmGiK9AQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=TkxjQEZ78tY:p24u4LK8o6A:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/MASHe?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=TkxjQEZ78tY:p24u4LK8o6A:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/MASHe?i=TkxjQEZ78tY:p24u4LK8o6A:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=TkxjQEZ78tY:p24u4LK8o6A:jQG8q3pvyYY"><img src="http://feeds.feedburner.com/~ff/MASHe?d=jQG8q3pvyYY" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=TkxjQEZ78tY:p24u4LK8o6A:IqXOmDWthuQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=IqXOmDWthuQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/MASHe/~4/TkxjQEZ78tY" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://mashe.hawksey.info/2012/04/guest-post-on-creative-commons-uk-adding-creative-commons-to-your-rss-feeds-in-feedburner/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <feedburner:origLink>http://mashe.hawksey.info/2012/04/guest-post-on-creative-commons-uk-adding-creative-commons-to-your-rss-feeds-in-feedburner/</feedburner:origLink></item> <item><title>Pitch for out-of-hours project: Develop a Chrome extension to include Creative Commons license in embed code</title><link>http://feedproxy.google.com/~r/MASHe/~3/pVSZDuBdxOw/</link> <comments>http://mashe.hawksey.info/2012/04/pitch-for-out-of-hours-project-develop-a-chrome-extension-to-include-creative-commons-license-in-embed-code/#comments</comments> <pubDate>Sat, 21 Apr 2012 22:58:14 +0000</pubDate> <dc:creator>Martin Hawksey</dc:creator> <category><![CDATA[Creative Commons]]></category> <guid isPermaLink="false">http://mashe.hawksey.info/?p=13206</guid> <description><![CDATA[Update: Looking at OpenAttribute which looks like it does some of this. I’ll probably spend my time on something different instead&#160; Right now it feels like I’m getting more ideas than time to develop them. So that I don’t waste more time with ill thought out ideas like ‘spreadembed’ (what was I thinking ;), this [...]]]></description> <content:encoded><![CDATA[<p><strong>Update:</strong> Looking at <a
href="http://openattribute.com/">OpenAttribute</a> which looks like it does some of this. I’ll probably spend my time on something different instead&#160;</p><p>Right now it feels like I’m getting more ideas than time to develop them. So that I don’t waste more time with ill thought out ideas like ‘<a
href="http://mashe.hawksey.info/2011/10/spreadembed/">spreadembed’</a> (what was I thinking ;), this time instead of disappearing into my creative dark space I thought I would pitch my idea here and if there is enough interest give it priority over my long list of other things to do. So here’s the pitch:</p><h3>Chrome extension to include Creative Commons license in embed code</h3><p>More services like YouTube, Flickr and Slideshare are giving people the option to include a Creative Commons license with their work. This is great because it means it’s easier for people to reuse the work and also it raises the profile of releasing work under Creative Commons (i.e. every time you see a CC icon you are advertising a different way to share your work).</p><h4>The problem</h4><p>The problem is often when you use these services the sites terms and conditions like:</p><blockquote><p>by submitting Content to YouTube, you hereby grant YouTube a worldwide, non-exclusive, royalty-free, sublicenseable and transferable license to use, reproduce, distribute, prepare derivative works of, display, and perform the Content in connection with the Service and YouTube&#8217;s (and its successors&#8217; and affiliates&#8217;) business, including without limitation for promoting and redistributing part or all of the Service (and derivative works thereof) in any media formats and through any media channels. <a
href="http://www.youtube.com/static?gl=EN&amp;template=terms">YouTube Terms Section 6.C</a></p></blockquote><p>aka ‘we can use your content how we like’, which often includes not including any CC license (if used) in the embed code. This has resulted in projects like <a
href="http://www.nottingham.ac.uk/xpert/attribution/">Xpert Attribution tool</a> and <a
href="http://www.imagecodr.org/">ImageCodr</a> which provide a way for people to grab embed content with the CC attribution as shown below:</p><table
border="0" cellspacing="0" cellpadding="2" width="680"><tbody><tr><td
valign="bottom" width="340"><p
align="center"><a
title="UKOER Twitter Community by m.hawksey, on Flickr" href="http://www.flickr.com/photos/mhawksey/6840994573/"><img
alt="UKOER Twitter Community" src="http://farm8.staticflickr.com/7146/6840994573_1fe5b0e9ec_n.jpg" width="320" height="208" /></a> <br
/>Embedded image from Flickr</p></td><td
valign="bottom" width="340"><p
align="center"><img
src="http://www.nottingham.ac.uk/xpert/attribution/pictureattrib/mangle.php?url=http://farm5.static.flickr.com/7146/6840994573_1fe5b0e9ec_b.jpg&amp;original_url=http://www.flickr.com/13151915@N00/6840994573/&amp;license=flickr_4&amp;flickr_id=6840994573&amp;size=toolkits&amp;picturename=UKOER%20Twitter%20Community" width="320" height="232" /> <br
/>Embedded image from Flickr using Xpert</p></td></tr></tbody></table><p>Xpert and ImageCodr are great tools but they rely on you going to their site to get the attributed embed code.</p><h4>The solution – guerrilla CC embed</h4><p>A better solution would of course be the host service doing this at source but this <a
href="http://www.flickr.com/groups/flickrideas/discuss/72157606429706091/">thread on Flickr is typical</a> in terms of this being a very low priority if at all. My idea (and it may have been done before – if so leave a note to put me out of my misery) is to develop a browser extension or bookmarklet that when you are getting the embed code from the source site (Flickr, YouTube and Slideshare would be my starters) it automatically modifies the existing code it to include Creative Commons license and attribution.</p><p>For example, here’s how the YouTube embed code looks now:</p><p><img
style="background-image: none; border-bottom: 0px; border-left: 0px; padding-left: 0px; padding-right: 0px; display: inline; border-top: 0px; border-right: 0px; padding-top: 0px" title="image" border="0" alt="Existing YouTube embed code" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image13.png" width="687" height="377" />&#160;&#160;&#160;</p><p>But with my magic black box users would automatically get something like:</p><p><img
style="background-image: none; border-bottom: 0px; border-left: 0px; padding-left: 0px; padding-right: 0px; display: inline; border-top: 0px; border-right: 0px; padding-top: 0px" title="image" border="0" alt="Proposed YouTube embed code" src="http://mcdn.hawksey.info/wp-content/uploads/2012/04/image14.png" width="687" height="377" /></p><p>So to recap your browsing for a video/image/presentation, find one you like and click on the websites embed button. Doing so appends and CC attribution to the embed.</p><p>So what do you think, is this a worthwhile project for me to spend my free time on? Would you be willing to help (it could be anything from code, promotion or testing)?</p><p>Over to you ;)</p><div
id="crp_related"><h3>You also might like :</h3><ul><li><a
href="http://mashe.hawksey.info/2012/04/guest-post-on-creative-commons-uk-adding-creative-commons-to-your-rss-feeds-in-feedburner/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13206');" class="crp_title">Guest Post on Creative Commons UK: Adding Creative Commons to your RSS feeds in Feedburner</a></li><li><a
href="http://mashe.hawksey.info/2012/04/google-docs-insert-cc-images/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13206');" class="crp_title">Google Docs defaults to searching for Creative Commons licensed images. Great, but could they do better?</a></li><li><a
href="http://mashe.hawksey.info/2011/05/work-in-progress-google-spreadsheetsites-flexible-event-booking-form/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13206');" class="crp_title">[Work in Progress] Creating a framework for custom form interfaces using Google Apps Script</a></li><li><a
href="http://mashe.hawksey.info/2009/12/if-outlook-was-my-idea/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13206');" class="crp_title">If Outlook was my idea &hellip;</a></li><li><a
href="http://mashe.hawksey.info/2011/09/embedding-mp3-audio-files-into-google-sites/" rel="bookmark" onClick="javascript: pageTracker._trackPageview('#!/related_click_13206');" class="crp_title">Embedding .mp3 audio files into Google Sites</a></li></ul></div><div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/MASHe?a=pVSZDuBdxOw:YnDPBdOZEBc:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/MASHe?i=pVSZDuBdxOw:YnDPBdOZEBc:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=pVSZDuBdxOw:YnDPBdOZEBc:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/MASHe?d=yIl2AUoC8zA" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=pVSZDuBdxOw:YnDPBdOZEBc:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/MASHe?i=pVSZDuBdxOw:YnDPBdOZEBc:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=pVSZDuBdxOw:YnDPBdOZEBc:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/MASHe?i=pVSZDuBdxOw:YnDPBdOZEBc:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=pVSZDuBdxOw:YnDPBdOZEBc:JUhcmGiK9AQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=JUhcmGiK9AQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=pVSZDuBdxOw:YnDPBdOZEBc:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/MASHe?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=pVSZDuBdxOw:YnDPBdOZEBc:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/MASHe?i=pVSZDuBdxOw:YnDPBdOZEBc:gIN9vFwOqvQ" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=pVSZDuBdxOw:YnDPBdOZEBc:jQG8q3pvyYY"><img src="http://feeds.feedburner.com/~ff/MASHe?d=jQG8q3pvyYY" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/MASHe?a=pVSZDuBdxOw:YnDPBdOZEBc:IqXOmDWthuQ"><img src="http://feeds.feedburner.com/~ff/MASHe?d=IqXOmDWthuQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/MASHe/~4/pVSZDuBdxOw" height="1" width="1"/>]]></content:encoded> <wfw:commentRss>http://mashe.hawksey.info/2012/04/pitch-for-out-of-hours-project-develop-a-chrome-extension-to-include-creative-commons-license-in-embed-code/feed/</wfw:commentRss> <slash:comments>1</slash:comments> <feedburner:origLink>http://mashe.hawksey.info/2012/04/pitch-for-out-of-hours-project-develop-a-chrome-extension-to-include-creative-commons-license-in-embed-code/</feedburner:origLink></item> </channel> </rss><!-- Dynamic page generated in 3.371 seconds. --><!-- Cached page generated by WP-Super-Cache on 2012-05-18 14:40:04 -->

