<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2enclosuresfull.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:media="http://search.yahoo.com/mrss/" xmlns:yt="http://gdata.youtube.com/schemas/2007" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">
   <channel>
      <title>JISC-CRIG Planet</title>
      <description>Selected feeds from around the web searching for a single tag and aggregated into a single feed. The tag (CRIG) is for the Common Repository Interface Group as funded by the Joint Information Systems Committee.  We are improving repositories in the Higher and Further Education Sector over the next eighteen months, til Sept 2009.</description>
      <link>http://pipes.yahoo.com/pipes/pipe.info?_id=IOhBPNqn3BG4kFuCJZhxuA</link>
      <pubDate>Sat, 11 Jul 2009 13:55:35 -0700</pubDate>
      <generator>http://pipes.yahoo.com/pipes/</generator>
      <itunes:explicit>no</itunes:explicit><itunes:subtitle>Selected feeds from around the web searching for a single tag and aggregated into a single feed. The tag (CRIG) is for the Common Repository Interface Group as funded by the Joint Information Systems Committee. We are improving repositories in the Higher </itunes:subtitle><itunes:summary>Selected feeds from around the web searching for a single tag and aggregated into a single feed. The tag (CRIG) is for the Common Repository Interface Group as funded by the Joint Information Systems Committee. We are improving repositories in the Higher and Further Education Sector over the next eighteen months, til Sept 2009.</itunes:summary><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/jisc-crig" type="application/rss+xml" /><feedburner:browserFriendly></feedburner:browserFriendly><item>
         <title>Reflections on the Repositories and Preservation Programme</title>
         <link>http://infteam.jiscinvolve.org/2009/07/06/reflections-on-the-repositories-and-preservation-programme/</link>
         <description>&lt;p&gt;Some of you will recall that back in May, we invited those of you who participated in the JISC Repositories and Preservation Programme #rpmeet (2006-2009) to come along to the Aston Business School to reflect on what the programme achieved, what knowledge it generated, what value it delivered and what we learnt from it.&lt;/p&gt;
&lt;p&gt; Judging by the feedback we got and our own perceptions of the event, it was a highly productive meeting and certainly gave us JISC programme managers food for thought. We thought it would be useful to combine those thoughts with some of the helpful comments we received in the feedback forms and share them with the community in the hope that we might continue to generate commentary and opinion about some of the themes that were raised at the meeting.&lt;/p&gt;
&lt;p&gt; The full document is available on the JISC website at: &lt;a rel="nofollow" target="_blank" href="http://bit.ly/nJQW2"&gt;http://bit.ly/nJQW2&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;The document (5 pages) is in 6 sections and refers to a total of 38 issues. These issues are clustered in sections as follows:&lt;/p&gt;
&lt;p&gt;1. Meeting Planning&lt;/p&gt;
&lt;p&gt;2. Programme Design&lt;/p&gt;
&lt;p&gt;3. Dissemination Issues&lt;/p&gt;
&lt;p&gt;4. Knowledge&lt;/p&gt;
&lt;p&gt;5. Strategic Ideas&lt;/p&gt;
&lt;p&gt;6. Potential Areas of Work&lt;/p&gt;
&lt;p&gt;To give you an idea of the content, the last two sections follow &amp;#8230;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&lt;font face="Arial"&gt;5. Strategic Ideas&lt;/font&gt;&lt;/strong&gt;&lt;font face="Arial"&gt; &lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font face="Arial"&gt;* A small working group might be convened around the subject of ‘research information management’.&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font face="Arial"&gt;* Some further work is required to secure the sustainability of software outputs, where those outputs are designed to be more substantial offerings to the community than simply trials or demonstrators.&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font face="Arial"&gt;* Corporate business requirements (rather than research and learning imperatives) can also drive the success of a repository. They can help clarify the aims of repository work by showcasing implementations, workflows and the amassing of content. There are a number of drivers acting upon institutional repositories and this diversity should be supported.&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font face="Arial"&gt;* There will be an increased requirement to link research outputs with data and this will need collaboration between JISC, Research Councils and HEI’s.&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font face="Arial"&gt;* The top 3 issues discussed in the ideas room, in order of popularity were: &lt;/font&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;font face="Arial"&gt;how do we increase the content in repositories? &lt;/font&gt;&lt;/li&gt;
&lt;li&gt;&lt;font face="Arial"&gt;what does a successful repository look like?&lt;/font&gt;&lt;/li&gt;
&lt;li&gt;&lt;font face="Arial"&gt;what features are missing from repositories?&lt;/font&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;font face="Arial"&gt;Delegates ideas to address these issues were captured in writing and on video and represent important input for JISC in scoping further work.&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font face="Arial"&gt;* Discussion of the preservation and curation of learning materials largely hinges around roles and responsibilities. The drivers to preserve in this area are precautionary rather than for the purposes of active re-use. Preservation specialists may profit from directing their attention to those with administrative rather than teaching responsibilities.&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font face="Arial"&gt;* Text-mining is potentially a powerful core tool for repositories.&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font face="Arial"&gt;* Further exploration is required around the business models and benefits of individual and shared model repositories using cloud based strategies.&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font face="Arial"&gt;* Further development may be needed for repository solutions that support rich local requirements and further integration with the Web.&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font face="Arial"&gt;* Some raised awareness of the issues around quality assessment of repository content may be valuable. National data centres do not accept everything they are offered but institutional repositories operate within a diversity of contexts. It may be sensible for some IR’s to accept all deposits whilst others may be selective. Assessing the quality of research outputs is a complex problem.&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&lt;font face="Arial"&gt;6. Potential Areas of Work&lt;/font&gt;&lt;/strong&gt;&lt;font face="Arial"&gt; &lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font face="Arial"&gt;* Demonstrating the value of embedding the repository into the workflow of research and learning is very important, rather than simply relying on a consensus opinion that it makes sense to do so.&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font face="Arial"&gt;* The startup and enhancement element of the programme has, in some cases, had a demonstrable impact on the status of staff within institutions (i.e. project staff have been made permanent). It may be useful to try and find out how participation in the programme has affected the career and professional prospects of those involved; this is also an issue that could be explored more generally across programmes.&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font face="Arial"&gt;* The repository roadmap is an important document that will support further phases of strategic input; the issue of meaningful measures and metrics regarding proportion, quality, use and other impact is a particular challenge that needs to be further addressed.&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font face="Arial"&gt;* More work needs to be commissioned to help institutions set up internal systems for managing learning materials.&lt;/font&gt;&lt;/p&gt;</description>
         <author>Neil Grindley</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/07/06/reflections-on-the-repositories-and-preservation-programme/</guid>
         <pubDate>Mon, 06 Jul 2009 07:31:01 -0700</pubDate>
      <enclosure url="http://bit.ly/nJQW2" length="37565" type="application/pdf" /><media:content url="http://bit.ly/nJQW2" fileSize="37565" type="application/pdf" /><itunes:subtitle> Some of you will recall that back in May, we invited those of you who participated in the JISC Repositories and Preservation Programme #rpmeet (2006-2009) to come along to the Aston Business School to reflect on what the programme achieved, what knowledg</itunes:subtitle><itunes:author>Neil Grindley</itunes:author><itunes:summary> Some of you will recall that back in May, we invited those of you who participated in the JISC Repositories and Preservation Programme #rpmeet (2006-2009) to come along to the Aston Business School to reflect on what the programme achieved, what knowledge it generated, what value it delivered and what we learnt from it.  Judging by the feedback we got and our own perceptions of the event, it was a highly productive meeting and certainly gave us JISC programme managers food for thought. We thought it would be useful to combine those thoughts with some of the helpful comments we received in the feedback forms and share them with the community in the hope that we might continue to generate commentary and opinion about some of the themes that were raised at the meeting.  The full document is available on the JISC website at: http://bit.ly/nJQW2 The document (5 pages) is in 6 sections and refers to a total of 38 issues. These issues are clustered in sections as follows: 1. Meeting Planning 2. Programme Design 3. Dissemination Issues 4. Knowledge 5. Strategic Ideas 6. Potential Areas of Work To give you an idea of the content, the last two sections follow &amp;#8230; 5. Strategic Ideas  * A small working group might be convened around the subject of ‘research information management’. * Some further work is required to secure the sustainability of software outputs, where those outputs are designed to be more substantial offerings to the community than simply trials or demonstrators. * Corporate business requirements (rather than research and learning imperatives) can also drive the success of a repository. They can help clarify the aims of repository work by showcasing implementations, workflows and the amassing of content. There are a number of drivers acting upon institutional repositories and this diversity should be supported. * There will be an increased requirement to link research outputs with data and this will need collaboration between JISC, Research Councils and HEI’s. * The top 3 issues discussed in the ideas room, in order of popularity were: how do we increase the content in repositories? what does a successful repository look like? what features are missing from repositories? Delegates ideas to address these issues were captured in writing and on video and represent important input for JISC in scoping further work. * Discussion of the preservation and curation of learning materials largely hinges around roles and responsibilities. The drivers to preserve in this area are precautionary rather than for the purposes of active re-use. Preservation specialists may profit from directing their attention to those with administrative rather than teaching responsibilities. * Text-mining is potentially a powerful core tool for repositories. * Further exploration is required around the business models and benefits of individual and shared model repositories using cloud based strategies. * Further development may be needed for repository solutions that support rich local requirements and further integration with the Web. * Some raised awareness of the issues around quality assessment of repository content may be valuable. National data centres do not accept everything they are offered but institutional repositories operate within a diversity of contexts. It may be sensible for some IR’s to accept all deposits whilst others may be selective. Assessing the quality of research outputs is a complex problem. 6. Potential Areas of Work  * Demonstrating the value of embedding the repository into the workflow of research and learning is very important, rather than simply relying on a consensus opinion that it makes sense to do so. * The startup and enhancement element of the programme has, in some cases, had a demonstrable impact on the status of staff within institutions (i.e. project staff have been made permanent). It may be useful to try and find out how participation in the programme has affected the career and professional prospects of </itunes:summary></item>
      <item>
         <title>Direct from MS Word to DSpace via SWORD</title>
         <link>http://feedproxy.google.com/~r/stuartlewis/~3/mUx976tduIE/</link>
         <description>As a member of the SWORD project, it has been a great seeing Microsoft&amp;#8217;s External Research group integrate SWORD into Word 2007, their Zentity repository, and their online journal hosting system. There is a good overview of this work in a presentation given by Pablo Fernicola at the Open Repositories 2009 conference entitled &amp;#8216;Connecting Authors and Repositories [...]</description>
         <guid isPermaLink="false">http://blog.stuartlewis.com/?p=379</guid>
         <pubDate>Sat, 04 Jul 2009 01:23:12 -0700</pubDate>
         <content:encoded><![CDATA[<p>As a member of the <a rel="nofollow" target="_blank" href="http://www.swordapp.org/">SWORD project</a>, it has been a great seeing <a rel="nofollow" target="_blank" href="http://research.microsoft.com/en-us/collaboration/about/default.aspx">Microsoft&#8217;s External Research group</a> integrate SWORD into <a rel="nofollow" target="_blank" href="http://www.microsoft.com/word/">Word 2007</a>, their <a rel="nofollow" target="_blank" href="http://research.microsoft.com/en-us/projects/zentity/">Zentity</a> repository, and their <a rel="nofollow" target="_blank" href="http://journal.mssandbox.net/">online journal hosting system</a>. There is a good overview of this work in a presentation given by <a rel="nofollow" target="_blank" href="http://research.microsoft.com/en-us/people/pablofe/">Pablo Fernicola</a> at the <a rel="nofollow" target="_blank" href="http://or09.library.gatech.edu/">Open Repositories 2009</a> conference entitled &#8216;<a rel="nofollow" target="_blank" href="http://research.microsoft.com/en-us/um/redmond/projects/authoring/090518-pablofe-connecting%20authors%20and%20repositories%20through%20sword.pptx">Connecting Authors and Repositories Through SWORD</a>&#8216;.</p>
<p>This blog post is about the functionality I have added to <a rel="nofollow" target="_blank" href="http://www.dspace.org/">DSpace</a> to allow it to accept deposits from within Microsoft Word using SWORD.</p>
<p>If you are unaware of the authoring add-in, then before reading the rest of this blog, take a look at Pablo&#8217;s YouTube video &#8216;Integrating with repositories and journal submissions&#8217; at <a rel="nofollow" target="_blank" href="http://www.youtube.com/watch?v=2_M2gfUyVzU">http://www.youtube.com/watch?v=2_M2gfUyVzU</a>. The video explains the authoring add-in, so I&#8217;ll not duplicate that information in this blog post. The rest of this post explains how I extended DSpace to work with the add-in&#8230;</p>
<p>In order for DSpace to be able to ingest a package, it needs an ingester that understands the format and knows how to unpack it and extract the metadata and file(s). In the case of .docx files created by Microsoft Word, it needs to know how to extract the metadata from within the file, and to archive the file as-is. This is a pretty easy task as a .docx file is actually just a zip file (try renaming it from .docx to .zip and then take a peek inside!). So I wrote an ingester than unzips the file, extracts the <a rel="nofollow" target="_blank" href="http://www.nlm.nih.gov/tsd/cataloging/metafilenew.html">NLM metadata</a> that the add-in inserted in the file, and then creates a new DSpace item with that metadata. Finally it adds the complete .docx file as a bitstream for people to download.</p>
<p>Some of the metadata such as the authors identities are held in the .docx file is held in the customXml/item*.xml files, and other parts such as the article title and abstract are held in the actual document contents in word/document.xml. The ingester extracts these values for use in the new DSpace item.</p>
<pre>
&lt;w:t&gt;Add an S to Microsoft Word and you get SWORD&lt;/w:t&gt;
</pre>
<pre>
&lt;my:name.&gt;
&lt;my:name.content-type.datatypeattribute.attribute.&gt;&lt;/my:name.content-type.datatypeattribute.attribute.&gt;
&lt;my:name.name-style.datatypeattribute.attribute.&gt;&lt;/my:name.name-style.datatypeattribute.attribute.&gt;
&lt;my:surname.&gt;Lewis&lt;/my:surname.&gt;
&lt;my:given-names.&gt;Stuart&lt;/my:given-names.&gt;
&lt;/my:name.&gt;
</pre>
<p>I then configured the DSpace ingesters to use the docx ingester when it encountered .docx files:</p>
<blockquote><p>plugin.named.org.dspace.content.packager.PackageIngester = &#92;<br />
org.dspace.content.packager.PDFPackager = Adobe PDF, PDF, &#92;<br />
org.dspace.content.packager.DSpaceMETSIngester = METS, &#92;<br />
org.dspace.content.packager.DSpaceDocxIngester = DOCX</p></blockquote>
<p>I then configured the SWORD package to expose the fact that it supported .docx files in its SWORD service document:</p>
<blockquote><p>sword.accept-packaging.Docx.identifier = application/vnd.openxmlformats-officedocument.wordprocessingml.document<br />
sword.accept-packaging.Docx.q = 1.0</p></blockquote>
<p>Finally the DSpace SWORD interface needed to know which packager to use for .docx files based on their MIME type:</p>
<blockquote><p>plugin.named.org.dspace.sword.SWORDIngester = &#92;<br />
org.dspace.sword.SWORDMETSIngester = http://purl.org/net/sword-types/METSDSpaceSIP &#92;<br />
org.dspace.sword.SimpleFileIngester = SimpleFileIngester &#92;<br />
org.dspace.sword.DocxIngester = application/vnd.openxmlformats-officedocument.wordprocessingml.document</p></blockquote>
<p>All that is needed to use this is a copy of the authoring add-in (<a rel="nofollow" target="_blank" href="http://research.microsoft.com/en-us/projects/authoring/">http://research.microsoft.com/en-us/projects/authoring/</a>), and a suitable formatted template for the repository that you wish to deposit the document into (<a rel="nofollow" target="_blank" href="http://blog.stuartlewis.com/wp-content/uploads/2009/06/dspace-swordapp-org.docx">dspace-swordapp-org.docx</a>). The template is preconfigured to deposit directly into the <a rel="nofollow" target="_blank" href="http://dspace.swordapp.org/">DSpace SWORD demo repository</a> which I have upgraded with the new code to accept .docx deposits. Feel free to create an account in that repository, install the add-in, load the template, and try out a deposit!</p>
<p>This complete end to end process allows you to create Word templates, and to mark them up with required and optional fields. It also allows you to embed details of the SWORD deposit repository URL (so the users do not need to know what it is) within the template for easy deposit. This could be used for example for a journal editor to provide a template <em>and</em> a deposit location for new paper submissions all-in-one. And this use case could be extended: for example if a faculty member wants all their students to submit an assignment with a template, they could do so and use the repository as the end point rather than a traditional VLE. And unlike a VLE, the repository will probably provide search and indexing facilities across the deposited documents. I&#8217;m sure as this tool gets used more, there will be a lot of new ideas for how it can be used.</p>
<p>Comments welcome! <img src='http://blog.stuartlewis.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley'/> </p>]]></content:encoded>
      <enclosure url="http://research.microsoft.com/en-us/um/redmond/projects/authoring/090518-pablofe-connecting%20authors%20and%20repositories%20through%20sword.pptx" length="70525" type="application/vnd.openxmlformats-officedocument.presentationml.presentation" /><media:content url="http://research.microsoft.com/en-us/um/redmond/projects/authoring/090518-pablofe-connecting%20authors%20and%20repositories%20through%20sword.pptx" fileSize="70525" type="application/vnd.openxmlformats-officedocument.presentationml.presentation" /><itunes:subtitle>As a member of the SWORD project, it has been a great seeing Microsoft&amp;#8217;s External Research group integrate SWORD into Word 2007, their Zentity repository, and their online journal hosting system. There is a good overview of this work in a presentati</itunes:subtitle><itunes:summary>As a member of the SWORD project, it has been a great seeing Microsoft&amp;#8217;s External Research group integrate SWORD into Word 2007, their Zentity repository, and their online journal hosting system. There is a good overview of this work in a presentation given by Pablo Fernicola at the Open Repositories 2009 conference entitled &amp;#8216;Connecting Authors and Repositories [...]</itunes:summary></item>
      <item>
         <title>Do library catalogues and repositories talk to each other?</title>
         <link>http://infteam.jiscinvolve.org/2009/07/02/do-library-catalogues-and-repositories-talk-to-each-other/</link>
         <description>&lt;p&gt;The Centre for Digital Library Research at the University of Strathclyde is currently investigating the links between university library catalogues and digital repositories as part of a JISC funded study.&lt;/p&gt;
&lt;p&gt;Can library users find resources in their university&amp;#8217;s digital repository through the library catalogue? Do library catalogues and repositories share records for the same items?&lt;/p&gt;
&lt;p&gt;If catalogues and repositories generally aren&amp;#8217;t linked in these sorts of ways at the moment, could they be?&lt;/p&gt;
&lt;p&gt;These are some of the issues being explored by the study. The team are surveying repository managers and others about now - so, if you receive a request to respond to their online survey, your response would be much appreciated.&lt;/p&gt;
&lt;p&gt;For further information about the study, please visit the &lt;a rel="nofollow" target="_blank" href="http://cdlr.strath.ac.uk/ocris/index.htm"&gt;project&amp;#8217;s Web site&lt;/a&gt; and the project&amp;#8217;s &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/projects/scholcommslinks.aspx"&gt;Web page on the JISC site&lt;/a&gt;.&lt;/p&gt;</description>
         <author>Ben Wynne</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/07/02/do-library-catalogues-and-repositories-talk-to-each-other/</guid>
         <pubDate>Thu, 02 Jul 2009 06:27:38 -0700</pubDate>
      </item>
      <item>
         <title>How do people use electronic information resources?</title>
         <link>http://infteam.jiscinvolve.org/2009/07/02/how-do-people-use-electronic-information-resources/</link>
         <description>&lt;p&gt;Research funded by the JISC, RIN and others over recent years has helped to increase understanding of how students and researchers use electronic information resources. Analysis of Web logs - such as the work done for the &lt;a rel="nofollow" target="_blank" href="http://www.jiscebooksproject.org/deep-log-analysis"&gt;e-Books Observatory Study&lt;/a&gt; by CIBER at UCL - has proved a fruitful line of inquiry.&lt;/p&gt;
&lt;p&gt;A new study - which has now been underway for a few months (so apologies for this late post) - seeks to add to this evidence through detailed observation of how individual students and researchers in Business and Economics use a number of information resources in their area (such as Business Source Premier).&lt;/p&gt;
&lt;p&gt;The aim is to observe how individuals react to and use particular interfaces and then to explore those behaviours through structured interviews.&lt;/p&gt;
&lt;p&gt;The work is being conducted by Middlesex University and is being complemented by an analysis of Web logs for a selection of Business and Economics e-books and e-journals by CIBER.&lt;/p&gt;
&lt;p&gt;A report of the findings is expected during the autumn.&lt;/p&gt;
&lt;p&gt;For further information, please visit the &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/inf11/userbehaviourbusandecon.aspx"&gt;project Web page&lt;/a&gt; on the JISC Web site.&lt;/p&gt;</description>
         <author>Ben Wynne</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/07/02/how-do-people-use-electronic-information-resources/</guid>
         <pubDate>Thu, 02 Jul 2009 06:08:28 -0700</pubDate>
      </item>
      <item>
         <title>From Search Engine to Twitter Optimisation</title>
         <link>http://ukwebfocus.wordpress.com/2009/06/29/from-search-engine-to-twitter-optimisation/</link>
         <description>Workshops on Search Engine Optimisation (SEO)
As described on the JISC Digitisation blog the Strategic Content Alliance (SCA) are running a series of free workshops entitled &amp;#8220;Improve your online presence&amp;#8220;. The workshop series, which will be held in June and July in London, Edinburgh, Cardiff and Belfast, is being coordinated by Netskills. The workshops will &amp;#8220;introduce [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ukwebfocus.wordpress.com&amp;blog=497535&amp;post=2841&amp;subd=ukwebfocus&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://ukwebfocus.wordpress.com/?p=2841</guid>
         <pubDate>Mon, 29 Jun 2009 02:57:55 -0700</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><h2>Workshops on Search Engine Optimisation (SEO)</h2>
<p>As <a rel="nofollow" target="_blank" href="http://digitisation.jiscinvolve.org/2009/06/05/improve-your-online-presence-free-workshops/">described on the JISC Digitisation blog</a> the Strategic Content Alliance (SCA) are running a series of <strong>free</strong> workshops entitled &#8220;<a rel="nofollow" title="Permanent Link to "Improve your online presence: free workshops"" target="_blank" href="http://sca.jiscinvolve.org/2009/06/03/improve-your-online-presence-free-workshops/">Improve your online presence</a>&#8220;. The workshop series, which will be held in June and July in London, Edinburgh, Cardiff and Belfast, is being coordinated by Netskills. The workshops will &#8220;<em>introduce simple and inexpensive search engine optimisation techniques to improve your online presence, web visibility and website traffic</em>&#8220;. I will be contributing to the workshop content by running a session on the role of the Social Web in enhancing access to scholarly and cultural content.</p>
<h2>The Potential of Twitter</h2>
<p>The potential of Twitter was recently discussed in a post entitled <a rel="nofollow" id="posttitle9670870828" target="_blank" href="http://www.guardian.co.uk/technology/blog/2009/may/31/twitter-suggested-users">How much is it worth to be one of Twitter&#8217;s suggested users?</a> which was published in the Guardian&#8217;s Technology blog. As described in this post, being included in Twitter&#8217;s Suggested Users List can boost one&#8217;s numbers of followers, and thus traffic to links included in the tweets being published.</p>
<p>Coincidentally on Friday 5<sup>th</sup> June 2009, whilst accessing this blog&#8217;s administrators interface in order to delete one or two spam comments which had failed to be detected by the Akisimet spam filter, I noticed that the top three referrers for the day were from the Twitter Web site (from Twitter.com, twitter.com/home and twitter.com/Twitter_tops). On further investigation I discovered that a page on the Twitter Web site which provides links to resources about use of Twitter <a rel="nofollow" target="_blank" href="http://twitter.com/Twitter_Tips/status/2042873204">had included the following link</a> to a post on this blog:</p>
<p style="padding-left:30px;"><em>What is Twitter? It’s An Interactive Business Card: </em><a rel="nofollow" target="_blank" href="http://cli.gs/YL6R4D"><em>http://cli.gs/YL6R4D</em></a><em> &#8211;Share this article: </em><a rel="nofollow" target="_blank" href="http://bit.ly/180g9w"><em>http://bit.ly/180g9w</em></a></p>
<p>Now although this link resulted in driving the most traffic to the blog in over 3 weeks, this was disappointing to me. I had been after evidence that Twitter can provide successful in driving traffic to arbitrary resources, rather than just traffic to an article about the Twitter service.</p>
<p><img class="alignright size-full wp-image-2848" title="Referrer statistics for UKOLN's Cultural Heritage blog, May 2009" src="http://ukwebfocus.files.wordpress.com/2009/06/cultural-heritage-blog-referrers-may-2009.png?w=276&#038;h=161" alt="Referrer statistics for UKOLN's Cultural Heritage blog, May 2009" width="276" height="161"/>However a better example was provided by the blog statistics for <a rel="nofollow" target="_blank" href="http://blogs.ukoln.ac.uk/cultural-heritage/">UKOLN&#8217;s Cultural Heritage blog</a>. As illustrated the statistics for May 2009 showed that, after Google, the second most popular Web site for driving traffic to the blog was Twitter.</p>
<p>In this particular example the most popular post in the month was one on <a rel="nofollow" target="_blank" href="http://blogs.ukoln.ac.uk/cultural-heritage/2009/05/21/explaining-the-risks-and-opportunities-framework/">Explaining the Risks and Opportunities Framework</a>- a blog post which was <a rel="nofollow" target="_blank" href="http://twitter.com/briankelly/status/1872225192">announced on Twitter</a> at 08.55 on 21<sup>st</sup> May:</p>
<p style="padding-left:30px;"><em>Blog post explaining the Risks &amp; Opportunities Framework published at </em><a rel="nofollow" target="_blank" href="http://tinyurl.com/p72kld"><em>http://tinyurl.com/p72kld</em></a></p>
<p>Evidence, it would seem, that Twitter can enhance the visibility of one&#8217;s Web content and therefore provide an example I can use in the workshop. But what of the dangers of using Twitter in this way? Might not Twitter followers resent being used as fodder for marking materials? Isn&#8217;t there a danger of killing the goose that lays the golden eggs?</p>
<h2>Twitter Optimisation</h2>
<p>Although some people regard Twitter as being essentially an informal communications channel and a tool for community building we can now observe that it is being used for a much wider variety of purposes. But what are the emerging best practices which one should adopt in order to optimise Twitter&#8217;s potential to maximise access to &#8217;stuff&#8217; out there, as opposed to engaging with one&#8217;s Twitter community?</p>
<p style="padding-left:30px;"><strong>Keep it short</strong>: Perhaps the best advice is to keep your tweets short to allow other to retweet (RT) the message, perhaps including their own comments.</p>
<p style="padding-left:30px;"><strong>Acknowledge the limitations</strong>: If you do intend to use Twitter as a one -way publishing mechanism (as, for example, <a rel="nofollow" target="_blank" href="http://twitter.com/mla_gov">the MLA does</a>) then you need to recognise that you should not expect to gain the benefits which fans of Twitter, as described in a post entitled &#8220;<a rel="nofollow" target="_blank" href="http://electronicmuseum.org.uk/2009/02/06/the-person-is-the-point/">The person is the point</a>&#8221; by Mike Ellis, feel they gain from its use as an individual.</p>
<p style="padding-left:30px;"><strong>Consider publishing a policy</strong>: You may also wish to consider having a policy covering your use of Twitter, as described in a recent post on &#8220;<a rel="nofollow" title="Permanent Link to Emerging Best Practices For Institutional Use of Twitter" target="_blank" href="http://blogs.ukoln.ac.uk/cultural-heritage/2009/06/08/emerging-best-practices-for-institutional-use-of-twitter/">Emerging Best Practices For Institutional Use of Twitter</a>&#8220;.</p>
<p style="padding-left:30px;"><strong>Think about your followers</strong>: If you are using Twitter as an individual but also wish to promote areas of your work you will need to consider the balance between engagement (chatting with your mates), support (helping your mates), requests (asking your mates for held) and dissemination (telling your mates what you&#8217;ve being doing and what you&#8217;re proud of). This was an area I addressed in a post on &#8220;<a rel="nofollow" target="_blank" href="http://ukwebfocus.wordpress.com/2009/02/23/twitter-can-pimp-up-your-stuff-but-should-it/">Twitter Can Pimp Up Your Stuff – But Should It?</a>&#8220;.</p>
<p>And if you&#8217;re still sceptical that Twitter has any significant role in delivering traffic to a Web site I&#8217;d suggest you read the TechCrunch article &#8220;<a rel="nofollow" target="_blank" href="http://www.techcrunch.com/2009/06/14/for-techcrunch-twitter-traffic-a-statistical-breakdown/">For TechCrunch, Twitter = Traffic (A Statistical Breakdown)</a>&#8220;.</p> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/ukwebfocus.wordpress.com/2841/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ukwebfocus.wordpress.com/2841/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/ukwebfocus.wordpress.com/2841/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ukwebfocus.wordpress.com/2841/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/ukwebfocus.wordpress.com/2841/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ukwebfocus.wordpress.com/2841/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/ukwebfocus.wordpress.com/2841/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ukwebfocus.wordpress.com/2841/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/ukwebfocus.wordpress.com/2841/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ukwebfocus.wordpress.com/2841/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ukwebfocus.wordpress.com&blog=497535&post=2841&subd=ukwebfocus&ref=&feed=1"/></div>]]></content:encoded>
         <media:content url="http://0.gravatar.com/avatar/27731abff266f585f006998f65c74be9?s=96&amp;amp;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
            <media:title>Brian Kelly (UK Web Focus)</media:title>
         </media:content>
         <media:content url="http://ukwebfocus.files.wordpress.com/2009/06/cultural-heritage-blog-referrers-may-2009.png" medium="image">
            <media:title>Referrer statistics for UKOLN's Cultural Heritage blog, May 2009</media:title>
         </media:content>
         <category>Twitter</category>
      </item>
      <item>
         <title>Sharing citations?</title>
         <link>http://infteam.jiscinvolve.org/2009/06/26/sharing-citations/</link>
         <description>&lt;p&gt;An international workshop in Amsterdam in March, funded by JISC, SURF and DRIVER, discussed work needed to improve interoperability between repositories. Four areas of work were focused upon:&lt;br /&gt;
- citation services&lt;br /&gt;
- interoperable identification systems&lt;br /&gt;
- repository handshaking (interoperable deposit systems), and&lt;br /&gt;
- repository organisation (supporting repositories around the world)&lt;br /&gt;
There&amp;#8217;s more information and an update here:&lt;br /&gt;
&lt;a rel="nofollow" target="_blank" href="http://repinf.pbworks.com/"&gt;http://repinf.pbworks.com/&lt;/a&gt;&lt;br /&gt;
There is also an option to sign up to this wiki / community.&lt;/p&gt;
&lt;p&gt;Specifically now, there is a proposal for internationally coordinated work to enable repositories of OA research papers to share the citations therein. This is currently out for public review, and your comments would be most welcome. Please see the proposal here:&lt;br /&gt;
&lt;a rel="nofollow" target="_blank" href="http://repinf.pbworks.com/Citation-Services-draft-project-proposal"&gt;http://repinf.pbworks.com/Citation-Services-draft-project-proposal&lt;/a&gt;&lt;br /&gt;
There are some 15-20 funders of repositories infrastructure around the world (like JISC in the UK) also looking at this proposal, with a view to funding the work it describes.&lt;/p&gt;
&lt;p&gt;Thanks&lt;br /&gt;
Neil&lt;/p&gt;</description>
         <author>Neil Jacobs</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/06/26/sharing-citations/</guid>
         <pubDate>Fri, 26 Jun 2009 05:46:34 -0700</pubDate>
      </item>
      <item>
         <title>Towards the academic library of the future - expressions of interest by 8 July 09</title>
         <link>http://infteam.jiscinvolve.org/2009/06/26/the-future-of-the-academic-library-expressions-of-interest-by-8-july-09/</link>
         <description>&lt;p&gt;We&amp;#8217;re working in partnership with SCONUL, RIN, RLUK and the British Library on an initiative:&lt;/p&gt;
&lt;p&gt;•	to gather and analyse evidence about current and future trends that are likely to have an impact on the future of academic and research libraries in the UK; and&lt;br /&gt;
•	to formulate strategic responses to help libraries respond creatively to key changes in the wider environment, so that they continue to develop and sustain effective levels and types of information services to support students and researchers in the HE and related sectors. &lt;/p&gt;
&lt;p&gt;We&amp;#8217;re currently seeking expressions of interest to lead and manage the activity which will use a variety of futures tools and techniques, such as horizon-scanning; trend and driver analysis; scenario analysis; attractiveness grids; and/or value chain analysis. Prospective consultants may wish to suggest other approaches. Whatever approaches are adopted, a key feature of the programme should be that seeks the active participation of as wide a range as possible of people in the library and information sectors, along with users and other stakeholders from across the higher education and research sectors, and their engagement in helping to&lt;br /&gt;
•	identify, review and assess key trends and underlying drivers likely to affect scholarly libraries, and their use by students and researchers in the years ahead;&lt;br /&gt;
•	formulate and test scenarios to assess current policies, processes and services, the likely impact of future developments, and the issues and choices facing decision-makers; and&lt;br /&gt;
•	draft a series of authoritative papers setting out possible futures for library and information services in supporting institutional teaching, learning and research strategies, and the steps required if those futures are to be realised.&lt;/p&gt;
&lt;p&gt;The success of the initiative will depend on securing contributions from people with a wide range of views and perspectives who are prepared to experiment with ideas, to challenge existing assumptions, to ask awkward questions, and to test possible impacts. &lt;/p&gt;
&lt;p&gt;The contract let will be funded up to a level of £120,000 (exlcuding VAT) and will run from September 2009 to April 2011.&lt;/p&gt;
&lt;p&gt;Expressing interest:&lt;br /&gt;
Expressions of interest should simply take the form of an email indicating a name and full contact details. Following the outcome of this call, the RIN will issue an ITT to those who have expressed an interest. A full project specification and tendering requirements will then be issued.&lt;/p&gt;
&lt;p&gt;The deadline for expressions of interest is 8 July 2009.&lt;/p&gt;
&lt;p&gt;Expressions of interest should be directed to Aaron Griffiths (aaron.griffiths@rin.ac.uk). Please also contact Aaron is you have any questions relating to this work. &lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://www.rin.ac.uk/library-future-call"&gt;&lt;a rel="nofollow" target="_blank" href="http://www.rin.ac.uk/library-future-call"&gt;Further information&lt;/a&gt;.&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/campaigns/librariesofthefuture.aspx"&gt;JISC has recently taken forward a campaign to help raise some of the related issues.&lt;/a&gt;&lt;/p&gt;</description>
         <author>Rachel Bruce</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/06/26/the-future-of-the-academic-library-expressions-of-interest-by-8-july-09/</guid>
         <pubDate>Thu, 25 Jun 2009 18:45:47 -0700</pubDate>
      </item>
      <item>
         <title>Zentity @ the &amp;ldquo;Repository Systems&amp;rdquo; workshop and Glastonbury</title>
         <link>http://savas.me/blog/976</link>
         <description>&lt;div&gt;&lt;p&gt;I am writing this while on a plane to London and then to Frankfurt. I am going to Germany to visit relatives for a day, close to Manheim, before heading to Munich to give a talk about &lt;a rel="nofollow" target="_blank" href="http://research.microsoft.com/en-us/projects/zentity/"&gt;Zentity&lt;/a&gt; at the “&lt;a rel="nofollow" target="_blank" href="http://escience.mpg.de/thema092_en.html"&gt;Repository Systems&lt;/a&gt;” workshop. Then, on Thursday I am going to flight back to London and make my way for this year’s &lt;a rel="nofollow" target="_blank" href="http://www.glastonburyfestivals.co.uk/"&gt;Glastonbury&lt;/a&gt; :-) Woooo hooo! I can’t wait!&lt;/p&gt; &lt;p&gt;Unfortunately, &lt;a rel="nofollow" target="_blank" href="http://www.cs.man.ac.uk/~carole/"&gt;Carole&lt;/a&gt; is not going to be there this year :-( but I am hoping to catch up with &lt;a rel="nofollow" target="_blank" href="http://www.soton.ac.uk/~dder/"&gt;David&lt;/a&gt;. I am soooooo looking forward to all the bands, the dancing, the comedy, the circus, the clubs, the ethnic food, the atmosphere, the vibe. I hope it’s not going to rain and that I will not damage my camera before I even get there, like I did last year.&lt;/p&gt;&lt;/div&gt;</description>
         <guid isPermaLink="false">http://savas.me/blog/976</guid>
         <pubDate>Tue, 23 Jun 2009 19:36:13 -0700</pubDate>
      </item>
      <item>
         <title>Zentity @ the &amp;ldquo;Repository Systems&amp;rdquo; workshop and Glastonbury</title>
         <link>http://savas.me/blog/976</link>
         <description>&lt;div&gt;&lt;p&gt;I am writing this while on a plane to London and then to Frankfurt. I am going to Germany to visit relatives for a day, close to Manheim, before heading to Munich to give a talk about &lt;a rel="nofollow" target="_blank" href="http://research.microsoft.com/en-us/projects/zentity/"&gt;Zentity&lt;/a&gt; at the “&lt;a rel="nofollow" target="_blank" href="http://escience.mpg.de/thema092_en.html"&gt;Repository Systems&lt;/a&gt;” workshop. Then, on Thursday I am going to flight back to London and make my way for this year’s &lt;a rel="nofollow" target="_blank" href="http://www.glastonburyfestivals.co.uk/"&gt;Glastonbury&lt;/a&gt; :-) Woooo hooo! I can’t wait!&lt;/p&gt; &lt;p&gt;Unfortunately, &lt;a rel="nofollow" target="_blank" href="http://www.cs.man.ac.uk/~carole/"&gt;Carole&lt;/a&gt; is not going to be there this year :-( but I am hoping to catch up with &lt;a rel="nofollow" target="_blank" href="http://www.soton.ac.uk/~dder/"&gt;David&lt;/a&gt;. I am soooooo looking forward to all the bands, the dancing, the comedy, the circus, the clubs, the ethnic food, the atmosphere, the vibe. I hope it’s not going to rain and that I will not damage my camera before I even get there, like I did last year.&lt;/p&gt;&lt;/div&gt;</description>
         <guid isPermaLink="false">http://savas.me/blog/976</guid>
         <pubDate>Tue, 23 Jun 2009 19:36:13 -0700</pubDate>
      </item>
      <item>
         <title>Zentity @ the &amp;ldquo;Repository Systems&amp;rdquo; workshop and Glastonbury</title>
         <link>http://savas.me/blog/976</link>
         <description>&lt;div&gt;&lt;p&gt;I am writing this while on a plane to London and then to Frankfurt. I am going to Germany to visit relatives for a day, close to Manheim, before heading to Munich to give a talk about &lt;a rel="nofollow" target="_blank" href="http://research.microsoft.com/en-us/projects/zentity/"&gt;Zentity&lt;/a&gt; at the “&lt;a rel="nofollow" target="_blank" href="http://escience.mpg.de/thema092_en.html"&gt;Repository Systems&lt;/a&gt;” workshop. Then, on Thursday I am going to flight back to London and make my way for this year’s &lt;a rel="nofollow" target="_blank" href="http://www.glastonburyfestivals.co.uk/"&gt;Glastonbury&lt;/a&gt; :-) Woooo hooo! I can’t wait!&lt;/p&gt; &lt;p&gt;Unfortunately, &lt;a rel="nofollow" target="_blank" href="http://www.cs.man.ac.uk/~carole/"&gt;Carole&lt;/a&gt; is not going to be there this year :-( but I am hoping to catch up with &lt;a rel="nofollow" target="_blank" href="http://www.soton.ac.uk/~dder/"&gt;David&lt;/a&gt;. I am soooooo looking forward to all the bands, the dancing, the comedy, the circus, the clubs, the ethnic food, the atmosphere, the vibe. I hope it’s not going to rain and that I will not damage my camera before I even get there, like I did last year.&lt;/p&gt;&lt;/div&gt;</description>
         <guid isPermaLink="false">http://savas.me/blog/976</guid>
         <pubDate>Tue, 23 Jun 2009 15:36:13 -0700</pubDate>
      </item>
      <item>
         <title>Launch of ‘The Edgeless University’: a new Demos report</title>
         <link>http://ukwebfocus.wordpress.com/2009/06/23/launch-of-the-edgeless-university-a-new-demos-report/</link>
         <description>A report entitled &amp;#8220;The Edgeless University: why Higher Education Must Embrace Technology&amp;#8221; was launched earlier today. As described on the JISC Web site:
The Edgeless University argues that technology in higher education is not just about virtual learning environments, but is increasingly central to the way institutions provide learning and facilitate research. Technology is making research [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ukwebfocus.wordpress.com&amp;blog=497535&amp;post=3015&amp;subd=ukwebfocus&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://ukwebfocus.wordpress.com/?p=3015</guid>
         <pubDate>Tue, 23 Jun 2009 05:48:55 -0700</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><p>A report entitled &#8220;<a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/events/2009/06/demoslaunch.aspx">The Edgeless University: why Higher Education Must Embrace Technology</a>&#8221; was launched earlier today. As described on the JISC Web site:</p>
<p style="padding-left:30px;"><strong>The Edgeless University </strong>argues that technology in higher education is not just about virtual learning environments, but is increasingly central to the way institutions provide learning and facilitate research. Technology is making research and learning possible in new places, often outside of institutions. Far from undermining them, this is creating exciting opportunities for universities to demonstrate and capitalise on their value so will take strategic leadership from inside institutions, new connections with a growing world of informal learning, and a commitment to openness and collaboration. This is the radical role of <em>The Edgeless University</em>.</p>
<p>I haven&#8217;t yet had a chance to fully absorb this 90 page report but there were a number of aspects to the report which reflect my areas of interest. I should first disclose, however, that I contributed to the report (Peter Bradwell, author of this DEMOS report, was aware of my work in this area and invited me to give my views).</p>
<p style="padding-left:30px;"><strong>The need for fundamental changes in the higher educational sector</strong>: The report describes the comment made by one participant at a roundtable meeting who described the current predicament of the higher education sector: ‘<em>This seminar feels a bit like sitting with a group of record industry executives in 1999</em>’. The report went on to say &#8220;<em>It is no use lamenting the golden age of universities (or record companies). The goals of the two ‘industries’ remain the same, but they must refocus on how to achieve them. Society’s aspirations for the sector remain the same. The challenge for institutions is to find the way to do it.</em>&#8220;</p>
<p style="padding-left:30px;"><strong>The need to understand changing student expectations</strong>: The report quoted an interviewee who said &#8220;<em>Technology is part of people’s daily life in a university, I would say everywhere except in the classroom</em>&#8221; in order to illustrate the need for institutions to &#8220;<em>get better at understanding exactly what it is these students need</em>&#8221; .</p>
<p style="padding-left:30px;"><strong>New tools to support teaching</strong>: It was interesting to note that the report, in a section on how social media tools can support collaborative teaching described Michael Wesch&#8217;s work at the University of Kansas in the US in using using online tools for collaborative and team-based student coursework including tools such as sites such as Netvibes, Yahoo Pipes and Diigo. Although I&#8217;m pleased to see Web 2.0 tools being highlighted in the report, it was somewhat strange to see a US-based example of use of these fairly mainstream tools. Aren&#8217;t there similar examples to be found in UK HEIs?</p>
<p style="padding-left:30px;"><strong>&#8220;</strong><strong>A renewed commitment to openness</strong><strong>&#8220;</strong>: The report includes a section with this title. The opening quotation for the section &#8220;<em>Science is as much about conversations in corridors as it is about papers in journals</em>&#8221; strikes me as summarising the benefits which the Social Web can provide for the research community. However this section seems to focus more on the ease of access provided by tools such as Scribd and iTunesU rather than the issues if open acces and open data.</p>
<p style="padding-left:30px;"><strong>&#8220;Experimentation and investment</strong><strong></strong><strong>&#8220;</strong>: I was p[particularly pleased to see that JISC Developer Happy Days’ (Dev8D) being mentioned as an "event brought together communities of coders and users from educational software and beyond" with the aim of "<em>mix[ing] people interested in civic society with those who have the skills to develop tools to encourage social change</em>&#8220;. Dave Flanders (now of JISC) will be pleased to see that his work in bringing together a set of developers has been appreciated in this report.</p>
<p>A few weeks ago the &#8220;<a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/publications/documents/heweb2.aspx">Higher Education in a Web 2.0 World</a>&#8221; report was published. And today we see another report which provides a similar top-down view on the importance of Web 2.0 in higher education. If you encounter resistance to change from senior managers in your institution I&#8217;d suggest you beat them over the head with these two report until they realise that Web 2.0 is changing the higher educational environment.</p> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/ukwebfocus.wordpress.com/3015/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ukwebfocus.wordpress.com/3015/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/ukwebfocus.wordpress.com/3015/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ukwebfocus.wordpress.com/3015/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/ukwebfocus.wordpress.com/3015/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ukwebfocus.wordpress.com/3015/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/ukwebfocus.wordpress.com/3015/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ukwebfocus.wordpress.com/3015/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/ukwebfocus.wordpress.com/3015/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ukwebfocus.wordpress.com/3015/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ukwebfocus.wordpress.com&blog=497535&post=3015&subd=ukwebfocus&ref=&feed=1"/></div>]]></content:encoded>
         <media:content url="http://0.gravatar.com/avatar/27731abff266f585f006998f65c74be9?s=96&amp;amp;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
            <media:title>Brian Kelly (UK Web Focus)</media:title>
         </media:content>
      </item>
      <item>
         <title>I2: Survey</title>
         <link>http://feedproxy.google.com/~r/technosophia/~3/9AGGicyXiVs/</link>
         <description>[Series] Near the end of my strawman post, I wrote:The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week. It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey. We completed [...]</description>
         <guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=363</guid>
         <pubDate>Sat, 20 Jun 2009 11:30:53 -0700</pubDate>
         <content:encoded><![CDATA[
<p>[<a rel="nofollow" target="_blank" href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p> <p>Near the end of my <a rel="nofollow" target="_blank" href="http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/">strawman post</a>, I wrote:<blockquote>The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week. It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey. </blockquote></p> <p>We completed the survey late last week and began distributing it. Here&#039;s what we sent out:<blockquote><p>The NISO I2 Working Group is surveying repository managers to determine the current practices and needs of the repository community regarding institutional identifiers. We value your time and your input in the process to create a standard for a new institutional identifier. We hope that you will complete the survey which should take less than 15 minutes. The survey will remain open through Monday, July 6th.</p> <p>
Here is a link to the survey:
<a rel="nofollow" target="_blank" href="http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d">http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d</a></p> <p>
Please feel free to share this message with other interested parties.</p> <p></blockquote></p> <p>First we used <a rel="nofollow" target="_blank" href="http://www.surveymonkey.com/">Survey Monkey</a> to send the survey link to approximately one-hundred repository managers that the subgroup identified. Our process for identifying repository managers involved pulling together a list of prominent repositories from subgroup members, and then gathering more from <a rel="nofollow" target="_blank" href="http://www.opendoar.org/">OpenDOAR</a>, &#034;an authoritative directory of academic open access repositories.&#034; Then subgroup members were encouraged to share the survey link with colleagues, and post it far and wide via blogs, listservs, and <a rel="nofollow" target="_blank" href="http://twitter.com/mjgiarlo/status/2230486784">tweets</a>. The listservs we targeted were: <a rel="nofollow" target="_blank" href="http://www.jiscmail.ac.uk/archives/jisc-repositories.html">JISC-REPOSITORIES</a>, <a rel="nofollow" target="_blank" href="http://metadatalibrarians.monarchos.com/">metadataLibrarians</a>, <a rel="nofollow" target="_blank" href="http://groups.google.com/group/digital-curation">digital-curation</a>, <a rel="nofollow" target="_blank" href="https://arl.org/Lists/SPARC-IR/">SPARC-IR</a>, <a rel="nofollow" target="_blank" href="http://mailman.anu.edu.au/mailman/listinfo/ir-net">ir-net</a>, <a rel="nofollow" target="_blank" href="http://www.lsoft.com/SCRIPTS/WL.EXE?SL1=REPOMAN-L&amp;H=LISTSERV.INDIANA.EDU">REPOMAN-L</a>, <a rel="nofollow" target="_blank" href="http://larch.palinet.org/archives/palinet-ir-l.html">PALINET-IR-L</a>, <a rel="nofollow" target="_blank" href="http://mailman.mit.edu/mailman/listinfo/dspace-general">dspace-general</a>, <a rel="nofollow" target="_blank" href="https://lists.sourceforge.net/lists/listinfo/fedora-commons-users">fedora-commons-users</a>, <a rel="nofollow" target="_blank" href="http://dublincore.org/groups/identifiers/">DC-IDENTIFIERS</a>, and <a rel="nofollow" target="_blank" href="http://dewey.library.nd.edu/mailing-lists/code4lib/">code4lib</a>.</p> <p>I&#039;ve already received a few responses and have gotten useful feedback. Two of the hardest questions to answer so far have been: &#034;What is an institutional identifier?&#034; and &#034;What is a repository?&#034;</p> <dl>
<dt><strong>Institutional identifier</strong></dt>
<dd><p>An institutional identifier is defined as a symbol or code that uniquely identifies an institution. Domain-specific examples of existing identifiers include SAN, IPEDS, GLN, MARC Org Code, and ISIL. Another example might be a Handle prefix or ARK name authority assigning number.</p></dd>
<dt><strong>Repository</strong></dt>
<dd><p>Institutional repositories and subject repositories like arxiv.org are clearly &#039;repositories&#039;, but beyond that it is a somewhat ill-defined term. One might look to the <a rel="nofollow" target="_blank" href="http://www.cnri.reston.va.us/k-w.html">Kahn-Wilensky architecture</a>, or the <a rel="nofollow" target="_blank" href="http://public.ccsds.org/publications/archive/650x0b1.pdf">OAIS reference model (PDF)</a>, or even <a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Digital_repository">Wikipedia</a> for definitions, but it&#039;s not clear that even the authorities agree on what constitutes a repository.</p><p>It&#039;s a system. It&#039;s network-accessible and typically has a web interface of some sort. Files and groups of files sometimes known as objects tend to be deposited in them, perhaps for some combination of management, access, or preservation. Many run Fedora, DSpace, and ePrints, and factor heavily in scholarly communication. Some are document-centric. Some will accept anything. To some, a learning management system may be a repo. To others, a content management system may fit.</p>
<p>My background is in academia so my own definition is somewhat based in that context, but I wouldn&#039;t say the term is necessarily limited to that context. There are other NISO I2 scenarios for library workflows and electronic resources, so it&#039;s safe to assume that repository does not mean ILS or OPAC or ERP system. My hope is that folks have their own working definitions of the term and can decide for themselves what it means.</p></dd>
</dl> <p>We&#039;ve given folks a little over two weeks to respond to the survey, so the constant I2 drum-beating will quiet down for a while around here. I am very interested in what sorts of responses we get from the survey. Fun times!</p> <p>Oh, and perhaps it goes without saying, but if you&#039;re a repository owner, manager, expert, developer, or stakeholder with an interest in identifiers, please feel free to take the <a rel="nofollow" target="_blank" href="http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d">survey</a>!</p>
<div class="feedflare">
<a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=9AGGicyXiVs:0SR0L0APkaQ:bcOpcFrp8Mo"><img src="http://feeds.feedburner.com/~ff/technosophia?d=bcOpcFrp8Mo" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=9AGGicyXiVs:0SR0L0APkaQ:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/technosophia?d=dnMXMwOfBR0" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=9AGGicyXiVs:0SR0L0APkaQ:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/technosophia?i=9AGGicyXiVs:0SR0L0APkaQ:D7DqB2pKExk" border="0"></a>
</div><img src="http://feeds.feedburner.com/~r/technosophia/~4/9AGGicyXiVs" height="1" width="1"/>]]></content:encoded>
      <enclosure url="http://public.ccsds.org/publications/archive/650x0b1.pdf" length="654750" type="application/pdf" /><media:content url="http://public.ccsds.org/publications/archive/650x0b1.pdf" fileSize="654750" type="application/pdf" /><itunes:subtitle>[Series] Near the end of my strawman post, I wrote:The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week. It will be interesting to see if the requirements we have thus far identified still obtain in light </itunes:subtitle><itunes:summary>[Series] Near the end of my strawman post, I wrote:The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week. It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey. We completed [...]</itunes:summary></item>
      <item>
         <title>A new climate for meteorological publishing?</title>
         <link>http://infteam.jiscinvolve.org/2009/06/19/a-new-climate-for-meteorological-publishing/</link>
         <description>&lt;p&gt;***This is a guest post from Fiona Hewer on behalf of the JISC OJIMS project***&lt;/p&gt;
&lt;p&gt;Scientific information about weather and climate change is being scrutinised more than ever to meet the need for advice to policy-makers on greenhouse gas emissions and their consequences. The Royal Meteorological Society (RMetS), a world-leading publisher, and the British Atmospheric Data Centre, a NERC repository, have been working on new ways to provide scientific quality assurance and access to meteorological data, to meet this need, and the wider requirements of the meteorological sciences.&lt;/p&gt;
&lt;p&gt;The internet provides many ways to give access to data and information on-line, from online journals and institutional repositories to project and personal websites. &amp;#8221;Overlay Journals&amp;#8221; are websites that can sit above all these sources, collating information on a particular topic and providing quality control information, such as through a peer-review process. So, for users with an interest in a particular topic, an overlay journal gives wide, quality-controlled access from a single web site.&lt;/p&gt;
&lt;p&gt;The &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/reppres/sue/ojims.aspx" title="OJIMS project"&gt;OJIMS project&lt;/a&gt; (Overlay Journal Infrastructure for Meteorological Sciences) aimed to investigate overlay journal mechanics, create an open-access repository and evaluate business models for potential overlay journals. It has been funded mainly by the &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/reppres.aspx" title="Repositories and preservation programme"&gt;JISC Repositories and Preservation Programme&lt;/a&gt; (&lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/reppres/sue.aspx" title="SUE strand"&gt;Repository Start-up and Enhancement Strand&lt;/a&gt;) and also supported by &lt;a rel="nofollow" target="_blank" href="http://www.nerc.ac.uk/" title="NERC"&gt;NERC&lt;/a&gt;. The project has raised awareness of the potential benefits and obstacles to digital publishing in the meteorological community, including scientists and organisations in the public and private sectors.&lt;/p&gt;
&lt;p&gt;The project activities included; surveys of scientists and stakeholder organisations; review of outputs from other JISC projects such as &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/reppres/tools/rioja.aspx" title="RIOJA project"&gt;RIOJA&lt;/a&gt; and other work on open-access; identification of the benefits, risks and costs of two overlay journal scenarios; development of a document editor for overlay journals; and creation of a new document repository.&lt;/p&gt;
&lt;p&gt;Fiona Hewer of &lt;a rel="nofollow" target="_blank" href="http://www.fionasredkite.co.uk" title="Fiona's Red Kite"&gt;Fiona&amp;#8217;s Red Kite&lt;/a&gt; was engaged by RMetS to evaluate the business models. It evaluated the technologies and business cases associated with new overlay journals. It is hoped that this will lead to the publication of a data journal in the near future.&lt;/p&gt;
&lt;p&gt;The OJIMS project outputs have been used to inform NERC’s information strategy.&lt;/p&gt;
&lt;p&gt;The full reports can be &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/reppres/sue/ojims.aspx" title="OJIMS project"&gt;downloaded here&lt;/a&gt;. More information is available on the &lt;a rel="nofollow" target="_blank" href="http://proj.badc.rl.ac.uk/ojims" title="OJIMS project website"&gt;project website&lt;/a&gt; including a link to the CEDA-docs demonstration repository.&lt;/p&gt;</description>
         <author>Andy McGregor</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/06/19/a-new-climate-for-meteorological-publishing/</guid>
         <pubDate>Fri, 19 Jun 2009 03:37:31 -0700</pubDate>
      </item>
      <item>
         <title>Participate in JISC Landscape Study</title>
         <link>http://infteam.jiscinvolve.org/2009/06/18/participate-in-jisc-landscape-study/</link>
         <description>&lt;p&gt;Apologies, firstly, to Ann Chapman and Rosemary Russell over at UKOLN that I haven&amp;#8217;t done this post a bit earlier. They are both currently working on a landscape study to understand how people use Web 2.0 tools and services as part of the &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/reppres/sharedservices.aspx"&gt;shared infrastructure services (SIS) programme&lt;/a&gt; to both set the current work in context and also to understand what we need to do into the future.&lt;/p&gt;
&lt;p&gt;There is more information on the UKOLN blog for the project at http://blogs.ukoln.ac.uk/jisc-sis-landscape/ so if you are working with Web 2.0 tools and services please visit and fill in the survey.&lt;/p&gt;</description>
         <author>James Farnhill</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/06/18/participate-in-jisc-landscape-study/</guid>
         <pubDate>Thu, 18 Jun 2009 04:27:53 -0700</pubDate>
      </item>
      <item>
         <title>UPEI Robertson Library Wins CAUBO Award for VRE</title>
         <link>http://feedproxy.google.com/~r/typepad/mleggott/loomware/~3/R64s_jOcVTQ/upei-robertson-library-wins-caubo-award-for-vre.html</link>
         <description>&lt;div&gt;&lt;p&gt;The announcement was made this week about the Robertson Library winning the Atlantic Regional CAUBO award for our Virtual Research Environment, or VRE. The text from the award: &lt;blockquote&gt;
&lt;p&gt;The Virtual Research Environment (VRE) is an innovative software system for stewarding research data regardless of the type of data or discipline. The VRE combines a best-practice web-based content management system (Drupal) with a state-of-the-art data repository system (Fedora) via an open source application suite (Islandora) developed at UPEI. Together, these components help solve one of the most important and challenging aspects of any research-intensive institution: the reliable stewardship of research data throughout the life-cycle of a research program and beyond. &lt;/p&gt; &lt;p&gt;The VRE provides a wealth of social software tools and techniques to provide a highly-functional environment that encourages collaboration. It allows users to create workflows that can transform data into more usable formats and ensure that accessible versions of research assets are available to collaborators and/or the broader public. Over 50 research groups currently use the VRE system, representing all disciplines and research approaches, including interdisciplinary projects with international communities of interest. Examples include the Mollusc Health Lab, Marine Natural Products Lab, Marxism and Psychology Research Group, Advancing Interdisciplinary Research in Singing, and the L.M. Montgomery Institute. The VRE software is also the basis of the library&amp;rsquo;s digital collections, including newspapers, magazines, books, audio and video. &lt;/p&gt; &lt;p&gt;The flexible architecture of the VRE system means that any digital file can be stored, described and accessed in a variety of ways, accommodating the requirements of individual researchers, while building a generalized framework which adapts to more uses. By bringing the core philosophies of the open source community to the research effort, the VRE also provides a transformative landscape on which to build UPEI&amp;rsquo;s research excellence and outreach. By distributing the open source software behind the VRE to the larger community, these same benefits will be available to the larger Canadian and international research communities. Emerging partnerships with other institutions (e.g., University of New Brunswick) and vendors (Sun Microsystems Inc.) point to the success of the VRE and related development efforts at UPEI&amp;rsquo;s Robertson Library. &lt;/p&gt;
&lt;/blockquote&gt; The other winners are available from the &lt;a rel="nofollow" target="_blank" href="http://www.caubo.ca/awards/documents/QP_Rich_UM_Summer_09_E.pdf"&gt;CAUBO site&lt;/a&gt;. You can get more information on the VRE and the software that sits behind it at our &lt;a rel="nofollow" target="_blank" href="http://vre.upei.ca/dev/"&gt;VRE&lt;/a&gt; and &lt;a rel="nofollow" target="_blank" href="http://vre.upei.ca/dev/islandora"&gt;Islandora&lt;/a&gt; sites.
&lt;/div&gt;
&lt;div class="feedflare"&gt;
&lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=R64s_jOcVTQ:omzR8pg6Y0Q:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?d=yIl2AUoC8zA" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=R64s_jOcVTQ:omzR8pg6Y0Q:F7zBnMyn0Lo"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?i=R64s_jOcVTQ:omzR8pg6Y0Q:F7zBnMyn0Lo" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=R64s_jOcVTQ:omzR8pg6Y0Q:V_sGLiPBpWU"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?i=R64s_jOcVTQ:omzR8pg6Y0Q:V_sGLiPBpWU" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=R64s_jOcVTQ:omzR8pg6Y0Q:EpLpB3ZkKWg"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?d=EpLpB3ZkKWg" border="0"&gt;&lt;/a&gt;
&lt;/div&gt;</description>
         <author>mleggott</author>
         <guid isPermaLink="false">tag:typepad.com,2003:post-68161411</guid>
         <pubDate>Tue, 16 Jun 2009 04:39:08 -0700</pubDate>
      <enclosure url="http://www.caubo.ca/awards/documents/QP_Rich_UM_Summer_09_E.pdf" length="2216456" type="application/pdf" /><media:content url="http://www.caubo.ca/awards/documents/QP_Rich_UM_Summer_09_E.pdf" fileSize="2216456" type="application/pdf" /><itunes:subtitle> The announcement was made this week about the Robertson Library winning the Atlantic Regional CAUBO award for our Virtual Research Environment, or VRE. The text from the award: The Virtual Research Environment (VRE) is an innovative software system for s</itunes:subtitle><itunes:author>mleggott</itunes:author><itunes:summary> The announcement was made this week about the Robertson Library winning the Atlantic Regional CAUBO award for our Virtual Research Environment, or VRE. The text from the award: The Virtual Research Environment (VRE) is an innovative software system for stewarding research data regardless of the type of data or discipline. The VRE combines a best-practice web-based content management system (Drupal) with a state-of-the-art data repository system (Fedora) via an open source application suite (Islandora) developed at UPEI. Together, these components help solve one of the most important and challenging aspects of any research-intensive institution: the reliable stewardship of research data throughout the life-cycle of a research program and beyond. The VRE provides a wealth of social software tools and techniques to provide a highly-functional environment that encourages collaboration. It allows users to create workflows that can transform data into more usable formats and ensure that accessible versions of research assets are available to collaborators and/or the broader public. Over 50 research groups currently use the VRE system, representing all disciplines and research approaches, including interdisciplinary projects with international communities of interest. Examples include the Mollusc Health Lab, Marine Natural Products Lab, Marxism and Psychology Research Group, Advancing Interdisciplinary Research in Singing, and the L.M. Montgomery Institute. The VRE software is also the basis of the library&amp;rsquo;s digital collections, including newspapers, magazines, books, audio and video. The flexible architecture of the VRE system means that any digital file can be stored, described and accessed in a variety of ways, accommodating the requirements of individual researchers, while building a generalized framework which adapts to more uses. By bringing the core philosophies of the open source community to the research effort, the VRE also provides a transformative landscape on which to build UPEI&amp;rsquo;s research excellence and outreach. By distributing the open source software behind the VRE to the larger community, these same benefits will be available to the larger Canadian and international research communities. Emerging partnerships with other institutions (e.g., University of New Brunswick) and vendors (Sun Microsystems Inc.) point to the success of the VRE and related development efforts at UPEI&amp;rsquo;s Robertson Library. The other winners are available from the CAUBO site. You can get more information on the VRE and the software that sits behind it at our VRE and Islandora sites. </itunes:summary></item>
      <item>
         <title>Which Will Last Longer: Hero.ac.uk or Facebook?</title>
         <link>http://ukwebfocus.wordpress.com/2009/06/15/which-will-last-longer-hero-ac-uk-or-facebook/</link>
         <description>A Hero For Our Sector
One of the real strengths of the UK higher education sector is the way in which we can work together as a sector, meaning that the whole is geater than the sum of the individual parts. This is undoubtedly true of JISC (which is envied in the higher education and research [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ukwebfocus.wordpress.com&amp;blog=497535&amp;post=2905&amp;subd=ukwebfocus&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://ukwebfocus.wordpress.com/?p=2905</guid>
         <pubDate>Mon, 15 Jun 2009 05:40:53 -0700</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><h2>A Hero For Our Sector</h2>
<p><img class="size-medium wp-image-2909 alignright" title="Hero home page (from Internet Archive)" src="http://ukwebfocus.files.wordpress.com/2009/06/hero-20090615.png?w=300&#038;h=233" alt="Hero home page (from Internet Archive)" width="300" height="233"/>One of the real strengths of the UK higher education sector is the way in which we can work together as a sector, meaning that the whole is geater than the sum of the individual parts. This is undoubtedly true of JISC (which is envied in the higher education and research sectors around the world) but also applies elsewhere. One example of this is <strong>Hero</strong>: &#8220;<em>the official gateway to universities, colleges and research organisations in the UK</em>&#8220;: a gateway funded by the various funding bodies (HEFCE, SHEFC, HEFCW and DENI) and supported by other higher educational agencies and by the high educational institutions themselves (and note that I was involved in the technical advisory group for the &#8220;HE Mall&#8221; as it was originally called.</p>
<p>Indeed will a service such as Hero, why would higher educational institutions wish to use other channels for online marketing, particularly social networking service such as Facebook which, despite its popularity are, in some circles, regarded with suspicion in not hostility?</p>
<h2>Our Hero Is Dead &#8230;</h2>
<p>Alas for those who believe that the sector should own its marketing channels, the Hero.ac.uk service <a rel="nofollow" target="_blank" href="http://www.universitiesandtheweb.com/heroacuk-to-close-tomorrow/">was closed on 4<sup>th</sup> June 2009</a> (and the image shown above was taken from <a rel="nofollow" target="_blank" href="http://web.archive.org/web/20080210162357rn_1/www.hero.ac.uk/uk/home/index.cfm">Hero&#8217;s most recent entry in the Internet Archive</a>, from 10<sup>th</sup> February 2008) I should disclose that last year I was interviewed by a consultant who had been appointed in order to identify future directions for the service, including whether the service was viable. I pointed out the flaws in the Hero service: it did not have the community aspects which potential new students might expect and it was a &#8216;walled garden&#8217; &#8211; information could be uploaded to the service but there were no easy ways of getting the data out again. &#8220;<em>Make &#8216;Hero 2.0&#8242; a trusted service which could host structured institutional data</em>&#8220;, I suggested &#8220;<em>and provide APIs to allow developers elsewhere to add value to the service</em>&#8220;. But this did not happen.</p>
<h2>&#8230; Long Live a New Hero?</h2>
<p>If the managed service to promote UK higher educational institutions is too costly to provide, why don&#8217;t we appropriate popular social networking services to fulfil this role? This is an idea inspired by a Tony Hirst&#8217;s post on &#8220;<a rel="nofollow" target="_blank" href="http://ouseful.wordpress.com/2009/06/07/appropriating-technology/">Appropriating technology</a>&#8221; which he described as &#8220;<em>appropriating technologies that might have been designed for other purposes in order to use them in an educational context</em>&#8221; but I would replace &#8216;<em>educational context</em>&#8216; by &#8216;<em>marketing context</em>&#8216;.</p>
<p>And, if we&#8217;re honest, isn&#8217;t Facebook the new Hero? It can provide the popular service for hosting institutional marketing materials. And it can provide the community aspects which Hero failed to provide. Admittedly it may be a &#8216;walled garden&#8217; &#8211; but then so was Hero, so nothing is being lost.</p>
<p>But if we wish to use Facebook in this way, don&#8217;t we as a sector need to identify the best practices for making use of Facebook, including minimising the risks associated with the service? And shouldn&#8217;t we be exploring the benefits which might be gained by working collaboratively?</p>
<p>Some initial thoughts on this:</p>
<p style="padding-left:30px;"><strong>Institutional URL</strong>: As mentioned in my recent post on &#8220;<a rel="nofollow" target="_blank" href="http://ukwebfocus.wordpress.com/2009/06/13/have-you-claimed-your-personal-and-institutional-facebook-vanity-url/">Have You Claimed Your Personal And Institutional Facebook Vanity URL?</a>&#8221; we are seeing Facebook URLs being minted as a single string (edgehilluniversity) and words separated by dots (aberystwyth.university). We might wish to consider whether there are advantages in seeking agreement on the form of the name &#8211; perhaps even using an institutional domain name in the URL (e.g. www.facebook.com/www.bath.ac.uk). However it is probably too late to do anything about this (which arguably demonstrates the failure in having not had such discussions previously).</p>
<p style="padding-left:30px;"><strong>Trademark disputes</strong>: We&#8217;ll want to avoid the possibility of trademark disputes. Might we see one between <a rel="nofollow" target="_blank" href="http://www.lmu.ac.uk/">Leeds Metropolitan University</a> and say, <a rel="nofollow" target="_blank" href="http://www.lmu.edu/">Loyola Marymount University</a> over www.facebook.com/lmu?</p>
<p style="padding-left:30px;"><strong>Ownership of Facebook resource</strong>: Who has access to the institutional Facebook account in your institution? And what if they&#8217;ve left or you can&#8217;t find the owner? The information should be regarded as a valuable institutional resource and ownership should be managed appropriately.</p>
<p style="padding-left:30px;"><strong>Workflow processes</strong>: There&#8217;s a need to establish effective workflow processes for information provide on the institutional Facebook page. Ideally information would be hosted elsewhere and automatically updated in Facebook though use of, for example, an RSS application in your Facebook page.</p>
<p style="padding-left:30px;"><strong>Will Facebook pages enhance or diminish Google Juice</strong>: Might not institutional content which is replicated on Facebook pages diminish institutional &#8216;Google juice&#8217; <a rel="nofollow" target="_blank" href="http://blog.paulwalk.net/2009/06/14/heis-get-facebook-fever-again/">as my colleague Paul Walk has suggested</a>? Or, alternatively, might content held in popular services such as Facebook and Wikipedia (and previously, to a lesser extent, Hero) held to increase traffic to the institutional Web site? Indeed if such replication of content is felt to be counter-productive, shouldn&#8217;t institutions try to prevent Web sites having links to their content rather than seeking to maximise such links?</p>
<p style="padding-left:30px;"><strong>Facebook Terms and Conditions</strong>: It would be useful to gain a better understanding of the Faceboom terms and conditions and the implications for an organisation&#8217;s pages in order to inform appropriate risk management approach. If the concern is that Facebook will claim ownership of marking material provides, is that really of concern?</p>
<p style="padding-left:30px;"><strong>Explore Possibilities for Facebook Applications</strong>: Might there be benefits in developing Facebook applications to make the UK HEI pages more appealling?</p>
<p>But have we, in the UK, missed the boat? Looking at <a rel="nofollow" target="_blank" href="http://www.eduwebconference.com/index.php/overview/">the timetable for the forthcoming Eduweb 2009 conference</a> I notice sessions on topics such as &#8220;<em>Facebook &#8212; a case study of building virtual relationships</em>&#8220;, &#8220;<em>Cheap, Fast, &amp; Out of Control: Brand management &amp; recruitment..</em>&#8221; and &#8220;<em>Recruiting and Marketing in the Web 2.0 World</em>&#8220;. We&#8217;ve nothing along these lines <a rel="nofollow" target="_blank" href="http://iwmw.ukoln.ac.uk/iwmw2009/programme/">planned for IWMW 2009</a> &#8211; but as the <a rel="nofollow" target="_blank" href="http://iwmw.ukoln.ac.uk/iwmw2009/barcamp/">bar camp sessions</a> can be submitted at the workshop itself, perhaps there&#8217;s an opportunity to build on these ideas?</p>
<p>Oh, and if you think it is inappro[oriate for an organisation to make use of a social network in this way, look at what companies such as <a rel="nofollow" target="_blank" href="http://www.facebook.com/starbucks">Starbucks</a> and <a rel="nofollow" target="_blank" href="http://www.facebook.com/McDonalds">McDonalds</a> are doing on Facebook.</p> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/ukwebfocus.wordpress.com/2905/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ukwebfocus.wordpress.com/2905/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/ukwebfocus.wordpress.com/2905/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ukwebfocus.wordpress.com/2905/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/ukwebfocus.wordpress.com/2905/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ukwebfocus.wordpress.com/2905/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/ukwebfocus.wordpress.com/2905/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ukwebfocus.wordpress.com/2905/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/ukwebfocus.wordpress.com/2905/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ukwebfocus.wordpress.com/2905/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ukwebfocus.wordpress.com&blog=497535&post=2905&subd=ukwebfocus&ref=&feed=1"/></div>]]></content:encoded>
         <media:content url="http://0.gravatar.com/avatar/27731abff266f585f006998f65c74be9?s=96&amp;amp;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
            <media:title>Brian Kelly (UK Web Focus)</media:title>
         </media:content>
         <media:content url="http://ukwebfocus.files.wordpress.com/2009/06/hero-20090615.png?w=300" medium="image">
            <media:title>Hero home page (from Internet Archive)</media:title>
         </media:content>
         <category>Web2.0</category>
      </item>
      <item>
         <title>I2: Strawman</title>
         <link>http://feedproxy.google.com/~r/technosophia/~3/NiQ73giN8nE/</link>
         <description>[Series] In the prior I2 post, I wrote about the requirements the repositories subgroup has come up with for an institutional identifier standard (with the hope that our findings re: repositories could be generalized to other scenarios). Image by PhOtOnQuAnTiQuE via Flickr My strawman proposal of sorts is to explore how well linked data patterns [...]</description>
         <guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=346</guid>
         <pubDate>Sat, 13 Jun 2009 16:02:36 -0700</pubDate>
         <content:encoded><![CDATA[
<p style="text-align:left;"></p> <p style="text-align:left;">[<a rel="nofollow" target="_blank" href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p> <p style="text-align:left;">In the prior I2 post, I wrote about the requirements the repositories subgroup has come up with for an institutional identifier standard (with the hope that our findings re: repositories could be generalized to other scenarios).</p> <div class="zemanta-img zemanta-action-dragged" style="margin:1em;display:block;text-align:left;">
<div><dl class="wp-caption alignright" style="width:250px;"> <dt class="wp-caption-dt"><a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/67968452@N00/3272712288"><img title="PhotonQ-Tim Berners Lee on Linked Data at TED" src="http://farm4.static.flickr.com/3449/3272712288_2ef843a4b7_m.jpg" alt="PhotonQ-Tim Berners Lee on Linked Data at TED" width="240" height="180"/></a></dt> <dd class="wp-caption-dd zemanta-img-attribution" style="font-size:0.8em;">Image by <a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/67968452@N00/3272712288">PhOtOnQuAnTiQuE</a> via Flickr</dd> </dl></div>
</div> <p style="text-align:left;">My strawman proposal of sorts is to explore how well <a rel="nofollow" target="_blank" href="http://linkeddata.org/">linked data patterns</a> fit this problem space. Linked data, briefly, is a way to expose and link data on the web in a more semantically meaningful way, and is often summarized using the four principles put forward by <a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Tim_Berners-Lee">Tim Berners-Lee</a>:</p> <blockquote style="text-align:left;">
<ol> <li>Use URIs as names for things</li> <li>Use HTTP URIs so that people can look up those names.</li> <li>When someone looks up a URI, provide useful information.</li> <li>Include links to other URIs. so that they can discover more things.</li>
</ol>
</blockquote> <p style="text-align:left;">That&#039;s the crux of it. Linked data takes well-known patterns on the web (linking, dereferencing, etc.) and applies them to data, which in this case could be metadata for identifying institutions.</p> <p style="text-align:left;">Let&#039;s examine each of the requirements and the applicability of linked data thereto.</p> <ol style="text-align:left;"> <li><em>Should be agnostic to type of institution, e.g., libraries, museums, personal collections, historical societies</em>: The web is already agnostic to type of institution. HTTP URIs do not favor one type of institution over another.<br/></li> <li><em>Should handle varying institutional granularity, e.g., institution-level, campus-level, division-level, unit-level</em>: HTTP <a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Uniform_Resource_Identifier">URIs</a> are flexible in this regard. Hierarchy, should one wish it to be surfaced in the identifier, may be encoded in either a DNS hostname or the path appended to the DNS name. One can imagine a URI like &#034;http://department.division.institution.tld/unit/subunit&#034; or &#034;http://institution.tld/campus/office/individual&#034;. <br/><br/>Hierarchy needn&#039;t be surfaced in the identifier if one favors opacity, in which case &#034;http://registry.tld/xnjsdasd&#034; would suffice as an identifier, and may instead be entirely reflected in the (RDF) representation returned by dereferencing the URI.<br/></li> <li><em>Should handle linking among institutions and subordinate units</em>: Linked data handles linking via well-known HTTP mechanisms, referenced in the fourth principle of linked data. Unlike the HTTP link, which has limited semantics, linked data links are semantically rich and extensible.<br/></li> <li><em>Should express different sorts of relationships among these institutions and units</em>: The &#034;useful information&#034; in the third principle of linked data is typically provided by an RDF representation, which is itself a list of assertions. These assertions, or triples, consist of subjects, predicates, and objects. The ability to express the relationships in this requirement is limited only by the availability of vocabularies that contain sets of predicates and classes for subjects and objects. Think of the predicates as elements defined within a metadata standard, e.g., <a rel="nofollow" target="_blank" href="http://dublincore.org/documents/dces/">Dublin Core</a> &#034;creator&#034;, <a rel="nofollow" target="_blank" href="http://www.loc.gov/standards/mods/">MODS</a> &#034;relatedItem&#034;, and so forth. Vocabularies that contain these predicates and classes are growing and evolving daily, and should there not be a vocabulary that contains the relationship one wishes to express, it is fairly easy to create a custom vocabulary. <br/><br/>The ability to mix and match vocabularies provides an expressiveness that is often not found in document-based metadata formats and the flexibility to express radically different relationships on a per-industry or per-institution basis. This latter point is important as the I2 group has identified both core metadata elements for identifying institutions of different types and additional elements for specific types of institutions. Why re-invent a new metadata format or schema when all one needs to express may already be contained in others?<br/></li> <li><em>Should relate to existing relevant identifiers and registries</em>: Same as requirement#4. Linked data is all about expressing relationships between things, e.g., institutions, identifiers, registries, etc.<br/></li> <li><em>Should be globally unique</em>: HTTP URIs are guaranteed to be globally unique by virtue of the distributed <a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Domain_name_system">DNS</a> system and hierarchical naming within each HTTP service.<br/></li> <li><em>Should be actionable</em>: HTTP URIs provide dereferenceability/actionability via the well-known HTTP protocol.<br/></li> <li><em>Should enable retrieval of metadata sufficient to identify the institution, which may vary widely by institution</em>: HTTP URIs are actionable per requirement #7 and the metadata returned is flexible per requirement #4.<br/></li> <li><em>Should accommodate changes as institutions come and go and re-organize and be able to relate defunct institutions to new ones</em>: Linked data patterns provide for redirecting from defunct representations (institutional identifiers) to new ones via HTTP redirects. One may also add assertions to institutional metadata such as owl:sameAs, for instance, which says that the institution identified by the given URI is the same as another institution identified by another URI.<br/></li>
</ol> <p>This seems like a compelling path to follow for the I2 standard.</p> <p>The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week. It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey. If so, I would like to explore the idea of linked data for institutional identifiers a bit more.</p>
<div class="feedflare">
<a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=NiQ73giN8nE:VvOKl12eAaE:bcOpcFrp8Mo"><img src="http://feeds.feedburner.com/~ff/technosophia?d=bcOpcFrp8Mo" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=NiQ73giN8nE:VvOKl12eAaE:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/technosophia?d=dnMXMwOfBR0" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=NiQ73giN8nE:VvOKl12eAaE:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/technosophia?i=NiQ73giN8nE:VvOKl12eAaE:D7DqB2pKExk" border="0"></a>
</div><img src="http://feeds.feedburner.com/~r/technosophia/~4/NiQ73giN8nE" height="1" width="1"/>]]></content:encoded>
      </item>
      <item>
         <title>The JISC SIS Landscape Study</title>
         <link>http://ukwebfocus.wordpress.com/2009/06/10/the-jisc-sis-landscape-study/</link>
         <description>The JISC is funding a landscape study on the UK HE sector use of content, communication and social networking services developed by commercial companies (or, perhaps more accurately, outside of the JISC sector).
As we know although JISC has developed a number of services specifically for use within the UK higher and further education sector (e.g. Jorum, JISCmail, [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ukwebfocus.wordpress.com&amp;blog=497535&amp;post=2869&amp;subd=ukwebfocus&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://ukwebfocus.wordpress.com/?p=2869</guid>
         <pubDate>Wed, 10 Jun 2009 02:00:17 -0700</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><p>The JISC is funding a landscape study on the <strong>UK HE sector use of content, communication and social networking services</strong> developed by commercial companies (or, perhaps more accurately, outside of the JISC sector).</p>
<p>As we know although <a rel="nofollow" target="_blank" href="http://blogs.ukoln.ac.uk/jisc-sis-landscape/jisc-servicesjisc-services/">JISC has developed a number of services</a> specifically for use within the UK higher and further education sector (e.g. Jorum, JISCmail, etc.) people within the sector are increasingly using services developed outside the sector, either in addition to &#8211; or in some cases instead of &#8211; JISC-provided services.</p>
<p>Since evidence of this usage is fragmented and often anecdotal, the JISC SIS Landscape study aims to provide a snapshot of the current situation in the UK.</p>
<p>My colleagues Ann Chapman and Rosemary Russell are leading this work and have set up the <a rel="nofollow" target="_blank" href="http://blogs.ukoln.ac.uk/jisc-sis-landscape/">JISC SIS Landscape Study blog</a> to facilitate their work. We welcome contributions to this blog in order to collate evidence on how such services are being used within the sector. Please note that JISC are primarily interested in use of such services within the UK higher and further education sectors. If you are outside this sector, feel free to contribute but please make it clear in your comments the sector you work or study in.</p> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/ukwebfocus.wordpress.com/2869/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ukwebfocus.wordpress.com/2869/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/ukwebfocus.wordpress.com/2869/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ukwebfocus.wordpress.com/2869/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/ukwebfocus.wordpress.com/2869/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ukwebfocus.wordpress.com/2869/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/ukwebfocus.wordpress.com/2869/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ukwebfocus.wordpress.com/2869/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/ukwebfocus.wordpress.com/2869/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ukwebfocus.wordpress.com/2869/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ukwebfocus.wordpress.com&blog=497535&post=2869&subd=ukwebfocus&ref=&feed=1"/></div>]]></content:encoded>
         <media:content url="http://0.gravatar.com/avatar/27731abff266f585f006998f65c74be9?s=96&amp;amp;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
            <media:title>Brian Kelly (UK Web Focus)</media:title>
         </media:content>
         <category>General</category>
      </item>
      <item>
         <title>I2: Requirements</title>
         <link>http://feedproxy.google.com/~r/technosophia/~3/qGu7adSk3-g/</link>
         <description>[Series] The I2 IR scenario subgroup approached the issue of institutional identifiers in repositories by first brainstorming about the various issues, problems, and sticking points that make identifiers in this space (and elsewhere) such a complex topic. Folks on the subgroup are repository managers or are otherwise involved with or knowledgeable about the repository space, [...]</description>
         <guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=327</guid>
         <pubDate>Sun, 07 Jun 2009 14:14:56 -0700</pubDate>
         <content:encoded><![CDATA[
<p>[<a rel="nofollow" target="_blank" href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p> <p>The I2 IR scenario subgroup approached the issue of institutional identifiers in repositories by first brainstorming about the various issues, problems, and sticking points that make identifiers in this space (and elsewhere) such a complex topic. Folks on the subgroup are repository managers or are otherwise involved with or knowledgeable about the repository space, so the brainstorming exercise yielded a good number of concerns. </p> <p>The purpose of the exercise was to enumerate concerns and issues that could inform a draft survey to be administered to repository managers and experts around the globe in different organizational contexts: libraries, subject disciplines, archives, historical societies, etc. The purpose of the survey is to get an idea of the use cases and constraints around institutional identifiers in these different repository contexts, the assumption being that we ought to have requirements grounded in real world usage before we go off building a standard.</p> <p>I will note here that the subgroup has worked up a draft survey that has just recently been reviewed by a small group of folks who know about survey design, and we hope to administer the survey to the aforementioned <em>Reporati</em> this week[1]. Which is to say that I don&#039;t yet have a strong grasp of the use cases out there in the wild, and this series should be construed as my own premature cognitive fumblings. But let&#039;s assume for now that what we learn from the survey results matches our initial brainstorming exercise. </p> <p>Here is a slightly modified and boiled down version of the concerns and issues the subgroup came up with for a potential institutional identifier standard, which resembles a set of minimum requirements:</p> <ol> <li>Should be agnostic to type of institution, e.g., libraries, museums, personal collections, historical societies</li> <li>Should handle varying institutional granularity, e.g., institution-level, campus-level, division-level, unit-level</li> <li>Should handle linking among institutions and subordinate units</li> <li>Should express different sorts of relationships among these institutions and units</li> <li>Should relate to existing relevant identifiers and registries</li> <li>Should be globally unique</li> <li>Should be actionable</li> <li>Should enable retrieval of metadata sufficient to identify the institution, which may vary widely by institution</li> <li>Should accommodate changes as institutions come and go and re-organize and be able to relate defunct institutions to new ones</li>
</ol> <p>I doubt the list is exhaustive; I am almost certain we will uncover all sorts of tangly and esoteric use cases that add requirements. I expect it. Why else would we be gathering to discuss the need for an institutional identifier if it were a solved problem or a simple one? [2]</p> <p>Nevertheless, looking at the above list, the task we&#039;ve taken on starts to feel less onerous. And thinking about identifier systems constrained by the list of concerns, the mind starts to cook up all sorts of possible solutions. I&#039;ll share one in the next post in this series, a strawman proposal of sorts, and how it addresses each of these requirements.</p>
<h5>Notes</h5><ol class="footnotes"><li id="footnote_0_327" class="footnote">We will also x-post to repo-related mailing lists as well, and some of us may blog or tweet about it. My inclination is to cast as wide a net as possible so as not to miss important use cases. We can always scope things out later on, but it&#039;s useful to be inclusive at this point lest our own assumptions carry the group forward.</li><li id="footnote_1_327" class="footnote">The cynical among you might have interesting answers to this question.</li></ol><br/>
<hr/><div class="feedflare">
<a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=qGu7adSk3-g:TKGUQwcgjZg:bcOpcFrp8Mo"><img src="http://feeds.feedburner.com/~ff/technosophia?d=bcOpcFrp8Mo" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=qGu7adSk3-g:TKGUQwcgjZg:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/technosophia?d=dnMXMwOfBR0" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=qGu7adSk3-g:TKGUQwcgjZg:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/technosophia?i=qGu7adSk3-g:TKGUQwcgjZg:D7DqB2pKExk" border="0"></a>
</div><img src="http://feeds.feedburner.com/~r/technosophia/~4/qGu7adSk3-g" height="1" width="1"/>]]></content:encoded>
      </item>
      <item>
         <title>jiscri - The results</title>
         <link>http://infteam.jiscinvolve.org/2009/06/05/jiscri-the-results/</link>
         <description>&lt;p&gt;We have been notifying and sending out grant letters to bidders to the &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/fundingopportunities/funding_calls/2009/03/309ricall.aspx" title="JISC rapid innovation"&gt;JISC Rapid Innovation call&lt;/a&gt;, also known as &lt;a rel="nofollow" target="_blank" href="http://search.twitter.com/search?q=jiscri" title="jiscri twitter search"&gt;jiscri&lt;/a&gt;, this week and are now able to release some information about the call.&lt;/p&gt;
&lt;p&gt;We got 94 bids for the call and have ended up funding 33 projects at a cost of £1.1m. Originally we had allocated less funding to this call but in response to the number and quality of bids we received it was agreed we were able to increase the level of funding available for these projects. So thank you to those that submitted proposals and showed interest in this programme activity.&lt;/p&gt;
&lt;p&gt;The 33 projects still need to sign and return their grant letters before the projects are confirmed so we can&amp;#8217;t give a full run down of them yet but will list them on this blog and the JISC website as soon as we can. We were able to fund projects in each of the priority areas specified in the call:
&lt;ul&gt;
&lt;li&gt;Mashups&lt;/li&gt;
&lt;li&gt;Aggregating tags&lt;/li&gt;
&lt;li&gt;Semantic web and linked data&lt;/li&gt;
&lt;li&gt;Data search&lt;/li&gt;
&lt;li&gt;Visualisation&lt;/li&gt;
&lt;li&gt;Personalisation&lt;/li&gt;
&lt;li&gt;Mobile technologies&lt;/li&gt;
&lt;li&gt;Lightweight shared infrastructure services&lt;/li&gt;
&lt;li&gt;User interfaces&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The projects will start in June and run for 6 months. We will be having a meeting for these projects and people interested in them in early September. An announcement about this should come from the &lt;a rel="nofollow" target="_blank" href="http://blog.iedemonstrator.org/" title="IE Demonstrator blog"&gt;IE demonstrator&lt;/a&gt; soon. The IE demonstrator will be showcasing the work of these projects so it is a good RSS feed to grab if you are interested in jiscri.&lt;/p&gt;</description>
         <author>Andy McGregor</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/06/05/jiscri-the-results/</guid>
         <pubDate>Fri, 05 Jun 2009 04:43:56 -0700</pubDate>
      </item>
      <item>
         <title>UK open data, open government</title>
         <link>http://feedproxy.google.com/~r/ScienceLibraryPad/~3/Mcd6S2Ic8BA/uk-open-data-open-government.html</link>
         <description>&lt;div&gt;&lt;p&gt;I was sorely tempted to title this "would uk like some data, guv?"&lt;/p&gt;&lt;p&gt;The UK government is picking up the challenges issued in the excellent &lt;a rel="nofollow" target="_blank" href="http://scilib.typepad.com/science_library_pad/2009/02/uk-power-of-information-taskforce-report-beta.html"&gt;Power of Information Taskforce report&lt;/a&gt;.&lt;/p&gt;&lt;p&gt;Via &lt;a rel="nofollow" target="_blank" href="http://friendfeed.com/andypowe11/8f56a751/open-government-data"&gt;Andy Powell&lt;/a&gt; in my FriendFeed, I find a &lt;cite&gt;Guardian&lt;/cite&gt; article &lt;a rel="nofollow" target="_blank" href="http://www.guardian.co.uk/technology/2009/jun/04/free-our-data"&gt;Free our data: UK set to follow successful US data method&lt;/a&gt; &lt;/p&gt;&lt;blockquote&gt;&lt;p&gt; Now the UK government has picked up on the idea, and in a post on the Cabinet Office blog Richard Stirling is &lt;a rel="nofollow" target="_blank" href="http://blogs.cabinetoffice.gov.uk/digitalengagement/post/2009/05/22/Information-and-how-to-make-it-useful.aspx"&gt;asking the British public how a UK version of the US site should be implemented&lt;/a&gt;. "What characteristics would be most useful to you - feeds (ATOM or RSS) or bulk download by FTP?," he asks. "Should this be an index or a repository? Should this serve particular types of data eg XML, JSON or RDF?" &lt;/p&gt;&lt;p&gt; Although there is a list of dozens of the &lt;a rel="nofollow" target="_blank" href="http://wiki.rewiredstate.org/?page=APIs"&gt;UK government's published data sources&lt;/a&gt; there is no clear pan-governmental approach to making data available. The proposal has been received with pleasure by a number of web developers and would-be data users, although it is not clear how free people would be to use the data commercially. &lt;/p&gt;&lt;/blockquote&gt;&lt;p&gt; Richard Stirling is writing in the UK Cabinet Office Digital Engagement blog&lt;/p&gt;&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://blogs.cabinetoffice.gov.uk/digitalengagement/"&gt;http://blogs.cabinetoffice.gov.uk/digitalengagement/&lt;/a&gt; &lt;/p&gt;&lt;p&gt;At some point I will no longer be saying things like "yes, that's an official gov.uk blog" but... well, it is. &lt;/p&gt;&lt;p&gt; The four themes they list on their &lt;a rel="nofollow" target="_blank" href="http://blogs.cabinetoffice.gov.uk/digitalengagement/page/About.aspx"&gt;about&lt;/a&gt; page: open information, open feedback, open conversation, open innovation.&lt;/p&gt;&lt;p&gt;A more extensive extract of what Richard Stirling asks in his posting &lt;a rel="nofollow" target="_blank" href="http://blogs.cabinetoffice.gov.uk/digitalengagement/post/2009/05/22/Information-and-how-to-make-it-useful.aspx"&gt;Information and how to make it useful&lt;/a&gt; :&lt;/p&gt;&lt;blockquote&gt;&lt;p&gt; Any solution must support open standards and would ideally be open source, but there are a couple of other questions we are pondering at the moment: &lt;/p&gt;&lt;ul&gt; &lt;li&gt; What characteristics would be most useful to you – feeds (ATOM or RSS) or bulk download by e.g. FTP, etc?&lt;/li&gt; &lt;li&gt; Should this be an index or a repository?&lt;/li&gt; &lt;li&gt; Should this serve particular types of data e.g. XML, JSON or RDF?&lt;/li&gt; &lt;li&gt; What examples should we be looking at (beyond data.gov e.g.http://ideas.welcomebackstage.com/data)?&lt;/li&gt; &lt;li&gt; Does this need its own domain, or should it sit on an existing supersite (e.g. http://direct.gov.uk)&lt;/li&gt; &lt;/ul&gt; &lt;/blockquote&gt; &lt;p&gt;There are already 19 substantive comments, and he indicates they are also monitoring Twitter for the hashtags &lt;a rel="nofollow" target="_blank" href="http://search.twitter.com/search?q=%23poit"&gt;#poit&lt;/a&gt; (Power of Information Taskforce) and &lt;a rel="nofollow" target="_blank" href="http://search.twitter.com/search?q=%23opendata"&gt;#opendata&lt;/a&gt;&lt;br&gt; &lt;/p&gt;&lt;p&gt;There is a new Director of Digital Engagement, Andrew Stott, according to his official Twitter feed, &lt;a rel="nofollow" target="_blank" href="http://twitter.com/dirdigeng"&gt;@DirDigEng&lt;/a&gt; , he was scheduled to start in his position yesterday. &lt;/p&gt;&lt;p&gt; Sometimes I feel like a certain country often considered to be between the UK and the US is missing out on this whole official open data, blogging, twitter thing... &lt;/p&gt;&lt;p&gt; If anyone were to want someone to start blogging officially about government open data in a certain northern neighbour of the US, I am available...&lt;/p&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=yIl2AUoC8zA" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:F7zBnMyn0Lo"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?i=Mcd6S2Ic8BA:_kmCJo89dWU:F7zBnMyn0Lo" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:qj6IDK7rITs"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=qj6IDK7rITs" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:EpLpB3ZkKWg"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=EpLpB3ZkKWg" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:V-t1I-SPZMU"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=V-t1I-SPZMU" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:XAVGb8Xj5zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=XAVGb8Xj5zA" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:dnMXMwOfBR0"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=dnMXMwOfBR0" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:V_sGLiPBpWU"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?i=Mcd6S2Ic8BA:_kmCJo89dWU:V_sGLiPBpWU" border="0"&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/ScienceLibraryPad/~4/Mcd6S2Ic8BA" height="1" width="1"/&gt;</description>
         <author>Richard Akerman</author>
         <guid isPermaLink="false">tag:typepad.com,2003:post-67650285</guid>
         <pubDate>Thu, 04 Jun 2009 12:47:26 -0700</pubDate>
      </item>
      <item>
         <title>Web Services and repositories</title>
         <link>http://infteam.jiscinvolve.org/2009/06/03/web-services-and-repositories/</link>
         <description>&lt;p&gt;I attended a workshop on June 2 on the use of Web Services to enable interoperability between repositories, repository services and other systems. The workshop was organised by the &lt;a rel="nofollow" target="_blank" href="http://www.ethos.ac.uk"&gt;Ethos project&lt;/a&gt; (Electronic Theses Online). Most attendees were from a repository manager or developer background.&lt;/p&gt;
&lt;p&gt;[Update: the presentations (including audio recording) at this event are &lt;a rel="nofollow" target="_blank" href="http://www.ethos.ac.uk/0031_Web_Services_Day.html"&gt;now available&lt;/a&gt;]&lt;/p&gt;
&lt;p&gt;As a number of speakers noted, the term &amp;#8216;Web Services&amp;#8217; is a very broad one. The main focus of the day was on the use of a number of specific protocols and approaches to provide &amp;#8217;services&amp;#8217;: &lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://www.swordapp.org"&gt;SWORD&lt;/a&gt; for depositing items in repositories&lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://www.loc.gov/standards/sru/"&gt;SRU&lt;/a&gt; to search for and retrieve items&lt;/p&gt;
&lt;p&gt;And &lt;a rel="nofollow" target="_blank" href="http://standards.jisc.ac.uk/catalogue/REST.phtml"&gt;REST&lt;/a&gt; for passing data between servers.&lt;/p&gt;
&lt;p&gt;Using such &amp;#8217;services&amp;#8217; enables repository services to be used from within environments other than the repository itself (so, if you wanted to deposit an item in a repository from within a research management application of some kind, for example). It also enables repositories to use other systems&amp;#8217; services. One example given of this was look up of file format information from the National Archives&amp;#8217; &lt;a rel="nofollow" target="_blank" href="http://www.nationalarchives.gov.uk/PRONOM/Default.aspx"&gt;PRONOM&lt;/a&gt; database of file formats.&lt;/p&gt;
&lt;p&gt;From discussion, the general view appeared to be that Web Services do have a role to play in aiding integration of repositories with other systems and avoiding &amp;#8217;silos&amp;#8217;. However, achieving such integration raises issues such as:&lt;/p&gt;
&lt;p&gt;- having sufficient access to technical expertise&lt;br /&gt;
- ensuring good communication between repository managers and developers&lt;br /&gt;
- focussing on real user needs.&lt;/p&gt;
&lt;p&gt;None of these are easy issues to address. Two of the speakers came from institutions which are members of the &lt;a rel="nofollow" target="_blank" href="http://www.nls.uk/sdlc/"&gt;Scottish Digital Library Consortium&lt;/a&gt; and they noted the value of libraries &amp;#8216;clubbing together&amp;#8217; to share available technical expertise and resources.&lt;/p&gt;</description>
         <author>Ben Wynne</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/06/03/web-services-and-repositories/</guid>
         <pubDate>Wed, 03 Jun 2009 04:12:20 -0700</pubDate>
      </item>
      <item>
         <title>The Great Divide : A Rant</title>
         <link>http://feedproxy.google.com/~r/JimDowning/~3/LtzNUxj2Pf0/</link>
         <description>There&amp;#8217;s a good posting on the Open Science Project called Scientific Software Wants To Be Free. I generally agree with their manifesto, we&amp;#8217;ve kicked similar ideas around the pubs of Cambridge for a while, without such concrete suggestions. In the comments, Geoff Hutchison makes an excellent point : - I think these are also key points: Software releases [...]</description>
         <guid isPermaLink="false">http://wwmm.ch.cam.ac.uk/blogs/downing/?p=323</guid>
         <pubDate>Thu, 28 May 2009 02:06:06 -0700</pubDate>
         <content:encoded><![CDATA[<p>There&#8217;s a good posting on the Open Science Project called <a rel="nofollow" target="_blank" href="http://www.openscience.org/blog/?p=270">Scientific Software Wants To Be Free</a>. I generally agree with their manifesto, we&#8217;ve kicked similar ideas around the pubs of Cambridge for a while, without such concrete suggestions.</p> <p>In the comments, <a rel="nofollow" target="_blank" href="http://www.openscience.org/blog/?p=270#comment-19043">Geoff Hutchison makes an excellent point</a> : -</p> <blockquote>
<p>I think these are also key points:</p> <ul><li>Software releases also rarely garner wide recognition, and without an accompanying paper to cite, the work cannot be paid back in citation currency.</li>
<li>Additionally, emphasizing one’s work on software or publishing methods papers carries a risk of being perceived as a programmer ﬁrst and scientist second, falling on the wrong side of the technician/scientist divide.</li></ul> <p>I have been told to be sure to publish “real chemistry” more, even though I want to get “citation currency” for open projects. After all, without publications or citations, I can’t get grant support for these activities either.</p> <p>So is the trick to get grants for “real research” that have “broader impacts” in software development?</p></blockquote> <p>I sympathize with Geoff. I have friends and colleagues whose academic careers have been made difficult, jeopardized or even ended because they gained technology expertise in order to make their research more effective, and ended up being perceived as too techie to be &#8220;proper scientists&#8221;.</p> <p>The problem, as I see it, is the very existence of a &#8220;scientist/technician&#8221; divide. I&#8217;m beginning to suspect that the problems software engineers have with existing metric systems based on journal articles are more the symptom than the cause of a lack of respect and recognition afforded to technical staff in academia, and that fixing it through software journals and the like will be a temporary palliative cure.</p> <p>Perhaps I&#8217;m just getting more bolshy the more streaks of grey appear, but perhaps academia needs to change to embrace software engineering, rather than software engineering twisting itself out of shape to fit in with the anachronisms of scientific research.</p> <p>Admittedly, I&#8217;m pretty short on &#8220;how&#8221;. The publication / citation model clearly isn&#8217;t working so well for researchers any more - perhaps fundamental changes there will help. Funding of the sort JISC, and the e-Science programme certainly help provide support and some credibility, but you still need a Principal Investigator, and if they&#8217;re academic they risk being dragged toward the divide for their pains. Post-grad training could also help to change the culture from the ground up. The JISC-CRIG Dev8D initiative is a band-aid for this very problem; an extra-mural support network to give developers some well-deserved recognition they don&#8217;t necessarily receive in their own institutions. Beyond vague ideas and short term fixes, I&#8217;ve got nothing. </p> <p>It&#8217;s an important issue though, because so much of modern research depends on software development, and when (not if) the software / technical talent walks out of the door to work in industrial research (where it is respected), how will academia compete?</p><img src="http://feeds.feedburner.com/~r/JimDowning/~4/LtzNUxj2Pf0" height="1" width="1"/>]]></content:encoded>
      </item>
      <item>
         <title>IMS's three-pronged strategy</title>
         <link>http://www.cetis.ac.uk/members/scott/blogview?entry=20090525100440</link>
         <description>&lt;p&gt;Two weeks ago I attended the IMS Learning Impact conference in Barcelona; this was one of the first IMS events I'd attended in some time, and I've tried to put together some thoughts on the IMS technical strategy. &lt;/p&gt; &lt;p&gt;Now a lot of discussion around IMS is about its policies, politics, membership, processes and so on, but I'm not going into any of that here; for now I want to focus on specifications. (Note also this is a personal reflection, and doesn't necessarily represent the view of JISC or CETIS)&lt;/p&gt; &lt;p&gt;IMS currently sets out its stall as offering three key "products" in its specification portfolio; together they make up its "&lt;a rel="nofollow" target="_blank" href="http://www.imsglobal.org/digitallearningservices.html"&gt;Digital Learning Services Standards&lt;/a&gt;" portfolio.&lt;/p&gt; &lt;h4&gt;Common Cartridge&lt;/h4&gt; &lt;p&gt;The first product is the &lt;a rel="nofollow" target="_blank" href="http://www.imsglobal.org/cc/index.html"&gt;IMS Common Cartridge&lt;/a&gt;. I remember this originating in discussions way back as to whether there should be a SCORM-like profile of standards (packaging, metadata, runtime and so on) that was better suited to Higher Education. Its come a long way since, and has gathered a lot of influential support. However, ultimately it is still a means of putting a bunch of web pages into a zip file to import into an LMS, which seems increasingly an odd thing to do. &lt;/p&gt; &lt;p&gt;CC does add some interesting capabilities - it adds the QTI (Question and Test Interoperability) specification to the profile, enabling automated assessments along with content (something ADL was considering for SCORM a few years back but never got around to), and it also adds a capability for adding "tools" using a version of the IMS Learning Tools Interoperability specification (more on which later). However it misses out things like SCORM's tracking functionality and CMI runtime, which have been a major selling point of SCORM.&lt;/p&gt; &lt;p&gt;Overall CC is an odd mix (one of my colleagues referred to it as the "curate's egg" specification) but is quite likely to gain some traction with publishers and commercial LMSs. But what will the impact be? I'm not convinced a market for common cartridges will open up in HE in the same manner as occurred in commercial training with SCORM; they really are quite different. However, IMS is putting a lot of effort into marketing CC to get adoption, so I remain open minded. &lt;/p&gt; &lt;p&gt;For the moment the key question is whether OER and CC are a natural fit - clearly the UK's Open University considers it an option.&lt;/p&gt; &lt;h4&gt;Learner Information Services&lt;/h4&gt; &lt;p&gt;IMS Learner Information Services is the latest revision of the IMS Enterprise specification, which was one of the first IMS specifications and the first I was personally involved in. IMS LIS, like its predecessor, is focussed on the connection between an LMS and a student record system; this means being able to provide groups and users to the LMS and to handle reporting back for things like final grades or tracking data. LIS extends Enterprise with new services for areas like course structures (a bit similar to &lt;a rel="nofollow" target="_blank" href="http://xcri.org"&gt;XCRI&lt;/a&gt;, though not compatible with it) and continues the service-oriented approach that started with IMS Enterprise Services 1.0. &lt;/p&gt; &lt;p&gt;However, IMS LIS also plans to add batch-file bindings that would also enable REST services; this is important, as some of the most successful applications of IMS Enterprise have focussed on simple REST services and batch processing, and I'm glad to see IMS is recognising this and providing official support.&lt;/p&gt; &lt;h4&gt;Learning Tools Interoperability&lt;/h4&gt; &lt;p&gt;IMS LTI v2.0 is a specification for enabling LMS's to include external applications running in iFrames that can communicate with the host LMS for things like user information. (There isn't a page for v2.0 yet, but there is a description of LTI &lt;a rel="nofollow" target="_blank" href="http://www.imsglobal.org/activities.html"&gt;on this page&lt;/a&gt;)&lt;/p&gt; &lt;p&gt;If you've been following this blog you'll know that I've been contributing to the W3C Widgets familiy of specifications, and have been keen to point out that the IMS specification is rather similar, begging the question as to why IMS is bothering to reinvent this particular wheel. Part of the reason is historical - IMS LTI started earlier than W3C's activity (although that itself is based on earlier technologies, such as Apple Dashboard Widgets and Yahoo! Konfabulator) and so has had time to diverge from common practice. Its also harder to backtrack on the legacy of its "version 1.0" which was really an exploration of a possible common extension mechanism for (back then) WebCT and Blackboard (and later Sakai). Another part of the reason is that many IMS members are not really that well connected into web standards generally, and even though I personally prompted the IMS working group to look into Widgets they never really managed to connect it with what they were working on.&lt;/p&gt; &lt;p&gt;I think IMS is putting a lot of pressure on LTI to succeed, although I think its fundamentally misconceived. For example, the current IMS LTI 2.0 document set consists of 20 word documents covering everything from inter-widget messaging to APIs for passing "outcomes" from widgets to the LMS - most of it involving lots of SOAP and WSDL. Using the W3C Widgets specification I reckon this could be whittled down to one or two very basic documents for things like common education vocabularies and how to use Widgets with an open API on the LMS end (or more likely, just IMS LIS with a REST binding). Already I've seen a lot of European projects working with W3C Widgets and Google OpenSocial to deliver IMS-LTI-like functionality; we've also been working with Sakai 3 and Moodle to integrate both Widgets and OpenSocial applications. Perhaps the main value IMS could contribute would be to sort out the REST APIs that Widgets could call to enable tracking.&lt;/p&gt; &lt;p&gt;Its unlikely that IMS would embrace W3C for LTI for a number of reasons, mostly that the whole approach doesn't just invalidate most of the effort on IMS LTI 2.0, it also calls into question the whole "DIgital Learning Services" strategy: if we used web standards, plus a REST API for things like cohorts, would we even need IMS Common Cartridge or IMS LTI &lt;em&gt;at all&lt;/em&gt;? From this perspective, the IMS strategy seems less about opening up education systems so much as supporting an education technology silo into which a few suppliers can offer services with limited external competition. After all, who apart from a few established players in the HE sector is going to implement those 20 IMS LTI documents?&lt;/p&gt; &lt;h4&gt;The Missing Specifications&lt;/h4&gt; &lt;p&gt;As well as what is IN the IMS strategy, its worth considering for a moment what's OUT of the strategy:&lt;/p&gt; &lt;p&gt;&lt;b&gt;IMS Learning Design&lt;/b&gt; is one of the most widely discussed specifications in a European context (or certainly seems to be in the circles I move in) and yet is completely absent. It had some discussion in relation to a K-12 profile of Common Cartridge, but I don't think this progressed anywhere. Its especially unusual that IMS isn't promoting any further development or marketing for LD, given that quite a few of the entrants to its own Learning Impact awards were using LD.&lt;/p&gt; &lt;p&gt;&lt;b&gt;IMS ePortfolio&lt;/b&gt; was a relatively recent specification, but was released with some fundamental problems. So far there has been insufficient interest from members to fix it, and CETIS has switched its efforts to looking into practical interoperability between ePortfolio applications using Atom feeds (see &lt;a rel="nofollow" target="_blank" href="http://wiki.cetis.ac.uk/Portfolio_interoperability_prototyping"&gt;PIOP&lt;/a&gt;).&lt;/p&gt; &lt;p&gt;&lt;b&gt;IMS QTI 2.x&lt;/b&gt; is largely completed, but was recently withdrawn by IMS and then reinstated after a number of complaints. I think its fair to say IMS doesn't really know what to do with it, given that IMS CC uses the older 1.x version of QTI.&lt;/p&gt; &lt;p&gt;Finally, IMS has a number of specifications that really should be retired - IMS LIP, IMS SSP, IMS RLI, IMS VDEX come to mind.&lt;/p&gt; &lt;h4&gt;The Rub?&lt;h4&gt; &lt;p&gt;Overall the IMS specifications seem oddly out of step with the wider web. There is still an adherence to SOAP and WSDL doctrine that has slid rapidly into the Trough of Despond elsewhere. Where REST APIs are considered, they are a bit of an add-on rather than at the core - IMS specifications are not based on web architecture, irrespective of binding. And, most critically, there is a big gap between IMS and web standards, as evidenced by the disconnect between IMS LTI 2.0 and W3C Widgets (or even Google OpenSocial). This was pointed out at the plenary of the event by Mark Stiles (who chairs the JISC-CETIS board as well as being on the IMS board of directors): IMS really needs to work more closely with horizontal standards rather than build a silo.&lt;/p&gt; &lt;p&gt;I think IMS LIS is worth a look when it comes out, as is Common Cartridge (though in the short term we can easily convert Cartridges to the more commonly-supported formats such as SCORM 1.2 and using the &lt;a rel="nofollow" target="_blank" href="http://blogs.cetis.ac.uk/sheilamacneill/2008/10/10/content-transcoder-demonstration/"&gt;Content Transcoder&lt;/a&gt;). However I'd give IMS LTI 2.0 a miss.&lt;/p&gt; &lt;p&gt;But more generally, is IMS too old-fashioned to remain relevant? Or is its focus on rather "unfashionable" areas of education technology a good bet for longer-term survival? A lot will depend on the success - or otherwise - of the current three-spec strategy&lt;/p&gt;&lt;/h4&gt;&lt;/h4&gt;</description>
         <author>Scott Wilson</author>
         <guid isPermaLink="false">tag:cetis.ac.uk,2009-05-25:20090525100440</guid>
         <pubDate>Sun, 24 May 2009 17:00:00 -0700</pubDate>
      </item>
      <item>
         <title>Open Repositories 2009 Day 4, Group 2</title>
         <link>http://feedproxy.google.com/~r/typepad/mleggott/loomware/~3/yeCDg7KShSw/open-repositories-2009-day-4-group-2.html</link>
         <description>&lt;div&gt;&lt;p&gt;Matthias Razum (FIZ Karlsruhe) gave &lt;em&gt;A Closer Look at Fedora's Ingest Performance&lt;/em&gt;. The group used vanilla hardware with single processor and 2 GB RAM to look at ingest speeds and optimization options. The ingest consisted of about 4.9 million objects/500 million triples (PDFs from the patent database, which took 3 weeks to ingest) CPU was not really the limiting factor, it was I/O. There was no difference from JDK 1.5 to 1.6. There was no real difference between the various triplestores or no triplestore, meaning that using triples does not add significant overhead. The most promising areas of optimization were with Postgres tuning - they switched off Postgres's ability to respond when the machine goes down during an operation. This resulted in a highly significant change in ingest rates (130ish ms compared to 40ish ms). With MySQL tuning the InnoDB/MyISAM tables resulted in similar levels of performance improvement. Putting the DB on a separate machine, even with network overhead had a significant improvement as well. Other findings: there was absolutely no impact with a growing number of objects indicating the scalability of Fedora; combination of I/O (re database) and other tuning can see an improvement factor of 4. Another thing the group has considered is creating a number of Fedora instances and then merging the indexes later. Dan Davis provided an update on the work with Sun and highlighted the conclusions of the Karlsruhe work. They will be using the open source &lt;a rel="nofollow" target="_blank" href="http://grinder.sourceforge.net/"&gt;Grinder&lt;/a&gt; app to create a testbed for ongoing work in this area.&lt;/p&gt; &lt;p&gt;Gert Schmeltz Pedersen (Technical University of Denmark) spoke about &lt;em&gt;Fedora and GSearch in a Research Project about Integrated Search&lt;/em&gt;. Gert looked at integrating multiple Fedora/GSearch implementations in a federated search kind of opportunity. Zoned on this one - to much data on little slides.&lt;/p&gt; &lt;p&gt;Tom Cramer (Stanford University), Richard Green (University of Hull), Lynn McRae (Stanford University), Tim Sigmon (University of Virginia), Ross Wayland (University of Virginia) presented on &lt;em&gt;Case Studies in Repository Workflows: Three Approaches&lt;/em&gt;. This is critical stuff for the Fedora community - workflows are what it is all about and I think will an area of major activity for the next couple of years. One nice thing about the new Hydra project is the intention to build a standard workflow tool and the fact that the 3 partners are each using a different framework for building workflows means they have a greater chance of coming up with something cool. See: being different is good :-)&lt;/p&gt; &lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=yeCDg7KShSw:YmUzqA6GkBA:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?d=yIl2AUoC8zA" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=yeCDg7KShSw:YmUzqA6GkBA:F7zBnMyn0Lo"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?i=yeCDg7KShSw:YmUzqA6GkBA:F7zBnMyn0Lo" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=yeCDg7KShSw:YmUzqA6GkBA:V_sGLiPBpWU"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?i=yeCDg7KShSw:YmUzqA6GkBA:V_sGLiPBpWU" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=yeCDg7KShSw:YmUzqA6GkBA:EpLpB3ZkKWg"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?d=EpLpB3ZkKWg" border="0"&gt;&lt;/a&gt;
&lt;/div&gt;</description>
         <author>mleggott</author>
         <guid isPermaLink="false">tag:typepad.com,2003:post-67130277</guid>
         <pubDate>Thu, 21 May 2009 16:30:16 -0700</pubDate>
      </item>
      <item>
         <title>Agile Languages &amp; Fedora — Update from OR09</title>
         <link>http://feedproxy.google.com/~r/MediashelfBlog/~3/T6ptEsFdP3A/</link>
         <description>Leading up to this year's Open Repositories, it became clear that there was demand for a BOF (Birds of Feather) session focused on agile languages and Fedora. I pitched the idea in an email to a couple colleagues beforehand and then announced the BOF at my presentation on Monday morning. Rather than constricting it to Fedora, I billed it as Agile Languages and Repositories. About 30 people showed up. We all exchanged info and I set up some new collaboration email lists.</description>
         <guid isPermaLink="false">http://yourmediashelf.com/blog/?p=147</guid>
         <pubDate>Tue, 19 May 2009 22:41:42 -0700</pubDate>
         <content:encoded><![CDATA[<p>Leading up to this year&#8217;s <a rel="nofollow" title="Open Repositories 2009" target="_blank" href="https://or09.library.gatech.edu/">Open Repositories</a>, it became clear that there was demand for a <a rel="nofollow" title="wikipedia: Birds of a Feather (computing)" target="_blank" href="http://en.wikipedia.org/wiki/Birds_of_a_Feather_(computing)">BOF</a> (Birds of Feather) session focused on agile languages and Fedora. I pitched the idea in an email to a couple colleagues beforehand and then announced the BOF at <a rel="nofollow" title="Many Lightweight Views into Complex Repository Content: Enabling Rapid Application Development for Fedora Repositories" target="_blank" href="https://or09.library.gatech.edu/general114.php">my presentation</a> on Monday morning. Rather than constricting it to Fedora projects, I billed it as Agile Languages and Repositories. About 30 people showed up. The split was pretty even between Ruby, Python, and PHP developers. About a third seemed to be Java developers in the process of defecting. In addition to people doing stuff with Fedora, there were a handful of DSpace developers and possibly a couple who maintain ePrints repositories. </p>
<p>For the first half of the BOF we sat in mixed groups, eating our lunches and each talking about the work we do. We then split up by language (Ruby, Python, PHP) and discussed language-specific topics. For that second half I sat at the Ruby table where we talked about <a rel="nofollow" title="ActiveFedora" target="_blank" href="http://mediashelf.us/activefedora">ActiveFedora</a>, <a rel="nofollow" title="JRuby Home" target="_blank" href="http://jruby.codehaus.org">JRuby</a>, <a rel="nofollow" title="Jena Ruby bindings: accessing Jena&#x002019;s feature rich RDF api from Ruby" target="_blank" href="http://www.thewebsemantic.com/2008/01/05/jena-ruby-bindings-accessing-jenas-feature-rich-rdf-api-from-ruby/">RDF support for Ruby</a>, MODS support for Ruby, Solr (<a rel="nofollow" title="solr-ruby" target="_blank" href="http://wiki.apache.org/solr/solr-ruby">solr-ruby</a> and <a rel="nofollow" title="RSolr" target="_blank" href="http://github.com/mwmitchell/rsolr/tree/master">RSolr</a>), and how <a rel="nofollow" title="Blacklight OPAC Homepage" target="_blank" href="http://blacklightopac.org">Blacklight</a> fits into the mix. </p>
<p>I closed the conversation by asking if we should set up email lists for collaboration. It seemed reasonable to set up a general mailing list for the solutions community as well as a list specifically for people doing stuff with Ruby, Fedora repositories, and (most likely) ActiveFedora. I also resolved to encourage the creation of Python-oriented and PHP-oriented equivalents. For now I have created two lists on Google Groups. The first one, <a rel="nofollow" title="Fedora Commons Create Group" target="_blank" href="http://groups.google.com/group/fedora-commons-create">Fedora Commons Create</a>, is for general discourse about creating client applications for Fedora. The second, <a rel="nofollow" title="ActiveFedora Group" target="_blank" href="http://groups.google.com/group/active-fedora">ActiveFedora / Ruby + Fedora Commons</a>, is for Ruby-specific collaboration.</p>
<p>In the end, I was really pleased to realize that for the first time we had a substantial group of people interested in each of the main interpreted languages (Ruby, Python, PHP) and each group had at least one open source Fedora-based project to use as a starting point for their conversations. The Ruby group had <a rel="nofollow" title="ActiveFedora" target="_blank" href="http://mediashelf.us/activefedora">ActiveFedora</a>, the Python group had <a rel="nofollow" title="Less Talk, More Code" target="_blank" href="http://oxfordrepo.blogspot.com/">Ben O&#8217;Steen&#8217;s work</a> and <a rel="nofollow" title="Fedora and Django for an image repository: a new front-end" target="_blank" href="https://or09.library.gatech.edu/fedora69.php">Peter Herndon&#8217;s Django integration</a>, and the PHP/Drupal people had <a rel="nofollow" title="Islandora" target="_blank" href="http://vre.upei.ca/dev/islandora">Islandora</a> &amp; <a rel="nofollow" title="Fez" target="_blank" href="http://sourceforge.net/projects/fez/">Fez</a> to start from. </p>
<p>This was a comfortable step forward from <a rel="nofollow" title="Inroads to Application Development for Fedora Commons" target="_blank" href="http://yourmediashelf.com/blog/2008/05/14/inroads-to-application-development-for-fedora-commons/">the scenario as it was a year ago</a>.</p>
<table style="background-color:#fff;padding:5px;" border="0" cellspacing="0">
<tbody>
<tr>
<td><img src="http://groups.google.com/groups/img/3nb/groups_bar.gif" alt="Google Groups" width="132" height="26"/></td>
</tr>
<tr>
<td style="padding-left:5px;font-size:125%;"><strong>Fedora Commons Create</strong></td>
</tr>
<tr>
<td style="padding-left:5px;"><a rel="nofollow" target="_blank" href="http://groups.google.com/group/fedora-commons-create">Visit this group</a></td>
</tr>
</tbody>
</table>
<table style="background-color:#fff;padding:5px;" border="0" cellspacing="0">
<tbody>
<tr>
<td><img src="http://groups.google.com/groups/img/3nb/groups_bar.gif" alt="Google Groups" width="132" height="26"/></td>
</tr>
<tr>
<td style="padding-left:5px;font-size:125%;"><strong>ActiveFedora / Ruby + Fedora Commons</strong></td>
</tr>
<tr>
<td style="padding-left:5px;"><a rel="nofollow" target="_blank" href="http://groups.google.com/group/active-fedora">Visit this group</a></td>
</tr>
</tbody>
</table>
<img src="http://feeds.feedburner.com/~r/MediashelfBlog/~4/T6ptEsFdP3A" height="1" width="1"/>]]></content:encoded>
      </item>
      <item>
         <title>Auto paste of JISCBIDS spreadsheet marks.</title>
         <link>http://feedproxy.google.com/~r/DFFlanders/~3/BOSZnsoWIY4/</link>
         <description>So @cardcc suggested at #rpmeet that most everyone was using an offline template spreadsheet to mark #jiscbids and that it would be a real time saver if there was a way to auto copy and paste the offline spreadsheet in one go. Well I don&amp;#8217;t have a one go solution but I do have a [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&amp;blog=532341&amp;post=184&amp;subd=dfflanders&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://dfflanders.wordpress.com/?p=184</guid>
         <pubDate>Mon, 11 May 2009 13:52:22 -0700</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><p>So @cardcc suggested at #rpmeet that most everyone was using an offline template spreadsheet to mark #jiscbids and that it would be a real time saver if there was a way to auto copy and paste the offline spreadsheet in one go. Well I don&#8217;t have a one go solution but I do have a three step solution that I was able to piece together over the weekend (while marking bids).</p>
<p>DISCLAIMER: THIS IS ONLY A PROTOTYPE AND IS NOT THE OFFICIAL WAY TO SUBMIT MARKS FOR JISCBIDS, THIS IS ONLY AN PROTOTYPE TO SHOW HOW THIS MIGHT BE ACHIEVED. YOU STILL MUST SUBMIT BIDS USING THE CURRENT APPROVED FORM.</p>
<p>None the less, if you would like to help me tweak this form so that it could be useable in the future then please follow the below instructions:</p>
<p>1.)<a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/ccc?key=rw_g4TI9EwZ6fjZ00GXh_-A"> Download this Excel Spreadsheet</a>. This spreadsheet is the marking template. Use the template spreadsheet exactly as it is set up. If you change any of the headings rows or columns it will render the spreadsheet useless.</p>
<p>2.) Once you have completed filling out the template spreadsheet<a rel="nofollow" target="_blank" href="http://www.screencast.com/users/DFFlanders/folders/Jing/media/49858083-418a-439b-b468-0300658a35c1"> follow this screencast (w/audio) step by step</a> for uploading to GoogleDocs. The above screencast will also show you how to publish an RSS feed from the document. TO NOTE: this RSS feed must be a &#8220;cell&#8221; rss feed so the data can be sliced and diced in multiple way so that it is exposed on the backend system to multiple outputs. Eventually, I&#8217;ll enable it for just the simple url for the SS, for now please just make sure you are exposing the right RSS feed.</p>
<p>3.) Once you have the RSS feed<a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/viewform?hl=en&amp;formkey=clZMcDQ3eEZxZHlMVEt4NWtiUjVOUXc6MA.."> go to this form and fill it out</a>. Job done, well then there is a lot of back end processing that takes place to parse your spreadsheet into the various forms it needs to go to as part of the next step in the marking workflow.</p>
<p>Resources used in this work:</p>
<ul>
<li>http://docs.google.com/support/bin/answer.py?hl=en&amp;answer=75507y</li>
<li>jQuery and the joy of arrays <img src='http://s.wordpress.com/wp-includes/images/smilies/face-smile.png' alt=':)' class='wp-smiley'/> </li>
</ul> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/dfflanders.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dfflanders.wordpress.com/184/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/dfflanders.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dfflanders.wordpress.com/184/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/dfflanders.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dfflanders.wordpress.com/184/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/dfflanders.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dfflanders.wordpress.com/184/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/dfflanders.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dfflanders.wordpress.com/184/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&blog=532341&post=184&subd=dfflanders&ref=&feed=1"/></div>]]></content:encoded>
         <media:content url="http://0.gravatar.com/avatar/8cdc26501af3b7e571a4d23f0afbbf08?s=96&amp;amp;d=identicon&amp;amp;r=G" medium="image">
            <media:title>dfflanders</media:title>
         </media:content>
      </item>
      <item>
         <title>The &amp;#8216;library management system&amp;#8217;: a round up of some (fairly) recent initiatives</title>
         <link>http://infteam.jiscinvolve.org/2009/05/08/the-library-management-system-a-round-up-of-some-fairly-recent-initiatives/</link>
         <description>&lt;p&gt;This is rather a long post - but it is on a big subject!&lt;/p&gt;
&lt;p&gt;Last year saw the publication by the JISC and SCONUL of a major report on &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/resourcediscovery/libraryms.aspx"&gt;library management systems&lt;/a&gt; used in UK higher education. A lot has happened since. So, I thought it might be useful to post an update - even a long one - on some related recent JISC activities.&lt;/p&gt;
&lt;p&gt;The LMS report highlighted a number of issues to be addressed in order to make library resources as visible and accessible as possible in the digital environment, including:&lt;/p&gt;
&lt;p&gt;•	integrating library systems and other university systems so that relevant data can be shared between them and users can access services in different ways (within university portals, virtual learning environments, etc.)&lt;/p&gt;
&lt;p&gt;•	making library metadata (catalogue records) available to other services (such as web search engines and collaborative web tools) so that information about library resources can be easily found, used and personalised by users where and how they choose&lt;/p&gt;
&lt;p&gt;•	using library usage data (information about how many times an item has been borrowed and by what kind of user, for example) to indicate how individual library resources are being used, thereby providing a basis for user ‘rating’ for library resources – and doing this at a ‘network level’ (so not limited to one library) in order to have access to enough meaningful data&lt;/p&gt;
&lt;p&gt;•	improving the quality of search interfaces and reducing the number of different systems which users need to search to access the full range of resources available to them.&lt;/p&gt;
&lt;p&gt;JISC is just one partner in seeking to address these issues with vendors of library systems developing new discovery tools, &lt;a rel="nofollow" target="_blank" href="http://www.oclc.org/us/en/worldcatlocal/default.htm"&gt;OCLC’s recent announcement&lt;/a&gt; about using WorldCat as the basis for effectively a shared library management system and &lt;a rel="nofollow" target="_blank" href="http://www.sconul.ac.uk/news/lms_apr09"&gt;SCONUL&lt;/a&gt; investigating the potential for shared library management systems, for example.&lt;/p&gt;
&lt;p&gt;So, what has been happening on the JISC front?&lt;/p&gt;
&lt;p&gt;The &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/resourcediscovery/tile.aspx"&gt;TILE (Towards Implementation of Library 2.0 and the e-Framework)&lt;/a&gt; project has investigated the use of Web 2.0 tools by library services, outlined a possible new model (a ‘library domain model’) for how library services might fit in the wider information environment (to help inform further development of library systems) and proposed testing the extraction and combination of usage data from a number of university systems to provide the bases of user ratings for library resources.&lt;/p&gt;
&lt;p&gt;As you may know from some earlier posts, reports from the TILE project are now available on the JISC web site and there is a &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/events/2009/06/tile.aspx"&gt;workshop&lt;/a&gt; on the domain model planned for June 19. Further work is planned to test extracting and combining usage data as noted above.&lt;/p&gt;
&lt;p&gt;Meanwhile, EDINA and MIMAS are developing the search interfaces for SUNCAT (the national serials catalogue) and COPAC (the national research libraries’ catalogue) as part of the &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/resourcediscovery/d2dateandm.aspx"&gt;D2D (Discovery to Delivery)&lt;/a&gt; project. The work includes providing Web 2.0 and other tools to enable users to share metadata and to link from bibliographic records to ‘the thing itself’ (or a means of obtaining it). The project partly builds on the recommendations and findings of the &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/resourcediscovery/dpie1.aspx"&gt;DPIE (Development of Personalisation for the Information Environment)&lt;/a&gt; investigations of 2008.&lt;/p&gt;
&lt;p&gt;Continuing the theme of ‘joining up’, the JISC Scholarly Communications Working Group recently commissioned a &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/fundingopportunities/funding_calls/2009/02/opacs.aspx"&gt;study into the existing and potential links between library management systems and digital repositories&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;And the winning entry at the recent Dev8D Web Developers event was a &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/news/stories/2009/04/dev8d.aspx"&gt;prototype reading list system&lt;/a&gt; which aimed to make it easier to extract information for reading lists from catalogues, Amazon and elsewhere and then make the reading lists available in different applications. &lt;/p&gt;
&lt;p&gt;Following the major &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/resourcediscovery/googlegen.aspx"&gt;‘Google generation’ report&lt;/a&gt; of early 2008, efforts also continue to understand what information services people actually need with the launch of a new British Library/JISC &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/news/stories/2009/01/ygen.aspx"&gt;study of young researchers’ use of online and physical information environments&lt;/a&gt;. JISC’s Publishers’ Action Group has also commissioned an &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/inf11/userbehaviourbusandecon.aspx"&gt;observational study&lt;/a&gt; of how individual students and academic staff in Business and Economics use electronic information resources. &lt;/p&gt;
&lt;p&gt;So, there is a lot going on - and that is just within a JISC context. If nothing else, the activity in this area illustrates that the &amp;#8216;library management system&amp;#8217; is even more the subject of debate than it was when the JISC/SCONUL study was commissioned not so very long ago &amp;#8230; &lt;/p&gt;</description>
         <author>Ben Wynne</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/05/08/the-library-management-system-a-round-up-of-some-fairly-recent-initiatives/</guid>
         <pubDate>Fri, 08 May 2009 08:59:11 -0700</pubDate>
      </item>
      <item>
         <title>Government of Canada internal IT Innovation Campaign</title>
         <link>http://feedproxy.google.com/~r/ScienceLibraryPad/~3/iY41WW9r_9g/government-of-canada-internal-it-innovation-campaign.html</link>
         <description>&lt;div&gt;&lt;p&gt;The Government of Canada has opened what I would call an ideas market, a system to submit ideas, vote on them, and comment on them. The Campaign is internal only and for IT staff only. You can see a screenshot (by permission of the GC and the developers, &lt;a rel="nofollow" target="_blank" href="http://www.publivate.com/"&gt;Publivate&lt;/a&gt;) &lt;/p&gt; &lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/rakerman/3505237714/" title="GC Innovation Campaign - Mozilla Firefox 2009-05-05 15824 PM - edit by rakerman, on Flickr"&gt;&lt;img alt="[GC Innovation Campaign - Mozilla Firefox 2009-05-05 15824 PM - edit]" height="416" src="http://farm4.static.flickr.com/3568/3505237714_bdb8a7b0a0.jpg" width="500"&gt;&lt;/a&gt; &lt;/p&gt;&lt;p&gt;Rather than give a URL which most of you can't access anyway, I'll just suggest that if you're in the Government of Canada, you can find more information and the link to the site by searching for "innovation campaign" on GCPEDIA (which is also accessible within GC only). The site opened for submissions of ideas May 4, and will close May 29, as you can see from the countdown bar in the upper right.&lt;/p&gt;&lt;p&gt;Like GCPEDIA itself, I consider this a great development, showing a government willing to embrace risk, try new technologies, and draw upon the expertise of the community (about 18,000 federal IT specialists, in this case). I think this is a measured approach, and I certainly would expect that if successful, it will lead to more consultations more broadly both within the government as well as ones open to all Canadian citizens.&lt;/p&gt;&lt;p&gt;I think this approach is really effective for breaking silos and circulating information - in some of the ideas already, a few things being proposed turn out to already be available, people weren't just aware of them elsewhere in the government.&lt;/p&gt;&lt;p&gt;I'm using the hashtag &lt;a rel="nofollow" target="_blank" href="http://search.twitter.com/search?q=%23gcitic"&gt;#gcitic&lt;/a&gt; to discuss and ask questions about the campaign on Twitter.&lt;/p&gt;&lt;p&gt;UPDATE 2009-05-06: I should mention another innovation aspect of this site, which is that it is the first time I've seen machine translation use in an official way. Currently all Government of Canada websites must have all text in both official languages, which is usually done through manual translation. This makes it basically impossible to have a dynamic site with constant changes. If we're allowed to use machine translation, it will make it much easier to bring up e.g. public blogs. Now that being said, the translation engine they are using is Google Translate, which is ok for a free translator but is by no means perfect. I know there's tons of work being done on machine translation at NRC and elsewhere in the government - it would be nice if there was a standard machine translation service available that we could also use... hmm... I think I'll submit an idea... ENDUPDATE&lt;/p&gt;&lt;p&gt;Google uses this approach internally, their site is called simply Google Ideas.&lt;/p&gt; &lt;p&gt;&lt;img src="http://blogoscoped.com/files/tools-presentation/google-ideas.png"&gt;&lt;br&gt;(Screenshot via blogoscoped.com - &lt;a rel="nofollow" target="_blank" href="http://blogoscoped.com/archive/2008-03-12-n39.html"&gt;The Tools Google Uses Internally&lt;/a&gt;)&lt;/p&gt;&lt;p&gt;I have seen this approach used to some extent in the library community, the example I always point to is JISC &lt;a rel="nofollow" target="_blank" href="http://jiscrepository.ideascale.com/"&gt;Repository Ideas&lt;/a&gt; (which is still up, but no longer active). There are probably many others that I have missed.&lt;/p&gt;&lt;p&gt;The Obama administration has also used this approach a number of times, see e.g. my recent posting about the &lt;a rel="nofollow" target="_blank" href="http://scilib.typepad.com/science_library_pad/2009/04/recoverygov-crowdsources.html"&gt;National Dialogue to gather IT ideas for the Recovery.gov site&lt;/a&gt;.&lt;/p&gt;&lt;p&gt;Previously:&lt;br&gt;November 03, 2008 &lt;a rel="nofollow" target="_blank" href="http://scilib.typepad.com/science_library_pad/2008/11/government-of-c.html"&gt;Government of Canada launches official wiki for federal employees&lt;/a&gt;&lt;/p&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=yIl2AUoC8zA" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:F7zBnMyn0Lo"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?i=iY41WW9r_9g:mgGwmHOR4iQ:F7zBnMyn0Lo" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:qj6IDK7rITs"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=qj6IDK7rITs" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:EpLpB3ZkKWg"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=EpLpB3ZkKWg" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:V-t1I-SPZMU"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=V-t1I-SPZMU" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:XAVGb8Xj5zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=XAVGb8Xj5zA" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:dnMXMwOfBR0"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=dnMXMwOfBR0" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:V_sGLiPBpWU"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?i=iY41WW9r_9g:mgGwmHOR4iQ:V_sGLiPBpWU" border="0"&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/ScienceLibraryPad/~4/iY41WW9r_9g" height="1" width="1"/&gt;</description>
         <author>Richard Akerman</author>
         <guid isPermaLink="false">tag:typepad.com,2003:post-66409445</guid>
         <pubDate>Tue, 05 May 2009 09:44:10 -0700</pubDate>
      </item>
      <item>
         <title>cnewmark: Social media trend sweeping Washington?</title>
         <link>http://www.cnewmark.com/2009/04/social-media-trend-sweeping-washington.html</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_bb65750a1d89c821445fd965b1b3b66b</guid>
         <pubDate>Mon, 27 Apr 2009 20:27:06 -0700</pubDate>
      </item>
      <item>
         <title>CheTA PostDoc at NaCTeM</title>
         <link>http://feedproxy.google.com/~r/JimDowning/~3/ysGcn9QjydM/</link>
         <description>There&amp;#8217;s still a little time to apply for one of the two Postdoc positions on the CheTA project, our JISC-funded collaboration with NaCTeM at the University of Manchester, the Royal Society of Chemistry and Thomson Reuters. The first position is at NaCTeM. An advertisement for the second position, here in the Unilever Centre for Molecular [...]</description>
         <guid isPermaLink="false">http://wwmm.ch.cam.ac.uk/blogs/downing/?p=321</guid>
         <pubDate>Mon, 27 Apr 2009 00:46:28 -0700</pubDate>
         <content:encoded><![CDATA[There&#8217;s still a little time to apply for one of the two Postdoc positions on the CheTA project, our JISC-funded collaboration with NaCTeM at the University of Manchester, the Royal Society of Chemistry and Thomson Reuters. The <a rel="nofollow" target="_blank" href="http://www.manchester.ac.uk/aboutus/jobs/research/vacancy/index.htm?ref=153707">first position</a> is at NaCTeM. An advertisement for the second position, here in the Unilever Centre for Molecular Science Informatics, will be out shortly.<img src="http://feeds.feedburner.com/~r/JimDowning/~4/ysGcn9QjydM" height="1" width="1"/>]]></content:encoded>
      </item>
      <item>
         <title>Podcast notes: Mark Shuttleworth, Canonical, free software and wealth creation</title>
         <link>http://feedproxy.google.com/~r/DFFlanders/~3/Of4dkViCSog/</link>
         <description>Podcast notes: Mark Shuttleworth, Canonical, free software and wealth creation Originally uploaded by dff.jisc itc.conversationsnetwork.org/shows/detail3994.html
* The stimulus of innovation is disclosure: not only saying what but
how we created the innovation
* how can we drive innovation faster?
** architect platform to be extensible and imbeddable, e.g. mozilla addons
** platform tolerance, must embrace our enemies (windows) to embrace
all humans, [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&amp;blog=532341&amp;post=180&amp;subd=dfflanders&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://dfflanders.wordpress.com/2009/04/21/podcast-notes-mark-shuttleworth-canonical-free-software-and-wealth-creation/</guid>
         <pubDate>Tue, 21 Apr 2009 12:57:43 -0700</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><div style="float:right;margin-left:10px;margin-bottom:10px;">
<a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/36299951@N08/3463733906/" title="photo sharing"><img src="http://farm4.static.flickr.com/3594/3463733906_55a2ce54c2.jpg" alt="" style="border:solid 2px #000000;"/></a><br />
<br />
<span style="font-size:.9em;margin-top:0;"><br />
<a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/36299951@N08/3463733906/">Podcast notes: Mark Shuttleworth, Canonical, free software and wealth creation</a><br />
<br />
Originally uploaded by <a rel="nofollow" target="_blank" href="http://www.flickr.com/people/36299951@N08/">dff.jisc</a><br />
</span>
</div>
<p><a rel="nofollow" target="_blank" href="http://itc.conversationsnetwork.org/shows/detail3994.html">itc.conversationsnetwork.org/shows/detail3994.html</a></p>
<p>* The stimulus of innovation is disclosure: not only saying what but<br />
how we created the innovation<br />
* how can we drive innovation faster?<br />
** architect platform to be extensible and imbeddable, e.g. mozilla addons<br />
** platform tolerance, must embrace our enemies (windows) to embrace<br />
all humans, be benevolentt<br />
** bring in fresh blood, it is not neccessarily the core that makes<br />
things happen<br />
* methodology and software development processes<br />
** purpose of methodology is to harness talent<br />
** aggregate development by building interest<br />
** agile for community (not pair prog) by phil oconnor<br />
** architect by community and collaboration<br />
* permission free development to enable web services<br />
** tools must interoperate over the web, eg bugzilla, trac, launchpad<br />
** branching should be enabled to any commmunity<br />
* change in innovation drives economics<br />
** economic models: advertisement, hardware providers pay for<br />
innovation&#8230; not likely<br />
** pay for service provision?<br />
* can a community create the cutting edge experience&#8230; better than Apple?<br />
** "we should be building software that helps it users get laid"<br />
* conical investing in a beautiful experience.<br /></p> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/dfflanders.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dfflanders.wordpress.com/180/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/dfflanders.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dfflanders.wordpress.com/180/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/dfflanders.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dfflanders.wordpress.com/180/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/dfflanders.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dfflanders.wordpress.com/180/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/dfflanders.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dfflanders.wordpress.com/180/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&blog=532341&post=180&subd=dfflanders&ref=&feed=1"/></div>]]></content:encoded>
         <media:content url="http://0.gravatar.com/avatar/8cdc26501af3b7e571a4d23f0afbbf08?s=96&amp;amp;d=identicon&amp;amp;r=G" medium="image">
            <media:title>dfflanders</media:title>
         </media:content>
         <media:content url="http://farm4.static.flickr.com/3594/3463733906_55a2ce54c2.jpg" medium="image" />
         <category>Uncategorized</category>
      </item>
      <item>
         <title>Podcast notes: IT conversations, Anne Thomas Manes, ‘Is SOA Dead’</title>
         <link>http://feedproxy.google.com/~r/DFFlanders/~3/zaWLrtzcyUM/</link>
         <description>Podcast notes: IT conversations, Anne Thomas Manes, &amp;#8216;Is SOA Dead&amp;#8217; Originally uploaded by dff.jisc itc.conversationsnetwork.org/shows/detail4004.html
* We have been doing &amp;#34;Service Oriented Integration&amp;#34; which has been
focused on integration rather than architecture
* Intel paper on IT business valuation for metrics that effect bottom
line: how many people, how many hours, etc
* biggest impediment to SOA is cultural and political, not [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&amp;blog=532341&amp;post=179&amp;subd=dfflanders&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://dfflanders.wordpress.com/2009/04/21/podcast-notes-it-conversations-anne-thomas-manes-is-soa-dead/</guid>
         <pubDate>Tue, 21 Apr 2009 12:11:04 -0700</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><div style="float:right;margin-left:10px;margin-bottom:10px;">
<a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/36299951@N08/3462795633/" title="photo sharing"><img src="http://farm4.static.flickr.com/3583/3462795633_d2e3d5d228.jpg" alt="" style="border:solid 2px #000000;"/></a><br />
<br />
<span style="font-size:.9em;margin-top:0;"><br />
<a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/36299951@N08/3462795633/">Podcast notes: IT conversations, Anne Thomas Manes, &#8216;Is SOA Dead&#8217;</a><br />
<br />
Originally uploaded by <a rel="nofollow" target="_blank" href="http://www.flickr.com/people/36299951@N08/">dff.jisc</a><br />
</span>
</div>
<p><a rel="nofollow" target="_blank" href="http://itc.conversationsnetwork.org/shows/detail4004.html">itc.conversationsnetwork.org/shows/detail4004.html</a></p>
<p>* We have been doing "Service Oriented Integration" which has been<br />
focused on integration rather than architecture<br />
* Intel paper on IT business valuation for metrics that effect bottom<br />
line: how many people, how many hours, etc<br />
* biggest impediment to SOA is cultural and political, not technological<br />
** the question for those not obsessed with technology is: what is<br />
going to increase bottm line?<br />
* business managers and leaders won&#8217;t understand SOA, developers are<br />
beginning to understand and so are building it from the bottom up.<br />
SOA itself will very seldom ever happen as a top down political<br />
process.<br />
* we *as humans* suck at architecture.<br />
* enterprise architecture is completly different to application architecture.<br />
** enterprise architecture is about reducing cost.<br /></p> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/dfflanders.wordpress.com/179/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dfflanders.wordpress.com/179/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/dfflanders.wordpress.com/179/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dfflanders.wordpress.com/179/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/dfflanders.wordpress.com/179/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dfflanders.wordpress.com/179/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/dfflanders.wordpress.com/179/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dfflanders.wordpress.com/179/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/dfflanders.wordpress.com/179/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dfflanders.wordpress.com/179/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&blog=532341&post=179&subd=dfflanders&ref=&feed=1"/></div>]]></content:encoded>
         <media:content url="http://0.gravatar.com/avatar/8cdc26501af3b7e571a4d23f0afbbf08?s=96&amp;amp;d=identicon&amp;amp;r=G" medium="image">
            <media:title>dfflanders</media:title>
         </media:content>
         <media:content url="http://farm4.static.flickr.com/3583/3462795633_d2e3d5d228.jpg" medium="image" />
         <category>Uncategorized</category>
      </item>
      <item>
         <title>DSpace Google Summer of Code students 2009</title>
         <link>http://feedproxy.google.com/~r/stuartlewis/~3/yuPjOlsDZeg/</link>
         <description>For the third year in a row, the DSpace Foundation has been successful in being selected to take part in the annual Google Summer of Code (a.k. GSOC). This year we welcome back one past student, and three new students to work on the DSpace repository code to develop new features and to experiment a [...]</description>
         <guid isPermaLink="false">http://blog.stuartlewis.com/?p=297</guid>
         <pubDate>Mon, 20 Apr 2009 17:19:51 -0700</pubDate>
         <content:encoded><![CDATA[<p>For the third year in a row, the <a rel="nofollow" target="_blank" href="http://www.dspace.org/">DSpace Foundation</a> has been successful in being selected to take part in the annual <a rel="nofollow" target="_blank" href="http://code.google.com/soc/">Google Summer of Code</a> (a.k. GSOC). This year we welcome back one past student, and three new students to work on the DSpace repository code to develop new features and to experiment a bit.</p>
<p>The following projects have been accepted:</p>
<blockquote>
<ul>
<li><a rel="nofollow" target="_blank" href="http://socghop.appspot.com/student_project/show/google/gsoc2009/dspace/t124022224740">Andrius Blazinskas</a> Fedora DAO implementation for DSpace, beta release</li>
<li> <a rel="nofollow" target="_blank" href="http://socghop.appspot.com/student_project/show/google/gsoc2009/dspace/t124022224877">Gaurav Kejriwal</a> Collection Administration Enhancements</li>
<li><a rel="nofollow" target="_blank" href="http://socghop.appspot.com/student_project/show/google/gsoc2009/dspace/t124022225021">Ashly Markose</a> Report Generation Tool for DSpace</li>
<li><a rel="nofollow" target="_blank" href="http://socghop.appspot.com/student_project/show/google/gsoc2009/dspace/t124022225159">Bojan Suzic</a> DSpace REST webapp</li>
</ul>
</blockquote>
<p>Congratulations to Andrius, Gaurav, Ashly and Bojan!</p>]]></content:encoded>
      </item>
      <item>
         <title>DSpace 1.5.2 – What’s in it for me?</title>
         <link>http://feedproxy.google.com/~r/stuartlewis/~3/3pCP7qUq2JA/</link>
         <description>You may have seen the recent announcement saying that DSpace 1.5.2 is now released. When it comes to upgrading software, especially something as large and possibly critical as repository software, there is always a decision to be made about whether to upgrade or not. As one of the DSpace committers, I&amp;#8217;ve worked on some of [...]</description>
         <guid isPermaLink="false">http://blog.stuartlewis.com/?p=272</guid>
         <pubDate>Wed, 15 Apr 2009 00:38:18 -0700</pubDate>
         <content:encoded><![CDATA[<p>You may have seen the recent <a rel="nofollow" target="_blank" href="http://mailman.mit.edu/pipermail/dspace-general/2009-April/002665.html">announcement</a> saying that DSpace 1.5.2 is now released. When it comes to upgrading software, especially something as large and possibly critical as repository software, there is always a decision to be made about whether to upgrade or not. As one of the <a rel="nofollow" target="_blank" href="http://wiki.dspace.org/index.php/DSpaceContributors#Committers">DSpace committers</a>, I&#8217;ve worked on some of the changes in 1.5.2 so understand quite well what it contains. This blog post lists some of the more important changes in 1.5.2 (compared to 1.5.1) to hopefully help you decide whether to upgrade or not. The changes are split into broad categories to make the list easier to follow. The full list of changes can be seen <a rel="nofollow" target="_blank" href="http://dspace.svn.sourceforge.net/viewvc/dspace/tags/dspace-1.5.2/dspace/CHANGES?revision=3717&amp;view=markup">here</a>.</p>
<p><strong>Features</strong></p>
<ul>
<li><a rel="nofollow" target="_blank" href="http://www.swordapp.org/">SWORD</a> support has been upgraded to the new 1.3 version of SWORD.</li>
<li><a rel="nofollow" target="_blank" href="http://www.sitemaps.org/">Sitemaps.org</a> sitemaps, and <a rel="nofollow" target="_blank" href="http://www.robotstxt.org/">robots.txt</a> files have been added to the XML (Manakin) user interface.</li>
<li>DOI links can now be rendered in the JSP user interface.</li>
<li>The <a rel="nofollow" target="_blank" href="http://ethostoolkit.cranfield.ac.uk/tiki-index.php?page_ref_id=69">uketd_dc</a> OAI-PMH metadata format is now included as standard. This is required by UK institutions who want to expose their electronic theses to the <a rel="nofollow" target="_blank" href="http://www.ethos.ac.uk/">EThOS </a>service. No additional code is now required.</li>
<li>A new statistics collection sub-system has been written to help collect statistics in a better fashion.</li>
<li>A better PDF extraction utility (<a rel="nofollow" target="_blank" href="http://www.foolabs.com/xpdf/">xpdf</a>) can now be used to extract the fulltext out of some files where the current system (<a rel="nofollow" target="_blank" href="http://incubator.apache.org/pdfbox/">pdfbox</a>) fails, or uses too much memory.</li>
<li>The exporter tools will now export nested files correctly (e.g. where you have archived a website). It also now exports file descriptions that you may have set.</li>
<li>The exporter tool has a new option (-m or -migrate) which strips out any per-repository-specifics which would get re-created when re-ingested. This is useful if you pre-load content into a test repository, and then want to move them into a production repository without taking with them items their temporary handle, provenance information and deposit dates etc.</li>
</ul>
<p><strong>Security</strong></p>
<ul>
<li>Prior to 1.5.2 items that were restricted (anonymous access disabled) were still accessible via OAI-PMH, RSS feeds, and daily subscription emails. There are now configuration options to disable each of these.</li>
<li>Provenance information used to be included in the metadata that was exposed in the &lt;head&gt; element of the item web pages (for consumption by search engines and tools such as <a rel="nofollow" target="_blank" href="http://www.zotero.org/">Zotero</a>). The provenance information is not needed, and has been removed.</li>
</ul>
<p><strong>Authentication</strong></p>
<ul>
<li><a rel="nofollow" target="_blank" href="http://shibboleth.internet2.edu/">Shibboleth</a> support has been added.</li>
<li>Support for <a rel="nofollow" target="_blank" href="http://blog.stuartlewis.com/2008/08/18/test-ldap-service-upgraded-now-with-branches/">Hierarchical LDAP</a> servers has been added (where users are spread across branches of an LDAP tree, rather than existing all in the same branch).</li>
<li>Automatic groups for all LDAP and Password users have been added, allowing to create groups for all authenticated users, without having to add them to a group manually.</li>
<li>IP authentication now works for users who are not logged in, as well as for users who are logged in. Prior to 1.5.2, users had to first log in, before they could access collections or items that were protected by IP address. It now works in a better fashion so that, for example, internal users can access such materials without having to have an account.</li>
</ul>
<p><strong>Internationalisation</strong></p>
<ul>
<li>New or updated translations of the software have been added for Thai, Italian, Ukrainian and Greek.</li>
</ul>
<p><strong>Bug fixes</strong></p>
<p><strong></strong>There have been many bug fixes, but of note are:</p>
<ul>
<li>SWORD deposits are now a lot faster. For example a deposit of a 10MB PDF file on my development server used to take about a minute, it now takes 9 seconds.</li>
<li>Statistics now work correctly in the xml user interface. Prior to 1.5.2 they did work, but unfortunately looked in the wrong directory for the reports, so the reports did not show.</li>
<li>If you have multiple authentication methods available (e.g. LDAP and Password Authentication) you won&#8217;t suffer any more database connection leaks.</li>
<li>The handling of UTF-8 has been standardized throughout the xml user interface.</li>
<li>When non-admin users deposit items using the xml user interface, the admin menu options no longer appear immediately following a deposit. Non-admin users could never use these links, but they shouldn&#8217;t have appeared.</li>
</ul>
<p>For a neat little overview of the changes in 1.5.2, see this <a rel="nofollow" target="_blank" href="http://www.wordle.net/">Wordle</a> made using the <a rel="nofollow" target="_blank" href="http://dspace.svn.sourceforge.net/viewvc/dspace/branches/dspace-1_5_x/dspace/CHANGES?view=markup">CHANGES</a> file from DSpace:</p>
<p><a rel="nofollow" title="Wordle: DSpace 1.5.2" target="_blank" href="http://www.wordle.net/gallery/wrdl/749372/DSpace_1.5.2"><img class="aligncenter size-full wp-image-281" title="152wordle" src="http://blog.stuartlewis.com/wp-content/uploads/2009/04/152wordle.jpg" alt="Wordle: DSpace 1.5.2" width="507" height="286"/><br />
</a></p>]]></content:encoded>
      </item>
      <item>
         <title>Less talk, less code, more data - The Preserv2 Data Registry</title>
         <link>http://davetaz-blog.blogspot.com/2009/04/talk-code-data-preserv2-data-registry.html</link>
         <description>Yes, less talk more code (oxfordrepo.blogspot.com) is a good saying but i'm going to argue in this post that in fact we need more data! Having a ton of available services and a load of highly complex and well considered data models is all well and good but without data all of these services are useless; A repository is not a repository until it has something in it (Harnad). &lt;br /&gt;&lt;br /&gt;If we look outside of the repository community for a minute we find the web community we are accumulating a whole ton of data, wikipedia being the main point of reference here. Yet in the repository community we are not harnessing this open linked data model to enhance our data. &lt;br /&gt;&lt;br /&gt;I have been working in the area of digital preservation for a while now and the PRONOM file format registry (TNA UK) has been my friend for many years now and contains some valuable data. However I am concerned with the way I see it progressing. The main thing I use the PRONOM registry for is as a complement to DROID for file format information, and the data here is not even that complete. I am concerned however at the size of the new data model and the sheer effort which is going to be required to fill it with the data which it specifies. &lt;br /&gt;&lt;br /&gt;Why not looked to the linked data web to see how to tie a series of smaller systems together to make a much more powerful and easier to maintain one! &lt;br /&gt;&lt;br /&gt;This is where I have started with the preserv2 registry available at &lt;a rel="nofollow" target="_blank" href="http://p2-registry.ecs.soton.ac.uk/"&gt;http://p2-registry.ecs.soton.ac.uk/&lt;/a&gt;. &lt;br /&gt;&lt;br /&gt;The preserv2 registry is a semantic knowledge base (RDF triples based) with an SPARQL endpoint, RESTful services and a basic browser. Currently the data is focussed on file formats and is basically made up of the PRONOM database ported from a complex XML schema into simple RDF triples. On top of this i'm beginning to add data from dbpedia (wikipedia RDF'd) and making links between the PRONOM data and the dbpedia data! &lt;br /&gt;&lt;br /&gt;Already this is helping is ascertain a greater knowledge base and the cost of gathering and compiling this data is very low. Other than that the registry took me less than a week to construct! &lt;br /&gt;&lt;br /&gt;So "Go forth and make links" (Wendy Hall) is exactly what I'm now doing. With enough data you will be able to make complex OWL-S rules that can be used to deduce accurately facts such as formats which are at risk.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-3516378215565453655?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-3516378215565453655</guid>
         <pubDate>Wed, 08 Apr 2009 01:21:00 -0700</pubDate>
      </item>
      <item>
         <title>New job, New Zealand</title>
         <link>http://feedproxy.google.com/~r/stuartlewis/~3/ONw8ibkVUy4/</link>
         <description>This is a quick blog post to mark the fact that I have now moved to New Zealand. Up until the end of February I was working for Information Services at Aberystwyth University as a team leader, project manager, repository developer, and JISC project person. Following a week of holiday in the UK, I have [...]</description>
         <guid isPermaLink="false">http://blog.stuartlewis.com/?p=264</guid>
         <pubDate>Sun, 22 Mar 2009 20:33:48 -0700</pubDate>
         <content:encoded><![CDATA[<p>This is a quick blog post to mark the fact that I have now moved to New Zealand. Up until the end of February I was working for <a rel="nofollow" title="Information Services" target="_blank" href="http://www.inf.aber.ac.uk/">Information Services</a> at <a rel="nofollow" title="Aberystwyth University" target="_blank" href="http://www.aber.ac.uk/">Aberystwyth University</a> as a team leader, project manager, repository developer, and <a rel="nofollow" title="The JISC" target="_blank" href="http://www.jisc.ac.uk/">JISC</a> project person. Following a week of holiday in the UK, I have now moved to Auckland in New Zealand along with my wife and two young children.</p>
<p>I am now working as a programmer in the Digital Services Team at the <a rel="nofollow" target="_blank" href="http://www.library.auckland.ac.nz/">University of Auckland Library</a>. The team is a multi-disciplined group of people working on all things digital within the library. As the biggest university in New Zealand, this involves many dozens of projects and lots of new and exciting things. In the five days I have been working here already, I have already started working with EAD xsl files for DigiTool, extraction of XMD metadata from PDF files, DSpace item migration tools, and research management systems. Lots of new fun stuff to learn about!</p>
<p>No doubt there will be many more blog posts on my work as it develops further.</p>]]></content:encoded>
      </item>
      <item>
         <title>Account from OGF25 Repositories Workshop: Creating a Repository Standard?</title>
         <link>http://feedproxy.google.com/~r/MediashelfBlog/~3/kgenvp2ahUE/</link>
         <description>04 March 2009
Catania, Sicily, Italy
Open Grid Forum 25th Conference (OGF25) It&amp;#8217;s not entirely clear when I figured out that I was sitting on a standards body panel discussing the creation of a digital repository related standard. I&amp;#8217;m pretty sure it finally clicked sometime after the session was over, once I had consumed a couple glasses of [...]</description>
         <guid isPermaLink="false">http://yourmediashelf.com/blog/?p=71</guid>
         <pubDate>Fri, 20 Mar 2009 13:56:09 -0700</pubDate>
         <content:encoded><![CDATA[<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;">04 March 2009</p>
<p style="font:normal normal normal 12px/normal Helvetica;color:#000000;margin:0px;">Catania, Sicily, Italy</p>
<p style="font:normal normal normal 12px/normal Helvetica;color:#000000;margin:0px;">Open Grid Forum 25th Conference (OGF25)</p>
<p style="font:normal normal normal 12px/normal Helvetica;color:#000000;margin:0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">It&#8217;s not entirely clear when I figured out that I was sitting on a standards body panel discussing the creation of a digital repository related standard. I&#8217;m pretty sure it finally clicked sometime after the session was over, once I had consumed a couple glasses of wine.</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">I still don&#8217;t see what I contributed to the conversation, though the other participants assured me that my comments were useful. The experience reminds me of my friend, let&#8217;s call him Josh, a community organizer who was recently pulled onto one of the Obama administration&#8217;s advisory panels. Shortly after joining the advisory panel, Josh confessed that at the end of most calls he has to follow up with a friend and ask &#8220;Ok. So what exactly did we just decide and who is responsible for doing what?&#8221;</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">The panel discussion started by making observations that we&#8217;re all familiar with:</p>
<ul>
<li>the importance, and associated challenges, of unique identifiers and persistent URIs</li>
<li>search, retrieval, and management are separate concerns, each with appropriate standards associated (ie. SWORD, OpenSearch, etc.)</li>
<li>cloud computing is very different from cloud storage</li>
</ul>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">After this, I quickly found myself in completely unfamiliar waters when conversation abruptly turned to the creation of a standard for digital repositories. I thought &#8220;Pshaw. We don&#8217;t need Yet Another Standard. Where did this come from?&#8221; In fact, the whole field of repositories is so new that the prospect of a repository standard seems absurdly premature to me. Discussion on the panel honed in on the two obvious contenders for a standard: 1) metadata requirements, and 2) functionality profiles (a list of features necessary in order for a repository system to be deemed compliant and interoperable). From my perspective, repositories already swim in a glut of metadata standards (as well as non-standard, ad-hoc metadata) and, by nature, must embrace heterogeneous metadata. The second notion, that of functionality profiles, sounds like something that few will read and none will understand. To be honest, the entire discussion confused me. I did my best to contribute to the discussion where I could.</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">After the workshop ended, I had a chance to catch my breath and discuss the panel with a couple of people. Eventually, I came to look at the whole scenario from a different perspective and had a mild change of heart. In a discussion with Neil Chue Hong, a very smart guy from Edinburgh, I started thinking about all the informal conclusions that frame discussions between developers at conferences like Dev8D, Code4Lib and RepoCamp. I then thought about all the little architectural wins and failures that I see in software like Flickr, YouTube, Hulu and (woah) ABC&#8217;s full episode player. After all, these are repositories too. Within a few moments of pondering, an initial list of obvious basic guidelines shone through quite clearly.</p>
<ul>
<li>give permanent, unique URIs to all content you expose, even if you intend to limit access to that content based on geography or time of access</li>
<li>support linking with versioning or datetime info</li>
<li>expose a RESTful API</li>
<li>give preference to AtomPub</li>
<li>consider ORE when you need to express aggregations of data</li>
<li>provide linked data (RDF) endpoints</li>
</ul>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">Some open topics also seem like obvious fodder for discussion:</p>
<ul>
<li>what query language(s) to use in search APIs</li>
<li>navigating the difference between standards and interoperability</li>
<li>leveraging standards where possible (ie SWORD)</li>
</ul>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">These are merely the things that seem obvious to me right away. What would happen if we got the <a rel="nofollow" title="Inkdroid" target="_blank" href="http://inkdroid.org/">really</a> <a rel="nofollow" title="OxfordRepo Blog" target="_blank" href="http://oxfordrepo.blogspot.com/">smart</a> <a rel="nofollow" title="Dave Tarrant" target="_blank" href="http://www.ecs.soton.ac.uk/people/dct05r">people</a> talking in this vein? </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">I think this warrants further exploration and, strange though it is, I expect that the outputs of such exploration might resemble the stuff of standards bodies (be it a recommendation, a community document, or a standard). Possibly I have been infected with that odd standards-wonk bug, or possibly I&#8217;m just catching up with the rest of the world in acknowledging the inevitable.</p>
<img src="http://feeds.feedburner.com/~r/MediashelfBlog/~4/kgenvp2ahUE" height="1" width="1"/>]]></content:encoded>
      </item>
      <item>
         <title>Session Hopping, LinkedData, and Data APIs at OGF25 in Sicily</title>
         <link>http://feedproxy.google.com/~r/MediashelfBlog/~3/iAbuQz22MQQ/</link>
         <description>04 March 2009
Catania, Sicily, Italy
Open Grid Forum 25th Conference (OGF25) [Note: I'm posting my backlog of updates from the past 2 months of travel. An update specifically about the OGF Repositories Workshop will follow shortly] I made it to the conference center in Catania, Sicily a few hours before the OGF Repositories Workshop. Immediately upon arriving I met Nick [...]</description>
         <guid isPermaLink="false">http://yourmediashelf.com/blog/?p=69</guid>
         <pubDate>Fri, 20 Mar 2009 13:43:50 -0700</pubDate>
         <content:encoded><![CDATA[<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">04 March 2009</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">Catania, Sicily, Italy</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">Open Grid Forum 25th Conference (OGF25)</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">[Note: I'm posting my backlog of updates from the past 2 months of travel. An update specifically about the OGF Repositories Workshop will follow shortly]</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">I made it to the conference center in Catania, Sicily a few hours before the OGF Repositories Workshop. Immediately upon arriving I met Nick Ferguson, coordinator of the workshop, and had a nice chat with Neil Chue Hong about repositories, ORE, and grid computing vs. cloud computing. After that I was left to kill time until the workshop by sitting in on one of the OGF sessions. At first, I stepped into what I thought was the Earth Sciences session, but it turned out to be the Computational Chemistry session and went way over my head. I then passed through a handful of other random presentations before settling on a room where about 30 people were having a discussion about XQuery.</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">I soon discerned that this group was hammering out the spec for some sort of standard data systems interface. When I arrived, they had been debating the strengths and demerits of XPath/XQuery vs. SQL as a query language. The converation quickly stumbled into the pit of interoperability hell. Standard interjections abounded: &#8220;Some implementations won&#8217;t have that data to return&#8230;&#8221;, &#8220;you will have to expose user info in order to support that&#8230;&#8221; mumble mumble &#8220;&#8230; we didn&#8217;t do it that way because one unnamed vendor couldn&#8217;t support it&#8230;&#8221; I nearly laughed out loud when an attendee from the back of the room interrupted the discussion declaring &#8220;But in most situations, you should only be returning items owned by the current user.&#8221; </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">I still had no idea what data they were attempting to expose. (I later learned that it was the <a rel="nofollow" title="RUS-WG" target="_blank" href="http://www.to.infn.it/grid/accounting/NA5/rus-wg.html">RUS-WG</a>, who are defining a standard interface for retrieving<span class="Apple-style-span" style="font-style:italic;"> job usage records &#8230; </span>Obscure indeed.) The 90-minute discussion ended up having nearly nothing to do with the actual data these people want to work with. Instead, the conversation was entirely dominated by the travails of navigating the strange space of Data API design.</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">Meanwhile, serendipitously, I was using this downtime (and the conference wifi access) to finally read George Thomas&#8217;s <a rel="nofollow" target="_blank" href="http://george.thomas.name/omb/recovery.gov.pdf">slides about recovery.gov publishing open data</a>. Though I missed the presentation, the slides spell out the project&#8217;s intentions pretty clearly. They&#8217;re full of references to REST, ATOM, RDFa and the <a rel="nofollow" target="_blank" href="http://richard.cyganiak.de/2007/10/lod/">LOD cloud</a>. I experienced such a fascinating contrast between the exposition before my eyes and the discussion filtering in through my ears. In particular, one of Thomas&#8217;s slides jumped out at me. The slide, titled &#8220;Follow the dollar, not the person&#8221;, showed a semantic model for users, user groups, and posts in a bulletin-board style Community Forum system. It was totally readable, totally understandable, precise, flexible, and using an ontology that lends itself to re-use. </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">Over the past year, I have satirically placed a golden halo above &#8220;linked data&#8221; in my mind. As I sat in the RUS-WG session, light fell upon that halo and it glowed.</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">This experience, as well as consequent discussions at OGF, has left me with a distinct sense that there&#8217;s a pattern here. We are all, of our own accord and in our own little techno-fiefdoms, attempting to do the same things and running into the same challenges. I think that the previously obscure field of digital repositories has valuable perspective to provide and many pieces of wisdom to share in this domain. I hope to see more public discourse about these topics, and I know who to start prodding to speak up. Watch this space.</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">Post Script:</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">The morning following my OGF session-hopping experience, I realized that the track I had passed over, innocuously titled &#8220;HEP&#8221;, was a meeting of the <a rel="nofollow" title="Wikipedia Entry for HEP" target="_blank" href="http://en.wikipedia.org/wiki/Particle_physics">High Energy Physics</a> community. In particular, it was primarily a discussion about how they are going to handle processing the data outputs from the LHC experiments when they fire up <a rel="nofollow" title="Cern Large Hadron Collider Website" target="_blank" href="http://www.cern.ch/lhc">the collider</a> later this year. /me kicks himself for missing this.</p>
<img src="http://feeds.feedburner.com/~r/MediashelfBlog/~4/iAbuQz22MQQ" height="1" width="1"/>]]></content:encoded>
      <enclosure url="http://george.thomas.name/omb/recovery.gov.pdf" length="904104" type="application/pdf" /><media:content url="http://george.thomas.name/omb/recovery.gov.pdf" fileSize="904104" type="application/pdf" /><itunes:subtitle>04 March 2009 Catania, Sicily, Italy Open Grid Forum 25th Conference (OGF25) [Note: I'm posting my backlog of updates from the past 2 months of travel. An update specifically about the OGF Repositories Workshop will follow shortly] I made it to the confer</itunes:subtitle><itunes:summary>04 March 2009 Catania, Sicily, Italy Open Grid Forum 25th Conference (OGF25) [Note: I'm posting my backlog of updates from the past 2 months of travel. An update specifically about the OGF Repositories Workshop will follow shortly] I made it to the conference center in Catania, Sicily a few hours before the OGF Repositories Workshop. Immediately upon arriving I met Nick [...]</itunes:summary></item>
      <item>
         <title>We need people!</title>
         <link>http://oxfordrepo.blogspot.com/2009/03/we-need-people.html</link>
         <description>(UPDATE - Grrr.... seems that the concept of persistent URLs is lost on the admin - link below has been removed - see google cached copy &lt;a rel="nofollow" target="_blank" href="http://209.85.229.132/search?q=cache:YsdhcWzKWksJ:www.admin.ox.ac.uk/ps/oao/ar/ar3979j.shtml+http://www.admin.ox.ac.uk/ps/oao/ar/ar3979j.shtml&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk"&gt;here&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.admin.ox.ac.uk/ps/oao/ar/ar3979j.shtml"&gt;http://www.admin.ox.ac.uk/ps/oao/ar/ar3979j.shtml&lt;/a&gt; - job description.&lt;br /&gt;&lt;br /&gt;Essentially, we need smart people who are willing to join us to do good, innovative stuff; work that isn't by-the-numbers with room for initiative and ideas.&lt;br /&gt;&lt;br /&gt;Help us turn our digital repository into a digital library, it'll be fun! Well, maybe not fun, but it will be very interesting at least!&lt;br /&gt;&lt;br /&gt;bulletpoints: python/ruby frameworks, REST, a little SemWeb, ajax, jQuery, AMQP, Atom, JSON, RDF+RDFa, Apache WSGI deployment, VMs, linux, NFS, storage, RAID, etc.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-462847752063249912?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-462847752063249912</guid>
         <pubDate>Thu, 19 Mar 2009 00:46:00 -0700</pubDate>
      </item>
      <item>
         <title>Tip: Clojure REPL from mvn</title>
         <link>http://feedproxy.google.com/~r/JimDowning/~3/-HtMnU5p8xw/</link>
         <description>If you manage your dependencies using maven, you can get clojure by including it as a dependency. You can pick this up from the formos repository at http://tapestry.formos.com/maven-snapshot-repository/org/clojure/, or keep a local source copy of clojure and install locally. Make sure your pom has a build section that points to your clojure code in a scriptSourceDirectory element [...]</description>
         <guid isPermaLink="false">http://wwmm.ch.cam.ac.uk/blogs/downing/?p=298</guid>
         <pubDate>Wed, 11 Mar 2009 04:42:14 -0700</pubDate>
         <content:encoded><![CDATA[<p>If you manage your dependencies using maven, you can get clojure by including it as a dependency. You can pick this up from the formos repository at <a rel="nofollow" target="_blank" href="http://tapestry.formos.com/maven-snapshot-repository/org/clojure/">http://tapestry.formos.com/maven-snapshot-repository/org/clojure/</a>, or keep a local source copy of clojure and install locally.</p> <p>Make sure your pom has a build section that points to your clojure code in a scriptSourceDirectory element (and?) or in a resource element. You can then start a Repl with your dependencies on the classpath by running</p> <code>mvn exec:java -Dexec.mainClass=clojure.lang.Repl</code> <p>For extra credit, include jline:jline as an optional dependency in your pom and change your execution line to: -</p> <code>mvn exec:java -Dexec.mainClass=jline.ConsoleRunner -Dexec.args="clojure.lang.Repl"</code><img src="http://feeds.feedburner.com/~r/JimDowning/~4/-HtMnU5p8xw" height="1" width="1"/>]]></content:encoded>
      </item>
      <item>
         <title>Developer Happiness days - why happyness is important</title>
         <link>http://oxfordrepo.blogspot.com/2009/02/developer-happiness-days-why-happyness.html</link>
         <description>&lt;big&gt;&lt;big&gt;&lt;b&gt;Creativity and innovation&lt;/b&gt;&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;One of the defining qualities of a good innovative developer is creativity and a pragmatic attitude; someone with the '&lt;i&gt;rough consensus, running code&lt;/i&gt;' mentality that pervades good software innovation. This can be seen as the drive to experiment, to turn inspiration and ideas into real, running code or to pathfind by trying out different things. Innovation can often happen when talking about quite separate, seemingly unrelated things, even to the point that most of the time, the 'outcomes' of an interaction are impossible to pin down.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;&lt;b&gt;Play, vagueness and communication&lt;/b&gt;&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Creativity, inspiration, innovation, ideas, fun, and curiousity&lt;/b&gt; are all useful and important when developing software. These words convey concepts that do not thrive in situations that are purely scheduled, didactic, and teacher-pupil focussed. There needs to be an amount of '&lt;b&gt;&lt;i&gt;play&lt;/i&gt;&lt;/b&gt;' in the system (&lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Play_%28engineering%29"&gt;see 'Play'.&lt;/a&gt;) While this '&lt;i&gt;&lt;b&gt;play&lt;/b&gt;&lt;/i&gt;' is bad in a tightly regimented system, it is an essential part in a creative system, to allow for new things to develop, new ideas to happen and for 'random' interactions to take place.&lt;br /&gt;&lt;br /&gt;Alongside this notion of &lt;i&gt;&lt;b&gt;play&lt;/b&gt;&lt;/i&gt; in an event, there also needs to be an amount of blank space, a &lt;i&gt;&lt;b&gt;vagueness&lt;/b&gt;&lt;/i&gt; to the event. I think that we can agree that much of the usefulness of normal conferences comes from the 'coffee breaks' and 'lunch breaks', which are blank spaces of a sort. It is the recognition of this that is important and to factor it in more.&lt;br /&gt;&lt;br /&gt;Note that if a single developer could guess at how things should best be developed in the academic space, they would have done so by now. &lt;i&gt;Pre-compartmentalisation of ideas into 'tracks' can kill potential innovation stone-dead.&lt;/i&gt; The distinction between CMSs, repositories and VLE developers is purely semantic and it is detrimental for people involved in one space to not overhear the developments, needs, ideas and issues in another. It is especially counter-productive to further segregate by community, such as having simultaneous Fedora, DSpace and EPrints strands at an event.&lt;br /&gt;&lt;br /&gt;While the inherent and intended &lt;i&gt;&lt;b&gt;vagueness&lt;/b&gt;&lt;/i&gt; provides the potential for cross-fertilisation of ideas, and the room for &lt;i&gt;&lt;b&gt;play&lt;/b&gt;&lt;/i&gt; provides the space, the final ingredient is that of &lt;i&gt;&lt;b&gt;speech, or any communication that takes place with the same ease and at the same speed of speech&lt;/b&gt;&lt;/i&gt;. While some may find the 140 character limit on twitter or identi.ca a strange constraint, this provides a target for people to really think about what they wish to convey and keeps the dialogue from becoming a series of monologues - much like the majority of emails of mailing lists - and keeps it as a dialogue between people.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;&lt;b&gt;Communication and Developers&lt;/b&gt;&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;One of the dichotomies in the necessity of communication to development is that developers can be shy, initially preferring the false anonymity of textual communication to spoken words between real people. There is a need to provide means for people to break the ice, and to strike up conversations with people that they can recognise as being of like minds. Asking that people's public online avatars are changed to be pictures of them can help people at an event find those that they have been talking to online and to start talking, face to face.&lt;br /&gt;&lt;br /&gt;On a personal note, one of the most difficult things I have to do when meeting people out in real life is answer the question 'What do you do?' - it is much easier when I already know that the person asking the question has a technical background.&lt;br /&gt;&lt;br /&gt;And again, going back to the concept of compartmentalisation - &lt;i&gt;developers who only deal with developers and their managers/peers will build systems that work best for their peers and their managers.&lt;/i&gt; If these people are not the only users then they need to widen their communications. It is important for the developers that do not use their own systems to engage with the people who actually do. They should do this directly, without the potential for garbled dialogue via layers of protocol. This part needs managing in whatever space, both to avoid dominance by loud, disgruntled users and to mitigate anti-social behaviour. By and large, I am optimistic of this process, people tend to want to be thanked, and this simple &lt;i&gt;feedback loop&lt;/i&gt; can be used to help motivate. Making this feedback more disproportionate (a small 'thank you' can lead to great effects) and adding in the notion of &lt;i&gt;highscore&lt;/i&gt; can lead to all sorts of interaction and outcomes, most notably being the rapid reinforcement of any behaviour that led to a positive outcome.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;&lt;b&gt;Disproportionate feedback loops and Highscores drive human behaviour&lt;/b&gt;&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;I'll just digress quickly to cover what I mean be a &lt;b&gt;&lt;i&gt;disproportionate feedback loop&lt;/i&gt;&lt;/b&gt;: A disproportionate feedback loop is something that encourages a certain behaviour; the input to which is something small and inexpensive, in either time or effort but the output can be large and very rewarding. This pattern can be seen in very many interactions: playing the lottery, [good] video game controls, twitter and facebook, musical instruments, the 'who wants to be a millionaire' format, mashups, posting to a blog ('free' comments, auto rss updating, a google-able webpage for each post) etc.&lt;br /&gt;&lt;br /&gt;The &lt;i&gt;&lt;b&gt;natural drive for highscores&lt;/b&gt;&lt;/i&gt; is also worth pointing out. At first glance, is it as simple as considering its use in videogames? How about the concept of getting your '5 fruit and veg a day'? &lt;a rel="nofollow" target="_blank" href="http://www.5aday.nhs.uk/topTips/default.html"&gt;http://www.5aday.nhs.uk/topTips/default.html&lt;/a&gt; Running in a marathon against other people? Inbox Zero (&lt;a rel="nofollow" target="_blank" href="http://www.slideshare.net/merlinmann/inbox-zero-actionbased-email"&gt;http://www.slideshare.net/merlinmann/inbox-zero-actionbased-email&lt;/a&gt;), Learning to play different musical scores? Your work being rated highly online? An innovation of yours being commented on by 5 different people in quick succession? Highscores can be very good drivers for human behaviour, addictive to some personalities.&lt;br /&gt;&lt;br /&gt;Why not set up some software highscores? For example, in the world of repositories, how about 'Fastest UI for self-submission' - encouraging automatic metadata/datamining, a monthly prize for 'Most issue tickets handled' - to the satisfaction of those posting the tickets, and so on.&lt;br /&gt;&lt;br /&gt;It is very easy to over-metricise this - some will purposefully abstain from this and some metrics are truely misleading. In the 90s, there was a push to have lines of code added as a metric to productivity. The false assumption is that lines of code have anything to do with producitivity - code should be lean, but not too lean to maintain.&lt;br /&gt;&lt;br /&gt;So be very careful when adding means to record highscores - they should be flexible, and be fun - if they are no fun for the developers and/or the users, they become a pointless metric, more of an obstacle than a motivation.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;&lt;b&gt;The Dev8D event&lt;/b&gt;&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;People were free to roam and interact at the Dev8D event and there was no enforced schedule, but twitter and a loudhailer were used to make people aware of things that were going on. Talks and discussions were lined up prior to the event of course, but the event was organised on a wiki which all were free to edit. As experience has told us, the important and sometimes inspired ideas occur in relaxed and informal surroundings where people just talk and share information, such as in a typical social situation like having food and drink.&lt;br /&gt;&lt;br /&gt;As a specific example, look &lt;a rel="nofollow" target="_blank" href="http://oxfordrepo.blogspot.com/2009/02/tracking-conferences-at-dev8d-with.html"&gt;at the role of twitter at the event&lt;/a&gt;. Sam Easterby-Smith (&lt;a rel="nofollow" target="_blank" href="http://twitter.com/samscam%29"&gt;http://twitter.com/samscam)&lt;/a&gt; created a means to track 'developer happiness' and shared the tracking '&lt;a rel="nofollow" target="_blank" href="http://samscam.co.uk/happier/"&gt;Happyness-o-meter'&lt;/a&gt; site with us all. This unplanned development inspired me to relay the infomation back to twitter and similarly led to me running an operating system/hardware survey in a very similar fashion.&lt;br /&gt;&lt;br /&gt;To help break the ice and to encourage play, we instituted a number of ideas:&lt;br /&gt;&lt;br /&gt;A &lt;b&gt;wordcloud on each attendees badge&lt;/b&gt;, consisting of whatever we could find of their work online, be it their blog or similar so that it might provide a talking point, or allow people to spot people who write about things they might be interested in learning more about.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;The poker chip game&lt;/b&gt; - each attendee was given 5 poker chips at the start of the event, and it was encouraged that chips were to be traded for help, advice or as a way to convey a thank you. The goal was that the top 4 people ranked by amounts of chips at the end of the third day would receive a Dell mini 9 computer. The balance to this was that each chip was also worth a drink at the bar on that day too.&lt;br /&gt;&lt;br /&gt;We were well aware that we'd left a lot of play in this particular system, allowing for lotteries to be set up, people pooling their chips, and so on. As the sole purpose of this was to encourage people to interact, to talk and bargain with each other, and to provide that feedback loop I mentioned earlier, it wasn't too important how people got the chips as long as it wasn't underhanded. It was the interaction and the 'fun' that we were after. Just as an aside, Dave Flanders deserves the credit for this particular scheme.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;&lt;b&gt;Developer Decathlon&lt;/b&gt;&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;The basic concept of the &lt;a rel="nofollow" target="_blank" href="http://code.google.com/p/developerhappinessdays/wiki/DeveloperDecathlon"&gt;Developer Decathlon&lt;/a&gt; was also reusing these ideas of play and feedback: "&lt;a rel="nofollow" name="What_is_the_Developer_Decathlon?"&gt;The Developer Decathlon is a competition at dev8D that enables developers to come together face-to-face to do rapid prototyping of software ideas. [..] &lt;/a&gt; We help facilitate this at dev8D by providing both 'real users' and 'expert advice' on how to run these rapid prototyping sprints. [..] The 'Decathlon' part of the competition represents the '10 users' who will be available on the day to present the biggest issues they have with the apps they use and in turn to help answer developer questions as the prototypes applications are being created. The developers will have two days to work with the users in creating their prototype applications."&lt;br /&gt;&lt;br /&gt;The best two submissions will get cash prizes that go to the individual, not to the company or institution that they are affiliated with. The outcomes of which will be made public shortly, once the judging panel have done their work.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;Summary&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;To foster innovation and to allow for creativity in software development:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Having &lt;b&gt;play&lt;/b&gt; space is &lt;b&gt;important&lt;/b&gt;&lt;/li&gt;&lt;li&gt;Being &lt;b&gt;vague&lt;/b&gt; with aims and &lt;b&gt;flexible&lt;/b&gt; with outcomes is not a bad thing and is &lt;b&gt;vital&lt;/b&gt; for unexpected things to develop - &lt;i&gt;e.g. A project's outcomes should be under continual re-negotiation as a general rule, not as the exception.&lt;/i&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Encouraging&lt;/b&gt; and &lt;b&gt;enabling&lt;/b&gt; free and easy communication is &lt;b&gt;crucial&lt;/b&gt;.&lt;/li&gt;&lt;li&gt;Be aware of what drives people to do what they do. Push all feedback to be &lt;b&gt;as disproportionate as possible&lt;/b&gt;, allowing both developers and users to benefit, with only putting a relatively trivial amount of input in (this pattern affects web UIs, development cycles, team interaction, etc)&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Choose useful highscores&lt;/b&gt; and be prepared to ditch them or change them if they are no longer &lt;b&gt;fun and motivational&lt;/b&gt;.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-7889113205346661214?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-7889113205346661214</guid>
         <pubDate>Tue, 24 Feb 2009 22:09:00 -0800</pubDate>
      </item>
      <item>
         <title>Less Talk, More Code: Tracking conferences (at Dev8D) with python, twitter and tags</title>
         <link>http://oxfordrepo.blogspot.com/2009/02/tracking-conferences-at-dev8d-with.html</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_37a36fe6a393a4ab706eac1eebbaba13</guid>
         <pubDate>Mon, 23 Feb 2009 06:45:38 -0800</pubDate>
      </item>
      <item>
         <title>Handling Tabular data</title>
         <link>http://oxfordrepo.blogspot.com/2009/02/handling-tabular-data.html</link>
         <description>&lt;div&gt;&lt;b&gt;"Storage"&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;I put the s-word in quotes because the storing of the item is actually a very straightforward process - we have been dealing with storing tabular data for computation for a very long time now. Unfortunately, this also means that there are very many ways to capture, edit and present tables of information.&lt;br/&gt;&lt;br/&gt;One realisation to make with regards to preserving access to data coming from research is that there is a huge backlog of data in formats that we shall kindly call 'legacy'. Not only is there this issue, but data is being made with tools and systems that effectively 'trap' or lock-in a lot of this information - case in point being any research being recorded using Microsoft Access. While the tables of data can often be extracted with some effort, it is normally difficult to impossible to extract the implicit information; how tables interlink, how the Access Form adds information to the dataset, etc.&lt;br/&gt;&lt;br/&gt;It is this implicit knowledge that is the elephant in the room. Very many serialisations, such as SQL 'dumps', csv, xsl and so on, rely on implicit knowledge that is either related to the particulars of the application used to open it, or is actually highly domain specific.&lt;br/&gt;&lt;br/&gt;So, it is trivial and easy to specify a model for storing data, but without also encoding the implied information and without making allowances for the myriad of sources, the model is useless; it would be akin to defining the colour of storage boxes holding bric-a-brac. The datasets need to be characterised, and the implied information recorded in as good a way as possible.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Characterisation&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;The first step is to characterise the dataset that has been marked for archival and reuse. (Strictly, the best first step is to consult with the researcher or research team and help and guide them so that as much of the unsaid knowledge is known by all parties.)&lt;br/&gt;&lt;br/&gt;Some serialisations so a good job of this themselves, *SQL-based serialisations include basic data type information inside the table declarations themselves. As a pragmatic measure, it seems sensible to accept SQL-style table descriptions as a reasonable beginning. Later, we'll consider the implicit information that also needs to be recorded alongside such a declaration.&lt;br/&gt;&lt;br/&gt;Some others, such as CSV, leave it up to the parsing agent to guess at the type of information included. In these cases, it is important to find out or even deduce the type of data held in each column. Again, this data can be serialised in a SQL table declaration held alongside the original &lt;i&gt;unmodified&lt;/i&gt; dataset.&lt;br/&gt;&lt;br/&gt;(It is assumed that a basic data review will be carried out; does the csv have a consistent number of columns per row, is the version and operating system known for the MySQL that held the data, is there a PI or responsible party for the data, etc.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Implicit information&lt;br/&gt;&lt;/b&gt;&lt;br/&gt;Good teachers are right to point out this simple truth: "don't forget to write down the obvious!" It may seem obvious that all your data is latin-1 encoded, or that you are using a FAT32 filesystem, or even that you are running in a 32-bit environment, the painful truth is that we can't guarantee that these aspects won't affect how the data is held, accessed or stored. There may be systematic issues that we are not aware of, such as the problems with early versions of ZFS causing [, at the time, detected] data corruption, or MySQL truncating fields when serialised in a way that is not anticipated or discovered until later.&lt;br/&gt;&lt;br/&gt;In characterising the legacy sets of data, it is important to realise that there will be loss, especially with the formats and applications that blend presentation with storage. For example, it will require a major effort to attempt to recover the forms and logic bound into the various versions of MS Access. I am even aware of a major dataset, a highly researched dictionary of old english words and phrases, that the final output of which is a Macromedia Authorware application, and the source files are held by an unknown party (that is if they still exist at all) - the Joy of hiring Contractors. In fact, this warrants a slight digression:&lt;br/&gt;&lt;br/&gt;&lt;b&gt;The gap in IT support for research&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;If an academic researcher wishes to gain an external email account at their institution, there is an established protocol for this. Email is so commonplace, it sounds an easy thing to provide, but you need expertise, server hardware, multiuser configuration, adoption of certain access standards (IMAP, POP3, etc), and generally there are very few types of email (text or text with MIME attachments - NB the IM in MIME stands for Internet Mail)&lt;br/&gt;&lt;br/&gt;If a researcher has a need to store tables of data, where do they turn? They should turn to the same department, who will handle the heavy lifting of guiding standards, recording the implicit information and providing standard access APIs to the data. What the IT departments seem to be doing currently is - to carry on the metaphor - handing the researcher the email server software and telling them to get on with it, to configure it as they want. No wonder the resulting legacy systems are as free-form as they are.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Practical measures - Curation&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;Back to specifics now, consider that a set of data has been found to be important, research has been based on it, and it's been recognised that this dataset needs to be looked after. [This will illustrate the technical measures. Licencing, dialogue with the data owners, and other non-technical analysis and administration is left out, but assumed.]&lt;br/&gt;&lt;br/&gt;First task is to store the incoming data, byte-for-byte, as much as is possible - storing the iso image of the media the data is stored on, storing the SQL dump of a database, etc.&lt;br/&gt;&lt;br/&gt;Analyse the tables of data - record the base types of each column (text, binary, float, decimal, etc) apeing the syntax of a SQL table declaration, as well as trying to identify the key columns.&lt;br/&gt;&lt;br/&gt;Record the inter-table joins between primary and secondary keys, possibly by using a &lt;i&gt;"table.column SAMEAS table.column;"&lt;/i&gt; declaration after the table declarations.&lt;br/&gt;&lt;br/&gt;Likewise, attempt to add information concerning each column, information such as units or any other identifying material.&lt;br/&gt;&lt;br/&gt;Store this table description alongside the recorded tabular data source.&lt;br/&gt;&lt;br/&gt;Form a representation of this data in a well-known, current format such as a MySQL dump. For spreadsheets that are 'frozen', cells that are the results of embedded formula should be calculated and added as fixed values. It is important to record the environment, library and platform that these calculations are made with.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Table description as RDF &lt;/b&gt;(strictly, referencing cols/rows via the URI)&lt;br/&gt;&lt;br/&gt;One syntax I am playing around with is the notion that by appending sensible suffixes to the base URI for a dataset, we can unique specify a row, a column, a region or even a single cell. Simply put:&lt;br/&gt;&lt;br/&gt;http://datasethost/datasets/{data-id}#table/{table-name}/column/{column-id} to reference a whole column&lt;br/&gt;http://datasethost/datasets/{data-id}#table/{table-name}/row/{column-id} to reference a whole row, etc&lt;br/&gt;&lt;br/&gt;[The use of the # in the position it is in will no doubt cause debate. Suffice it to say, this is a pragmatic measure, as I suspect that an intermediary layer will have to take care of dereferencing a GET on these forms in any case.]&lt;br/&gt;&lt;br/&gt;The purpose for this is so that the tabular description can be made using common and established namespaces to describe and characterise the tables of data. Following on from a previous post on extending the BagIt protocol with an RDF manifest, this information can be included in said manifest, alongside the more expected metadata without disrupting or altering how this is handled.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;A possible content type for tabular data&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;By considering the base Fedora repository object model, or the BagIt model, we can apply the above to form a content model for a dataset:&lt;br/&gt;&lt;br/&gt;As a Fedora Object:&lt;br/&gt;&lt;br/&gt;&lt;ul&gt;&lt;li&gt;Original data in whatever forms or formats it arrives in (dsid prefix convention: DATA*)&lt;/li&gt;&lt;li&gt;Binary/textual serialisation in a well-understood format (dsid prefix convention: DERIV*)&lt;/li&gt;&lt;li&gt;'Manifest' of the contents (dsid convention: RELS-INT)&lt;/li&gt;&lt;li&gt;Connections between this dataset and other objects, like articles, etc as well as the RDF description of this item (RELS-EXT)&lt;/li&gt;&lt;li&gt;Basic description of dataset for interoperability (Simple dublin core - DC)&lt;/li&gt;&lt;/ul&gt;&lt;br/&gt;As a BagIt+RDF:&lt;br/&gt;&lt;br/&gt;Zip archive - &lt;br/&gt;&lt;ul&gt;&lt;li&gt;/MANIFEST (list of files and checksums)&lt;/li&gt;&lt;li&gt;/RDFMANIFEST (RELS-INT and RELS-EXT from above)&lt;/li&gt;&lt;li&gt;/data/* (original dataset files/disk images/etc)&lt;/li&gt;&lt;li&gt;/derived/* (normalised/re-rendered datasets in a well known format)&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Presentation - the important part&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;What is described above is the archival of the data. This is a form suited for discovery, but is not in a form suited for reuse. So, what is the possibility?&lt;br/&gt;&lt;br/&gt;BigTable (Google) or HBase (Hadoop) provides a platform where tabular data can be put in a scalable manner. In fact, I would go on to suggest that HBase should be a basic service offered by the IT department of any institution. By providing this database as a service, it should be easier to normalise, and to educate the academic users in a manner that is useful to them, not just to the archivist. Google spreadsheet is an extremely good example of how such a large, scalable database might be presented to the end-user.&lt;br/&gt;&lt;br/&gt;For archival sets with a good (RDF) description of the table, it should be possible to instantiate working versions of the tabular data on a scalable database platform like HBase on demand. Having a policy to put to 'sleep' unused datasets can provide a useful comprimise, avoiding having all the tables live but still providing a useful service. &lt;br/&gt;&lt;br/&gt;It should also be noted that the adoption of popular methods of data access should be part of the responsibility of the data providers - this will change as time goes on, and protocols and methods for access alter with fashion. Currently, Atom/RSS feeds of any part of a table of data (the google spreadsheet model) fits very well with the landscape of applications that can reuse this information.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br/&gt;&lt;ul&gt;&lt;li&gt;Try to record as much information as can be found or derived - from host operating system to column types.&lt;/li&gt;&lt;li&gt;Keep the original dataset byte-for-byte as you recieved it.&lt;/li&gt;&lt;li&gt;Try to maintain a version of the data in a well-understood format&lt;/li&gt;&lt;li&gt;Describe the tables of information in a reusable way, preferably by adopting a machine-readable mechanism&lt;/li&gt;&lt;li&gt;Be prepared to create services that the users want and need, not services that you think they should have.&lt;/li&gt;&lt;/ul&gt;&lt;br/&gt;&lt;br/&gt;&lt;div class='zemanta-pixie'&gt;&lt;img src='http://img.zemanta.com/pixy.gif?x-id=38add0ef-8f30-4993-98ab-7ae1db1f0b20' class='zemanta-pixie-img'/&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-7013951292415915268?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-7013951292415915268</guid>
         <pubDate>Sun, 22 Feb 2009 10:28:00 -0800</pubDate>
      </item>
      <item>
         <title>Some SWORD updates</title>
         <link>http://feedproxy.google.com/~r/JimDowning/~3/BgSiHvHSsms/</link>
         <description>Being an aggregate of bits and pieces associated with the SWORD protocol: - A quick note to clear up a common confusion I encountered last week at the JISC developer days: The JISC-funded project &amp;#8220;SWORD2&amp;#8243; led to the 1.3 version of the spec. There is no &amp;#8220;SWORD 2.0&amp;#8243; spec. I&amp;#8217;ve put my mercurial repository of the SWORD [...]</description>
         <guid isPermaLink="false">http://wwmm.ch.cam.ac.uk/blogs/downing/?p=293</guid>
         <pubDate>Thu, 19 Feb 2009 07:30:46 -0800</pubDate>
         <content:encoded><![CDATA[Being an aggregate of bits and pieces associated with the SWORD protocol: - <ul> <li>A quick note to clear up a common confusion I encountered last week at the JISC developer days: The JISC-funded project &#8220;SWORD2&#8243; led to the 1.3 version of the spec. There is no &#8220;SWORD 2.0&#8243; spec.</li> <li>I&#8217;ve put my mercurial repository of the <a rel="nofollow" target="_blank" href="http://bitbucket.org/jimdowning/sword-spec/">SWORD spec on bitbucket</a>. This is mainly in the interests of transparency, rather than because I think anyone will be interested.</li> <li>Mark Nottingham&#8217;s recent <a rel="nofollow" target="_blank" href="http://www.mnot.net/blog/2009/02/18/x-">blog post on the evil of experimental HTTP headers</a> points out that the headers we used in SWORD should be changed. We hadn&#8217;t read enough RFCs, clearly! I&#8217;m in the process of working out what needs to be done to register headers and change the spec in a back-compatible manner. As a result of this, I&#8217;ll probably create proposal to run the package type registry along the same lines as the IETF&#8217;s provisional header repositories. </li> </ul> <img src="http://feeds.feedburner.com/~r/JimDowning/~4/BgSiHvHSsms" height="1" width="1"/>]]></content:encoded>
         <category>SWORD</category>
      </item>
      <item>
         <title>Tracking conferences (at Dev8D) with python, twitter and tags</title>
         <link>http://oxfordrepo.blogspot.com/2009/02/tracking-conferences-at-dev8d-with.html</link>
         <description>There was so much going on at http://www.dev8d.org (#dev8d) that it might be foolish for me to attempt to write up what happened.&lt;br /&gt;&lt;div&gt;&lt;br /&gt;So, I'll focus on a small, but to my mind, crucial aspect of it - tag tracking with a focus on &lt;a rel="nofollow" target="_blank" href="http://twitter.com/"&gt;Twitter&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;The Importance of Tags&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;First, the tag (#)dev8d was cloudburst over a number of social sites - &lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/tags/dev8d/"&gt;Flickr&lt;/a&gt;(dev8d tagged photos), &lt;a rel="nofollow" target="_blank" href="http://search.twitter.com/search?q=%23dev8d"&gt;Twitter&lt;/a&gt;(dev8d feed), blogs such as the &lt;a rel="nofollow" target="_blank" href="http://dev8d.jiscinvolve.org/"&gt;JISCInvolve Dev8D site&lt;/a&gt;, and so on. This was not just done for publicity, but as a means to track and re-assemble the various inputs to and outputs from the event.&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/tags/dev8d/"&gt;Flickr&lt;/a&gt; has some really nice photos on it, shared by people like &lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/ianibbo/"&gt;Ian Ibbotson&lt;/a&gt; (who caught &lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/ianibbo/3275945388/"&gt;an urban fox&lt;/a&gt; on camera during the event!) While there was an 'official' &lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/dev8d"&gt;dev8d flickr user&lt;/a&gt;, I expect the most unexpected and most interesting photos to be shared by other people who kindly add on the dev8d tag so we can find them. For conference organisers, this means that there is a pool of images that we can choose from, each with their own provenance so we can contact the owner if we wanted to re-use, or re-publish. Of course, if the owner puts a &lt;a rel="nofollow" target="_blank" href="http://creativecommons.org/"&gt;CC licence&lt;/a&gt; on them, it makes things easier :)&lt;br /&gt;&lt;br /&gt;So, asserting a tag or label for an event is a useful thing to do in any case. But, this twinned with using a messaging system like &lt;a rel="nofollow" target="_blank" href="http://twitter.com/"&gt;Twitter&lt;/a&gt; or &lt;a rel="nofollow" target="_blank" href="http://identi.ca/"&gt;Identi.ca&lt;/a&gt;, means that you can coordinate, share, and bring together an event. There was a projector in the Basecamp room, which was either the bar, or one of the large basement rooms at Birkbeck depending on the day. Initially, this was used to run through the basic flow of events, which was primarily organised through the use of a &lt;a rel="nofollow" target="_blank" href="http://code.google.com/p/developerhappinessdays/"&gt;wiki&lt;/a&gt;, to which all of us and the attendees were members.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;Projecting the bird's eye view of the event&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;I am not entirely sure whose idea it was initially to use the projector to follow the dev8d tag on twitter, auto-refreshing itself every minute, but it would be one or more of the following: Dave Flanders(&lt;a rel="nofollow" target="_blank" href="http://twitter.com/dfflanders"&gt;@dfflanders&lt;/a&gt;), Andy McGregor(&lt;a rel="nofollow" target="_blank" href="http://twitter.com/andymcg"&gt;@andymcg&lt;/a&gt;) and Dave Tarrant(&lt;a rel="nofollow" target="_blank" href="http://twitter.com/davetaz"&gt;@davetaz&lt;/a&gt;) who is aka BitTarrant due to his network wizardry keeping the wifi going despite Birkbeck's network's best efforts at stopping any form of useful networking going.&lt;br /&gt;&lt;br /&gt;The funny thing about the feed being there, was that it felt perfectly natural from the start. Almost like a mix of notice board, event liveblog and facebook status updates, but the overall effect was like it was the&lt;i&gt; bird's eye view&lt;/i&gt; of the entire event, which you could dip into and out of at will, follow up on talks you weren't even attending, catch interesting links that people posted, and just follow the whole event while doing your own thing.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;Then things got interesting.&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;From what I heard, a conversation in the bar about developer happiness (involving &lt;a rel="nofollow" target="_blank" href="http://twitter.com/rgardler"&gt;@rgardler&lt;/a&gt;?) lead to &lt;a rel="nofollow" target="_blank" href="http://samscam.co.uk/"&gt;Sam Easterby-Smith&lt;/a&gt; (&lt;a rel="nofollow" target="_blank" href="http://twitter.com/samscam"&gt;@samscam&lt;/a&gt;) to create a script that dug through the dev8d tweets looking for &lt;i&gt;n/m&lt;/i&gt; (like 7/10) and to use that as a mark of happyness e.g.&lt;br /&gt;&lt;blockquote&gt;&lt;a rel="nofollow" target="_blank" href="http://twitter.com/samscam"&gt;" @samscam&lt;/a&gt; #dev8d I am seriously 9/10 happy &lt;a rel="nofollow" target="_blank" href="http://samscam.co.uk/happier"&gt;http://samscam.co.uk/happier&lt;/a&gt; HOW HAPPY ARE YOU? " &lt;a rel="nofollow" target="_blank" href="http://twitter.com/samscam/status/1197185415"&gt; (Tue, 10 Feb 2009 11:17:15)&lt;/a&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;img src="http://lh4.ggpht.com/_KLlGSypGAvw/SZvxf5lx4rI/AAAAAAAAAEA/C9twrbS5xgE/%5BUNSET%5D.png?imgmax=800" style="max-width:800px;"/&gt;&lt;br /&gt;&lt;br /&gt;And computed the average happyness and overall happyness of those who tweeted how they were doing!&lt;br /&gt;&lt;br /&gt;Of course, being friendly, constructive sorts, we knew the best way to help 'improve' his happyometer was to try to break it by sending it bad input... *ahem*.&lt;br /&gt;&lt;blockquote&gt;&lt;a rel="nofollow" target="_blank" href="http://twitter.com/samscam"&gt;" @samscam&lt;/a&gt; #dev8d based on instant discovery of bugs in the Happier Pipe am now only 3/5 happy " (&lt;a rel="nofollow" target="_blank" href="http://twitter.com/samscam/statuses/1197215138"&gt;Tue, 10 Feb 2009 23:05:05&lt;/a&gt;)&lt;br /&gt;&lt;/blockquote&gt;BUT things got fixed, and the community got involved and interested. It caused talk and debate, got people wondering how that it was done, how they could do the same thing and how to take it further.&lt;br /&gt;&lt;br /&gt;At which point, I thought it might be fun to 'retweet' the happyness ratings as they change, to keep a running track of things. And so, a purpose for &lt;a rel="nofollow" target="_blank" href="http://twitter.com/randomdev8d"&gt;@randomdev8d&lt;/a&gt; was born:&lt;br /&gt;&lt;br /&gt;&lt;img src="http://lh4.ggpht.com/_KLlGSypGAvw/SZvxqf_Xz5I/AAAAAAAAAEE/Gr_rAh0ojPs/%5BUNSET%5D.png?imgmax=800" style="max-width:800px;"/&gt;&lt;br /&gt;&lt;br /&gt;How I did this was fairly simple: I grabbed his page every minute or so, used BeautifulSoup to parse the HTML, got the happyness numbers out and compared it to the last ones the script had seen. If there was a change, it tweeted it and seconds later, the projected tweet feed updated to show the new values - a disproportionate feedback loop, the key to involvement in games; you do something small like press a button or add 4/10 to a message, and you can affect the stock-market ticker of happyness :)&lt;br /&gt;&lt;br /&gt;If I had been able to give my talk on the python code day, the code to do this would contain zero surprises, because I covered 99% of this - so here's my &lt;a rel="nofollow" target="_blank" href="http://developerhappinessdays.googlecode.com/files/dev8d-presentation.pdf"&gt;'slides'&lt;/a&gt;[pdf] - basically a snapshot screencast.&lt;br /&gt;&lt;br /&gt;Here's the crufty code though that did this:&lt;br /&gt;&lt;blockquote&gt;import time&lt;br /&gt;import simplejson, httplib2, BeautifulSoup&lt;br /&gt;h = httplib2.Http()&lt;br /&gt;h.add_credentials('randomdev8d','PASSWORD')&lt;br /&gt;happy = httplib2.Http()&lt;br /&gt;o = 130.9&lt;br /&gt;a = 7.7&lt;br /&gt;import urllib&lt;br /&gt;&lt;br /&gt;while True:&lt;br /&gt;print "Checking happiness...."&lt;br /&gt;(resp, content) = happy.request('http://samscam.co.uk/happier/')&lt;br /&gt;soup = BeautifulSoup.BeautifulSoup(content)&lt;br /&gt;overallHappyness = soup.findAll('div')[2].contents&lt;br /&gt;avergeHappyness = soup.findAll('div')[4].contents&lt;br /&gt;over = float(overallHappyness[0])&lt;br /&gt;ave = float(avergeHappyness[0])&lt;br /&gt;print "Overall %s - Average %s" % (over, ave)&lt;br /&gt;omess = "DOWN"&lt;br /&gt;if over &amp;gt; o:&lt;br /&gt;omess = "UP!"&lt;br /&gt;amess = "DOWN"&lt;br /&gt;if ave &amp;gt; a:&lt;br /&gt;amess= "UP!"&lt;br /&gt;if over == o:&lt;br /&gt;omess = "SAME"&lt;br /&gt;if ave == a:&lt;br /&gt;amess = "SAME"&lt;br /&gt;if not (o == over and a == ave):&lt;br /&gt;print "Change!"&lt;br /&gt;o = over&lt;br /&gt;a = ave&lt;br /&gt;tweet = "Overall happiness is now %s(%s), with an average=%s(%s) #dev8d (from http://is.gd/j99q)" % (overallHappyness[0], omess, avergeHappyness[0], amess)&lt;br /&gt;data = {'status':tweet}&lt;br /&gt;body = urllib.urlencode(data)&lt;br /&gt;(rs,cont) = h.request('http://www.twitter.com/statuses/update.json', "POST", body=body)&lt;br /&gt;else:&lt;br /&gt;print "No change"&lt;br /&gt;time.sleep(120)&lt;br /&gt;&lt;/blockquote&gt;(Available from &lt;a rel="nofollow" target="_blank" href="http://pastebin.com/f3d42c348"&gt;http://pastebin.com/f3d42c348&lt;/a&gt; with syntax highlighting - NB this was written beat-poet style, written from A to B with little concern for form. The fact that it works is a miracle, so comment on the code if you must.)&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;The grand, official #Dev8D survey!&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;... which was anything but official, or grand. The happyness-o-meter idea lead BitTarrant and I to think "Wouldn't it be cool to find out what computers people have brought here?" Essentially, finding out what computer environment developers &lt;i&gt;choose&lt;/i&gt; to use is a very valuable thing - developers choose things which make our lives easier, by and large, so finding out which setups they use by preference to develop or work with could guide later choices, such as being able to actually target the majority of environments for wifi, software, or talks.&lt;br /&gt;&lt;br /&gt;So, on the Wednesday morning, Dave put out the call on @dev8d for people to post the operating systems on the hardware they brought to this event, in the form of OS/HW. I then busied myself with writing a script that hit the twitter search api directly, and parsed it itself. As this was a more intended script, I made sure that it kept track of things properly, pickling its per-person tallys. (You could post up multiple configurations in one or more tweets, and it kept track of it per-person.) This script was a little bloated at 86 lines, so I won't post it inline - plus, it also showed that I should've gone to the regexp lesson, as I got stuck trying to do it with regexp, gave up, and then used whitespace-tokenising... but it worked fine ;)&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://pastebin.com/f2c04719b"&gt;Survey code: http://pastebin.com/f2c04719b&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Survey results:&lt;/span&gt; &lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=pDKcyrBE6SJqToHzjCs2jaQ"&gt;http://spreadsheets.google.com/pub?key=pDKcyrBE6SJqToHzjCs2jaQ&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;OS:&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Linux was the majority at 42%&lt;/span&gt; closely followed by Apple at 37% with MS-based OS at 18% with a stellar showing of one user of OpenSolaris (4%)!&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Hardware type:&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;66% were laptops, with 25% of the machines there being classed as netbooks&lt;/span&gt;. 8% of the hardware there were iPhones too, and one person claimed to have brought Amazon EC2 with them ;)&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;The post hoc analysis&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;Now then, having gotten back to normal life, I've spent a little time grabbing stuff from twitter and digging through them. Here is the &lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=pDKcyrBE6SJoJdIcm7mdpBg"&gt;list of the 1300+ tweets with the #dev8d tag in them&lt;/a&gt; published via google docs, and here is some derived things posted by Tony Hirst(&lt;a rel="nofollow" target="_blank" href="http://twitter.com/psychemedia"&gt;@psychemedia&lt;/a&gt;) and Chris Wilper(&lt;a rel="nofollow" target="_blank" href="http://twitter.com/cwilper"&gt;@cwilper&lt;/a&gt;) seconds after I posted this:&lt;br /&gt;&lt;br /&gt;Tagcloud of twitterer's:&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.wordle.net/gallery/wrdl/549364/dev8_twitterers"&gt;http://www.wordle.net/gallery/wrdl/549364/dev8_twitterers&lt;/a&gt; [java needed]&lt;br /&gt;&lt;br /&gt;Tagcloud of tweeted words:&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.wordle.net/gallery/wrdl/549350/dev8d"&gt;http://www.wordle.net/gallery/wrdl/549350/dev8d&lt;/a&gt; [java needed]&lt;br /&gt;&lt;br /&gt;And a column of all the tweeted links:&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=p1rHUqg4g423-wWQn8arcTg"&gt;http://spreadsheets.google.com/pub?key=p1rHUqg4g423-wWQn8arcTg&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;This lead me to dig through them and republish the list of tweets, but try to unminimise the urls and try to grab the &amp;amp;lt;title&amp;gt; tag of the html page it goes to, which you can find here:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=pDKcyrBE6SJpwVmV4_4qOdg"&gt;http://spreadsheets.google.com/pub?key=pDKcyrBE6SJpwVmV4_4qOdg&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;(Which incidently, lead me to spot that there was one link to "YouTube - Rick Astley - Never Gonna Give You Up" which means the hacking was all worthwhile :))&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;Graphing Happyness&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;For one, I've re-analysed the happyness tweets and posted up the following:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;A &lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=pDKcyrBE6SJqHVP8Fb7euEA"&gt;full log of happyness with timeline attached to it&lt;/a&gt;,&lt;/li&gt;&lt;li&gt;&lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=pDKcyrBE6SJoxj8D7_EWscQ"&gt;The running average, with accompanying timeline,&lt;/a&gt;&lt;/li&gt;&lt;li&gt;and the &lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=pDKcyrBE6SJp6acAAn77SZQ"&gt;average of the last 10 tweets&lt;/a&gt; in much the same way as before.&lt;/li&gt;&lt;/ul&gt;It is easier to understand the averages as graphs over time of course! You could also use Tony Hirst's excellent write up here about &lt;a rel="nofollow" target="_blank" href="http://ouseful.wordpress.com/2009/02/17/creating-your-own-results-charts-for-surveys-created-with-google-forms/"&gt;creating graphs from google forms and spreadsheets.&lt;/a&gt; I'm having issues embedding the google timeline widget here, so you'll have to make do with static graphs.&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://lh6.ggpht.com/_KLlGSypGAvw/SZr0KFxfnRI/AAAAAAAAADk/AJQI307X1As/s800/dev8d_running_total_average.png"&gt;&lt;img style="margin:0px auto 10px;display:block;text-align:center;cursor:pointer;width:800px;height:468px;" src="http://lh6.ggpht.com/_KLlGSypGAvw/SZr0KFxfnRI/AAAAAAAAADk/AJQI307X1As/s800/dev8d_running_total_average.png" alt="" border="0"/&gt;&lt;/a&gt;&lt;span style="font-weight:bold;"&gt;Average happyness over the course of the event - all tweets counted towards the average.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://lh4.ggpht.com/_KLlGSypGAvw/SZr0KnArJ0I/AAAAAAAAADs/N5OdzUBDefQ/s912/dev8d_last10HappynessTweetsCount.png"&gt;&lt;img style="margin:0px auto 10px;display:block;text-align:center;cursor:pointer;width:912px;height:510px;" src="http://lh4.ggpht.com/_KLlGSypGAvw/SZr0KnArJ0I/AAAAAAAAADs/N5OdzUBDefQ/s912/dev8d_last10HappynessTweetsCount.png" alt="" border="0"/&gt;&lt;/a&gt;&lt;span style="font-weight:bold;"&gt;Average happyness, but only the previous 10 tweets counted towards the average making it more reflective of the happyness at that time.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;If you are wondering about the first dip, that was when we all tried to break Sam's tracker by sending it bad data, a lot of 0 happyness's were recorded therefore :) As for the second dip, well, you can see that from the &lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=pDKcyrBE6SJqHVP8Fb7euEA"&gt;log of happyness&lt;/a&gt;, yourselves :)&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-3073595490420683868?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-3073595490420683868</guid>
         <pubDate>Tue, 17 Feb 2009 20:18:00 -0800</pubDate>
      </item>
      <item>
         <title>Making developers happy</title>
         <link>http://blog.paulwalk.net/2009/02/13/making-developers-happy/</link>
         <description>Since I joined UKOLN two years ago, I have frequently claimed that we (JISC, the sector, our community) don&amp;#8217;t do enough to support and listen to developers. Well, I&amp;#8217;m just back from The Developers Happiness Days (dev8D) in London and I can certainly no longer say this. A solid week of developer happiness! A week [...]</description>
         <guid isPermaLink="false">http://blog.paulwalk.net/2009/02/13/making-developers-happy/</guid>
         <pubDate>Fri, 13 Feb 2009 12:39:29 -0800</pubDate>
         <content:encoded><![CDATA[<p>Since I joined <a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk" title="UKOLN">UKOLN</a> two years ago, I have frequently claimed that we (<a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk" title="JISC">JISC</a>, the sector, our community) don&#8217;t do enough to support and listen to developers. Well, I&#8217;m just back from <a rel="nofollow" target="_blank" href="http://www.dev8d.org/">The Developers Happiness Days (dev8D)</a> in London and I can certainly no longer say this. A solid week of developer happiness! A week of ideas generated, geeks networking with users, competitive and yet collaborative development, knowledge being exchanged&#8230;. followed by fun and, yes, a bit of drinking.</p>
<p>
<img src="http://blog.paulwalk.net/wp-content/uploads/2009/02/dev8d-developers.jpg" width="480" height="204" alt="dev8d-developers.jpg" name="dev8d-developers.jpg" style="padding-right:5px;float:left;" id="dev8d-developers.jpg"/></p>
<p>The brain-child of <a rel="nofollow" target="_blank" href="http://dfflanders.wordpress.com/">David Flanders</a> and <a rel="nofollow" target="_blank" href="http://www.oxfordrepo.blogspot.com/">Ben O&#8217;Steen</a>, with support and ideas from several others and funding from the JISC, dev8D has been a fantastic success, and has managed the difficult task of appealing to, and being successful for, a range of people with varying levels of experience and technical chops. The inexperienced developer looking to be exposed to new ideas and to the wisdom of more experienced folk was well served. Julian Cheal of UKOLN fitted this description and he embraced the opportunities dev8D presented to him, engaging at all levels with the event to the extent that he was rewarded with both a prize for his helpfulness and a special mention at the awards dinner for &#8216;best newcomer&#8217;. But the older hands were fully engaged nonetheless &#8211; presenting on their areas of expertise in &#8216;lightning talks&#8217; in the true barcamp style which geeks have embraced as their own way of conducting conference sessions. It was great to see so many familiar faces together at one event, being unashamedly techie, exchanging ideas and help.</p>
<p>Although, like some others, I was forced to miss some of the event due to a deadline for bids to a JISC call falling on the Wednesday, I still managed to sit in on some sessions, and I learned plenty, especially in a talk on agile development about which I&#8217;ll blog more, separately.</p>
<p>One of the things which stood out at dev8D was the way in which users (or <a rel="nofollow" target="_blank" href="http://code.google.com/p/developerhappinessdays/wiki/UberUsers"><em>UberUsers</em></a>) were invited to engage with developers. There&#8217;s an <strong>important, non-obvious</strong> distinction here. Users were invited to come into the developers&#8217; environment. Brave users, you might say! Normally, developers are invited into the users&#8217; environment&#8230;. for just long enough to explain to them what the users require. Users would often rather not have to deal with developers all that much. To step into an environment of happy, busy developers must have been an eye-opening experience for those users who were brave enough, and open-minded enough to try it. Although I wasn&#8217;t on the &#8216;dragons den&#8217; panel looking at the prototypes being developed in the <em><a rel="nofollow" target="_blank" href="http://code.google.com/p/developerhappinessdays/wiki/DeveloperDecathlon">Developer Decathlon</a></em>, it was remarked to me several times that the quality of submissions was better than in previous events &#8211; and that this was attributed to the fact that users had been involved in the prototyping process. I&#8217;m one of the judges who&#8217;ll be marking these submissions and I&#8217;m really looking forward to seeing what was produced.</p>
<p>With these events, there are little things which can make a difference. The use of <a rel="nofollow" target="_blank" href="http://www.wordle.net/">Wordle</a> to produce personalised name badges for each delegate was inspired, as was the use of happiness tokens to reward help or ideas. The Twitter back-channel was used to tremendous effect &#8211; the &#8216;#dev8D&#8217; tag made the top ten Twitter &#8216;trends&#8217; worldwide. Sam Easterby Smith even built a <a rel="nofollow" target="_blank" href="http://samscam.co.uk/happier/">Twitter-powered developer-happiness meter</a>!</p>
<p>I&#8217;d like to go on record thanking David Flanders in particular for driving this event &#8211; the guy must be utterly exhausted after working 18 hour days for a week. I think we should also recognise the vision of those in the JISC (and especially <a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/contactus/staff/rachelbruce">Rachel Bruce</a>) who were prepared to back what must have looked like a risky proposition. There was value in the event itself &#8211; the networking, and the capacity building which went with this and I have good reason to believe there will be value in the prototypes and ideas generated as a result. But, perhaps most importantly, the sector has just shown the world that it values its developers, and is prepared to invest in them, and even spend a little to make them happy. I believe this will have been a wise investment. As I said on Twitter, <em><a rel="nofollow" target="_blank" href="http://twitter.com/paulwalk/status/1204506122">there&#8217;s a community developing which I&#8217;m proud to be associated with.</a></em></p>
<p>If you want to know more, the tag &#8216;dev8D&#8217; has been used extensively in various systems. Some examples:</p>
<ul>
<li><a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/tags/dev8d">Photos on Flickr</a></li>
<li><a rel="nofollow" target="_blank" href="http://blogsearch.google.com/blogsearch?hl=en&amp;ie=UTF-8&amp;q=dev8d&amp;btnG=Search+Blogs">Google blog search</a></li>
</ul>
<p><em><br /></em></p>
<p style="text-align:right;"><em>Image by Dave Pattern (<a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/davepattern/3274205523/sizes/m/">http://www.flickr.com/photos/davepattern/3274205523/sizes/m/</a>)</em></p>]]></content:encoded>
      </item>
      <item>
         <title>briankelly: Blog post on possible synergies between UK's #IWMW event &amp;amp; US's #eduWeb http://is.gd/jgkK (with refs to #Dev8D / #CRIG folk</title>
         <link>http://twitter.com/briankelly/statuses/1202055528</link>
         <description>briankelly: Blog post on possible synergies between UK's #IWMW event &amp;amp; US's #eduWeb http://is.gd/jgkK (with refs to #Dev8D / #CRIG folk</description>
         <guid isPermaLink="false">http://twitter.com/briankelly/statuses/1202055528</guid>
         <pubDate>Thu, 12 Feb 2009 00:48:36 -0800</pubDate>
      </item>
      <item>
         <title>Repository Road</title>
         <link>http://feedproxy.google.com/~r/DFFlanders/~3/YtxdgvYx9-o/</link>
         <description>Repository Road Originally uploaded by davidflanders Came accross this the other day, I&amp;#8217;m sure it will show up in many
presentations from hence forward. &lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&amp;blog=532341&amp;post=169&amp;subd=dfflanders&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://dfflanders.wordpress.com/2009/01/23/repository-road/</guid>
         <pubDate>Fri, 23 Jan 2009 08:02:43 -0800</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><div style="float:right;margin-left:10px;margin-bottom:10px;"><a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/dff1978/3219753829/" title="photo sharing"><img src="http://farm4.static.flickr.com/3126/3219753829_a2257ba853.jpg" alt="" style="border:solid 2px #000000;"/> </a><br /><br /><span style="font-size:.9em;margin-top:0;"><br /><a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/dff1978/3219753829/">Repository Road</a><br /> <br /> Originally uploaded by <a rel="nofollow" target="_blank" href="http://www.flickr.com/people/dff1978/">davidflanders</a><br /> </span>
</div>
<p>Came accross this the other day, I&#8217;m sure it will show up in many<br />
presentations from hence forward.<br /></p> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/dfflanders.wordpress.com/169/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dfflanders.wordpress.com/169/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/dfflanders.wordpress.com/169/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dfflanders.wordpress.com/169/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/dfflanders.wordpress.com/169/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dfflanders.wordpress.com/169/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/dfflanders.wordpress.com/169/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dfflanders.wordpress.com/169/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/dfflanders.wordpress.com/169/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dfflanders.wordpress.com/169/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&blog=532341&post=169&subd=dfflanders&ref=&feed=1"/></div>]]></content:encoded>
         <media:content url="http://0.gravatar.com/avatar/8cdc26501af3b7e571a4d23f0afbbf08?s=96&amp;amp;d=identicon&amp;amp;r=G" medium="image">
            <media:title>dfflanders</media:title>
         </media:content>
         <media:content url="http://farm4.static.flickr.com/3126/3219753829_a2257ba853.jpg" medium="image" />
         <category>Uncategorized</category>
      </item>
      <item>
         <title>EPrints 3.2 - Amazon S3/Cloudfront Plug-in</title>
         <link>http://davetaz-blog.blogspot.com/2009/01/eprints-32-amazon-s3cloudfront-plug-in.html</link>
         <description>A quick post to say that we have just successfully tested an EPrints 3.2 (svn) install with the new Storage Controller plugged into Amazon S3! &lt;br /&gt;&lt;br /&gt;This has quiet a lot of implications for both EPrints and other projects wanting to provide external services which operate on objects in a repository. We hope to bring people more news on this at the upcoming Open Repositories 2009 conference in Atlanta. &lt;br /&gt;&lt;br /&gt;For more information on this all check out storage section on the Preserv2 website @ &lt;a rel="nofollow" target="_blank" href="http://www.preserv.org.uk"&gt;www.preserv.org.uk&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-7166652851684473013?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-7166652851684473013</guid>
         <pubDate>Tue, 20 Jan 2009 21:45:00 -0800</pubDate>
      </item>
      <item>
         <title>CRIG Awards Dinner</title>
         <link>http://feedproxy.google.com/~r/DFFlanders/~3/ExdNAMGbsbQ/</link>
         <description>Dear CRIG,
We would like to cordially invite each of you to a 5* Awards Dinner on Thursday night February 12th at the stunning Court Restaurant atop the British Museum Reading Rooms.
If we could ask you to please RSVP at the below link. Also prior to RSVPing could you consider whom to nominate for the award [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&amp;blog=532341&amp;post=167&amp;subd=dfflanders&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://dfflanders.wordpress.com/?p=167</guid>
         <pubDate>Mon, 19 Jan 2009 01:17:31 -0800</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><p>Dear CRIG,</p>
<p>We would like to cordially invite each of you to a 5* Awards Dinner on Thursday night February 12th at the stunning Court Restaurant atop the British Museum Reading Rooms.</p>
<p>If we could ask you to please RSVP at the below link. Also prior to RSVPing could you consider whom to nominate for the award of “exemplary innovation in the community” (the nomination is part of the RSVP form).</p>
<p>http://is.gd/ey8P</p>
<p>Also as part of the developerHappinessDays event (Feb 9-13th), there will be a final CRIG barcamp on Friday (13th) at Birkbeck college where lead developers for EPrints, DSPace, FedoraCommons and Microsoft will be present to discuss the latest advancement in repository innovation. If you would like to attend this day please sign up for dev8D below and select the “roll your own day” and put down “repository summit”:</p>
<p>http://www.dev8d.org/book.html</p>
<p>Hope to see you all there.</p>
<p>Kind Regards,</p>
<p>CRIG Support Team</p> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/dfflanders.wordpress.com/167/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dfflanders.wordpress.com/167/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/dfflanders.wordpress.com/167/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dfflanders.wordpress.com/167/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/dfflanders.wordpress.com/167/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dfflanders.wordpress.com/167/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/dfflanders.wordpress.com/167/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dfflanders.wordpress.com/167/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/dfflanders.wordpress.com/167/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dfflanders.wordpress.com/167/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&blog=532341&post=167&subd=dfflanders&ref=&feed=1"/></div>]]></content:encoded>
         <media:content url="http://0.gravatar.com/avatar/8cdc26501af3b7e571a4d23f0afbbf08?s=96&amp;amp;d=identicon&amp;amp;r=G" medium="image">
            <media:title>dfflanders</media:title>
         </media:content>
      </item>
      <item>
         <title>Developer happiness</title>
         <link>http://blog.paulwalk.net/2009/01/13/developer-happiness/</link>
         <description>Are you a developer of software? Could you be happier? If so, come along to the JISC Developer Happiness Days event! From the website: Over four intensive days we&amp;#8217;re bringing together the cream of the crop of educational software developers along with coders from other sectors, users, and technological tinkerers in an exciting new forum.
Share your skills [...]</description>
         <guid isPermaLink="false">http://blog.paulwalk.net/2009/01/13/developer-happiness/</guid>
         <pubDate>Tue, 13 Jan 2009 08:14:05 -0800</pubDate>
         <content:encoded><![CDATA[<p>Are you a developer of software? Could you be happier? If so, come along to the <a rel="nofollow" target="_blank" href="http://www.dev8d.org/">JISC Developer Happiness Days event</a>!</p>
<p>
<img src="http://blog.paulwalk.net/wp-content/uploads/2009/01/dev8d.jpg" width="458" height="76" alt="dev8d.jpg"/></p>
<p>From the website:</p>
<blockquote>
<p>Over four intensive days we&#8217;re bringing together the cream of the crop of educational software developers along with coders from other sectors, users, and technological tinkerers in an exciting new forum.</p>
<p>Share your skills and knowledge with the coding community in a stimulating and fun environment and come away with new skills, fresh contacts – and you might even win a prize.</p>
<p>The top ideas generated at the event will be documented, publicised and made available to the community.</p>
<p>This innovative event includes:</p>
<ul>
<li>Pre-event workshops with code labs to share knowledge and develop skills</li>
<li>The &#8216;developer decathlon&#8217;: a two-day team coding session with prizes for the best code</li>
<li>A community-building day bringing together developers based around software platforms</li>
<li>Roll-your-own-event: space and time for your own community-specific event or workshop</li>
</ul>
</blockquote>
<p>I think this will be an excellent opportunity for developers from the education sector especially, but also from beyond this, to get together and share ideas and experience. And drink beer.</p>
<p>It&#8217;s all happening in London, in and around the Birkbeck College, between the 9th-13th February.</p>
<p>The tag for this event is &#8216;dev8d&#8217;.</p>]]></content:encoded>
      </item>
      <item>
         <title>Push or pull?</title>
         <link>http://blog.paulwalk.net/2009/01/08/push-or-pull/</link>
         <description>A brief comment, as I hop across the North Sea back to Bristol.
With the news that arXiv will now accept deposits from institutional repositories, Dorothea Salo continues her theme about a deposit flow which goes from author, to institutional repository, to subject/discipline repository. Dorothea offers some scenarios, including: Achaea University adopts a Harvard-style open-access mandate. If [...]</description>
         <guid isPermaLink="false">http://blog.paulwalk.net/?p=154</guid>
         <pubDate>Thu, 08 Jan 2009 11:11:19 -0800</pubDate>
         <content:encoded><![CDATA[<p>A brief comment, as I hop across the North Sea back to Bristol.</p>
<p>With the <a rel="nofollow" target="_blank" href="http://www.earlham.edu/~peters/fos/2009/01/arxiv-has-implemented-sword.html">news that arXiv will now accept deposits from institutional repositories</a>, <a rel="nofollow" target="_blank" href="http://cavlec.yarinareth.net/2009/01/07/its-a-poor-sword-that-doesnt-point-both-ways/">Dorothea Salo continues her theme</a> about a deposit flow which goes from author, to institutional repository, to subject/discipline repository. Dorothea offers some scenarios, including:</p>
<blockquote>
<p>Achaea University adopts a Harvard-style open-access mandate. If she wants her articles in arXiv as well, Dr. Troia must rather annoyingly dual-deposit… unless Achaea’s IR implements a deposit pipeline to arXiv, in which case the most she has to do is tick a ticky-box (and I can imagine ways to abstract away the ticky-box).</p>
</blockquote>
<p>In an abstract sense I appreciate the notion of the &#8216;deposit pipeline&#8217;. I also agree with the main point which is about the <em>direction</em> of the flow. Indeed, I have <a rel="nofollow" target="_blank" href="http://blog.paulwalk.net/2008/07/07/repository-architecture-83/">previously characterized</a> the institutional repository as being, or more usually containing, the <em>source</em> repository. However, I remain slightly doubtful about the need for the flow to be initiated by the source. If there were some mechanism by which the subject/discipline repository could be <em>alerted</em> to the appearance of relevant materials in the institutional repository, then doesn&#8217;t it make sense for the subject repository to <em>fetch</em> the record/artefact, rather than wait to have it sent. Well, we already have the mechanism, it&#8217;s called RSS (or Atom) and it&#8217;s already supported by some of our most popular repository software.</p>
<p>Come to think of it, an even better approach might be for the subject repository, having been alerted to a new &amp; relevant deposit in the institutional repository, to simply maintain a pointer to the original (optionally creating new and related resources)</p>
<p>In other words, as a certain generation of programmers would put it, <em>pass by reference</em>, not by <em>copy</em>.</p>]]></content:encoded>
      </item>
      <item>
         <title>eFoundations: The apples and oranges of Shibboleth and OpenID</title>
         <link>http://efoundations.typepad.com/efoundations/2008/12/the-apples-and-oranges-of-shibboleth-and-openid.html</link>
         <description>I&amp;#039;ve found the Computing Services at Cambridge to be very positive about OpenID. Reports like that aren&amp;#039;t likely to move it any closer at a strategic level though.</description>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_33b132a13fa53a9fa20e5bf44605c7f4</guid>
         <pubDate>Wed, 24 Dec 2008 00:10:57 -0800</pubDate>
      </item>
      <item>
         <title>Web feeds and repositories</title>
         <link>http://feedproxy.google.com/~r/JimDowning/~3/Pg3zo6KqhAw/</link>
         <description>I was invited to give a presentation on RSS and Atom as part of a SUETr workshop on interoperability yesterday. Of course I didn&amp;#8217;t even scratch the surface of what can be achieved with feeds in terms of mash-ups, 3rd party sites and visualisations - but I did try to get across the breadth of [...]</description>
         <guid isPermaLink="false">http://wwmm.ch.cam.ac.uk/blogs/downing/?p=279</guid>
         <pubDate>Wed, 10 Dec 2008 04:07:27 -0800</pubDate>
         <content:encoded><![CDATA[I was invited to give a presentation on RSS and Atom as part of a <a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/events/suetr-2008/">SUETr workshop on interoperability</a> yesterday. Of course I didn&#8217;t even scratch the surface of what can be achieved with feeds in terms of mash-ups, 3rd party sites and visualisations - but I did try to get across the breadth of &#8216;repository&#8217; problems feeds can address, and the importance of feeds in easy wins to add value to your repository efforts (this theme courtesy of <a rel="nofollow" target="_blank" href="http://repositoryman.blogspot.com/">Les Carr on his blog</a>). The slides can be downloaded from either of these places: -
<ul> <li><a rel="nofollow" target="_blank" href="http://www.slideshare.net/jimdowning/web-feeds-and-repositories-presentation">http://www.slideshare.net/jimdowning/web-feeds-and-repositories-presentation</a></li> <li><a rel="nofollow" target="_blank" href="http://www.dspace.cam.ac.uk/handle/1810/206424">http://www.dspace.cam.ac.uk/handle/1810/206424</a></li>
</ul><img src="http://feeds.feedburner.com/~r/JimDowning/~4/Pg3zo6KqhAw" height="1" width="1"/>]]></content:encoded>
      </item>
      <item>
         <title>How can institutional processes better support flexible learning?</title>
         <link>http://www.cetis.ac.uk/members/scott/blogview?entry=20081202163149</link>
         <description>&lt;p&gt;This was the topic of a session I ran at the JISC CETIS conference; focussing on the agendas of work-based learning and other policy initiatives, the question for institutions is what would need to change, and what areas are actually ready. We had groups develop ideas and pitch them to video.&lt;/p&gt; &lt;p&gt;I had participants consider the drivers and influences on the process, the potential impact, the readiness for change, and the types of interventions that would be useful.&lt;/p&gt; &lt;p&gt;Briefly, the ideas presented were:&lt;/p&gt; &lt;p&gt;&lt;b&gt;Validation processes that are agile and proportionate&lt;/b&gt;, enabling smaller courses and courses on demand. The recommendation is that pilots are developed with the regulatory agencies involved so that institutions can try out more flexible approaches to designing, validating and offering courses.&lt;/p&gt; &lt;p&gt;&lt;b&gt;Enabling the use of net resources in education&lt;/b&gt;, supporting teachers and students in making effective use of resources and exercising appropriate discrimination. Recommendation is for materials supporting teacher education and student skills.&lt;/p&gt; &lt;p&gt;&lt;b&gt;Marking processes that supports personalised coursework&lt;/b&gt;, where the submissions are less media-specific, enabling students to submit work in media they are confident in (e.g. video, text, audio) without causing problems for markers and institutions. The recommendation is to support a toolkit for "social marking" that involves students as well as staff in holistic rather than atomic assessment of student work.&lt;/p&gt; &lt;p&gt;&lt;b&gt;Recognising prior experience in formal education&lt;/b&gt;, developing support for Accreditation of Prior Experiential Learning, and associated information, advice and guidance, particularly to support workforce development and linking education with employment. The recommendation is to support process modelling to better understand how APEL and similar processes fit today, and can be enhanced in the future.&lt;/p&gt; &lt;p&gt;&lt;b&gt;Making the VLE flexible to handle new ways of learning&lt;/b&gt;, decoupling the processes of planning, engagement, and assessment in the VLE and reconnecting them more flexibly through a coordination mechanism, supporting, for example, engaging in academic planning and assessment in one organisation, but engagement in another - such as in a work-based system.&lt;/p&gt; &lt;p&gt;In their own words, here they are:&lt;/p&gt; &lt;iframe class="embeddedvideo" src="http://www.youtube.com/v/XzDtgtIcTS0&amp;hl=en&amp;fs=1" type="application/x-shockwave-flash" width="425" height="344"&gt;&lt;/iframe&gt; &lt;p&gt;More information &lt;a rel="nofollow" target="_blank" href="http://wiki.cetis.ac.uk/Re-inventing_institutional_Processes_to_Support_flexible_Learning"&gt;on the wiki&lt;/a&gt;&lt;/p&gt;</description>
         <author>Scott Wilson</author>
         <guid isPermaLink="false">tag:cetis.ac.uk,2008-12-02:20081202163149</guid>
         <pubDate>Tue, 02 Dec 2008 08:31:49 -0800</pubDate>
      </item>
      <item>
         <title>Caveat Lector » Blog Archive » Home-grown versus outsourced repository software</title>
         <link>http://cavlec.yarinareth.net/2008/11/24/home-grown-versus-outsourced-repository-software/</link>
         <description>Interesting commentary on development communities around open source repositories.</description>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_84eb55f2d9a173f8fd6f3c125d6a18df</guid>
         <pubDate>Tue, 25 Nov 2008 01:57:48 -0800</pubDate>
      </item>
      <item>
         <title>Big Blue Brainstorm</title>
         <link>http://www.businessweek.com/magazine/content/06_32/b3996062.htm</link>
         <description>The big blue jumps in with the rest of barcamping fools!</description>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_6e4540b5fb634bb0ed3877e63960a9ec</guid>
         <pubDate>Sun, 23 Nov 2008 06:34:28 -0800</pubDate>
      </item>
      <item>
         <title>Quotes from Panlibus JP Rangaswami Podcast</title>
         <link>http://feedproxy.google.com/~r/JimDowning/~3/2m3NV8UiXZQ/</link>
         <description>[I've only just found a good mechanism and time to listen to podcasts, so this is a little after the event, still worthwhile I hope.] Earlier this month Richard Wallis of Talis interviewed JP Rangaswami at BT, and posted a podcast of the conversation. Sterling stuff - I thoroughly recommend listening to it in full. I&amp;#8217;ve [...]</description>
         <guid isPermaLink="false">http://wwmm.ch.cam.ac.uk/blogs/downing/?p=267</guid>
         <pubDate>Fri, 21 Nov 2008 03:00:25 -0800</pubDate>
         <content:encoded><![CDATA[<p><small>[I've only just found a good mechanism and time to listen to podcasts, so this is a little after the event, still worthwhile I hope.]</small></p> <p>Earlier this month Richard Wallis of Talis interviewed JP Rangaswami at BT, and posted a <a rel="nofollow" target="_blank" href="http://blogs.talis.com/panlibus/archives/2008/11/the-economics-of-scarcity-jp-rangaswami-talks-ahead-of-online-information-2008.php">podcast of the conversation</a>. Sterling stuff - I thoroughly recommend listening to it in full. I&#8217;ve pulled out some of the bits as quotes here. </p> <blockquote>If you work in a very vendor dominated world you can abdicate responsibility for
a lot of what you do by transferring not just the risk, but the <del datetime="2008-11-30T22:04:53+00:00">world</del>reward to the
vendor. That doesn&#8217;t scale any more.</blockquote> <blockquote>If a problem is generic, look to the open source community to solve it. If it&#8217;s
a narrow market for the problem &#8230; then look to the commercial environment to
solve it. If it is unique to your enterprise, you&#8217;d better solve it yourself,
because no-one else is going to solve it for you.</blockquote> <blockquote>We&#8217;ve lived through a whole generation of mistakes when we had proprietary
architectures for the way we had information in enterprises. First you paid
money to completely drown the information in concrete, then you paid money to
dig it out to move it somewhere else. That&#8217;s what enterprise application
integration looked like, spending money to sticking it into
somebody&#8217;s silo then spending even more money taking it out of silos. Instead of
exposing data you were excavating data, and paying for the privilege of your own
data. That is the danger we face if we don&#8217;t get issues to do with identity,
with authentication and permissioning, with intellectual property rights correct
in this generation. Because we will end up repeatedly wasting money digging out
stuff that should have been made available much more cheaply because the costs
of reproduction and transmission are going down.</blockquote> <p>Wishing I could self-replicate and get to Online Information (as well as going to the DCC Conference) to hear JP speak there!</p><img src="http://feeds.feedburner.com/~r/JimDowning/~4/2m3NV8UiXZQ" height="1" width="1"/>]]></content:encoded>
      </item>
      <item>
         <title>Panlibus » Blog Archive » The economics of scarcity – JP Rangaswami talks ahead of Online Information 2008</title>
         <link>http://blogs.talis.com/panlibus/archives/2008/11/the-economics-of-scarcity-jp-rangaswami-talks-ahead-of-online-information-2008.php</link>
         <description>Finally got around to listening to this today. Fantastic stuff with insight on topics from Open Source to scholarly publishing to librarians and digital freedoms. Nearly shouted hallelujah on the train at the bit about enterprise integration being like sticking your data in a hole and filling it with concrete!</description>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_edf0c3440ce4e66d0da50714fabf1f89</guid>
         <pubDate>Thu, 20 Nov 2008 04:31:03 -0800</pubDate>
      </item>
      <item>
         <title>The Future of Repositories? Patterns for (Cross-)Repository Architectures</title>
         <link>http://dlib.org/dlib/november08/aschenbrenner/11aschenbrenner.html</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_f58eae525e5d7bb29011ae9d0e8c4ce1</guid>
         <pubDate>Wed, 19 Nov 2008 02:40:55 -0800</pubDate>
      </item>
      <item>
         <title>The Future of Repositories? Patterns for (Cross-)Repository Architectures</title>
         <link>http://dlib.org/dlib/november08/aschenbrenner/11aschenbrenner.html</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_f58eae525e5d7bb29011ae9d0e8c4ce1</guid>
         <pubDate>Tue, 18 Nov 2008 07:34:36 -0800</pubDate>
      </item>
      <item>
         <title>Twitter / Richard Akerman: @dfflanders idea (not mine ...</title>
         <link>http://twitter.com/scilib/status/1008777800</link>
         <description>tweets from SPARC-DR</description>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_fea585e820379c8ddb30a6b818d24cfb</guid>
         <pubDate>Mon, 17 Nov 2008 00:44:00 -0800</pubDate>
      </item>
      <item>
         <title>1,000 Web APIs</title>
         <link>http://blog.programmableweb.com/2008/11/03/1000-web-apis/</link>
         <description>Some nice statistics in regards to API trends.</description>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_1e22f3334b3e6b31e905bd236a6e5456</guid>
         <pubDate>Fri, 14 Nov 2008 13:01:56 -0800</pubDate>
      </item>
      <item>
         <title>Beginning with RDF triplestores - a 'survey'</title>
         <link>http://oxfordrepo.blogspot.com/2008/11/beginning-with-rdf-triplestores.html</link>
         <description>&lt;div&gt;Like last time, this was prompted by an email that eventually was passed to me. It was a call for opinion - "&lt;tt&gt;&lt;font color='#737373'&gt;we thought we'd check first to see what software&lt;/font&gt;&lt;/tt&gt;&lt;tt&gt;&lt;font color='#737373'&gt; either of you recommend or use for an RDF database.&lt;/font&gt;&lt;/tt&gt;"&lt;br/&gt;&lt;br/&gt;It's a good question.&lt;br/&gt;&lt;br/&gt;In fact, it's a really great question, as searching for similar advice online results in very few opinions on the subject.&lt;br/&gt;&lt;br/&gt;But which one's are the best for novices? Which have the best learning curves? which has the easiest install or the shortest time between starting out and being able to query things?&lt;br/&gt;&lt;br/&gt;I'll try to pose as much as I can as a newcomer which won't be too hard :) Some of the comments will be my own, and some will be comments from others, but I'll try to be as honest as I can be to reflect new user expectation and experience and most importantly, developer-attention span. (See the end for some of my reasons for this approach.)&lt;br/&gt;&lt;br/&gt;&lt;em&gt;(Puts on newbie hat and enables PEBKAC mode.)&lt;/em&gt;&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Installable (local) triplestores&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Sesame&lt;/b&gt; - &lt;a rel="nofollow" target="_blank" href='http://www.openrdf.org/'&gt;http://www.openrdf.org/&lt;/a&gt;&lt;br/&gt;&lt;br/&gt;Simple menu on the left of the website, one called downloads. Great, I'll give that a whirl. "Download &lt;a rel="nofollow" target="_blank" href='http://sourceforge.net/project/showfiles.php?group_id=46509&amp;amp;package_id=168413'&gt;the latest Sesame 2.x release&lt;/a&gt;" looks good to me. Hmm 5 differently named files... I'll grab the 'onejar' file and try to run it. "Failed to load Main-Class manifest attribute from openrdf-sesame-2.2.1-onejar.jar", okay... so back to the site to find out how to install this thing.&lt;br/&gt;&lt;br/&gt;No links for installation guide... on the &lt;a rel="nofollow" target="_blank" href='http://www.openrdf.org/documentation.jsp'&gt;Documentation&lt;/a&gt; page, no link for installation instructions for the sesame 2.2.1 I downloaded, but there is &lt;a rel="nofollow" target="_blank" href='http://www.openrdf.org/doc/sesame2/users/'&gt;Sesame 2 user documentation&lt;/a&gt; and &lt;a rel="nofollow" target="_blank" href='http://www.openrdf.org/doc/sesame2/system/'&gt;Sesame 2 system documentation&lt;/a&gt;. Phew, after guessing that the user documentation might have the guide, I finally found the &lt;a rel="nofollow" target="_blank" href='http://www.openrdf.org/doc/sesame2/users/ch06.html'&gt;installation guide&lt;/a&gt; (system documentation was about the architecture, not how to administer the system as you might expect.)&lt;br/&gt;&lt;br/&gt;(Developer losing interest...)&lt;br/&gt;&lt;br/&gt;Ah, I see, I need the SDK. I wonder what that 'onejar' was then... "The deployment process is container-specific, please consult the&lt;br/&gt; documentation for your container on how to deploy a web application. " - right, okay... let's assume that I have a Java background and am not just a user wanting to hook into it from my language of choice, such as php, ruby, python, or dare I say it, javascript.&lt;br/&gt;&lt;br/&gt;(Only Java-friendly developers continue on)&lt;br/&gt;&lt;br/&gt;Right, got Tomcat, and put in the war file... right so, now I need to work out how to use a &lt;a rel="nofollow" target="_blank" href='http://www.openrdf.org/doc/sesame2/users/ch07.html#d0e354'&gt;commandline&lt;/a&gt; console tool to set up a 'repository'... does this use SVN or CVS then? Oh, it doesn't do anything unless I end the line with a period. I thought it had hung trying to connect! "Triple indexes [spoc,posc]" Wha? Well, whatever that was, the test repository is created. Let's see what's at http://localhost:8080/openrdf-sesame then. &lt;br/&gt;&lt;br/&gt;"You are currently accessing an OpenRDF Sesame server. This server is&lt;br/&gt;intended to be accessed by dedicated clients, using a specialized&lt;br/&gt;protocol. To access the information on this server through a browser,&lt;br/&gt;we recommend using the OpenRDF Workbench software."&lt;br/&gt;&lt;br/&gt;Bugger. Google for "sesame clients" then.&lt;br/&gt;&lt;ul&gt;&lt;li&gt;There is a Java client it seems, but it seems to need a lot to get going. Oh, and useful if my application is in Java or in a JVM (jRuby, jython)&lt;br/&gt;&lt;/li&gt;&lt;li&gt;&lt;a rel="nofollow" target="_blank" href='http://jeenbroekstra.blogspot.com/2008/09/sesame-2-desktop-client.html'&gt;http://jeenbroekstra.blogspot.com/2008/09/sesame-2-desktop-client.html&lt;/a&gt; .Net GUI... not so useful for programmatic stuff&lt;/li&gt;&lt;li&gt;...&lt;/li&gt;&lt;/ul&gt;I've pretty much given up at this point. If I knew I needed to use a triplestore then I might have persisted, but if I was just investigating it? I would've probably given up earlier.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Mulgara&lt;/b&gt; - &lt;a rel="nofollow" target="_blank" href='http://www.mulgara.org/'&gt;http://www.mulgara.org/&lt;/a&gt;&lt;br/&gt;&lt;br/&gt;Nice, they've given the frontpage some style, not too keen on orange, but the effort makes it look professional. "&lt;em&gt;Mulgara&lt;/em&gt; is a scalable RDF database written entirely in &lt;strong&gt;&lt;a rel="nofollow" class='styleBlack' target="_blank" href='http://java.com/'&gt;Java&lt;/a&gt;&lt;/strong&gt;." -&amp;gt; Great, I found what I am looking for, and it warns me it needs Java. "DOWNLOAD NOW" - that's pretty clear. *click*&lt;br/&gt;&lt;br/&gt;Hmm, where's the style gone? Lots of download options, but thankfully one is marked by "These released binaries are all that are required for most applications." so I'll grab &lt;a rel="nofollow" target="_blank" href='http://www.mulgara.org/files/v2.0.6/mulgara-2.0.6-bin.tar.gz'&gt;those&lt;/a&gt;. 25Mb? Wow...&lt;br/&gt;&lt;br/&gt;Okay, it's downloaded and unpacked now. Let's see what we've got - a 'dist/' directory and two jars. Well, I guess I should try to run one (wonder what the licence is, where's the README?)&lt;br/&gt;&lt;blockquote&gt;&lt;em&gt;Mulgara Semantic Store Version 2.0.6 (Build 2.0.6.local) INFO [main] (EmbeddedMulgaraServer.java:715) - RMI Registry started automatically on port 10990 [main] INFO org.mulgara.server.EmbeddedMulgaraServer - RMI Registry started automatically on port 1099 INFO [main] (EmbeddedMulgaraServer.java:738) - java.security.policy set to jar:file:/home/ben/Desktop/apache-tomcat-6.0.18/mulgara-2.0.6/dist/mulgara-2.0.6.jar!/conf/mulgara-rmi.policy3 [main] INFO org.mulgara.server.EmbeddedMulgaraServer - java.security.policy set to jar:file:/home/ben/Desktop/apache-tomcat-6.0.18/mulgara-2.0.6/dist/mulgara-2.0.6.jar!/conf/mulgara-rmi.policy2008-11-14 14:06:39,899 INFO Database - Host name aliases for this server are: [billpardy, localhost, 127.0.0.1]&lt;/em&gt;&lt;br/&gt;&lt;/blockquote&gt;Well, I guess something has started... back to the site, there is a documentation page and a wiki. A quick view of the official documentation has just confused me, is &lt;a rel="nofollow" target="_blank" href='http://docs.mulgara.org/'&gt;this an external site&lt;/a&gt;? No easy link to something like 'getting started' or tutorials. I've heard of SPARQL, what's iTQL? nevermind, let's see if the &lt;a rel="nofollow" target="_blank" href='http://www.mulgara.org/trac/wiki'&gt;wiki&lt;/a&gt; is more helpful.&lt;br/&gt;&lt;br/&gt;Let's try '&lt;a rel="nofollow" target="_blank" href='http://www.mulgara.org/trac/wiki/Docs'&gt;Documentation&lt;/a&gt;' - sweet, first link looks like what I want - &lt;a rel="nofollow" target="_blank" href='http://www.mulgara.org/trac/wiki/WebUI' class='wiki'&gt;Web User Interface&lt;/a&gt;.&lt;br/&gt;&lt;blockquote&gt;A default configuration for a standalone Mulgara server runs a set of&lt;br/&gt;web services, including the Web User Interface. The standard&lt;br/&gt;configuration puts uses port 8080, so the web services can be seen by&lt;br/&gt;pointing a browser on the server running Mulgara to &lt;a rel="nofollow" target="_blank" href='http://localhost:8080/' class='ext-link'&gt;&lt;span class='icon'&gt;http://localhost:8080/&lt;/span&gt;&lt;/a&gt;.&lt;br/&gt;&lt;/blockquote&gt;Ooo cool. *click* &lt;br/&gt;&lt;blockquote&gt;&lt;h2&gt;Available Services&lt;/h2&gt;&lt;ul&gt;&lt;li&gt;&lt;a rel="nofollow" target="_blank" href='http://localhost:8080/sparql'&gt;SPARQL HTTP Service&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a rel="nofollow" target="_blank" href='http://localhost:8080/webui'&gt;User Interface&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a rel="nofollow" target="_blank" href='http://localhost:8080/webservices'&gt;Web Services&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a rel="nofollow" target="_blank" href='http://localhost:8080/tql'&gt;TQL HTTP Service&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/blockquote&gt;&lt;br/&gt;SPARQL, I've heard of that. *click* &lt;br/&gt;&lt;blockquote&gt;&lt;h2&gt;HTTP ERROR: 400&lt;/h2&gt;&lt;pre&gt;Query must be supplied&lt;/pre&gt;&lt;p&gt;RequestURI=/sparql/&lt;/p&gt;&lt;p&gt;&lt;i&gt;&lt;small&gt;&lt;a rel="nofollow" target="_blank" href='http://jetty.mortbay.org/'&gt;Powered by Jetty://&lt;/a&gt;&lt;/small&gt;&lt;/i&gt;&lt;/p&gt;&lt;/blockquote&gt;I guess that's the SPARQL api, good to know, but the frontpage could've warned me a little. Ah, second link is to the User Interface.&lt;br/&gt;&lt;br/&gt;Good, I can use a drop down to look at lots of example queries, nice. Don't understand most of them at the moment, but it's definitely comforting to have examples. They look nothing like SPARQL though... wonder what it is? I'm sure it does SPARQL... was I wrong?&lt;br/&gt;&lt;br/&gt;Quick poke at the HTML shows that it is just POSTing the query text to webui/ExecuteQuery. Looks straightforward to start hacking against too, but probably should password protect this somehow! I wonder how that is done... documentation mentions a '&lt;tt&gt;java.security.policy'&lt;/tt&gt; field:&lt;tt&gt;&lt;br/&gt;&lt;br/&gt;java.security.policy&lt;/tt&gt;&lt;i&gt;&lt;br/&gt;string: URL&lt;/i&gt;: The URL for the security policy file to use.&lt;br/&gt;Default: jar:file:/jar_path!/conf/mulgara-rmi.policy &lt;br/&gt;&lt;blockquote&gt;&lt;p/&gt;&lt;/blockquote&gt;Kinda stumped... will investigate that later, but at least there's hope. Just be firing off the example queries though shows me stuff, so I've got something to work with at least.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Jena&lt;/b&gt; - &lt;a rel="nofollow" target="_blank" href='http://jena.sourceforge.net/'&gt;http://jena.sourceforge.net/&lt;/a&gt;&lt;br/&gt;&lt;br/&gt;Front page is pretty clear, even if I don't understand what all those acronyms are. &lt;a rel="nofollow" target="_blank" href='http://jena.sourceforge.net/downloads.html'&gt;downloads&lt;/a&gt; link takes me to a page with an obvious download link, good. (Oh, and sourceforge, you suck. How many frikkin mirrors do I have to try to get this file?)&lt;br/&gt;&lt;br/&gt;Have to put Jena on pause while Sourceforge sorts its life out.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;ARC2&lt;/b&gt; - http://arc.semsol.org/&lt;br/&gt;&lt;br/&gt;Frontpage: "Easy RDF and SPARQL for LAMP systems" Nice, I know of LAMP and I particularly like the word Easy. Let's see... &lt;a rel="nofollow" target="_blank" href='http://arc.semsol.org/download'&gt;Download&lt;/a&gt; is easy to find, and tells me straight away I need PHP 4.3+ and MySQL 4.0.4+ *check* Right, now how do I enable PHP for apache again?... Ah, it helps if I install it first... Okay, done. Dropping the folder into my web space... Hmm nothing does anything. From the documentation, it does look like it is geared to providing a PHP library framework for working with its triplestore and RDF. Hang on, &lt;a rel="nofollow" target="_blank" href='http://arc.semsol.org/docs/v2/endpoint'&gt;SPARQL Endpoint Setup&lt;/a&gt; looks like what I want. It wants a database, okay... done, bit of a hassle though.&lt;br/&gt;&lt;br/&gt;Hmm, all I get is "&lt;b&gt;Fatal error&lt;/b&gt;: Call to undefined function mysql_connect() in &lt;b&gt;/********/arc2/store/ARC2_Store.php&lt;/b&gt; on line &lt;b&gt;53"&lt;br/&gt;&lt;/b&gt;&lt;br/&gt;Of course, install php libraries to access mysql (PEBKAC)... done and I also realise I need to set up the store, like the example in "&lt;a rel="nofollow" target="_blank" href='http://arc.semsol.org/docs/v2/getting_started'&gt;Getting Started&lt;/a&gt;"... done (with &lt;a rel="nofollow" target="_blank" href='http://pastebin.com/f2ca379e7'&gt;this&lt;/a&gt;) and what does the index page now look like?&lt;br/&gt;&lt;br/&gt;&lt;img src='http://lh4.ggpht.com/_KLlGSypGAvw/SR2Xk92vjbI/AAAAAAAAACo/RhWSkZvbYCM/%5BUNSET%5D.png?imgmax=800' style='max-width:800px;'/&gt;&lt;br/&gt;&lt;br/&gt;Yay! there's like SPARQL and stuff... I guess 'load' and 'insert' will help me stick stuff in, and 'select' looks familiar... Well, it seems to be working at least.&lt;br/&gt;&lt;br/&gt;Unfortunately, it looks like the Jena download from sourceforge is in a world of FAIL for now. Maybe I'll look at it next time?&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Triplestores in the cloud&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;Talis Platform - &lt;a rel="nofollow" target="_blank" href='http://www.talis.com/platform/'&gt;http://www.talis.com/platform/&lt;/a&gt;&lt;br/&gt;&lt;br/&gt;From the frontpage - "&lt;i&gt;Developers using the Platform can spend more of their time building&lt;br /&gt;extraordinary applications and less of their time worrying about how&lt;br /&gt;they will scale their data storage.&lt;/i&gt;" - pretty much want I wanted to hear, so how do I get to play with it?&lt;br/&gt;&lt;br/&gt;There is a &lt;a rel="nofollow" title='Get involved' target="_blank" href='http://www.talis.com/platform/get_involved/index.shtml'&gt;Get involved&lt;/a&gt; link on the left, which rapidly leads me to see the section: "Develop, play and try out" - &lt;a rel="nofollow" target="_blank" href='http://n2.talis.com/wiki/Main_Page'&gt;n&lt;sup&gt;2&lt;/sup&gt; developer community &lt;/a&gt; seems to be where it wants me to go. &lt;br/&gt;&lt;br/&gt;Lots of links on the frontpage, takes a few seconds to spot: "&lt;a rel="nofollow" title='Join' target="_blank" href='http://n2.talis.com/wiki/Join'&gt;Join&lt;/a&gt; - join the n² community to get free developer stores and online support" - free, nice word that. So, I just have to email someone? Okay, I can live with that.&lt;br/&gt;&lt;br/&gt;Documentation seems good, lots of choices though, a little hard to spot a single thread to follow to get up to speed, but &lt;a rel="nofollow" title='Guides and Tutorials' target="_blank" href='http://n2.talis.com/wiki/Guides_and_Tutorials'&gt;Guides and Tutorials&lt;/a&gt; looks right to get going with. The &lt;a rel="nofollow" target="_blank" href='http://n2.talis.com/wiki/Kniblet_Tutorial'&gt;Kniblet tutorial&lt;/a&gt; (whatever a kniblet is) looks the most beginnerish, and it's also very PHP focussed, which is either a good thing or a bad thing depending on the user :)&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Commercial triplestores&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Openlink Virtuoso&lt;/b&gt; - &lt;a rel="nofollow" target="_blank" href='http://virtuoso.openlinksw.com/'&gt;http://virtuoso.openlinksw.com/&lt;/a&gt;&lt;br/&gt;&lt;br/&gt;Okay, I tried the &lt;a rel="nofollow" target="_blank" href='http://download.openlinksw.com/download/'&gt;Download&lt;/a&gt; link, but I am pretty confused by what I'm greeted with: &lt;br/&gt;&lt;br/&gt;&lt;img src='http://lh3.ggpht.com/_KLlGSypGAvw/SR2rWiIXUBI/AAAAAAAAACs/qOB0ORoEOI4/%5BUNSET%5D.png?imgmax=800' style='max-width:800px;'/&gt;&lt;br/&gt;&lt;br/&gt;Not sure what one to pick just to try it out, it's late in the day, and my tolerance for all things installable has ended.&lt;br/&gt;&lt;br/&gt;-----------------------------------------&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Why take the http/web-centric, newbie approach to looking at these?&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;&lt;i&gt;Answer: &lt;/i&gt;In part, I am taking this approach because I have a deep belief that it&lt;br/&gt;was only after relational DBs became commoditised - "You want fries&lt;br/&gt;with you MySQL database?" - that the dynamic web kicked off. If we want&lt;br/&gt;the semantic web to kick off, we need to commoditise it or at least, make&lt;br/&gt;it very easy for developers to get started. And I mean &lt;b&gt;&lt;i&gt;EASY&lt;/i&gt;&lt;/b&gt;. A query that I want answered is: "Is there something that fits: 'apt-get install&lt;br/&gt;triplestore; r = store('localhost'), r.add(rdf), r.query(blah)'? " &lt;br/&gt;&lt;br/&gt;(I am particularly interested to see what happens when &lt;a rel="nofollow" target="_blank" href='http://tom.opiumfield.com/'&gt;Tom Morris&lt;/a&gt;'s work on &lt;a rel="nofollow" target="_blank" href='http://github.com/tommorris/reddy/tree/master'&gt;Reddy&lt;/a&gt; collides with ActiveRecord or activerdf...)&lt;br/&gt;&lt;br/&gt;&lt;b&gt;NB &lt;/b&gt;I've short circuited the discovery of software homepages - Imagine&lt;br/&gt;I've seen projects stating that they use "XXXXX as a triplestore". I know&lt;br/&gt;this will likely mean I've compared apples to oranges, but as a newbie, how&lt;br/&gt;would I be expected to know this? "Powered by the Talis Platform" and&lt;br/&gt;"Powered by Jena" seem pretty similar on the surface.)&lt;br/&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-280227876627147956?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-280227876627147956</guid>
         <pubDate>Fri, 14 Nov 2008 00:50:00 -0800</pubDate>
      <enclosure url="http://www.mulgara.org/files/v2.0.6/mulgara-2.0.6-bin.tar.gz" length="26470059" type="application/x-gzip" /><media:content url="http://www.mulgara.org/files/v2.0.6/mulgara-2.0.6-bin.tar.gz" fileSize="26470059" type="application/x-gzip" /><itunes:subtitle>Like last time, this was prompted by an email that eventually was passed to me. It was a call for opinion - "we thought we'd check first to see what software either of you recommend or use for an RDF database." It's a good question. In fact, it's a really</itunes:subtitle><itunes:author>Ben O'Steen</itunes:author><itunes:summary>Like last time, this was prompted by an email that eventually was passed to me. It was a call for opinion - "we thought we'd check first to see what software either of you recommend or use for an RDF database." It's a good question. In fact, it's a really great question, as searching for similar advice online results in very few opinions on the subject. But which one's are the best for novices? Which have the best learning curves? which has the easiest install or the shortest time between starting out and being able to query things? I'll try to pose as much as I can as a newcomer which won't be too hard :) Some of the comments will be my own, and some will be comments from others, but I'll try to be as honest as I can be to reflect new user expectation and experience and most importantly, developer-attention span. (See the end for some of my reasons for this approach.) (Puts on newbie hat and enables PEBKAC mode.) Installable (local) triplestores Sesame - http://www.openrdf.org/ Simple menu on the left of the website, one called downloads. Great, I'll give that a whirl. "Download the latest Sesame 2.x release" looks good to me. Hmm 5 differently named files... I'll grab the 'onejar' file and try to run it. "Failed to load Main-Class manifest attribute from openrdf-sesame-2.2.1-onejar.jar", okay... so back to the site to find out how to install this thing. No links for installation guide... on the Documentation page, no link for installation instructions for the sesame 2.2.1 I downloaded, but there is Sesame 2 user documentation and Sesame 2 system documentation. Phew, after guessing that the user documentation might have the guide, I finally found the installation guide (system documentation was about the architecture, not how to administer the system as you might expect.) (Developer losing interest...) Ah, I see, I need the SDK. I wonder what that 'onejar' was then... "The deployment process is container-specific, please consult the documentation for your container on how to deploy a web application. " - right, okay... let's assume that I have a Java background and am not just a user wanting to hook into it from my language of choice, such as php, ruby, python, or dare I say it, javascript. (Only Java-friendly developers continue on) Right, got Tomcat, and put in the war file... right so, now I need to work out how to use a commandline console tool to set up a 'repository'... does this use SVN or CVS then? Oh, it doesn't do anything unless I end the line with a period. I thought it had hung trying to connect! "Triple indexes [spoc,posc]" Wha? Well, whatever that was, the test repository is created. Let's see what's at http://localhost:8080/openrdf-sesame then. "You are currently accessing an OpenRDF Sesame server. This server is intended to be accessed by dedicated clients, using a specialized protocol. To access the information on this server through a browser, we recommend using the OpenRDF Workbench software." Bugger. Google for "sesame clients" then. There is a Java client it seems, but it seems to need a lot to get going. Oh, and useful if my application is in Java or in a JVM (jRuby, jython) http://jeenbroekstra.blogspot.com/2008/09/sesame-2-desktop-client.html .Net GUI... not so useful for programmatic stuff...I've pretty much given up at this point. If I knew I needed to use a triplestore then I might have persisted, but if I was just investigating it? I would've probably given up earlier. Mulgara - http://www.mulgara.org/ Nice, they've given the frontpage some style, not too keen on orange, but the effort makes it look professional. "Mulgara is a scalable RDF database written entirely in Java." -&amp;gt; Great, I found what I am looking for, and it warns me it needs Java. "DOWNLOAD NOW" - that's pretty clear. *click* Hmm, where's the style gone? Lots of download options, but thankfully one is marked by "These released binaries are all that are required for most applications." so I'll grab those. 25Mb? Wow... Okay, it's d</itunes:summary></item>
      <item>
         <title>A Fedora/Solr Digital Library for Oxford's 'Forced Migration Online'</title>
         <link>http://oxfordrepo.blogspot.com/2008/11/fedorasolr-digital-library-for-oxford.html</link>
         <description>&lt;div&gt;(mods:subtitle - Slightly more technical follow-up to the &lt;a rel="nofollow" target="_blank" href='http://expertvoices.nsdl.org/hatcheck/2008/11/06/a-fedorasolr-digital-library-for-oxfords-forced-migration-online/'&gt;Fedora Hatcheck piece&lt;/a&gt;.)&lt;br/&gt;&lt;br/&gt;As I have been prompted via email by Phil Cryer (of the &lt;a rel="nofollow" target="_blank" href='http://mobot.org/'&gt;Missouri Botanical Garden&lt;/a&gt;) to talk more about how this technically works, I thought it would be best to make it a written post, rather than the more limited email response.&lt;br/&gt;&lt;br/&gt;&lt;big&gt;Background&lt;/big&gt;&lt;br/&gt;&lt;br/&gt;Forced Migration Online (FMO) had a proprietary system, supporting their document needs. It was originally designed for newpaper holdings and applied that model to encoding the mostly paginated documents that FMO held - such that each part was broken up into paragraphs of text, images and the location of all these parts on a page. It even encoded (in its own format) the location of the words on the page when it OCR'd the documents, making per-word higlighting possible. Which is nice.&lt;br/&gt;&lt;br/&gt;However, the backend that powered this was over-priced, and FMO wanted to move to a more open, sustainable platform.&lt;br/&gt;&lt;br/&gt;&lt;big&gt;Enter the DAMS&lt;/big&gt;&lt;br/&gt;&lt;br/&gt;(DAMS = Digital Asset Management System)&lt;br/&gt;&lt;br/&gt;I have been doing work on trying to make a service out of a base of fedora-commons and additional 'plugin' services, such as the wonderful Apache Solr and the useful eXist XML db. The end aim is for departments/users/whoever to requisition a 'store' with a certain quality of service (solr attached, 50Gb+ etc) but this is not yet an automated process.&lt;br/&gt;&lt;br/&gt;The focus for the store is a very clear separation between storage, management, indexing services and distribution - Normal filesystems, or Sun Honeycomb are the storage, Fedora-commons provides the management + CRUD, solr, eXist, mulgara, sesame, and couchDB can provide potential index and query services, and distribution is handed pragmatically, caching outgoing and mirroring where necessary.&lt;br/&gt;&lt;br/&gt;&lt;big&gt;The FMO 'store'&lt;/big&gt;&lt;br/&gt;&lt;br/&gt;From discussions with FMO, and examining the information they held and the way they wished to make use of it, a simple Fedora/Solr store seemed to fufill what they wanted: a persistant store of items with attachments and the ability to search the metadata and retrieve results.&lt;br/&gt;&lt;br/&gt;&lt;big&gt;Bring in the consultants&lt;/big&gt;&lt;br/&gt;&lt;br/&gt;FMO hired Aptivate to do the migration of their data from the proprietary system, in its custom format, to a Fedora/Solr store and trying as much as possible to retain the functionality they had.&lt;br/&gt;&lt;br/&gt;Some points that I think it is important to impress on people here:&lt;br/&gt;&lt;ul&gt;&lt;li&gt;In general, software engineer consultants don't understand METS or FOXML.&lt;/li&gt;&lt;li&gt;They *really* don't understand the point of disseminators.&lt;br/&gt;&lt;/li&gt;&lt;li&gt;Having to teach software engineer consultants to do METS/FOXML/bDef's etc is likely an arduous and costly task.&lt;/li&gt;&lt;li&gt;Consultants add lots of money to do things their team don't already have the experience to do.&lt;/li&gt;&lt;/ul&gt;So, my conclusion was to not make these things part of the development at all to the extent that I might even have forgotten to mention these things to them except in passing. I helped them install their own local store and helped them with the various interfaces and gotchas of the two software packages. By showing them how I use Fedora and Solr in ora.ouls.ox.ac.uk, they were able to hit the ground running.&lt;br/&gt;&lt;br/&gt;They began by using the REST interface to Fedora and the RESTful interface to Solr. By having them begin by using the simple put/get REST interface to Fedora, they could concentrate on getting used to the nature of Fedora as an objectstore. I think they moved to use the SOAP interface as it better suited their Java background, although I cannot be certain as it wasn't an issue that came up.&lt;br/&gt;&lt;br/&gt;Once they had developed the migration scripts to their satisfaction, they asked me to give them a store, which I did (but due to hardware and stupid support issues here I am afraid to say I held them up on this.) They fired off their scripts, moved all the content into the fedora with a straightforward layout per object (pdf, metadata, fulltext and thumbnail) The metadata is - from what I can see - the same XML metadata as before - very MARCXML in nature, with 'Application_Info' elements having types like 'bl:DC.Title'. If necessary, we will strip out the dublin core metadata and put what we can into the DC datastream, but that's not of particular interest to FMO right now. &lt;br/&gt;&lt;br/&gt;&lt;big&gt;Fedora/Solr notes&lt;/big&gt;&lt;br/&gt;&lt;br/&gt;As for the link between Solr and Fedora? This is very loosely coupled, such that they are running in the same Tomcat container for convenience, but aren't linked in a hard way. &lt;br/&gt;&lt;br/&gt;I've looked at GSearch, which is great for a homogenous collection of items, such that they can be acted on by the same XSLT to produce a suitable record for Solr, but as the metadata was a complete unknown for this project, it wasn't too suitable.&lt;br/&gt;&lt;br/&gt;Currently, they have one main route into the fedora store, and so, it isn't hard to simply reindex an item after a change is made, especially for services such as Solr or eXist, which expect to have things change incrementally. I am looking at services such as ActiveMQ for scheduling these index tasks, but more and more I am starting to favour RabbitMQ which seems to be more useful, while retaining the scalability and very robust nature.&lt;br/&gt;&lt;br/&gt;Sending an update to Solr is as simple as an HTTP POST to its /update service, consisting of a XML or JSON packet like " changeme:1 John Smith .... " - it uses a transactional model, such that you can push all the changes and additions into the live index via a commit call, without taking the index offline. To query Solr, all manner of clients exist, and it is built to be very simple to interact with, handling facet queries, filtering, ordering and can deliver the results in XML, JSON, PHP or Python directly. It can even do a XSLT transform of the results on the way out, leading to a trivial way to support OpenSearch, Atom feeds and even HTML blocks for embedding in other sites.&lt;br/&gt;&lt;br/&gt;Likewise, to change a PDF in Fedora can be done by a HTTP POST as well. Does it need to be more complicated?&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Last, but not least, a project to watch closely:&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;The &lt;a rel="nofollow" target="_blank" href='http://ice.usq.edu.au/projects/fascinator/trac'&gt;Fascinator project&lt;/a&gt;, funded by &lt;a rel="nofollow" class='ext-link' target="_blank" href='http://www.arrow.edu.au/'&gt;&lt;span class='icon'&gt;ARROW&lt;/span&gt;&lt;/a&gt;, as part of their mini project scheme, is an Apache &lt;a rel="nofollow" class='ext-link' target="_blank" href='http://lucene.apache.org/solr/'&gt;&lt;span class='icon'&gt;Solr&lt;/span&gt;&lt;/a&gt; front end to the &lt;a rel="nofollow" class='ext-link' target="_blank" href='http://www.fedora-commons.org/'&gt;&lt;span class='icon'&gt;Fedora commons&lt;/span&gt;&lt;/a&gt; repository. The goal of the project is to create a simple interface to Fedora that uses a single technology – that’s Solr – to handle all browsing, searching and security. Well worth a look, as it seeks to turn this Fedora/Solr pairing truly into an appliance, with a simple installer and handling the linkage between the two.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-7367532608884117842?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-7367532608884117842</guid>
         <pubDate>Wed, 12 Nov 2008 22:30:00 -0800</pubDate>
      </item>
      <item>
         <title>The World's Best Photos of crig. Flickr Hive Mind</title>
         <link>http://fiveprime.org/hivemind/Tags/crig</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_1a20f44ae51cd3b5aba2889983406caa</guid>
         <pubDate>Fri, 07 Nov 2008 01:21:04 -0800</pubDate>
      </item>
      <item>
         <title>Codependency Checklist</title>
         <link>http://shalomplace.com/res/codepcheck.html</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_5a2f4b78f2f26e2da042b827091cb035</guid>
         <pubDate>Mon, 03 Nov 2008 22:42:49 -0800</pubDate>
      </item>
      <item>
         <title>OfficeSWORD - Home</title>
         <link>http://www.codeplex.com/OfficeSWORD</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_da68e69f17678efa3260da5b85c1ac0c</guid>
         <pubDate>Fri, 31 Oct 2008 02:52:41 -0700</pubDate>
      </item>
      <item>
         <title>News and updates Oct 2008</title>
         <link>http://oxfordrepo.blogspot.com/2008/10/news-and-updates-oct-2008.html</link>
         <description>&lt;div&gt;Right, I haven't forgotten about this blog, just getting all my ducks in a line as it were. Some updates:&lt;br/&gt;&lt;br/&gt;&lt;ul&gt;&lt;li&gt;The JISC bid for eAdministration was successful, titled "Building the Research Information Infrastructure (BRII)". The project will categorise the research information structure, build vocabularies if necessary, and populate it with information. It will link research outputs (text and data), people, projects, groups, departments, grant/funding information and funding bodys together, using RDF and as many pre-existing vocabularies as is suitable. The first vocab gap we've hit is one for funding, and I've made a draft RDF schema for this which will be openly published once we've worked out a way to make it persistent here at Oxford (trying to get a vocab.ox.ac.uk address)&lt;/li&gt;&lt;ul&gt;&lt;li&gt;One of the final outputs will be a 'foafbook' which will re-use data in the BRII store - it will act as a blue book of researchers. Think Cornell's Vivo, but with the idea of Linked Data firmly in mind.&lt;/li&gt;&lt;li&gt;We are just sorting out a home for this project, and I'll post up an update as soon as it is there.&lt;br/&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;Forced Migration Online (FMO) have completed their archived document migration from a crufty, proprietary store to a ORA-style store (Fedora/Solr) - you can see their preliminary frontend at &lt;a rel="nofollow" target="_blank" href='http://fmo.qeh.ox.ac.uk'&gt;http://fmo.qeh.ox.ac.uk.&lt;/a&gt; Be aware that this is a work in progress. We provide the store as a service to them, giving them a Fedora and a Solr to use. They contracted a company called Aptivate to migrate their content, and I believe also to create their frontend. This is a pilot project to show that repositories can be treated in a distributed way, given out like very smart, shared drive space.&lt;/li&gt;&lt;li&gt;We are working to archive and migrate a number of library and historical catalogs. A few projects have a similar aim to provide an architecture and software system to hold medieval catalog research - a record of what libraries existed, and what books and works they held. This is much more complex that a normal catalog, as each assertion is backed by a type of evidence, ranging from the solid (first-hand catalog evidence), to the more loose (handwriting on the front page looks like a certain person who worked at a certain library.) So modelling this informational structure is looking to be very exciting, and we will have to try a number of ways to represent this, starting with RDF due to the interlinked nature of the data. This is related to the kinds of evidence that genealogy uses, and so related ontologies may be of use.&lt;/li&gt;&lt;li&gt;The work on storing and presenting scanned imagery is gearing up. We are investigating storing the sequence of images and associated metadata/ocr text/etc as a single tar file as part of a Fedora object (i.e. a book object will have a catalog record, technical/provenance information and an attached tar file and and a list of file to offset information.)&lt;/li&gt;&lt;ul&gt;&lt;li&gt;This is due to us trying to hit the 'sweet spot' for most file systems. A very large number of highly compressed images and little pieces of text does not fit well with most FS internals. We estimate that for a book there will be around [4+PDFs+2xPages] files, or 500+ typically. Just counting up the various sources of scanned media we already have, we are pressing for about 1/2 million books from one source, 200,000 images from another, 54,000 from yet another... it's adding up real fast.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;We are starting to deal with archiving/curating the 'long-tail' of data - small, bespoke datasets that are useful to many, but don't fall into the realm of Big Data, or Web data. I don't plan on touching Access/FoxPro databases any time soon though! I am building a Fedora/Solr/eXist box to hold and disseminate these, which should live at databank.ouls.ox.ac.uk very, very shortly. (Just waiting on a new VMware host to grow into, our current one is at capacity.)&lt;br/&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;To give a better idea of the structure, etc, I am writing it up in a second blog post to follow shortly - currently named "Modelling and storing a phonetics database inside a store"&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;I am in the process of integrating the Google-analytics-style statistics package at http://piwik.org with the ORA interface, to give relatively live hit counts on a per-item and to build per-collection reports.&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Right now, piwik is capturing the hits and downloads from ORA, but I have yet to add in the count display on each item page, so halfway there :)&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;We are just waiting on a number of departments here to upgrade the version of EPrints they are using for their internal, disciplinary repositories, so that we can begin archiving surrogate copies of the work they wish to put up for this service. (Using ORE descriptions of their items) By doing so, their content becomes exposed in ORA, mirror copies are made (working on a good way to maintain these as content evolves), but they retain the content control, ORA will also act as a registry for their content. It's only when their service drops do the users get redirected to the mirror copies that ORA holds (think google cache, but a 100% copy).&lt;/li&gt;&lt;li&gt;In the process of battle-testing the Fedora-Honeycomb connection, but as mentioned above, just waiting for a little more hardware before I set to it. Also, we are examining a number of other storage boxes that should plug in under Fedora, using the Honeycomb software, such as the new and shiny Thumper box, "Thor" Sun Fire Xsomething-or-other. Also, getting pretty interested at the idea of MAID storage - massive array of idle disks. Hopefully, this will act like tape, but have a sustainable access speed of disk. Also, a little more green than a tower of spinning hardware.&lt;/li&gt;&lt;li&gt;Planning out the indexer service at the moment. It will use the Solr 1.3 multicore functionality, with a little parsing magic at the ingest side of things to make a generic indexer-as-a-service type system. One use-case is to be able to bring up VM machines with multicore solr on to act as indexers/search engines as needed. An example aim? "Economics want an index that facets on their JEL codes." POST a schema and ingest indexer to the nearest free indexer, and point the search interface at it once an XMPP message comes back that it is finished.&lt;/li&gt;&lt;li&gt;URI resolvers - still investigating what can be put in place for this, as I strongly wish to avoid coding this myself. Looking at OCLC's OpenURL and how I can hack it to feed it info:fedora uris and link them to their disseminated location. Also, using a tinyurl type library + simple interface might not be a bad idea for a quick PoC.&lt;br/&gt;&lt;/li&gt;&lt;li&gt;Just to let you all know that we are building up the digital team here, most recently held interviews for the futureArch project but we are looking for about 3 others to hire, due to all the projects we are doing. We will be putting out job adverts as and when we feel up to battling with HR :)&lt;/li&gt;&lt;/ul&gt;That's most of the more interesting hot topics and projects I am doing at the moment.... phew :)&lt;br/&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-2317123182133225178?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-2317123182133225178</guid>
         <pubDate>Wed, 15 Oct 2008 20:39:00 -0700</pubDate>
      </item>
      <item>
         <title>ptsefton » ARROW week</title>
         <link>http://ptsefton.com/2008/10/16/arrow-week.htm</link>
         <description>This week I’m in Brisbane for an ARROW week. About to start on VALET hacking.</description>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_8e6b3af116df756c686b1b7c12e3f678</guid>
         <pubDate>Wed, 15 Oct 2008 18:41:37 -0700</pubDate>
      </item>
      <item>
         <title>Research-Output Repository Platform (Beta 1)</title>
         <link>http://research.microsoft.com/research/downloads/Details/48e60ac1-a95a-4163-a23d-28a914007743/Details.aspx</link>
         <description>Microsoft releases repository platform, download now. Is it OS yet?</description>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_3400301ed16bd83848943766eb3b29e8</guid>
         <pubDate>Wed, 15 Oct 2008 04:47:29 -0700</pubDate>
      </item>
      <item>
         <title>davetaz: google, fedora, microsoft, eprints, OAI ORE and SWORD. Player rich workshop. Repository Futures? #dorsdl2, #crig</title>
         <link>http://twitter.com/davetaz/statuses/925910531</link>
         <description>davetaz: google, fedora, microsoft, eprints, OAI ORE and SWORD. Player rich workshop. Repository Futures? #dorsdl2, #crig</description>
         <guid isPermaLink="false">http://twitter.com/davetaz/statuses/925910531</guid>
         <pubDate>Thu, 18 Sep 2008 06:54:01 -0700</pubDate>
      </item>
      <item>
         <title>Institutions hate repositories... one simple reason.</title>
         <link>http://davetaz-blog.blogspot.com/2008/09/institutions-hate-repositories-one.html</link>
         <description>Open access is not enough!&lt;br /&gt;&lt;br /&gt;People want to give Open Access to some of their materials at their institution however the IR software is seen as a means to manage all Institutional content and not just that which is Open Access and part of the external image of the Institution.&lt;br /&gt;The problem exists in the other direction as well where repository software is trying to solve these problems, thus people are not likely to use this software until it is included.&lt;br /&gt;&lt;br /&gt;So what do we end up with...&lt;br /&gt;&lt;br /&gt;Lots of Repository Islands which aren't interoperable with each other!&lt;br /&gt;&lt;br /&gt;So if we solve the access and copyright issue will people use the software? errrr No. At this point the software is an all in solution and not a service which can be utilised by current institutional practise ... Give up...?&lt;br /&gt;&lt;br /&gt;No!&lt;br /&gt;&lt;br /&gt;Focus on providing a service, e.g. something which can manage your Digital Resources and enable this to plug to existing institutional services. Some softwares would argue they support this already. OK good, so don't try and solve the problem if it is just an integration issue.&lt;br /&gt;&lt;br /&gt;To the repositories: Decouple! Build a set of services, build ways of plugging services together and allow the community to pic 'n' mix.&lt;br /&gt;&lt;br /&gt;To the institution: You already have access control systems ask your Information/Computer Systems department. You probably already have a Content Management System for educational resources for students (Blackboard? - Integrates with an LDAP server), these use external services to manage access and authentication! Here's a few services for you... LDAP, Radius, Eduroam, Domain Controller. &lt;br /&gt; &lt;br /&gt; Till next time!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-3694637630389146162?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-3694637630389146162</guid>
         <pubDate>Wed, 17 Sep 2008 19:14:00 -0700</pubDate>
      </item>
      <item>
         <title>DSpace and Fedora *need* opinionated installers.</title>
         <link>http://oxfordrepo.blogspot.com/2008/08/dspace-and-fedora-need-opinionated.html</link>
         <description>Just to say that both Fedora-Commons and DSpace really, really need opinionated installers that make choices for the user. Getting either installed is a real struggle - which we demonstrated during the Crigshow, so please don't write in the comments that it is easy, it just isn't.&lt;br /&gt;&lt;br /&gt;Something that is relatively straightforward to install, is a debian package.&lt;br /&gt;&lt;br /&gt;So, just a plea in the dark, can we set up a race? Who can make their repository software installable as a .deb first? will it be DSpace or Fedora? Who am I going to send a box of cookies to and a thank you note from the entire developer community?&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://wiki.eprints.org/w/Installing_EPrints_3_via_apt_%28Debian/Ubuntu%29"&gt;(EPrints doesnt count in this race; they've already done it)&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-1371143081899015772?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-1371143081899015772</guid>
         <pubDate>Sun, 17 Aug 2008 22:16:00 -0700</pubDate>
      </item>
      <item>
         <title>Re-using video compression code to aid document quality checking</title>
         <link>http://oxfordrepo.blogspot.com/2008/08/re-using-video-compression-code-to-aid.html</link>
         <description>&lt;a rel="nofollow" target="_blank" href="http://crigshow.blogspot.com/2008/07/prototype-motion-analysis-to-detect.html"&gt;(Expanding on this video post from the Crigshow)&lt;/a&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;&lt;br /&gt;Problem:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The volume of pages from a large digitisation project can be overwhelming. Add into that the simple fact that all (UK) institutional projects are woefully underfunded and underresourced, it's surprising that we can cope with them really.&lt;br /&gt;&lt;br /&gt;One issue that repeatedly comes up is the idea of quality assurance; How can we know that a given book has been scanned well? How can we spot images easily? Can we detect if foreign bodies were present in the scan, such as thumbs, fingers or bookmarks?&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;A quick solution:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Inspired by a talk at one of the conference strands at WorldComp, where the author talked about the use of a component of a commonly used video compression standard (MPEG2) to detect degrees of motion and change in a video, without having to analyse the image sequences using a novel, or smart algorithm.&lt;br /&gt;&lt;br /&gt;He talked about using the motion vector stream to be a good rough guide to the amount of change between frames of video.&lt;br /&gt;&lt;br /&gt;So, why did this inspire me?&lt;br /&gt;&lt;ul&gt;&lt;li&gt;MPEG-2 compression is a pretty much a solved problem; there are some very fast and scalable solutions out there today - direct benefit: &lt;span style="font-weight:bold;"&gt;No new code needs to be written and maintained&lt;/span&gt;&lt;/li&gt;&lt;li&gt;The format is very well understood and stripping out the motion vector stream wouldn't be tricky. Code exists for this too.&lt;/li&gt;&lt;li&gt;Pages of text in printed documents tend towards being justified so that the two edges of the text columns are straight lines. There is also (typically) a fixed number of lines on a page.&lt;/li&gt;&lt;li&gt;A (comparatively rapid) MPEG2 compression of the scans of a book would have the following qualities:&lt;/li&gt;&lt;ul&gt;&lt;li&gt;The motion vectors between pages of text would either shown little overall change (as differing letters are actually quite similar) or a small, global shift if the page was printed on a slight offset.&lt;/li&gt;&lt;li&gt;The motion vectors between a page of text and a page with an image embedded in text on the next, or a thumb on the edge, would show localised and distinct changes that differ greatly from the overall perspective.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;In fact, a real crude solution could be, just using the vector stream to create a bookmark list for all the suspect changes. This might bring the number of pages to check down to a level that a human mediator could handle.&lt;/li&gt;&lt;/ul&gt;How much needs to be checked?&lt;br /&gt;&lt;br /&gt;Via basic sample survey statistics: to be sure to 95% (±5%) that the scanned images of 300 million pages are okay, just 387 totally random pages need to be checked. However, to be sure that each individual book is okay to the same degree, a book being ~300 pages, 169 pages need to be checked &lt;span style="font-weight:bold;"&gt;in each book&lt;/span&gt;. I would suggest that the above technique would significantly lower this threshold, but it would be by an empirically found amount.&lt;br /&gt;&lt;br /&gt;Also note that the above figures carry the assumption that the scanning process doesn't change over time, which of course it does!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-7963164596904275017?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-7963164596904275017</guid>
         <pubDate>Sun, 17 Aug 2008 21:47:00 -0700</pubDate>
      </item>
      <item>
         <title>The four rules of the web and compound documents</title>
         <link>http://oxfordrepo.blogspot.com/2008/08/four-rules-of-web-and-compound.html</link>
         <description>A real quirk that truly interests me is the difference in aims between the way documents are typically published and the way that the information within them is reused.&lt;br /&gt;&lt;br /&gt;A published document is normally in a single 'format' - a paginated layout, and this may comprise text, numerical charts, diagrams, tables of data and so on.&lt;br /&gt;&lt;br /&gt;My assumption is that, to support a given view or argument, a reference to the entirety of an article is not necessary; The full paper gives the context to the information, but it is much more likely that a small part of this paper contains the novel insight being referenced.&lt;br /&gt;&lt;br /&gt;In the paper-based method, it is difficult to uniquely identify parts of an article as items in their own right. You could reference a page number, give line numbers, or quote a table number, but this doesn't solve this issue that the author hadn't put time to considering that a chart, a table or a section of text would be reused.&lt;br /&gt;&lt;br /&gt;So, on the web, where multiple representations of the same information is getting to be commonplace (mashups, rss, microblogs, etc), what can we do to help better fulfill both aims, to show a paginated final version of a document, and also to allow each of the components to exist as items in their own right, with their own URIs (or better, URLs containing some notion of the context e.g. if /store/article-id gets to the splash page of the article, /store/article-id/paragraph-id will resolve to the text for that paragraph in the article.)&lt;br /&gt;&lt;br /&gt;Note that the four rules of the web (well, of &lt;a rel="nofollow" target="_blank" href="http://linkeddata.org/"&gt;Linked Data&lt;/a&gt;) are in essence:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;give everything a name,&lt;br /&gt;&lt;/li&gt;&lt;li&gt;make that name a URL ...&lt;/li&gt;&lt;li&gt;...which results in data about that thing,&lt;br /&gt;&lt;/li&gt;&lt;li&gt;and have it link to other related things. &lt;/li&gt;&lt;/ul&gt;[From &lt;a rel="nofollow" target="_blank" href="http://www.w3.org/DesignIssues/LinkedData.html"&gt;TimBL's originating article&lt;/a&gt;. Also, see this &lt;a rel="nofollow" target="_blank" href="http://virtuoso.openlinksw.com/presentations/Creating_Deploying_Exploiting_Linked_Data2/Creating_Deploying_Exploiting_Linked_Data2_TimBL_v3.html#%281%29"&gt;presentation &lt;/a&gt;- a remix of presentations from TimBL and the speaker, &lt;span style="font-style:italic;font-size:100%;"&gt;&lt;a rel="nofollow" target="_blank" href="http://myopenlink.net/dataspace/person/kidehen#this"&gt;&lt;span style="color:rgb(0, 147, 182);text-decoration:underline;"&gt;Kingsley Idehen&lt;/span&gt;&lt;/a&gt;&lt;/span&gt;&lt;span style="font-size:100%;"&gt; - given at the recent Linked Data Planet conference&lt;/span&gt;]&lt;br /&gt;&lt;br /&gt;I strongly believe that applying this to the individual components of a document is a very good and useful thing.&lt;br /&gt;&lt;br /&gt;One thing first, we have to get over the legal issue of just storing and presenting a bitwise perfect copy of what an author gives us. We need to let author's know that we may present alternate versions, based on a user's demands. This actually needs to be the case for preservation and the repository needs to make it part of their submission policy to allow for format migrations, accessibility requirements and so on.&lt;br /&gt;&lt;br /&gt;The system holding the articles needs to be able to clearly indicate versions and show multiple versions for a single record.&lt;br /&gt;&lt;br /&gt;When a compound document is submitted to the archive, a second parallel version should be made by fragmenting the document into paragraphs of text, individual diagrams, tables of data, and other natural elements. One issue that has already come up in testing, is that documents tend to clump multiple, separate diagrams together into a single physical image. It is likely that the only solution to breaking these up to this is going to be a human one, either author/publisher education(unlikely) or by breaking them up by hand.&lt;br /&gt;&lt;br /&gt;I would suggest using a very lightweight, hierarchical structure to record the document's logical structure. I have yet to settle on basing it on the content XML format inside the OpenDocument format, or on something very lightweight, using HTML elements, which would have a double benefit of being able to be sent directly to a browser to 'recreate' the document roughly.&lt;br /&gt;&lt;br /&gt;Summary:&lt;br /&gt;&lt;br /&gt;1) Break apart any compound document into its constituent elements (paragraph level is suggested for text)&lt;br /&gt;2) Make sure that each one of these parts are clearly expressed in the context they are in, using hierarchical URLs, /article/paragraph or even better, /article/page/chart&lt;br /&gt;3) On the article's splashpage, make a clear distinction between the real article and the broken up version. I would suggest a scheme like Google search's 'View [PDF, PPT, etc] as HTML'. I would assert that many people intuitively understand that this view is not like the original and will look or act differently.&lt;br /&gt;&lt;br /&gt;Some related video blogs from the &lt;a rel="nofollow" target="_blank" href="http://crigshow.blogspot.com/"&gt;Crigshow&lt;/a&gt; trip&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://crigshow.blogspot.com/2008/07/prototype-extracting-and-finding.html"&gt;Finding and reusing algorithms from published articles&lt;/a&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://crigshow.blogspot.com/2008/07/real-documents-are-complex-objects.html"&gt;OCR'ing documents; Real documents are always complex&lt;/a&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://crigshow.blogspot.com/2008/07/protoype-providing-overviews-of.html"&gt;Providing a systematic overview of how a Research paper is written&lt;/a&gt; - giving each component and each version of a component would have major benefits here&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-2742105134406656447?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-2742105134406656447</guid>
         <pubDate>Sun, 17 Aug 2008 20:40:00 -0700</pubDate>
      </item>
      <item>
         <title>Trackbacks, and spammers, and DDoS, oh my!</title>
         <link>http://oxfordrepo.blogspot.com/2008/08/trackbacks-and-spammers-and-ddos-oh-my.html</link>
         <description>&lt;span style="font-size:130%;"&gt;The Idea&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Before I give you all the dark news about this, let me set out my position: I really, really think that repositories communicating the papers that are cited and referenced to each other is a really good thing. If a paper was deposited in the Oxford archive, and it referenced a paper held in a different repository, say in Southampton's EPrints archive, I think that it is a really fantastic idea to let the Oxford archive tell the Southampton one about it.&lt;br /&gt;&lt;br /&gt;And I decided to do something about it - I added two linkback facilities to the archive's user interface, allowing both &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;trackbacks&lt;/a&gt; and &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Pingback"&gt;pingbacks&lt;/a&gt; to be archived by the system. I adopted the pre-existing "standards" - really, they are just rough api's - because I think we have all learned our lessons about making up new APIs for basic tasks.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:130%;"&gt;What is &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;Trackback&lt;/a&gt;?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;Trackback&lt;/a&gt; is an agreed technique from the blogging world. Many blogging systems have it built in, and it enables one blog post to explicitly reference and talk about another post, made on a remote blog somewhere. It does this by &lt;a rel="nofollow" target="_blank" href="http://www.ietf.org/rfc/rfc2616.txt"&gt;POST&lt;/a&gt;ing a number of form-encoded parameters to a specific URL, specific to the item that is being referenced. The parameters include things like title, abstract and URL of the item making the reference.&lt;br /&gt;&lt;br /&gt;So on the surface, it appears that this &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;trackback&lt;/a&gt; idea performs exactly what I was looking for.&lt;br /&gt;&lt;br /&gt;BUT! &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;Trackback&lt;/a&gt; has massive, gaping flaws, akin to the flaws in the email system which is full of spam. For one, all trackbacks are trusted in the basic specifications. No checking that the URL exists, no checking of the text for relevance, etc.&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Pingback"&gt;Pingback&lt;/a&gt; is a slightly different system, in that all that is passed, is the URL of the referencing item. It is then up to the remote server to go and get the requested page and parse that out to find the reference. (The next version of the specification is crying out to recommend microformats et al, in my opinion)&lt;br /&gt;&lt;br /&gt;So, these systems, &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;trackback&lt;/a&gt; and &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Pingback"&gt;pingback&lt;/a&gt;, have been on trial in the live system for about 4 or 5 months, and I am sure you all want to hear my conclusions:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-weight:bold;"&gt;Don't implement &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;Trackback&lt;/a&gt;&lt;/span&gt; as it is defined in its &lt;a rel="nofollow" target="_blank" href="http://www.sixapart.com/pronet/docs/trackback_spec"&gt;specifications...&lt;/a&gt; seriously. It is a poorly designed method, with so much slack that it is a spammers goldmine.&lt;/li&gt;&lt;li&gt;Even after adding in some safeguards to the &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;Trackback&lt;/a&gt; method, such as parsing the supposed referencing page and checking for HTML and the presence of the supposed link, it was still possible for spammers to get through.&lt;/li&gt;&lt;li&gt;When I implemented &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;Trackbacks&lt;/a&gt;, I did so with the full knowledge that I might have to stand at a safe distance and nuke the lot. Here is the Trackback model used in the &lt;a rel="nofollow" target="_blank" href="http://ora.ouls.ox.ac.uk/"&gt;Fedora repository&lt;/a&gt; - A DC datastream containing the &lt;a rel="nofollow" target="_blank" href="http://www.ietf.org/rfc/rfc2616.txt"&gt;POST&lt;/a&gt;ed information mapped to simple dublin core and a RELS-EXT RDF entry asserting that this &lt;a rel="nofollow" target="_blank" href="http://www.fedora-commons.org/"&gt;Fedora&lt;/a&gt; Object &amp;lt;info:fedora/trackback-id&amp;gt;, referenced &lt;a rel="nofollow" target="_blank" href="http://dublincore.org/documents/dcmi-terms/#terms-references"&gt;&amp;lt;dcterms:references&amp;gt;&lt;/a&gt; the main item in the archive &amp;lt;info:fedora/item-id&amp;gt;. As the user interface for the archive gets the graph for that object, it was easy to get the trackbacks out as well. Having separate objects for the trackbacks and not changing the referenced item &lt;span style="font-weight:bold;"&gt;at all&lt;/span&gt;, made it very easy to remove the trackbacks at the end.&lt;/li&gt;&lt;li&gt;The &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;Trackback&lt;/a&gt; system did get hit, once the spammers found a way around my safeguards. So, yes, the trackbacks got 'nuked' and the system turned off.&lt;/li&gt;&lt;li&gt;Currently, the system is under a sort of mini-DDoS, from the spammer's botnet trying to make trackbacks and overloading the session tracking system.&lt;/li&gt;&lt;li&gt;The &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Pingback"&gt;Pingback&lt;/a&gt; system, utilising XML-RPC calls, was never hit by spam. I still turned it off, as the safeguards on this system were equivalent to the Trackback system.&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-size:130%;"&gt;So, how do we go on from this quagmire of spam?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Well, for one, if I had time (and resources) to pass all requests through spamassassin or pay for akismet, that would have cut down the number drastically. Also, if I had time to sit and moderate all the linkbacks, again, spam would be nipped in the bud.&lt;br /&gt;&lt;br /&gt;So, while I truly believe that this type of system is the future, it certainly isn't the case that it can be a system that can just be turned on and the responsibility for maintaining it added to an already full workload.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:130%;"&gt;Alternatives?&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;White-listing sites may be one method. To limit the application to sharing references between institutions, you could use the &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Pretty_Good_Privacy#Web_of_trust"&gt;PGP idea - a web of trust&lt;/a&gt;; a technique of encrypting the passed information with a private key that resolves to a public key from a white-listed institution. This would ensure that the passed reference really was from a given institution. This should be more flexible than requiring a single IP address to accept references from.&lt;br /&gt;&lt;br /&gt;(There is always the chance that the private key could be leaked and made not-so-private by the institution, but that would have to be their responsibility. Any spam from a mistake of this sort would be directly attributed to those at fault!)&lt;br /&gt;&lt;br /&gt;A slower, far less accurate but more traditional method, would be for a given institution to harvest references from &lt;span style="font-weight:bold;"&gt;all&lt;/span&gt; the other repositories it knows about. I really don't think this is workable, but has the pro that a harvester can be sure that a reference links to a given URL, (barring the more and more common &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/DNS_cache_poisoning"&gt;DNS poisoning attacks&lt;/a&gt;)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-5023695488040957840?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-5023695488040957840</guid>
         <pubDate>Sun, 17 Aug 2008 19:52:00 -0700</pubDate>
      </item>
      <item>
         <title>#crigshow - Conference 2 - Worldcomp</title>
         <link>http://davetaz-blog.blogspot.com/2008/07/crigshow-conference-2-worldcomp.html</link>
         <description>&lt;span style="font-weight:bold;"&gt;Agents and Web Services... Why no collaboration?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Out of all the presentations at worldcomp this one struck me as one of the most obvious but not covered areas for research in computer science. Probably the most well known agent system is that used by the travel industry where they have standard ways of interfacing with each other to find details of travel and hotels available on a global scale. This is no mean feat with the number of companies there are hooking into this network. &lt;br /&gt;&lt;br /&gt;So why doesn't the same exist for web services or if there is such a system why isn't everyone in the open community using it? &lt;br /&gt;&lt;br /&gt;Surely the point of web services is for people to discover and use them in their own scenarios just like the agents in the travel industry do. OK so maybe the problem lies in the fact that there are so many communities that there will never be a specific use case or framework and thus hosting a generic web service network becomes infinitely hard with the number of different APIs and Implementations. &lt;br /&gt;&lt;br /&gt;OK so if you are going to use Agents in Web Services what issues do you need to consider? Also what do you gain through doing this?&lt;br /&gt;&lt;br /&gt;One of the key ideas which came out of a talk at worldcomp is to use Agents to be the intelligent front to a web service. This enables an agent to track of a set of web services including information about a specific web service such as availability, versions, changing cost and and offline copy if the service allows this. So the agent becomes a Rendezvous Point for a series of web services. &lt;br /&gt;&lt;br /&gt;So why aren't we seeing more collaboration between the Agent community and the Web Services community?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-6139828693528407847?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-6139828693528407847</guid>
         <pubDate>Wed, 16 Jul 2008 06:19:00 -0700</pubDate>
      </item>
      <item>
         <title>jimdowning: Anyone at #jif08 wants me to give their project some publicity in #crig in DC next week, grab me in a break. I'll be near a power socket.</title>
         <link>http://twitter.com/jimdowning/statuses/858844575</link>
         <description>jimdowning: Anyone at #jif08 wants me to give their project some publicity in #crig in DC next week, grab me in a break. I'll be near a power socket.</description>
         <guid isPermaLink="false">http://twitter.com/jimdowning/statuses/858844575</guid>
         <pubDate>Tue, 15 Jul 2008 02:45:03 -0700</pubDate>
      </item>
      <item>
         <title>OSCELOT Open Source Day III - views</title>
         <link>http://oxfordrepo.blogspot.com/2008/07/oscelot-open-source-day-iii-views.html</link>
         <description>The event was held at the Nevada gaming institute, and was overall, a well-structured day. The driving ideology was that of the &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Unconference"&gt;unconference&lt;/a&gt; - "... a facilitated, face-to-face, and participant-driven conference centered around a theme or purpose."&lt;br /&gt;&lt;br /&gt;However, it seemed that the theme or purpose of the event was not about Open Source - it was as if it were a Blackboard self-help group, trying to solve the issues and failings of this proprietary software. Some of the issues were a little shocking - someone proposed that they had "a need to search the content of [Blackboard Vista] repository" - it came as some surprise to me that this wasn't already possible in such a mature product.&lt;br /&gt;&lt;br /&gt;I was pleased that we were able to help and inform the other attendees about more open technology and standards, such as &lt;a rel="nofollow" target="_blank" href="http://oauth.net"&gt;OAuth&lt;/a&gt;, resource-orientated architecture, creative commons licensing and more. &lt;br /&gt;&lt;br /&gt;One session I lead on was titled - controversially - "Why [bother with] Portals?" - in which I wanted to get a discussion on what students actually use. The point I wanted to make was that URLs are the base currency of the internet - search engines produce lists of them, people bookmark them, and URLs are used when sharing information between people. &lt;br /&gt;&lt;br /&gt;This means that there is a very large responsibility on the content providers &lt;span style="font-weight:bold;"&gt;not&lt;/span&gt; to change URLs, or they will devalue the very resources they are trying to get people to use. This is the reason why persistent URLs are a crucial thing to aim for.&lt;br /&gt;&lt;br /&gt;I hope that we were able to bring extra value to the meeting, due to the fact that, unlike the vast majority of attendees, we do not have a Blackboard background.&lt;br /&gt;&lt;br /&gt;However, I do think that the event needed to have more emphasis on real-world open source projects such as Sakai and Moodle, and examine how best to intergrate their systems with external systems.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-6727893367281856190?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-6727893367281856190</guid>
         <pubDate>Mon, 14 Jul 2008 08:24:00 -0700</pubDate>
      </item>
      <item>
         <title>#crigshow - Conference 1 - Oscelot</title>
         <link>http://davetaz-blog.blogspot.com/2008/07/crigshow-conference-1-oscelot.html</link>
         <description>This open source day (#osdiii) hosted by Oscelot was an unconferene which soon became based heavily around the Blackboard platform. This was expected as the majority of people attending it were then going on to attend the BbWorld conference. With the title of the conference being Open Source and yet the main topic being that of a Closed Source product this gave an opening for the CRIG team to promote the wider Open Source community to those who are focused on Blackboard use cases. &lt;br /&gt;&lt;br /&gt;The day was a success for the team as we promoted good practices in web development, standards, resource management and the fact that the people who manage an eLearning platform has a responsibility to the content they hold. &lt;br /&gt;&lt;br /&gt;From our point of view, we discovered: If blackboard is the industry leader in learning management systems then the repository community is big problems when it comes to archiving these resources by the current methodologies each community practices. &lt;br /&gt;&lt;br /&gt;More Collaboration and Awareness please!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-8889383351258675109?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-8889383351258675109</guid>
         <pubDate>Mon, 14 Jul 2008 06:00:00 -0700</pubDate>
      </item>
      <item>
         <title>dfflanders: @paulwalk just met with tiddlywiki team, they are interested to see tw as a scholarly tool / integration into repo? #crig</title>
         <link>http://twitter.com/dfflanders/statuses/850660240</link>
         <description>dfflanders: @paulwalk just met with tiddlywiki team, they are interested to see tw as a scholarly tool / integration into repo? #crig</description>
         <guid isPermaLink="false">http://twitter.com/dfflanders/statuses/850660240</guid>
         <pubDate>Sat, 05 Jul 2008 05:21:23 -0700</pubDate>
      </item>
      <item>
         <title>dfflanders: #crig #sword much easier to use eduroam here at BBK than guest access, see config here: http://tinyurl.com/59wra9</title>
         <link>http://twitter.com/dfflanders/statuses/847477260</link>
         <description>dfflanders: #crig #sword much easier to use eduroam here at BBK than guest access, see config here: http://tinyurl.com/59wra9</description>
         <guid isPermaLink="false">http://twitter.com/dfflanders/statuses/847477260</guid>
         <pubDate>Tue, 01 Jul 2008 00:56:23 -0700</pubDate>
      </item>
      <item>
         <title>dfflanders: #crig #sword room is air conditioned</title>
         <link>http://twitter.com/dfflanders/statuses/847475211</link>
         <description>dfflanders: #crig #sword room is air conditioned</description>
         <guid isPermaLink="false">http://twitter.com/dfflanders/statuses/847475211</guid>
         <pubDate>Tue, 01 Jul 2008 00:51:20 -0700</pubDate>
      </item>
      <item>
         <title>dfflanders: #crig #sword in room 538 setting up for SWORD2 meeting, donuts good (don't think they will last 'til the 10.30 start)</title>
         <link>http://twitter.com/dfflanders/statuses/847465687</link>
         <description>dfflanders: #crig #sword in room 538 setting up for SWORD2 meeting, donuts good (don't think they will last 'til the 10.30 start)</description>
         <guid isPermaLink="false">http://twitter.com/dfflanders/statuses/847465687</guid>
         <pubDate>Tue, 01 Jul 2008 00:28:22 -0700</pubDate>
      </item>
      <item>
         <title>OAI-PMH + OAI-ORE (Atom) + Pronom Droid = Pretty</title>
         <link>http://davetaz-blog.blogspot.com/2008/06/oai-pmh-oai-ore-atom-pronom-droid.html</link>
         <description>I've just finished writing a wrapper (very simple!) which takes a &lt;a rel="nofollow" target="_blank" href="http://www.openarchives.org/ore/"&gt;OAI-ORE&lt;/a&gt; Resource Map in Atom Format and classifies the objects which are listed in the Aggregation using the National Archives (UK) technical registry (&lt;a rel="nofollow" target="_blank" href="http://www.nationalarchives.gov.uk/pronom/"&gt;Pronom&lt;/a&gt;). &lt;br /&gt;&lt;br /&gt;The wrapper provides a simple front end to the &lt;a rel="nofollow" target="_blank" href="http://droid.sourceforge.net/wiki/index.php/Introduction"&gt;DROID tool&lt;/a&gt;, it takes an &lt;a rel="nofollow" target="_blank" href="http://www.openarchives.org/pmh/"&gt;OAI-PHM&lt;/a&gt; URI and requests the latest resource maps in atom format (ore-atom) and creates a list of the resources which are passed to DROID to classify directly. &lt;br /&gt;&lt;br /&gt;The wrapper requires OAI-PMH as it requests all records which have been modified since it last did a parse of the repository. This way the wrapper can be scheduled to run once a day/week/month etc. &lt;br /&gt;&lt;br /&gt;A single DROID xml file comes back as the output. &lt;br /&gt;&lt;br /&gt;This is all working with EPrints repository software currently.&lt;br /&gt;&lt;br /&gt;Next stage is to do something useful with the output xml in terms of providing useful data back to the repository manager.&lt;br /&gt;&lt;br /&gt;Total lines of source code for the wrapper: 302 :)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-6418976742344708314?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-6418976742344708314</guid>
         <pubDate>Thu, 26 Jun 2008 22:56:00 -0700</pubDate>
      </item>
      <item>
         <title>ORE software libraries from Foresite</title>
         <link>http://chronicles-of-richard.blogspot.com/2008/06/foresite-1-project-is-pleased-to.html</link>
         <description>The Foresite [1] project is pleased to announce the initial code of two software libraries for constructing, parsing, manipulating and serialising OAI-ORE [2] Resource Maps. These libraries are being written in Java and Python, and can be used generically to provide advanced functionality to OAI-ORE aware applications, and are compliant with the latest release (0.9) of the specification. The software is open source, released under a BSD licence, and is available from a Google Code repository:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://code.google.com/p/foresite-toolkit/"&gt;http://code.google.com/p/foresite-toolkit/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;You will find that the implementations are not absolutely complete yet, and are lacking good documentation for this early release, but we will be continuing to develop this software throughout the project and hope that it will be of use to the community immediately and beyond the end of the project.&lt;br /&gt;&lt;br /&gt;Both libraries support parsing and serialising in: ATOM, RDF/XML, N3, N-Triples, Turtle and RDFa&lt;br /&gt;&lt;br /&gt;Foresite is a JISC [3] funded project which aims to produce a demonstrator and test of the OAI-ORE standard by creating Resource Maps of journals and their contents held in JSTOR [4], and delivering them as ATOM documents via the SWORD [5] interface to DSpace [6]. DSpace will ingest these resource maps, and convert them into repository items which reference content which continues to reside in JSTOR. The Python library is being used to generate the resource maps from JSTOR and the Java library is being used to provide all the ingest, transformation and dissemination support required in DSpace.&lt;br /&gt;&lt;br /&gt;Please feel free to download and play with the source code, and let us have your feedback via the Google group:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="mailto:foresite@googlegroups.com"&gt;foresite@googlegroups.com&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Richard Jones &amp; Rob Sanderson&lt;br /&gt;&lt;br /&gt;[1] Foresite project page: &lt;a rel="nofollow" target="_blank" href="http://foresite.cheshire3.org/"&gt;http://foresite.cheshire3.org/&lt;/a&gt;&lt;br /&gt;[2] OAI-ORE specification: &lt;a rel="nofollow" target="_blank" href="http://www.openarchives.org/ore/0.9/toc"&gt;http://www.openarchives.org/ore/0.9/toc&lt;/a&gt;&lt;br /&gt;[3] Joint Information Systems Committee (JISC): &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/"&gt;http://www.jisc.ac.uk/&lt;/a&gt;&lt;br /&gt;[4] JSTOR: &lt;a rel="nofollow" target="_blank" href="http://www.jstor.org/"&gt;http://www.jstor.org/&lt;/a&gt;&lt;br /&gt;[5] Simple Web Service Offering Repository Deposit (SWORD):&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/repositories/digirep/index/SWORD"&gt;http://www.ukoln.ac.uk/repositories/digirep/index/SWORD&lt;/a&gt;&lt;br /&gt;[6] DSpace: &lt;a rel="nofollow" target="_blank" href="http://www.dspace.org/"&gt;http://www.dspace.org/&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-4049331535370251135?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/CiZe1c6fjFs" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-4049331535370251135</guid>
         <pubDate>Mon, 09 Jun 2008 07:56:00 -0700</pubDate>
      </item>
      <item>
         <title>Repository Software is Dead</title>
         <link>http://davetaz-blog.blogspot.com/2008/06/repository-software-is-dead.html</link>
         <description>Repository Software for digital collections as we know it supplies the complete solution to the client, thus without the software you cannot access any of the data in your repository. This is a bad thing for object reuse and digital preservation! &lt;br /&gt;&lt;br /&gt;Many people at conferences such as &lt;a rel="nofollow" target="_blank" href="http://or08.ecs.soton.ac.uk"&gt;Open Repositories 2008&lt;/a&gt; and from workgroups like &lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/repositories/digirep/index/CRIG"&gt;CRIG&lt;/a&gt; have been talking for a long while about the importance of Interoperability. However, if you get rid of the need for the interoperability and use a standard specification for accessing simple data objects (pdfs and their metadata), then you don't need interoperability! &lt;br /&gt;&lt;br /&gt;So this leads me to the fact that &lt;a rel="nofollow" target="_blank" href="http://www.eprints.org"&gt;EPrints&lt;/a&gt;, &lt;a rel="nofollow" target="_blank" href="http://www.fedora-commons.org/"&gt;Fedora&lt;/a&gt; and hopefully at some point &lt;a rel="nofollow" target="_blank" href="http://www.dspace.org"&gt;DSpace&lt;/a&gt; are abstracting their database and storage layers to support use of any type of storage platform. Thanks goes &lt;a rel="nofollow" target="_blank" href="http://www.sun.com"&gt;SUN Microsystems&lt;/a&gt; preservation action group and open storage group for pushing this work from a commercial perspective. But we need to go further than this to get rid of the need for interoperability.&lt;br /&gt;&lt;br /&gt;From &lt;a rel="nofollow" target="_blank" href="http://or08.ecs.soton.ac.uk"&gt;Open Repositories 2008&lt;/a&gt;, myself and a college &lt;a rel="nofollow" target="_blank" href="http://oxfordrepo.blogspot.com/"&gt;Ben O'Steen&lt;/a&gt; from Oxford University proved how &lt;a rel="nofollow" target="_blank" href="http://www.ope anarchives.org/ore/"&gt;OAI-ORE (OAI specification for Object Reuse and Exchange)&lt;/a&gt; can be used to enable high level repository interoperability. &lt;a rel="nofollow" target="_blank" href="http://www.preserv.org.uk/?page=oai-ore"&gt;This work won us $5000&lt;/a&gt; but more importantly got the community thinking about the true power of a specification like OAI-ORE. Ben and I are now hoping to push this work down to the low level storage such that the objects within an ORE map (documents and metadata) can be directly referenced without the need for the current repository layer. For this to happen &lt;b&gt;all objects need to be stored in their simplest form - NO WRAPPER FORMATS ALLOWED at the lowest level&lt;/b&gt;. &lt;br /&gt;&lt;br /&gt;From recent talks with Sandy Payette and Les Carr (Fedora and EPrints respectively) I am envisaging that the current repository software becomes classified as repository service software which is able to manage low level objects but is not specifically required to access these objects. So current services which plug into the repository software can act directly on the objects.&lt;br /&gt;&lt;br /&gt;A couple of problems to solve, security and consistency of cached data. All especially applicable if you have more than one piece of repository service software modifying your objects.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-6344307351894496505?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-6344307351894496505</guid>
         <pubDate>Sat, 07 Jun 2008 21:43:00 -0700</pubDate>
      </item>
      <item>
         <title>CRIG / IEDemonstator After Thoughts</title>
         <link>http://davetaz-blog.blogspot.com/2008/06/crig-iedemonstator-after-thoughts.html</link>
         <description>IEDemonstrator is a really bad name for a project as it just says Microsoft to me but I'm fairly it isn't anything to do with that most stable of web browsers.&lt;br /&gt;&lt;br /&gt;From the workshop it has become clear to me that discussing a specification for service interaction globally is going to be impossible. This could be due to the fact that SOAP did such a good job of it and no one wants to use anything else (enough sarcasm??). I think many people left the workshop with a much better idea at how HTTP error codes (which have been around years) already go most of the way to solving a web service model. We also realised quickly that any specification would have to be built specifically for pay services (e.g. make use of the 402 code), this would then encourage companies/institutions to supply reliable services which last more than 4 years (cough AHDS cough).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-8557063529511560836?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-8557063529511560836</guid>
         <pubDate>Sat, 07 Jun 2008 21:35:00 -0700</pubDate>
      </item>
      <item>
         <title>First Post - CRIG DRY Workshop</title>
         <link>http://davetaz-blog.blogspot.com/2008/06/first-post-crig-dry-workshop.html</link>
         <description>Well there's a surprise!&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_DRY_Workshop"&gt;CRIG DRY Workshop&lt;/a&gt; in Bath is where I am now. So what's happening:&lt;br /&gt;&lt;br /&gt;People have been talking about services and proposed projects to provide authoritative and complete services to users/agents/repositories. A couple of themes have come out morning session for me:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.w3.org/2004/02/skos/"&gt;SKOS: &lt;/a&gt; A lot of projects (incl. &lt;a rel="nofollow" target="_blank" href="http://lcsh.info"&gt;Library of Congress&lt;/a&gt;) are using this RDF language to describe subject and properties. Each provides access to this information in so many different ways it is hard to see how to interact in a constant manor.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Service Interaction&lt;/b&gt; (read on as the name is not that descriptive)&lt;br /&gt;&lt;br /&gt;This moves us on from the Open Storage stuff i've been working on (again more later in another blog post) into how we facilitate the use of services and discover how to interact with these services. We are pushing for the use of http codes! CRIG it. &lt;br /&gt;&lt;br /&gt;Tis it for now....&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-1214878339545743666?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-1214878339545743666</guid>
         <pubDate>Thu, 05 Jun 2008 21:43:00 -0700</pubDate>
      </item>
      <item>
         <title>dfflanders: its been officially announced on listservs: DSpace and Fedora will collaborate on development #crig</title>
         <link>http://twitter.com/dfflanders/statuses/825051785</link>
         <description>dfflanders: its been officially announced on listservs: DSpace and Fedora will collaborate on development #crig</description>
         <guid isPermaLink="false">http://twitter.com/dfflanders/statuses/825051785</guid>
         <pubDate>Mon, 02 Jun 2008 01:24:32 -0700</pubDate>
      </item>
      <item>
         <title>dfflanders: testing #crig twemes tag for RSS input into CRIG blog planet (death to listservs!)</title>
         <link>http://twitter.com/dfflanders/statuses/786601816</link>
         <description>dfflanders: testing #crig twemes tag for RSS input into CRIG blog planet (death to listservs!)</description>
         <guid isPermaLink="false">http://twitter.com/dfflanders/statuses/786601816</guid>
         <pubDate>Thu, 10 Apr 2008 10:21:23 -0700</pubDate>
      </item>
      <item>
         <title>More online information on our repository platform</title>
         <link>http://santoshb.spaces.live.com/Blog/cns!4E66AFE14080180C!1109.entry</link>
         <description>&lt;p&gt;&lt;strong&gt;Demo&lt;/strong&gt; - &lt;/p&gt; &lt;div style="padding-right:0px;display:inline;padding-left:0px;padding-bottom:0px;margin:0px;padding-top:0px;"&gt;&lt;div&gt;&lt;/div&gt;&lt;/div&gt; &lt;p&gt;Savas Parastatidis (architect of the Microsoft repository platform) created a demo for Open Repositories Challenge 2008 that shows the ability to pull data from two repositories, create the appropriate relationships and display the relationships in a WPF based network graph. &lt;/p&gt; &lt;p&gt;He pulls the relationships b/w people and resources from &lt;a rel="nofollow" target="_blank" href="http://www.myexperiment.org/"&gt;MyExperiment&lt;/a&gt;, and resources from &lt;a rel="nofollow" target="_blank" href="http://savas.parastatidis.name/2008/03/24/19a7fa4a-560d-47b4-bbea-9a22e9824bb8.aspx"&gt;Microsoft Research Output Repository Platform&lt;/a&gt;. He then creates a network graph displaying relationships b/w people and resources from the two repositories. He also displays a preview when the appropriate resource or person is selected. For ex. in the demo, you will notice upon selecting a video, the video is played. &lt;/p&gt; &lt;p&gt;This kind of visualization of relationships using network graphs on web pages (using technologies like Silverlight) is truly very useful to visualize the social networking relationships. &lt;/p&gt; &lt;p&gt;Additional info - &lt;a rel="nofollow" title="http://savas.parastatidis.name/2008/04/03/2e308e65-a670-40a3-80ae-a283de1ee917.aspx" target="_blank" href="http://savas.parastatidis.name/2008/04/03/2e308e65-a670-40a3-80ae-a283de1ee917.aspx"&gt;http://savas.parastatidis.name/2008/04/03/2e308e65-a670-40a3-80ae-a283de1ee917.aspx&lt;/a&gt;&lt;/p&gt; &lt;p&gt;&lt;strong&gt;Blog&lt;/strong&gt; - &lt;a rel="nofollow" title="http://orweblog.oclc.org/archives/001591.html" target="_blank" href="http://orweblog.oclc.org/archives/001591.html"&gt;http://orweblog.oclc.org/archives/001591.html&lt;/a&gt; &lt;/p&gt; &lt;div style="padding-right:0px;display:inline;padding-left:0px;float:none;padding-bottom:0px;margin:0px;padding-top:0px;"&gt;del.icio.us Tags: &lt;a rel="nofollow" target="_blank" href="http://del.icio.us/popular/Microsoft Research Output Repository Platform"&gt;Microsoft Research Output Repository Platform&lt;/a&gt;,&lt;a rel="nofollow" target="_blank" href="http://del.icio.us/popular/OR08"&gt;OR08&lt;/a&gt;&lt;/div&gt;</description>
         <guid isPermaLink="false">4E66AFE14080180C!1109</guid>
         <pubDate>Fri, 04 Apr 2008 12:22:38 -0700</pubDate>
         <category>Computers and Internet</category>
      </item>
      <item>
         <title>Successful Presentation of the Microsoft Research Output Repository Platform at Open Repositories Conference</title>
         <link>http://santoshb.spaces.live.com/Blog/cns!4E66AFE14080180C!1108.entry</link>
         <description>&lt;p&gt;&lt;p&gt;&amp;lt;EDITED UpdatedDate="Apr 4 08"&amp;gt; &lt;p&gt;Slides - &lt;a rel="nofollow" title="http://pubs.or08.ecs.soton.ac.uk/84/1/Research_Output_Repositories_-_Microsoft_Initiatives.pdf" target="_blank" href="http://pubs.or08.ecs.soton.ac.uk/84/1/Research_Output_Repositories_-_Microsoft_Initiatives.pdf"&gt;http://pubs.or08.ecs.soton.ac.uk/84/1/Research_Output_Repositories_-_Microsoft_Initiatives.pdf&lt;/a&gt; &lt;p&gt;&amp;lt;/EDITED&amp;gt; &lt;p&gt;The Presentation at the Conference today went well (Monday session was held by Microsoft for interop b/w repositories, with a few known members of the community). This session was open to all members of the conference - we had a decent turnout. &lt;p&gt;Lee Dirks went through the charter of TCI team, and Microsoft, in the Scholarly Communications space, followed by my presentation and Savas' demos providing the "Wow!" factor. &lt;p&gt;Overall the response/questions were positive. We had a few folks talk to us after the presentation for additional information and explanation. &lt;p&gt;-santosh.</description>
         <guid isPermaLink="false">4E66AFE14080180C!1108</guid>
         <pubDate>Wed, 02 Apr 2008 15:16:17 -0700</pubDate>
         <category>Computers and Internet</category>
      <enclosure url="http://pubs.or08.ecs.soton.ac.uk/84/1/Research_Output_Repositories_-_Microsoft_Initiatives.pdf" length="1720033" type="application/pdf" /><media:content url="http://pubs.or08.ecs.soton.ac.uk/84/1/Research_Output_Repositories_-_Microsoft_Initiatives.pdf" fileSize="1720033" type="application/pdf" /><itunes:subtitle> &amp;lt;EDITED UpdatedDate="Apr 4 08"&amp;gt; Slides - http://pubs.or08.ecs.soton.ac.uk/84/1/Research_Output_Repositories_-_Microsoft_Initiatives.pdf &amp;lt;/EDITED&amp;gt; The Presentation at the Conference today went well (Monday session was held by Microsoft for int</itunes:subtitle><itunes:summary> &amp;lt;EDITED UpdatedDate="Apr 4 08"&amp;gt; Slides - http://pubs.or08.ecs.soton.ac.uk/84/1/Research_Output_Repositories_-_Microsoft_Initiatives.pdf &amp;lt;/EDITED&amp;gt; The Presentation at the Conference today went well (Monday session was held by Microsoft for interop b/w repositories, with a few known members of the community). This session was open to all members of the conference - we had a decent turnout. Lee Dirks went through the charter of TCI team, and Microsoft, in the Scholarly Communications space, followed by my presentation and Savas' demos providing the "Wow!" factor. Overall the response/questions were positive. We had a few folks talk to us after the presentation for additional information and explanation. -santosh.</itunes:summary><itunes:keywords>Computers and Internet</itunes:keywords></item>
      <item>
         <title>Microsoft Research Output Repository Platform Previewed</title>
         <link>http://santoshb.spaces.live.com/Blog/cns!4E66AFE14080180C!1104.entry</link>
         <description>&lt;div&gt;Project Info - &lt;a rel="nofollow" target="_blank" href="http://savas.parastatidis.name/2008/03/24/19a7fa4a-560d-47b4-bbea-9a22e9824bb8.aspx"&gt;http://savas.parastatidis.name/2008/03/24/19a7fa4a-560d-47b4-bbea-9a22e9824bb8.aspx&lt;/a&gt; &lt;/div&gt; &lt;div&gt; &lt;/div&gt; &lt;div&gt;Microsoft presentation of the Scholarly Communications space and Research Output Repository platform on Monday (3/31) went very well. this was followed by presentation by leaders/members of other repositories, protocols in the Institutional Repository space. There was a rich discussion around repositories, interop between repositories, protocols etc. &lt;/div&gt; &lt;div&gt; &lt;/div&gt; &lt;div&gt;Live blog notes taken by one of the participants - Andy Powell - &lt;a rel="nofollow" target="_blank" href="http://andypowe11.net/coveritlive/"&gt;http://andypowe11.net/coveritlive/&lt;/a&gt; &lt;/div&gt; &lt;div&gt;Blog notes by another participant - &lt;span style="font-size:12pt;font-family:'Calibri', 'sans-serif';"&gt;&lt;font color="#000000"&gt;&lt;font size="2"&gt;Richard Akerman &lt;/font&gt;- &lt;/font&gt;&lt;/span&gt; &lt;a rel="nofollow" target="_blank" href="http://scilib.typepad.com/science_library_pad/2008/04/microsoft-summi.html"&gt;http://scilib.typepad.com/science_library_pad/2008/04/microsoft-summi.html&lt;/a&gt;&lt;/div&gt; &lt;div&gt; &lt;/div&gt; &lt;div&gt;Our team will be posting notes from the meeting on 3/31 tomorrow. &lt;/div&gt; &lt;div&gt; &lt;/div&gt; &lt;div&gt;Getting ready for another presentation in the Open Repositories conference in an hour. :-)&lt;/div&gt; &lt;div&gt; &lt;/div&gt; &lt;div&gt;-santosh.&lt;/div&gt;</description>
         <guid isPermaLink="false">4E66AFE14080180C!1104</guid>
         <pubDate>Wed, 02 Apr 2008 10:59:17 -0700</pubDate>
         <category>Computers and Internet</category>
      </item>
      <item>
         <title>CRIG Flipchart Outputs</title>
         <link>http://chronicles-of-richard.blogspot.com/2008/01/crig-flipchart-outputs.html</link>
         <description>The JISC &lt;a rel="nofollow" target="_blank" href="http://chronicles-of-richard.blogspot.com/search/label/CRIG"&gt;CRIG&lt;/a&gt; meeting which I previously &lt;a rel="nofollow" target="_blank" href="http://chronicles-of-richard.blogspot.com/2007/12/crig-meeting-day-1-1.html"&gt;live-blogged&lt;/a&gt; from has now had its output formulated into a series of slides with annotations on Flickr, which can be found here:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/wocrig/"&gt;http://www.flickr.com/photos/wocrig/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The process by which this was achieved was through an intense round of brain-storming sessions culminating in a room full of topic spaced flip chart sheets. We then performed a &lt;a rel="nofollow" target="_blank" href="http://www.dotmocracy.org/"&gt;Dotmocracy&lt;/a&gt;, and the results that you see on the Flickr page are the ideas which made it through the process as having some interest invested in them.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-4749146878939399170?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/emYCzRUx-Pk" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-4749146878939399170</guid>
         <pubDate>Thu, 24 Jan 2008 06:23:00 -0800</pubDate>
      </item>
      <item>
         <title>European ORE Roll-Out at Open Repositories 2008</title>
         <link>http://chronicles-of-richard.blogspot.com/2008/01/european-ore-roll-out-at-open.html</link>
         <description>The European leg of the ORE roll-out has been announced and will occur on the final day of the &lt;a rel="nofollow" target="_blank" href="http://or08.ecs.soton.ac.uk/"&gt;Open Repositories 2008&lt;/a&gt; conference in Southampton, UK. This is to complement the meeting at Johns Hopkins University in Baltimore on March 3. From the email circular:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;A meeting will be held on April 4, 2008 at the University of Southampton, in conjunction with Open Repositories 2008, to roll-out the beta release of the OAI-ORE specifications. This meeting is the European follow-on to a meeting that will be held in the USA on March 3, 2008 at Johns Hopkins University.&lt;br /&gt;&lt;br /&gt;The OAI-ORE specifications describe a data model to identify and describe aggregations of web resources, and they introduce machine-readable formats to describe these aggregations based on ATOM and RDF/XML. The current, alpha version of the OAI-ORE specifications is at &lt;a rel="nofollow" target="_blank" href="http://www.openarchives.org/ore/0.1/"&gt;http://www.openarchives.org/ore/0.1/&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Additional details for the OAI-ORE European Open Meeting are available at:&lt;br /&gt;&lt;br /&gt;- The full press release for this event:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.openarchives.org/ore/documents/EUKickoffPressrelease.pdf"&gt;http://www.openarchives.org/ore/documents/EUKickoffPressrelease.pdf&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;- The registration site for the event:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://regonline.com/eu-oai-ore"&gt;http://regonline.com/eu-oai-ore&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Note that registration is required and space is limited.&lt;br /&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-8526639596679087835?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/lsmJMN_BEro" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-8526639596679087835</guid>
         <pubDate>Wed, 23 Jan 2008 09:11:00 -0800</pubDate>
      <enclosure url="http://www.openarchives.org/ore/documents/EUKickoffPressrelease.pdf" length="59937" type="application/pdf" /><media:content url="http://www.openarchives.org/ore/documents/EUKickoffPressrelease.pdf" fileSize="59937" type="application/pdf" /><itunes:subtitle>The European leg of the ORE roll-out has been announced and will occur on the final day of the Open Repositories 2008 conference in Southampton, UK. This is to complement the meeting at Johns Hopkins University in Baltimore on March 3. From the email circ</itunes:subtitle><itunes:author>Richard</itunes:author><itunes:summary>The European leg of the ORE roll-out has been announced and will occur on the final day of the Open Repositories 2008 conference in Southampton, UK. This is to complement the meeting at Johns Hopkins University in Baltimore on March 3. From the email circular: A meeting will be held on April 4, 2008 at the University of Southampton, in conjunction with Open Repositories 2008, to roll-out the beta release of the OAI-ORE specifications. This meeting is the European follow-on to a meeting that will be held in the USA on March 3, 2008 at Johns Hopkins University. The OAI-ORE specifications describe a data model to identify and describe aggregations of web resources, and they introduce machine-readable formats to describe these aggregations based on ATOM and RDF/XML. The current, alpha version of the OAI-ORE specifications is at http://www.openarchives.org/ore/0.1/. Additional details for the OAI-ORE European Open Meeting are available at: - The full press release for this event: http://www.openarchives.org/ore/documents/EUKickoffPressrelease.pdf - The registration site for the event: http://regonline.com/eu-oai-ore Note that registration is required and space is limited. </itunes:summary></item>
      <item>
         <title>Fine Grained Repository Interoperability: can't package, won't package</title>
         <link>http://chronicles-of-richard.blogspot.com/2008/01/fine-grained-repository.html</link>
         <description>Sadly (although some of you may not agree!), my paper proposed for this year's Open Repositories conference in Southampton has not made it through the Programme Committee. I include here, therefore, my submission so that it may live on, and you can get an idea of the sorts of things I was thinking about talking about.&lt;br /&gt;&lt;br /&gt;The reasons given for not accepting it are probably valid; mostly concerning a lack of focus. Honestly, I thought it did a pretty good job of saying what I would talk about, but such is life.&lt;br /&gt;&lt;br /&gt;&lt;hr/&gt;&lt;br /&gt;&lt;br /&gt;What is the point of interoperability, what might it allow us to achieve, and why aren't we very good at it yet? &lt;br /&gt;&lt;br /&gt;Interoperability is a loosely defined concept. It can allow systems to talk to each other about the information that they hold, about the information that they can disseminate, and to interchange that information. It can allow us to tie systems together to improve ingest and dissemination of repository holdings, and allows us to distribute repository functions across multiple systems. It ought even to allow us to offer repository services to systems which don't do so natively, improving the richness of the information space; repository interoperability is not just about repository to repository, it is also about cross-system communications. The maturing set of repositories such as DSpace, Fedora and EPrints and other information systems such as publications management tools and research information systems, as well as home-spun solutions are making the task of taking on the interoperability beast both tangible and urgent.&lt;br /&gt;&lt;br /&gt;Traditional approaches to interoperability have often centred around moving packaged information between systems (often other repositories). The effect this has is to introduce a black-box problem concerning the content of the package itself. We are no longer transferring information, we are transferring data! It therefore becomes necessary to introduce package descriptors which allow the endpoint to re-interpret the package correctly, to turn it back into information. But this constrains us very tightly in the form of our packages, and introduces a great risk of data loss. Furthermore, it means that we cannot perform temporally and spatially disparate interoperability on an object level (that is, assemble an object's content over a period of time, and from a variety of sources). A more general approach to information interchange may be more powerful. &lt;br /&gt;&lt;br /&gt;This paper brings together a number of sources. It discusses some of the work undertaken at Imperial College London to connect a distributed repository system (built on top of DSpace) to an existing information environment. This provides repository services to existing systems, and offers library administrators custom repository management tools in an integrated way. It also considers some of the thoughts arising from the JISC Common Repository Interfaces Group (CRIG) in this area, as well as some speculative proposals for future work and further ideas that may need to be explored.&lt;br /&gt;&lt;br /&gt;Where do we start? The most basic way to address this problem is to break the idea of the package down into its most simple component parts in the context of a repository: the object metadata, the file content, and the use rights metadata. Using this approach, you can go a surprisingly long way down the interoperability route without adding further complexity. At the heart of the Imperial College Digital Repository is a set of web services which deal with exactly this fine structure of the package, because the content for the repository may be fed from a number of sources over a period of time, and thus there never is a definitive package.&lt;br /&gt;&lt;br /&gt;These sorts of operations are not new, though, and there are a variety of approaches to it which have already been undertaken. For example, WebDAV offers extensions to HTTP to deal with objects using operations such as PUT, COPY or MOVE which could be used to achieve the effects that we desire. The real challenge, therefore, is not in the mechanics of the web services which we use to exchange details about this deconstructed package, but is in the additional complexities which we can introduce to enhance the interoperability of our systems and provide the value-added services which repositories wish to offer. &lt;br /&gt;&lt;br /&gt;Consider some other features of interoperability which might be desirable&lt;br /&gt;&lt;br /&gt;- &lt;strong&gt;fine grained or partial metadata records.&lt;/strong&gt; We may wish to ingest partial records from a variety of sources to assemble into a single record, or disseminate only subsets of our stored metadata.&lt;br /&gt;- &lt;strong&gt;file metadata&lt;/strong&gt;, or any other sub-structure element of the object. This may include bibliographic, administrative or technical metadata.&lt;br /&gt;object structural information, to allow complex hierarchies and relationships to be expressed and modified.&lt;br /&gt;- &lt;strong&gt;versioning&lt;/strong&gt;, and other inter-object relationships.&lt;br /&gt;- &lt;strong&gt;workflow status&lt;/strong&gt;, if performing deposit across multiple systems, it may be necessary to be aware of the status of the object in each system to calculate an overall state.&lt;br /&gt;- &lt;strong&gt;state and provenance reporting&lt;/strong&gt;, to offer feedback on the repository state to other information systems, administrators or users.&lt;br /&gt;- &lt;strong&gt;statistics&lt;/strong&gt;, to allow content delivery services to aggregate statistics globally.&lt;br /&gt;- &lt;strong&gt;identifiers&lt;/strong&gt;, to support multiple identification schemes.&lt;br /&gt;&lt;br /&gt;Techniques such as application profiling for metadata allow us to frame entire metadata records in terms of their interpretation (e.g. the Scholarly Works Application Profile (SWAP)), but should also be used to frame individual metadata elements. Object structural data can be encoded using standards such as METS, which can also help us with attaching metadata to sub-structures of the object itself, such as its files. Versioning, and other inter-object relationships could be achieved using an RDF approach, and perhaps the OAI-ORE project will offer some guidance. But other operations such as workflow status, and state and provenance reporting do not have such clear approaches. Meanwhile, the Interoperable Repository Statistics (IRS) project has looked at the statistics problem, and the RIDIR project is looking into interoperable identifiers. In these latter cases, can we ever consider providing access to their outcomes or services through some general fine grained interface?&lt;br /&gt;&lt;br /&gt;The Imperial College Digital Repository offers limited file metadata which is attached during upload and exposed as part of a METS record, detailing the entire digital object, as a descriptive metadata section. It can deal with the idea that some metadata comes from one source, while other metadata comes from another, allowing for a primitive partial metadata interchange process. Conversely, it will also deal with multiple metadata records for the same item. Also introduced are custom workflow metadata fields which allow some basic interaction between different systems to track deposit of objects both from the point of view of the administrator, the depositor and the systems themselves. In addition, there is an extensible notifications engine which is used to produce periodic reports to all depositors whose content has undergone some sort of modification or interesting event in a given time period. This notifications engine is behind a very generic web service which offers extreme flexibility within the context of the College's information environment.&lt;br /&gt;&lt;br /&gt;Important work in the fields that will help achieve this interoperability include the SWORD deposit mechanism which currently deals with packages but may be extensible to include these much needed enhancements. Meanwhile, the OAI-ORE will be able to provide the semantics for complex objects which will no doubt assist in framing the problems that interoperability faces in a manor in which they can be solved.&lt;br /&gt;&lt;br /&gt;Other examples of the spaces in which interoperability needs to work would include the EThOSnet project, the UK national e-theses effort, where it is conceivable that institutions may want to provide their own e-theses submission system with integration into the central hub to offer seamless distributed submission. Or in the relationship between Current Research Information Systems (CRIS) and open access repositories, to offer a full-stack information environment for researchers and administrators alike. The possibilities are extensive and the benefit to the research community would be truly great. HP Labs is actively researching in these and related areas with its continued work on the DSpace platform.&lt;br /&gt;&lt;br /&gt;&lt;hr/&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-2615047246925194437?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/ykFbvAZETZs" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-2615047246925194437</guid>
         <pubDate>Tue, 22 Jan 2008 10:56:00 -0800</pubDate>
      </item>
      <item>
         <title>SWORD/ORE</title>
         <link>http://chronicles-of-richard.blogspot.com/2008/01/swordore.html</link>
         <description>Last week I was at the ORE meeting in Washington DC, and presented some thoughts regarding SWORD and its relationship to ORE. The slides I presented can be found here:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://wiki.dspace.org/static_files/1/1d/Sword-ore.pdf"&gt;http://wiki.dspace.org/static_files/1/1d/Sword-ore.pdf&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;[Be warned that discussion on these slides ensued, and they therefore don't reflect the most recent thinking on the topic]&lt;br /&gt;&lt;br /&gt;The overall approach of using SWORD as the infrastructure to do deposit for ORE seems sound. There are three main approaches identified:&lt;br /&gt;&lt;br /&gt;- &lt;strong&gt;SWORD is used to deposit the URI of a Resource Map onto a repository&lt;/strong&gt;&lt;br /&gt;- &lt;strong&gt;SWORD is used to deposit the Resource Map as XML onto a repository&lt;/strong&gt;&lt;br /&gt;- &lt;strong&gt;SWORD is used to deposit a package containing the digital object and its Resource Map onto a repository&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;In terms of complications there are two primary ones which concern me the most:&lt;br /&gt;&lt;br /&gt;- &lt;strong&gt;Mapping of the SWORD levels to the usage of ORE.&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;The principal issue is that level 1 implies level 0, and therefore level 2 implies level 1 and level 0. The inclusion of semantics to support ORE specifics could invoke a new level, and if this level is (for argument's sake) level 3, it implies all the levels beneath it, whatever they might require. Since the service, by this stage, is becoming complex in itself, such a linear relationship might not follow.&lt;br /&gt;&lt;br /&gt;A brief option discussed at the meeting would be to modularise the SWORD support instead of implementing a level based approach. That is, the service document would describe the actual services offered by the server, such as ORE support, NoOp support, Verbose support and so forth, with no recourse to "bundles" of functionality labelled by linear levelling.&lt;br /&gt;&lt;br /&gt;- &lt;strong&gt;Scalability of the service document&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;The mechanisms imposed by ORE allow for complex objects to be attached to other complex objects as aggregated resources (ORE term). This means that you could have a resource map which you wish to tell a repository describes a new part of an existing complex object. In order to do this, the service document will need to supply the appropriate deposit URI for a segment of an existing repository item. In DSpace semantics, for example, we may be adding a cluster of files to an existing item, and would therefore require the deposit URI of the item itself. To do otherwise would be to limit the applicability of ORE within SWORD and the repository model. Our current service document is a flat document describing what is pragmatically assumed (correctly, in virtually all cases) to be a small selection of deposit URIs. The same will not be true of item level deposit targets, which could be a very large number of possible deposit targets. Furthermore, in repositories which exploit the full descriptive capabilities of ORE, the number of deposit targets could be identical to the number of aggregations described (which can be more than one per resource map), which has the potential to be a very large number.&lt;br /&gt;&lt;br /&gt;The consequences are in scalability of response time, which is a platform specific issue, and the scalability of the document itself and the usefulness of the consequences. It may be more useful to navigate hierarchically through the different levels of the service document in order to identify deposit nodes.&lt;br /&gt;&lt;br /&gt;Any feedback on this topic is probably most useful in the &lt;a rel="nofollow" target="_blank" href="http://groups.google.com/group/oai-ore"&gt;ORE Google Group&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-9149789036743981806?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/DJEvhQnxebE" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-9149789036743981806</guid>
         <pubDate>Mon, 21 Jan 2008 05:05:00 -0800</pubDate>
      <enclosure url="http://wiki.dspace.org/static_files/1/1d/Sword-ore.pdf" length="37773" type="application/pdf" /><media:content url="http://wiki.dspace.org/static_files/1/1d/Sword-ore.pdf" fileSize="37773" type="application/pdf" /><itunes:subtitle>Last week I was at the ORE meeting in Washington DC, and presented some thoughts regarding SWORD and its relationship to ORE. The slides I presented can be found here: http://wiki.dspace.org/static_files/1/1d/Sword-ore.pdf [Be warned that discussion on th</itunes:subtitle><itunes:author>Richard</itunes:author><itunes:summary>Last week I was at the ORE meeting in Washington DC, and presented some thoughts regarding SWORD and its relationship to ORE. The slides I presented can be found here: http://wiki.dspace.org/static_files/1/1d/Sword-ore.pdf [Be warned that discussion on these slides ensued, and they therefore don't reflect the most recent thinking on the topic] The overall approach of using SWORD as the infrastructure to do deposit for ORE seems sound. There are three main approaches identified: - SWORD is used to deposit the URI of a Resource Map onto a repository - SWORD is used to deposit the Resource Map as XML onto a repository - SWORD is used to deposit a package containing the digital object and its Resource Map onto a repository In terms of complications there are two primary ones which concern me the most: - Mapping of the SWORD levels to the usage of ORE. The principal issue is that level 1 implies level 0, and therefore level 2 implies level 1 and level 0. The inclusion of semantics to support ORE specifics could invoke a new level, and if this level is (for argument's sake) level 3, it implies all the levels beneath it, whatever they might require. Since the service, by this stage, is becoming complex in itself, such a linear relationship might not follow. A brief option discussed at the meeting would be to modularise the SWORD support instead of implementing a level based approach. That is, the service document would describe the actual services offered by the server, such as ORE support, NoOp support, Verbose support and so forth, with no recourse to "bundles" of functionality labelled by linear levelling. - Scalability of the service document The mechanisms imposed by ORE allow for complex objects to be attached to other complex objects as aggregated resources (ORE term). This means that you could have a resource map which you wish to tell a repository describes a new part of an existing complex object. In order to do this, the service document will need to supply the appropriate deposit URI for a segment of an existing repository item. In DSpace semantics, for example, we may be adding a cluster of files to an existing item, and would therefore require the deposit URI of the item itself. To do otherwise would be to limit the applicability of ORE within SWORD and the repository model. Our current service document is a flat document describing what is pragmatically assumed (correctly, in virtually all cases) to be a small selection of deposit URIs. The same will not be true of item level deposit targets, which could be a very large number of possible deposit targets. Furthermore, in repositories which exploit the full descriptive capabilities of ORE, the number of deposit targets could be identical to the number of aggregations described (which can be more than one per resource map), which has the potential to be a very large number. The consequences are in scalability of response time, which is a platform specific issue, and the scalability of the document itself and the usefulness of the consequences. It may be more useful to navigate hierarchically through the different levels of the service document in order to identify deposit nodes. Any feedback on this topic is probably most useful in the ORE Google Group</itunes:summary></item>
      <item>
         <title>BMC and the Free Open Repository Trial</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/12/bmc-and-free-open-repository-trial.html</link>
         <description>Our good buddies at BioMedCentral's Open Repository team have released the latest upgrade to their service, and are offering 3 month trial repositories for evaluation. From the &lt;a rel="nofollow" target="_blank" href="http://www.dspace.org/"&gt;DSpace home page&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;BioMed Central announced the latest upgrades to Open Repository, the open access publisher's hosted repository solution. Open Repository offers institutions a cost effective repository solution (setup, hosting and maintenance) which includes new DSpace features, customization options, improved user interface. Along with the annoucement of the upgrades, Open Repository is offereing a free 3-month pilot repository, so institutions can test the suitability of the service without obligation. See the full articles in &lt;a rel="nofollow" target="_blank" href="http://newsbreaks.infotoday.com/wndReader.asp?ArticleId=40331"&gt;Weekly News Digest&lt;/a&gt; and in &lt;a rel="nofollow" target="_blank" href="http://www.alphagalileo.org/index.cfm?fuseaction=readrelease&amp;releaseid=525415"&gt;Alpha Galieo&lt;/a&gt;.&lt;br /&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-5373224028320805461?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/cxCzlAqTC1k" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-5373224028320805461</guid>
         <pubDate>Wed, 12 Dec 2007 01:36:00 -0800</pubDate>
      </item>
      <item>
         <title>CRIG Meeting Day 2 (2)</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/12/crig-meeting-day-2-2.html</link>
         <description>Topics for today:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_Unconference#Friday_December_7th"&gt;http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_Unconference#Friday_December_7th&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The ones that interest me the most are probably these:&lt;br /&gt;&lt;br /&gt;- Death to Packages&lt;br /&gt;&lt;br /&gt;Not really Death to Packages, but lets not forget that packaging sometimes isn't what we want to do or what we can do.&lt;br /&gt;&lt;br /&gt;- Get What?&lt;br /&gt;&lt;br /&gt;This harks to my ORE interest, as to what is available under the URLs, and what that means for something like content negotiation.&lt;br /&gt;&lt;br /&gt;- One Put to Multiple Places&lt;br /&gt;&lt;br /&gt;Really important to distributed information systems (e.g. ethosnet integration into local institutions). Also, this relates, for me, to the unpackaging question, because it introduces differences between what systems might all be expecting.&lt;br /&gt;&lt;br /&gt;- Web 2.0 interfaces (ok, ok)&lt;br /&gt;&lt;br /&gt;I'm interested in web services. Yes it's a bit trendy. But it is useful.&lt;br /&gt;&lt;br /&gt;- Core Servies of a Repository&lt;br /&gt;&lt;br /&gt;For repository core architecture, this is important. With my DSpace hat on I'd like to see what sorts of things an internal service architecture or api ought to be able to support&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-1908914017517604548?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/XguJLOxM_NU" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-1908914017517604548</guid>
         <pubDate>Fri, 07 Dec 2007 02:08:00 -0800</pubDate>
      </item>
      <item>
         <title>CRIG Meeting Day 2 (1)</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/12/crig-meeting-day-2-1.html</link>
         <description>It's first thing on day two. I'm late because I have to get all the way across town, which takes a surprisingly long time in London. I should have just stayed at a nearby hotel. Oh well.&lt;br /&gt;&lt;br /&gt;The remainder of yesterday was interesting. Scope for live blogging is difficult, as the conference is extremely mobile. Today I will have to pick a point and hide in a corner to get you up to date.&lt;br /&gt;&lt;br /&gt;In the afternoon we discussed the CRIG scenarios, and then implemented something called a Dotmocracy, which involves sticking dots (like house points at school) next to topics which appeared which we were interested in. When we start up today, the first order of business will be to see what topics made the cut. From what I saw at the end of the day, this will include Federated Searching, Google Search, and package deconstruction (my personal favourite this week).&lt;br /&gt;&lt;br /&gt;As a brief aside, one running theme has been "no more standards". As it happens, I disagree with this. We're never going to get everything thinking the same and working the same. That's why there are so many standards, and why new ones get made all the time. It's the way of the world. At least, with a standard, though, when you have implemented one, you at least have a way of telling people what you did, over the home grown undocumented solutions which are the alternative.&lt;br /&gt;&lt;br /&gt;Right, I suppose I'd better get my skates on.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-4799712183286978005?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/dPbLzdCq4ok" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-4799712183286978005</guid>
         <pubDate>Fri, 07 Dec 2007 00:42:00 -0800</pubDate>
      </item>
      <item>
         <title>CRIG Meeting Day 1 (2)</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/12/crig-meeting-day-1-2.html</link>
         <description>&lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Unconference"&gt;http://en.wikipedia.org/wiki/Unconference&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;See also &lt;a rel="nofollow" target="_blank" href="http://wwmm.ch.cam.ac.uk/blogs/downing/"&gt;Jim Downing's&lt;/a&gt; live blogging.&lt;br /&gt;&lt;br /&gt;We've just done a round of preliminary unconferencing, where the &lt;a rel="nofollow" target="_blank" href="http://chronicles-of-richard.blogspot.com/2007/11/sword-10-released.html"&gt;CRIG Podcast&lt;/a&gt; topics were brainstormed onto flip charts. Not sure how useful that's going to be, but I'm going to approach the whole thing with an open mind. I've got my marker pen, my baloon, and my three dots.&lt;br /&gt;&lt;br /&gt;wish me luck ...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-1512541800794566898?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/5j8sMCBCo5k" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-1512541800794566898</guid>
         <pubDate>Thu, 06 Dec 2007 07:01:00 -0800</pubDate>
      </item>
      <item>
         <title>CRIG Meeting Day 1 (1)</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/12/crig-meeting-day-1-1.html</link>
         <description>Some live blogging; may be slightly malformed, as this is happening inline, with no post-editing.&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_Unconference"&gt;http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_Unconference&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Les Carr and Jim Downing have introduced us to the CRIG workshop first day. We're unconferencing which means that there's not a programme! We're going to try and stay at the abstract or high level discussion, not try to talk about technology.&lt;br /&gt;&lt;br /&gt;David Flanders outlines the meeting philosophy. The outputs aimed for the meeting include: ideas (bluesky), standards and scenarios and how they can be linked together. The outputs will be taken to OR08. The best way for a group to produce good stuff is for everyone to think about themselves. Makes me think of an article I read recently:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www7.nationalgeographic.com/ngm/0707/feature5/index.html"&gt;http://www7.nationalgeographic.com/ngm/0707/feature5/index.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;We are &lt;em&gt;not&lt;/em&gt; about creating new specs.&lt;br /&gt;&lt;br /&gt;Julie then brings us some stuff about SWORD. See my &lt;a rel="nofollow" target="_blank" href="http://chronicles-of-richard.blogspot.com/2007/11/sword-10-released.html"&gt;previous post&lt;/a&gt; on this. We are going to have implementations for xrXiv, white rose research online and Jorum. A SPECTRa deposit client, and later an article in Ariadne and a presentation at OR08.&lt;br /&gt;&lt;br /&gt;Break time ... tea and coffee!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-6875507276473434800?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/wSKAR7Doxt8" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-6875507276473434800</guid>
         <pubDate>Thu, 06 Dec 2007 05:19:00 -0800</pubDate>
      </item>
      <item>
         <title>CRIG Podcast</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/11/crig-podcast.html</link>
         <description>A couple of weeks ago the JISC CRIG (Common Repository Interfaces Group) organised a series of telephone debates on important areas for it. These have now been edited into short commentaries which might be of interest to you, and are aimed at priming and informing the upcoming "unconference" to be held 6/7 December in London:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_Podcasts"&gt;http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_Podcasts&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The "unconference" will take place at Birkbeck College in Bloomsbury, London. Take a listen, and enjoy. Yours truly appears in the "Get and Put within Repositories" and the "Object Interoperability" discussions.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-2034686157678763588?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/WXv1M_TyW-w" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-2034686157678763588</guid>
         <pubDate>Fri, 30 Nov 2007 01:19:00 -0800</pubDate>
      </item>
      <item>
         <title>SWORD 1.0 Released</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/11/sword-10-released.html</link>
         <description>Just a quick heads up to say that the SWORD 1.0 release is now out and ready for download from SourceForge:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://sourceforge.net/projects/sword-app/"&gt;http://sourceforge.net/projects/sword-app/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Here you will find the common java library which supports repositories wanting to implement SWORD, plus implementations for DSpace and Fedora. There is also a client (with GUI and CLI versions) which you can use to deposit content into the repositories.&lt;br /&gt;&lt;br /&gt;The DSpace implementation is designed only to work with the forthcoming DSpace 1.5 (which is currently in Alpha release). Your feedback and experiences with the code would be much appreciated. We expect to be making refinements to the DSpace implementation up unitl DSpace 1.5 is released as stable.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-8632597815206720838?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/OIEs2RRDcp4" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-8632597815206720838</guid>
         <pubDate>Thu, 08 Nov 2007 06:45:00 -0800</pubDate>
      </item>
      <item>
         <title>Scandinavian Dugnad</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/10/scandinavian-dugnad.html</link>
         <description>I was invited by the Scandinavian DSpace User Group meeting to join them in their first official meeting yesterday in Oslo. It was great to see so many people representing a small-ish geographical area and a reasonably small population all together from 4 nations (Norway, Sweden, Finland and Denmark) to talk about DSpace. Probably 35 people all-in, with plans to extend the group to be the Nordic DSpace User Group to include members from Iceland, and perhaps even the Faroe Islands, and Greenland (if DSpace instances appear there).&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://wiki.dspace.org/index.php/Scandinavia"&gt;http://wiki.dspace.org/index.php/Scandinavia&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;In the grand traditions of Open Source and Open Access, I borrowed presentations given at the recent DSpace User Group Rome, and gave them an update on the state of the DSpace Foundation, DSpace 2.0, and then went on to produce some original slides telling folks how to get involved in DSpace developments. Hopefully all the content will be available on the web soon.&lt;br /&gt;&lt;br /&gt;As your humble chronicaller struggled with his sub-par Norwegian, he picked up some interesting things. There is good user end development going on in Scandinavia which could be harnessed to bring improvements to the DSpace UI. There are also increasingly many requests for "Integration with ...", where the object of integration is one of a variety of library information systems. Statistics are high on the agenda here as they are everywhere else. They are also a base of experts in multi-language problems stemming from being polyglot nations with additional letters in their native alphabets.&lt;br /&gt;&lt;br /&gt;It's clear where the future of repositories lie in Scandinavian nations where the national interest and the community feature prominently in society and culture. Bibsys, a major supplier of library systems and services in Norway (and organisers of the meeting), have 29 DSpace clients on their books already, and are looking at tighter integration between it and their other products, right down to the information model level. National research reporting systems are much desired repository data sources, and internal information systems at each institutions are starting to feed into their public repositories.&lt;br /&gt;&lt;br /&gt;With such a big user group, and such a community focus, there is little doubt in my mind that the Nordic user group will be a great asset to the DSpace users in that region, and probably to the DSpace community as a whole.&lt;br /&gt;&lt;br /&gt;PS Dugnad is a Norwegian word effectively referring to voluntary, communal work which benefits the community to some degree, but is also social and enjoyable for the participants. It also formed the basis of the 2006 DSpace User Group Meeting in Bergen&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://dsug2006.uib.no/"&gt;http://dsug2006.uib.no/&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-7713247949699281106?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/BlPZVoGLi1A" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-7713247949699281106</guid>
         <pubDate>Wed, 31 Oct 2007 05:23:00 -0700</pubDate>
      </item>
      <item>
         <title>DSpace 1.5 Alpha with experimental binary distribution</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/10/dspace-15-alpha-with-experimental.html</link>
         <description>The DSpace 1.5 Alpha has now been released and we encourage you to download this exciting new release of DSpace and try it out.&lt;br /&gt;&lt;br /&gt;There are big changes in this code base, both in terms of functionality and organisation. First, we are now using Maven to manage our build process, and have carved the application into a set of core modules which can be used to assemble your desired DSpace instance. For example, the JSP UI and the Manakin UI are now available as separate UI modules, and you may build either or both of these. We are taking an important step down the road, here, to allowing for community developments to be more easily created, and also more easily shared. You should be able, with a little tinkering, to provide separate code packages which can be dropped in alongside the dspace core modules, and built along with them. There are many stages to go through before this process is complete or perfect, so we encourage you to try out this new mechanism, and to let us know how you get on, or what changes you would make. Oh, and please do share your modules with the community! Props to Mark Diggory and the MIT guys for this restructuring work.&lt;br /&gt;&lt;br /&gt;The second big and most exciting thing is that Manakin is now part of our standard distribution, and we want to see it taking over from the JSP UI over the next few major releases. A big hand for Scott Phillips and the Texas A&amp;M guys for getting this code into the distribution; they have worked really hard.&lt;br /&gt;&lt;br /&gt;In addition to this, we have an Event System which should help us start to decouple tightly integrated parts of the repository, from Richard Rodgers and the guys at MIT. Browsing is now done with a heavily configurable system written initially by myself, but with significant assistance from Graham Triggs at BioMed Central. Tim Donohue's much desired Configurable Submission system is now integrated with both JSP and Manakin interfaces and is part of the release too.&lt;br /&gt;&lt;br /&gt;Further to this we have a bunch of other functionality including: IP Authentication, better metadata and schema registry import, move items from one collection to another, metadata export, configurable multilingualism support, Google and html sitemap generator, Community and Sub-Communities as OAI Sets, and Item metadata in XHTML head &amp;lt;meta&amp;gt; elements.&lt;br /&gt;&lt;br /&gt;All in all, a good looking release. There will be a testathon organised shortly which will be announced on the mailing lists, so that we can run this up to beta and then into final release as soon as possible. There's lots to test, so please lend a hand.&lt;br /&gt;&lt;br /&gt;We are also experimenting with a binary release, which can be downloaded from the same page as the source release. We are interested in how people get on with this, so let us know on the mailing lists.&lt;br /&gt;&lt;br /&gt;Come and get it:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://sourceforge.net/project/showfiles.php?group_id=19984"&gt;http://sourceforge.net/project/showfiles.php?group_id=19984&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-4691783139275190839?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/Su8qnj6lHUM" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-4691783139275190839</guid>
         <pubDate>Thu, 25 Oct 2007 02:48:00 -0700</pubDate>
      </item>
      <item>
         <title>my my where did the summer go</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/10/my-my-where-did-summer-go.html</link>
         <description>OK, ok, it's been a long long time since I updated. Did I say at the beginning that this was an experiment in seeing if I was capable of maintaining a blog? If I didn't I should have done.&lt;br /&gt;&lt;br /&gt;But there's a good reason that I've not updated for a while. That is, that I've been working flat out on the Imperial College Digital Repository: Spir@l, and am pleased to finally announce in a quiet way that we are officially LIVE:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://spiral.imperial.ac.uk/"&gt;http://spiral.imperial.ac.uk/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;On the outside it doesn't look too serious. A standard looking DSpace, I hear you say, with an Imperial College site template on it. And you'd be right. But only about the tip of the ice-berg.&lt;br /&gt;&lt;br /&gt;Without wishing to blow my own trumpet (modesty &lt;em&gt;is&lt;/em&gt; the third or fourth best thing about me), please do check out the article which I co-wrote with my good colleague Fereshteh Afshari:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://hdl.handle.net/10044/1/493"&gt;http://hdl.handle.net/10044/1/493&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;And you may also be interested in my presentation at the recent DSpace User Group Meeting in Rome 2007 (more on that later, maybe):&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.aepic.it/conf/viewabstract.php?id=200&amp;cf=11"&gt;http://www.aepic.it/conf/viewabstract.php?id=200&amp;cf=11&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;I could probably be persueded to write a little here about how it works; maybe you'll even get snippets from the monolithic technical documentation that I'm in the middle of writing.&lt;br /&gt;&lt;br /&gt;Oh, and there's more news, but now I've got your attention again you have to wait for the next installment.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-5921631848918710005?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/nIamUPYRpyU" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-5921631848918710005</guid>
         <pubDate>Wed, 24 Oct 2007 08:27:00 -0700</pubDate>
      </item>
      <item>
         <title>EThOSnet Kick-Off</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/05/ethosnet-kick-off.html</link>
         <description>On Tuesday of this week the EThOSnet Project Board met for the first time to kick off this significant new project. For background, this project is the successor to the &lt;a rel="nofollow" target="_blank" href="http://www.ethos.ac.uk/"&gt;EThOS&lt;/a&gt; project, which in turn grew out of the Scottish projects: &lt;a rel="nofollow" target="_blank" href="http://www.thesesalive.ac.uk/"&gt;Theses Alive&lt;/a&gt; at Edinburgh, &lt;a rel="nofollow" target="_blank" href="http://www.lib.gla.ac.uk/daedalus/"&gt;DAEDALUS&lt;/a&gt; at Glasgow, and &lt;a rel="nofollow" target="_blank" href="http://www2.rgu.ac.uk/library/e-theses.htm"&gt;Electronic Theses&lt;/a&gt; at the Robert Gordon University.&lt;br /&gt;&lt;br /&gt;The aim of EThOSnet is to take the work done under EThOS and bring it up to a point where UK institutions can actually start to become early adopters, to start to digitise the back-catalogue of print theses in the UK, investigate technology for the current and the future incarnations of the system, and to basically kick-start a genuinely viable service for deposit and dissemination of UK theses.&lt;br /&gt;&lt;br /&gt;At this stage, the project does not have a Project Manager, which is causing minor hold-ups initially, but Project Director, and Director of Library Services Clare Jenkins of Imperial College Library has stepped in to hold things together until one is appointed (we are expecting to hear very soon). In the interim, the Project Board has also been put in place to check that all the 7 Work Packages have the things they need to get going.&lt;br /&gt;&lt;br /&gt;Of these 7 workpackages, the first and last are concerned with project management and exit strategy, and the meat of the project will take place in packages 2 - 6. Details of these work packages are available in the project proposal, which will hopefully be available on the JISC website soon.&lt;br /&gt;&lt;br /&gt;A quick summary, then, of some of the changes and more concrete decisions that we made during the meeting:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;br /&gt;&lt;li&gt;We have set a pleasingly high target of 20,000 digitised theses and 3,000 born-digital theses by the end of the project. This will be sourced from the many institutions who have already expressed an interest in adopting the service, before the project is even going!&lt;/li&gt;&lt;br /&gt;&lt;li&gt;The first port of call for the technology is to smooth the process of the existing software tools for repository users. I would hope to have something which works well for DSpace available quickly, and general enough to be part of the main distribution. EPrints is already fully compliant, and Fedora has representitives from the University of Hull looking after it.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;Communications will be done primarily through a soon-to-exist project wiki, and it is hoped that the existing &lt;a rel="nofollow" target="_blank" href="http://www.jiscmail.ac.uk/lists/E-THESES-UK.html"&gt;E-Theses UK list&lt;/a&gt; will be used more heavily than it is already. Imperial College has agreed to host the existing ethos website, the wiki, and potentially the toolkit if necessary (&lt;a rel="nofollow" target="_blank" href="http://ethostoolkit.rgu.ac.uk/"&gt;currently hosted&lt;/a&gt; at RGU).&lt;/li&gt;&lt;br /&gt;&lt;li&gt;Toolkit development will be ongoing, with work being done on it within a wiki, but with the option to move to some XML format for the final product&lt;/li&gt;&lt;br /&gt;&lt;/ul&gt;&lt;br /&gt;&lt;br /&gt;This is a very big project, and I can't possibly represent everything that came out of Tuesday's meeting here. In the near future expect to see links to the project wiki appear and more information to come out.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-1653851498787432588?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/iT023WB9CPI" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-1653851498787432588</guid>
         <pubDate>Thu, 10 May 2007 07:05:00 -0700</pubDate>
      </item>
   <language>en-us</language><media:rating>nonadult</media:rating></channel>
</rss><!-- fe3.pipes.re3.yahoo.com uncompressed Sat Jul 11 13:55:29 PDT 2009 -->
