<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2enclosuresfull.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:media="http://search.yahoo.com/mrss/" xmlns:yt="http://gdata.youtube.com/schemas/2007" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">
   <channel>
      <title>JISC-CRIG Planet</title>
      <description>Selected feeds from around the web searching for a single tag and aggregated into a single feed. The tag (CRIG) is for the Common Repository Interface Group as funded by the Joint Information Systems Committee.  We are improving repositories in the Higher and Further Education Sector over the next eighteen months, til Sept 2009.</description>
      <link>http://pipes.yahoo.com/pipes/pipe.info?_id=IOhBPNqn3BG4kFuCJZhxuA</link>
      <pubDate>Tue, 10 Nov 2009 18:38:50 -0800</pubDate>
      <generator>http://pipes.yahoo.com/pipes/</generator>
      <itunes:explicit>no</itunes:explicit><itunes:subtitle>Selected feeds from around the web searching for a single tag and aggregated into a single feed. The tag (CRIG) is for the Common Repository Interface Group as funded by the Joint Information Systems Committee. We are improving repositories in the Higher </itunes:subtitle><itunes:summary>Selected feeds from around the web searching for a single tag and aggregated into a single feed. The tag (CRIG) is for the Common Repository Interface Group as funded by the Joint Information Systems Committee. We are improving repositories in the Higher and Further Education Sector over the next eighteen months, til Sept 2009.</itunes:summary><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/jisc-crig" type="application/rss+xml" /><feedburner:browserFriendly></feedburner:browserFriendly><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com" /><item>
         <title>Open Science at Web-Scale: Optimising Participation and Predictive Potential : JISC</title>
         <link>http://feedproxy.google.com/~r/typepad/mleggott/loomware/~3/8KFjvZwIRhw/open-science-at-web-scale-optimising-participation-and-predictive-potential-jisc.html</link>
         <description>&lt;div&gt;&lt;blockquote&gt;This Report has attempted to draw together and synthesise evidence and opinion associated with data-intensive open science from a wide range of sources. The potential impact of data-intensive open science on research practice and research outcomes, is both substantive and far-reaching. There are implications for funding organisations, for research and information communities and for higher education institutions.&lt;/blockquote&gt; &lt;p&gt;&lt;small&gt;via &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/publications/documents/opensciencerpt.aspx"&gt;www.jisc.ac.uk&lt;/a&gt;&lt;/small&gt;&lt;/p&gt; &lt;p&gt;A comprehensive report in the emerging open science landscape.&lt;/p&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=8KFjvZwIRhw:OwqnWxBLU9c:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?d=yIl2AUoC8zA" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=8KFjvZwIRhw:OwqnWxBLU9c:F7zBnMyn0Lo"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?i=8KFjvZwIRhw:OwqnWxBLU9c:F7zBnMyn0Lo" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=8KFjvZwIRhw:OwqnWxBLU9c:V_sGLiPBpWU"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?i=8KFjvZwIRhw:OwqnWxBLU9c:V_sGLiPBpWU" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=8KFjvZwIRhw:OwqnWxBLU9c:EpLpB3ZkKWg"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?d=EpLpB3ZkKWg" border="0"&gt;&lt;/a&gt;
&lt;/div&gt;</description>
         <author>mleggott</author>
         <guid isPermaLink="false">tag:typepad.com,2003:post-6a00d83452e76c69e201287570b496970c</guid>
         <pubDate>Tue, 10 Nov 2009 06:59:28 -0800</pubDate>
      </item>
      <item>
         <title>Thoughts on JISCPress</title>
         <link>http://feedproxy.google.com/~r/ouseful/~3/z4OzaMfdenU/</link>
         <description>As we come to the final month of the JISCPress project, we had some great news over on WriteToReply last week where we were able to announce that Eduserv would be covering our hosting costs for the immediate future (Eduserv funds hosting for WriteToReply, eFoundations: Write To Reply).
So what exactly does the platform we&amp;#8217;ve been [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ouseful.wordpress.com&amp;blog=325417&amp;post=2486&amp;subd=ouseful&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://ouseful.wordpress.com/?p=2486</guid>
         <pubDate>Sun, 08 Nov 2009 10:44:12 -0800</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><p>As we come to the final month of the JISCPress project, we had some great news over on WriteToReply last week where we were able to announce that Eduserv would be covering our hosting costs for the immediate future (<a rel="nofollow" target="_blank" href="http://writetoreply.org/actually/2009/11/05/eduserv-funds-hosting-for-writetoreply/">Eduserv funds hosting for WriteToReply</a>, <a rel="nofollow" target="_blank" href="http://efoundations.typepad.com/efoundations/2009/11/write-to-reply.html">eFoundations: Write To Reply</a>).</p>
<p>So what exactly does the platform we&#8217;ve been working on have to offer? Here&#8217;s one of the ways I think of it&#8230;</p>
<p><em>A document publishing platform that automatically atomises documents to the paragraph level, allows aggregated commenting at the paragraph and &#8216;user&#8217; level, and supports the republication and re-presentation of documents in a variety of standard formats at the document level. </em></p>
<p>The first part of the process is the (manual assisted) ingress stage, in which documents are imported into the WordPress environment such that each substantive document section ideally maps onto a single WordPress &#8220;blog post&#8221;:</p>
<p><a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/psychemedia/4086733558/" title="Photo Sharing"><img src="http://farm3.static.flickr.com/2774/4086733558_8182792b82.jpg" width="500" height="338"></a></p>
<p>An RSS for the document as a whole, with one item per section, is generated automatically by the WordPress platform. A single item RSS feed is also generated for each page (so the content of each page can be easily transported around the web).</p>
<p>The second part of the process is the atomisation of each post, carried out automatically by the Digress.It theme, in which each paragraph in the document is given its own unique URI, derived from the URI of the web page (&#8220;blog post&#8221;) the paragraph appears on:</p>
<p><a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/psychemedia/4086741160/" title="Photo Sharing"><img src="http://farm3.static.flickr.com/2568/4086741160_3cf1f5f682.jpg" width="500" height="199"></a></p>
<p>Potentially, an RSS feed can also be produced for each page in which each paragraph is a separate feed item, thus allowing a page/section to be transported around the web via a single feed, but in atomised form.</p>
<p>The paragraph level chunks produced by the atomistation process can be transcluded as independent elements in independent web documents in other documents by a variety of means (as an embeddable object, via XML, txt, JSON, etc):</p>
<p><a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/psychemedia/4086754106/" title="Photo Sharing"><img src="http://farm3.static.flickr.com/2629/4086754106_6e6be55d05.jpg" width="500" height="273"></a></p>
<p>The default nature of the WordPress platform allows comments to be made at the level of each web page, with an RSS feed of comments for each page being published &#8216;for free&#8217;. JISCPress extends this functionality by allowing comments to be associated with discrete paragraphs. Views over the comments are also available at the user level, (that is, grouped according to the user who made the comments, wheresoever they are made in the document). An additional RSS fed of comments by user is also available, which means that a document on the platform can actually be used as a scaffold for a critical response to the document by a particular user.</p>
<p><a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/psychemedia/4086768294/" title="Photo Sharing"><img src="http://farm3.static.flickr.com/2761/4086768294_9f8ed02c78.jpg" width="500" height="346"></a></p>
<p>A further level of innovation is based on the automated generation of &#8217;semantic tags&#8217; at the page level. Once generated, tag based collections of posts can be syndicated in the normal way via WordPress generated tag based RSS feeds:</p>
<p><a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/psychemedia/4086015709/" title="Photo Sharing"><img src="http://farm3.static.flickr.com/2615/4086015709_6aaa67c331.jpg" width="433" height="333"></a></p>
<p>JISCPress also benefits from the Trackback mechanism implemented by WordPress. When a page or paragraph URI is linked to from a third party web page, a trackback to the originating page may be captured, which we interpret as the automated capture of links remote annotations or comments about the document. </p>
<p><a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/psychemedia/4086773460/" title="Photo Sharing"><img src="http://farm3.static.flickr.com/2534/4086773460_3a00055180.jpg" width="500" height="264"></a></p>
<p>When considered in these terms, the JISCPress/WriteToReply platform is seen to provide a powerful means of publishing documents in which individual sections may carry their own unique URI, and individual paragraphs within a section also contain their own unique URI (which in many situations may be rooted on the section URI).</p>
<p>The platform can also be regarded as republishing &#8211; or re-presenting &#8211; each section (i.e. page) <em>and</em> each paragraph as an independent entity. That is, whenever a document is published via the platform, each separate paragraph may also be thought of as being independently published &#8220;for free&#8221;, in the sense that:</p>
<p>- each paragraph is independently addressable,<br />
- each paragraph is independently commentable, and<br />
- each paragraph is independently <em>re</em>publishable/syndicatable.</p>
<p>So, given that, can you think of any ways in which the JISCPress/WriteToReply platform can support <em>your</em> document publishing and comment gathering strategy?</p> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/ouseful.wordpress.com/2486/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ouseful.wordpress.com/2486/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/ouseful.wordpress.com/2486/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ouseful.wordpress.com/2486/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/ouseful.wordpress.com/2486/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ouseful.wordpress.com/2486/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/ouseful.wordpress.com/2486/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ouseful.wordpress.com/2486/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/ouseful.wordpress.com/2486/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ouseful.wordpress.com/2486/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ouseful.wordpress.com&blog=325417&post=2486&subd=ouseful&ref=&feed=1"/></div><div class="feedflare">
<a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ouseful?a=z4OzaMfdenU:bKminmo7xMg:yIl2AUoC8zA"><img src="http://feeds.feedburner.com/~ff/ouseful?d=yIl2AUoC8zA" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ouseful?a=z4OzaMfdenU:bKminmo7xMg:2mJPEYqXBVI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=2mJPEYqXBVI" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ouseful?a=z4OzaMfdenU:bKminmo7xMg:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/ouseful?i=z4OzaMfdenU:bKminmo7xMg:gIN9vFwOqvQ" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ouseful?a=z4OzaMfdenU:bKminmo7xMg:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/ouseful?i=z4OzaMfdenU:bKminmo7xMg:F7zBnMyn0Lo" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ouseful?a=z4OzaMfdenU:bKminmo7xMg:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/ouseful?i=z4OzaMfdenU:bKminmo7xMg:V_sGLiPBpWU" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ouseful?a=z4OzaMfdenU:bKminmo7xMg:cGdyc7Q-1BI"><img src="http://feeds.feedburner.com/~ff/ouseful?d=cGdyc7Q-1BI" border="0"></a>
</div><img src="http://feeds.feedburner.com/~r/ouseful/~4/z4OzaMfdenU" height="1" width="1"/>]]></content:encoded>
         <media:content url="" medium="image">
            <media:title>Tony Hirst</media:title>
         </media:content>
         <media:content url="http://farm3.static.flickr.com/2774/4086733558_8182792b82.jpg" medium="image" />
         <media:content url="http://farm3.static.flickr.com/2568/4086741160_3cf1f5f682.jpg" medium="image" />
         <media:content url="http://farm3.static.flickr.com/2629/4086754106_6e6be55d05.jpg" medium="image" />
         <media:content url="http://farm3.static.flickr.com/2761/4086768294_9f8ed02c78.jpg" medium="image" />
         <media:content url="http://farm3.static.flickr.com/2615/4086015709_6aaa67c331.jpg" medium="image" />
         <media:content url="http://farm3.static.flickr.com/2534/4086773460_3a00055180.jpg" medium="image" />
      </item>
      <item>
         <title>Part 2 of 2: Evaluation of the &amp;#8216;Deposit Tool Show and Tell&amp;#8217; (Features and Flows of Deposit)</title>
         <link>http://infteam.jiscinvolve.org/2009/11/03/part-2-of-2-evaluation-of-the-deposit-tool-show-and-tell-features-and-flows-of-deposit/</link>
         <description>&lt;p&gt;&lt;em&gt;NOTE TO READER: JISC IS CURRENTLY IN THE PROCESS OF DRAFTING A CALL FOR PROPOSALS TO FURTHER EMBED DEPOSIT TOOLS AND SOLUTIONS INTO THE AUTHORS DAY-TO-DAY WORKBENCH. PLEASE &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/fundingopportunities.aspx"&gt;SUBSCRIBE TO ONE OF JISC’S MANY FUNDING ANNOUNCEMENT FEEDS&lt;/a&gt; FOR FURTHER INFORMATION ON THIS CALL.&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;PLEASE ALSO SEE PART 1 OF 2 WHICH PROVIDES &lt;a rel="nofollow" target="_blank" href="http://infteam.jiscinvolve.org/2009/11/03/part-1-of-2-report-on-depost-deposit-tool-show-tell-meeting-2009-12-10/"&gt;A REPORT ON THE ‘DEPOSIT TOOL SHOW &amp;amp; TELL&amp;#8217; MEETING INCLUDING A LIST OF WHAT TOOLS WERE SHOWN ON THE DAY&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Published by: &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/contactus/staff/davidfflanders"&gt;David F. Flanders (JISC Programme Manager)&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;There were three parts to the &amp;#8216;Deposit Tool Show &amp;amp; Tell Meeting which provides the scope for what a deposit tool is:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Types of deposit tools (e.g. drag and drop, email, file/folder, etc). These &lt;a rel="nofollow" target="_blank" href="http://www.slideshare.net/dfflanders/introduction-to-the-day-the-deposit-tool-show-and-tell-meeting"&gt;&amp;#8220;tool types&amp;#8221; are listed in this presentation&lt;/a&gt; by David F. Flanders.&lt;/li&gt;
&lt;li&gt;Types of features present within the deposit tools (e.g. auto name lookup, publications management, recommendations, etc.). These&lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/dff-jisc/tags/depositfeature"&gt; &amp;#8220;tool features&amp;#8221; are listed here in the the following set of pictures&lt;/a&gt; as they were written out on the day (also see the list below).&lt;/li&gt;
&lt;li&gt;Types of (work)flows that the author&amp;#8217;s research content can go through to be published as Open Access (e.g. author deposit to publisher then publisher push to repository, author deposit to personal platform with repository auto-archiving information, etc.). These&lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/dff-jisc/tags/depositflow/"&gt;&amp;#8220;deposit flows&amp;#8221; are listed here in this set of pictures&lt;/a&gt; as they were written out on the day (also see the below list along with images).&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;It is the evaluation of the latter two (&amp;#8217;DEPOSIT TOOL FEATURES&amp;#8217; and &amp;#8216;DEPOSIT FLOWS&amp;#8217;) that provided the most significant implications on the day. What follows is a very brief evaluation of what participants on the day decided were priority areas for deposit features and flows (please keep in mind this is only a &amp;#8217;straw pole&amp;#8217; taken on the day) .&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;DEPOSIT TOOL &lt;em&gt;FEATURES&lt;/em&gt;:&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;During the first half of the day twenty-some deposit tools were shown off, while these tools were being shown off each one of their features was listed on a piece of large piece of paper (i.e. flip-chart), these features were then hung up around the room during lunch time for people to go around and vote on them (by placing a green sticker on their preferred features, known as &amp;#8216;dotmocracy&amp;#8217;). Listed below is that list of deposit features along with the votes that each one recieved (to note: the fifty-some participants had 3 votes each).&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Auto Look-up / Select from Drop Down List: the author when filling in metadata about their publication is able to select basic information from lists of &amp;#8220;matching&amp;#8221; items as they type in fields such as: Publisher, ISSN, Author Name, Institution, Subject Areas, etc. 9 votes.&lt;/li&gt;
&lt;li&gt;Name Authority: Able to provide an authoritative list of author names for both authors trying to combine their various named spelling versions as well as for citation by authors of other authors. 7 votes.&lt;/li&gt;
&lt;li&gt;Manage/Edit Publications List: enabling author to manage their publications list for publishing and presenting to the wider world. 7 votes.&lt;/li&gt;
&lt;li&gt;Expose Granular Content: being able to show the content within a publication as it&amp;#8217;s own individual parts (multiple HTTP link anchors in each item) be that per section, per activity, per media type, etc. 7 votes.&lt;/li&gt;
&lt;li&gt;Recommendation: able to provide recommendation to the author (after deposit) of other like-minded authors and/or other similar content to their own. 6 votes.&lt;/li&gt;
&lt;li&gt;Email Upload (SMTP/POP3/IMAP): ability for author to email their content directly into system without intermediary. 6 votes.&lt;/li&gt;
&lt;li&gt;License: Able to add copyright metadata about the article with as little hassle as possible to the author. 5 votes.&lt;/li&gt;
&lt;li&gt;Broker: ability to pass publication to a broker that will hold paper for an embargo period until it is able to be published. 5 votes.&lt;/li&gt;
&lt;li&gt;Trust: making sure that the trust between author and tool/publisher/library/archive/etc is well established on behalf of the authors. 4 votes&lt;/li&gt;
&lt;li&gt;Catching Documents At Creation Time: As part of the author&amp;#8217;s writing process, enabling the auto saving of documents as they are versioned and created. 4 votes.&lt;/li&gt;
&lt;li&gt;Desktop Hoover: enabling the author&amp;#8217;s computer to auto crawl files and folder to suggest what content might be deposited. 2 votes.&lt;/li&gt;
&lt;li&gt;Return RIP URL: being able to return a link (after deposit) that shows the user where their content has been published. 2 votes&lt;/li&gt;
&lt;li&gt;Bulk Deposit: ability to upload collections of content in one go. 2 votes.&lt;/li&gt;
&lt;li&gt;Versioning: ability to handle multiple versions of same publication as single deposit. 2 votes.&lt;/li&gt;
&lt;li&gt;Distribution Rights: Ability to declare how published content (both technically and legally) can be repurposed and reused. 1 vote&lt;/li&gt;
&lt;li&gt;File Hierarchy: being able to have your list of online Web publications listed as a file hierarchy so that authors can organise their content in specified folders, e.g. as a networked drive. 1 vote.&lt;/li&gt;
&lt;li&gt;Workbench Toolbar: being able to mark up various parts of the publication while you are authoring it with tags and other machine readable text. 1 vote&lt;/li&gt;
&lt;li&gt;Archivist/Librarian/RepoManager/Publisher Proxy Deposit: Enable deposit tools that allow for others to deposit on behalf of the author. 1 vote&lt;/li&gt;
&lt;li&gt;Embargo Edit Area: allowing the user to enter an embargo area where they can access their data prior to it being published/archived. 1 vote.&lt;/li&gt;
&lt;li&gt;Uploading List of Publications by Author: able to upload metadata to system that provides further information about the user and their publications, like person profile metadata. 1 vote.&lt;/li&gt;
&lt;li&gt;Embed in HTML: ability to embed a deposit tool anywhere that allows for html embed script: blogs, facebook, etc. 1 vote.&lt;/li&gt;
&lt;li&gt;Save As: being able to save directly from an authoring tool, e.g. within Word by clicking a &amp;#8220;save as&amp;#8221; or rather &amp;#8220;save to&amp;#8230; repository&amp;#8221;. 0 votes.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;em&gt;Please note&lt;/em&gt;: these votes are only a snapshot of what people were thinking on the day and do not reflect a definitive list of features or concerns. Rather the vote was only intended as a way to engage people in the features listed and to help further specify which were of significant interest on the day.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;DEPOSIT TOOL &lt;em&gt;FLOWS&lt;/em&gt;:&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The list of deposit (work)flows was presented by Jim Downing as part of the ongoing Repository Handshake work which is being lead by Pablo de Castro who was also present on the day.&lt;/p&gt;
&lt;p&gt;The five flows presented were (in order of the vote, also see images below):&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Publish via Personal Publications Management System, e.g. system to sync various versions of publication out on the Web (12 votes);&lt;/li&gt;
&lt;li&gt;Author Self-Publishing, e.g. via website, blog or other personal publishing platform (9 votes);&lt;/li&gt;
&lt;li&gt;Publish via Broker, e.g. a broker service watches with publications appear on a publisher platform and pull content into an embargo area until publisher allows for open publishing of content (5 votes);&lt;/li&gt;
&lt;li&gt;Publish via Repository, e.g. author gives to institutional repository and repository is responsible for passing out to other platforms and publications systems (3 votes);&lt;/li&gt;
&lt;li&gt;Publish via Single Event/Theme, e.g. conference publishing system or other publishing platform provided by an organisation for others to contribute around a common theme/tag/node (0 votes).&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;1.&lt;img src="http://farm3.static.flickr.com/2570/4014093118_950daac333_m.jpg" alt="" width="180" height="240"/&gt; 2.&lt;img src="http://farm4.static.flickr.com/3425/4014093186_abfbced173_m.jpg" alt="" width="180" height="240"/&gt; 3.&lt;img src="http://farm3.static.flickr.com/2470/4014093244_f2fb118141_m.jpg" alt="" width="180" height="240"/&gt; 4.&lt;img src="http://farm3.static.flickr.com/2623/4014093308_06634fafa6_m.jpg" alt="" width="180" height="240"/&gt; 5.&lt;img src="http://farm4.static.flickr.com/3527/4013327541_94f56f2a6b_m.jpg" alt="" width="180" height="240"/&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;Again please note&lt;/em&gt;: these votes only represent a view of the priorities placed on the day and do not represent a comprehensive list.&lt;/p&gt;
&lt;p&gt;PLEASE ALSO SEE PART 1 OF 2 WHICH PROVIDES &lt;a rel="nofollow" target="_blank" href="http://infteam.jiscinvolve.org/2009/11/03/part-1-of-2-report-on-depost-deposit-tool-show-tell-meeting-2009-12-10/"&gt;A REPORT ON THE ‘DEPOSIT TOOL SHOW &amp;amp; TELL&amp;#8217; MEETING, INCLUDING A LIST OF WHAT TOOLS WERE SHOWN ON THE DAY&lt;/a&gt;&lt;/p&gt;</description>
         <author>dfflanders</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/11/03/part-2-of-2-evaluation-of-the-deposit-tool-show-and-tell-features-and-flows-of-deposit/</guid>
         <pubDate>Tue, 03 Nov 2009 10:28:15 -0800</pubDate>
      </item>
      <item>
         <title>Part 1 of 2: Report on #DepoST (Deposit Tool Show &amp;#38; Tell) Meeting 2009-12-10</title>
         <link>http://infteam.jiscinvolve.org/2009/11/03/part-1-of-2-report-on-depost-deposit-tool-show-tell-meeting-2009-12-10/</link>
         <description>&lt;p&gt;&lt;em&gt;NOTE TO READER: JISC IS CURRENTLY IN THE PROCESS OF DRAFTING A CALL FOR PROPOSALS TO FURTHER EMBED DEPOSIT TOOLS AND SOLUTIONS INTO THE AUTHORS DAY-TO-DAY WORKBENCH. PLEASE &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/fundingopportunities.aspx"&gt;SUBSCRIBE TO ONE OF JISC’S MANY FUNDING ANNOUNCEMENT FEEDS&lt;/a&gt; FOR FURTHER INFORMATION ON THIS CALL.&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;&lt;em&gt;&lt;strong&gt;&lt;a rel="nofollow" target="_blank" href="http://infteam.jiscinvolve.org/2009/11/03/part-2-of-2-evaluation-of-the-deposit-tool-show-and-tell-features-and-flows-of-deposit/"&gt;PLEASE SEE PART 2 OF 2 ON THE EVALUATION (FEATURES AND FLOWS) OF THE &amp;#8216;DEPOST TOOL MEETING&amp;#8217;&lt;/a&gt;&lt;/strong&gt;&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;Published by: &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/contactus/staff/davidfflanders"&gt;David F. Flanders (JISC Programme Manager)&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Just before I sat down to write this post, I quickly went back to have a look at the original&lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/reppres/tools/sword.aspx"&gt;SWORD (Deposit API) Project&lt;/a&gt; to look up when the&lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/repositories/digirep/index/SWORD_APP_Profile_1.0#SWORD_Profile_of_the_Atom_Publishing_Protocol"&gt; first draft specification was published&lt;/a&gt;, to my amazement version 1 was published *exactly* two years to the date of the &amp;#8220;Deposit Tool Show &amp;amp; Tell&amp;#8221; event: 12 October 2007. And quite significantly (as you&amp;#8217;ll see below), there are well over twenty different applications and deposit tools built atop the SWORD Deposit API since that first 1.0 publication. So, CONGRATULATIONS ON YOUR TWO YEAR ANNIVERSARY SWORD! A little tip of the hat to Rachael Heery who brought a bunch of us hackers to sit around a table to talk about how deposit could be improved, your focus and drive in this space is missed.&lt;/p&gt;
&lt;p&gt;The show (and tell) -must of course- go on, accordingly &lt;a rel="nofollow" target="_blank" href="http://docs.google.com/Doc?docid=0AbsNASxXZiL6ZGRwZDRxNHFfNDJjY3BrN2s4eg&amp;amp;hl=en"&gt;here is agenda for the day along with the people who attended&lt;/a&gt;. The rest of the story is picked up by our blogger-on-the-day Bashera Kahn:&lt;/p&gt;
&lt;p&gt;&lt;em&gt;12 October 2009, London, UK.&lt;/em&gt; JISC held a one-day Barcamp at the University of London focusing on author deposit tools, ahead of the &lt;a rel="nofollow" target="_blank" href="http://bambuser.com/channel/dsug09/broadcast/290727"&gt;DSpace User Group Meeting&lt;/a&gt; at the University of Gothenburg in Sweden.&lt;/p&gt;
&lt;p&gt;The &lt;a rel="nofollow" target="_blank" href="http://docs.google.com/View?id=ddpd4q4q_42ccpk7k8z"&gt;Deposit Show &amp;amp; Tell&lt;/a&gt; event is one of the first steps in JISC’s plan to invest £300,000 in sustained improvements to author deposit tools. It followed the September 2009 &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/publications/documents/communicatingknowledgereport.aspx"&gt;JISC report into how and why UK researchers publish and disseminate their findings&lt;/a&gt;, which provides an excellent contextual backdrop to the challenges facing the architects and users of repositories and deposit tools.&lt;/p&gt;
&lt;p&gt;‘DepoST’, as it was tagged, brought together developers and stakeholders from across the UK and Europe who have already broken ground on creating and refining author deposit tools and interfaces.&lt;/p&gt;
&lt;p&gt;Several lightning-fast rounds of demonstrations proved that the development space in this area is thriving, with a strong focus on making the deposit process quicker and easier for users authoring research content, from academics to students, librarians to archivists and curators.&lt;/p&gt;
&lt;p&gt;JISC’s David F. Flanders stressed in his welcoming address the importance of adding improved ‘feedback loops’ to the deposit process, to provide authors with more information during and after the process than just ‘Okay’.&lt;/p&gt;
&lt;p&gt;Flanders mentioned a few patterns he’d observed in the showcased tools which adopted workflows and interactions that would be familiar to users from commonplace computing or online experiences, such as:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Drag &amp;amp; Drop&lt;/li&gt;
&lt;li&gt;Upload and add, as popularised by the &lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/tools/uploadr/"&gt;Flickr Uploadr&lt;/a&gt; and other such upload tools&lt;/li&gt;
&lt;li&gt;Machine-assisted, e.g. a deposit tool that crawls the user’s HD for files to deposit&lt;/li&gt;
&lt;li&gt;Network drive e.g. a tool that allows the user to ‘map’ the folder containing papers or accompanying media&lt;/li&gt;
&lt;li&gt;Contextual community dashboard which draws on the ancillary information around other researchers in a particular subject area to create a view of the research community around that subject area&lt;/li&gt;
&lt;li&gt;Tools embedded into existing applications, e.g. Microsoft’s &lt;a rel="nofollow" target="_blank" href="http://research.microsoft.com/en-us/projects/chem4word/"&gt;Chem4Word&lt;/a&gt; project to support the authoring and rendering of semantically-rich chemistry information in Word 2007 documents.&lt;/li&gt;
&lt;/ul&gt;
&lt;div&gt;David&amp;#8217;s Slides on: &lt;a rel="nofollow" target="_blank" href="http://www.slideshare.net/dfflanders/introduction-to-the-day-the-deposit-tool-show-and-tell-meeting"&gt;The &amp;#8216;Deposit Tool Show And Tell&amp;#8217; Event (Introduction to the Day and Overview of Deposit Tool Types for Author Publishing)&lt;/a&gt;&lt;/div&gt;
&lt;p&gt;&amp;lt;&amp;#8211;!DFF: The twenty some, short and fast (&amp;#8221;lightning talk&amp;#8221;) &amp;#8217;show and tell&amp;#8217; presentations followed with five minutes a piece to SHOW their app, with five minutes &amp;#8216;question and TELL&amp;#8217; following:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Shown &amp;amp; Told:&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&lt;strong&gt;(1) &lt;a rel="nofollow" target="_blank" href="http://twitter.com/juliancheal"&gt;Julian Cheal&lt;/a&gt;&lt;/strong&gt;&lt;strong&gt;, SUE/SIS Systems Developer, &lt;/strong&gt;&lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/"&gt;&lt;strong&gt;UKOLN&lt;/strong&gt;&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; DepositAir IE Demonstrator&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; SWORD, DSpace&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt; Adobe AIR, SQLite, Ruby on Rails&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; DepositAir is an Adobe AIR application which borrows its look and feel from the Flickr Uploadr. The user drags and drops the files to deposit from the source folder to the application. DepositAir auto-populates metadata fields such as title, ISSN, publisher, author name, and then sends the files and metadata to dspace.swordapp.org.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt; &lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;(2) &lt;a rel="nofollow" target="_blank" href="http://www.twitter.com/davetaz"&gt;Dave Tarrant&lt;/a&gt;&lt;/strong&gt;&lt;strong&gt;, Postgraduate researcher, &lt;/strong&gt;&lt;a rel="nofollow" target="_blank" href="http://www.ecs.soton.ac.uk/people/dct05r"&gt;&lt;strong&gt;University of Southampton&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool: &lt;/strong&gt;ePrints 3 Upload Handler plugin&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; ePrints, SWORD, Microsoft Word, Microsoft PowerPoint&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt; OpenXML&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; The development roadmap for ePrints 3.2 is focused on a more modular experience with better desktop and cloud integration. The plug-in works with Microsoft Word 2007 and Powerpoint to extract metatdata and media during the deposit process. Although the current extraction process is inline, the plan is to make it an unobtrusive background operation.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt; &lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;(3) Pat McSweeney, ePrints project developer, University of Southampton&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; PDFMetaExtractor&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; ePrints&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt; Java, OO-Perl&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; This tool searches the user’s computer for PDFs and then intelligently extracts metadata as well as keywords specified within the document. A known issue is that non-native PDF documents (e.g. those converted from Microsoft Word documents or scanned from paper) may return incomplete information.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(4) &lt;a rel="nofollow" target="_blank" href="http://ptsefton.com/"&gt;Peter Sefton,&lt;/a&gt;&lt;/strong&gt;&lt;strong&gt; eScholarship Tech Team Manager, University of Southern Queensland, Australia&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; &lt;a rel="nofollow" target="_blank" href="http://ice.usq.edu.au/"&gt;ICE&lt;/a&gt; (Integrated Content Environment)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; Microsoft Word 2007, OpenOffice, Zotero, Wordpress&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages: &lt;/strong&gt;Windows, Mac, Ubuntu&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; ICE lets you create web and print documents from a word processor. You can use Microsoft Word, or the free OpenOffice.org. Peter demonstrated the ICE toolbar in Word, uploading the document as styled HTML to an ICE server and then publishing to a Wordpress blog. The tool is especially useful for thesis supervision, as it allows comments and annotations to be made without changing the content of the document.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(5) Richard Jones, &lt;/strong&gt;&lt;a rel="nofollow" target="_blank" href="http://www.symplectic.co.uk"&gt;&lt;strong&gt;Symplectic Limited&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; Dashboard deposit in ‘&lt;a rel="nofollow" target="_blank" href="http://www.symplectic.co.uk/products/publications.html"&gt;Publications&lt;/a&gt;’ product&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; DSpace, SHERPA/RoMEO, all major digital repository technologies&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; Symplectic’s tools to link the Repository module of the Symplectic Publications Management System to digital repositories using all major digital repository technologies. Users can upload full text documents and supporting information directly from the Symplectic Publications interface. Copyright guidance is collected automatically from SHERPA/RoMEO and made available to users. A stand-out feature is that the author provides distribution rights information only if it’s available and/or necessary; the system doesn’t mandate that this information is present.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(6) &lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/ukoln/staff/temporary-staff/"&gt;Alex Strelnikov&lt;/a&gt;&lt;/strong&gt;&lt;strong&gt;, UKOLN&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; Email-based deposit plugin for SWORD&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; SWORD&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt; Javascript&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; The premise of this deposit tool is to encourage take-up and use of ‘1-click’ deposit tools by embedding them in trusted and frequently used applications, like email, or Facebook. The user can deposit papers by attaching them to an email and sending to a pre-defined email address. The plugin checks for an attachment, and if found, sends it to an analysis server where metadata is automatically extracted. Future development roadmap includes support of email threads.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(7) Jan Reichelt, &lt;/strong&gt;&lt;a rel="nofollow" target="_blank" href="http://www.mendeley.com/"&gt;&lt;strong&gt;Mendeley&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; Mendeley&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; PubMed, CrossRef, Google Scholar, ACM, IEEE and others&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt; Windows, Mac, Linux&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; Described as “Last.fm for research papers”, Mendeley is more a workflow productivity tool rather than repository tool. It is a free research management tool for desktop &amp;amp; web which aggregates metadata from all papers added to the Mendeley research network via the Mendeley Desktop software. This indexes and organizes PDF documents and research papers, creating a personal digital bibliography for users. Mendeley has enjoyed takeup from users in highly respected universities around the world, including Stanford, MIT, Cambridge, Harvard, Aachen, Cornell and others. The company is attempting to redefine the space, time-frame and influences by which the ‘impact factor’ of scientific careers can be determined, by analysing discussions around research findings in social networks such as Twitter, Facebook and FriendFeed.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(8) &lt;a rel="nofollow" target="_blank" href="http://lucas.ucs.ed.ac.uk/"&gt;Ian Stuart&lt;/a&gt;&lt;/strong&gt;&lt;strong&gt;, Software Engineer, &lt;/strong&gt;&lt;a rel="nofollow" target="_blank" href="http://edina.ac.uk/"&gt;&lt;strong&gt;EDINA&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; The &lt;a rel="nofollow" target="_blank" href="http://edina.ac.uk/projects/Open_Access_Repository_Junction_summary.html"&gt;Open Access Repository Junction&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; RoMEO, OpenDOAR, all major repositories&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; Known as OA-RJ, this project’s aim is to build on the existing EDINA&lt;a rel="nofollow" target="_blank" href="http://depot.edina.ac.uk/"&gt;Depot&lt;/a&gt; to create a ‘middleware’ interoperability bridge between existing repositories which will act as a deposit broker system. The tool will help authors who are either not associated with an institution, or collaborative researchers from different institutions, to find the right repositories to deposit their work into. The system will automate RoMEO and OpenDOAR lookups, and provide an author disambiguation feature. Although still in development, the Nature Publishing Group is interested in using this tool.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(9) Joe Lambert, University of Southampton&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; &lt;a rel="nofollow" target="_blank" href="http://blog.mspace.fm/2009/10/13/jisc-depost-meetup/"&gt;Drag&amp;amp;Drop Deposit Tool&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; ePrints&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt; Mac, Cocoa&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; This prototype updater is written with the collaborative author in mind. It tries to address the issue of metadata tools for time-starved academics submitting PDFs to ePrints. The development roadmap suggests an ideal user experience of being able to drag and drop multiple files into the application, which would return a report of all the metadata extracted for the user to check, approve, edit if necessary and then file to the IR.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(10) Viv Cothey, &lt;/strong&gt;&lt;a rel="nofollow" target="_blank" href="http://www.gloucestershire.gov.uk/index.cfm?articleid=15434"&gt;&lt;strong&gt;Gloucestershire Archives&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; GAip desktop curation tool&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with: &lt;/strong&gt;SWORD, DSpace&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt; Perl&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; This tool stood out for being one of the only deposit tools to address archive and repository materials which aren’t academic research papers. The Gloucestershire Archives deals with physical materials as well as digital records, and faces the problem of taking “a 100-year view”. The intended user for GAip is an archivist - not the creator or the author. Viv raised the very pertinent issue of trusted storage. (Aside: anyone interested in the issues around long-term digital storage should read/listen to &lt;a rel="nofollow" target="_blank" href="http://www.longnow.org/seminars/02005/nov/14/making-digital-durable-what-time-does-to-categories/"&gt;Clay Shirky’s Long Now lecture on digital durability&lt;/a&gt;.)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(11) Tim Brody, EPrints WebDav, University of Southampton&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; Map a WebDav or FTP drive directly into ePrints 3.2&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; ePrints&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; The ePrints team presented &lt;a rel="nofollow" target="_blank" href="http://files.eprints.org/451/"&gt;a video walkthrough&lt;/a&gt; of this tool, authored by Tim Brody. This solution seems targeted at a technical IR administrator or author, as the interface design is definitely geared to people very familiar with the command line, rather than your standard non-techie academic user. It provides a browsable and searchable folder structure with ‘dropbox’ like import functionality. At present it lacks any automatic metadata harvesting, and requires the user to complete the deposit via a standard ePrints web interface.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(12) Theo Andrew &amp;amp; Fred Howell, The Open Access Repository, &lt;/strong&gt;&lt;a rel="nofollow" target="_blank" href="http://edina.ac.uk/"&gt;&lt;strong&gt;EDINA&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; EM-Loader (Extracting Metadata to Load for Open Access Deposit)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; SWORD, the Depot, &lt;a rel="nofollow" target="_blank" href="http://publicationslist.org/"&gt;PublicationsList.org&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; This project, still under development, is a proof of concept middleware that links the Depot and PublicationsList.org, a web site for researchers to build a web page listing their publications. EM-Loader’s goal is to make batch deposits easier, by handling multiple queries for metadata from web-based resources like PubMed, Web of Science, and personal databases such as EndNote, Reference Manager, BibTeX etc. Fred’s annotated presentation on the ‘&lt;a rel="nofollow" target="_blank" href="http://a.nnotate.com/php/pdfnotate.php?d=2009-10-11&amp;amp;c=fwHrIkD8#page1"&gt;From Swords to Ploughshares&lt;/a&gt;’ is available on his site.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(13) &lt;a rel="nofollow" target="_blank" href="http://blog.stuartlewis.com"&gt;Stuart Lewis&lt;/a&gt;&lt;/strong&gt;&lt;strong&gt;, IT Innovations Analyst &amp;amp; Developer, &lt;/strong&gt;&lt;a rel="nofollow" target="_blank" href="http://library.auckland.ac.nz/"&gt;&lt;strong&gt;University of Auckland Library&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool: &lt;/strong&gt;EasyDeposit configurable deposit client&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; SWORD&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages: &lt;/strong&gt;PHP&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; The &lt;a rel="nofollow" target="_blank" href="http://wiki.github.com/stuartlewis/EasyDeposit"&gt;EasyDeposit&lt;/a&gt; client is a PHP powered configurable SWORD repository deposit client which can be configured to create a custom deposit interface for your repository. In this case, Stuart demonstrated how it can be configured to accept deposits via email using the standard PHP IMAP library to connect to your inbox. It extracts metadata from the sender of the email, the email subject, and the body of the message, which should contain the abstract. The script also adds each email attachment to the deposited item. When the deposit process is completed, the sender receives an email with a URL linking to that record in the repository. The script can also be configured for deposit via Facebook.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(14) Alex Wade, Director for Scholarly Communication, &lt;/strong&gt;&lt;a rel="nofollow" target="_blank" href="http://www.microsoft.com/mscorp/tc/scholarly_communication.mspx"&gt;&lt;strong&gt;Microsoft External Research&lt;/strong&gt;&lt;/a&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; WordDeposit&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; Microsoft Word 2007, ArXiv, SWORD&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt; Windows&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; Microsoft’s External Research division is working with several leading academic organisations and researchers to produce workflow support tools. Alex discussed two exciting repository developments. First, the fact that &lt;a rel="nofollow" target="_blank" href="http://www.arxiv.org"&gt;arXiv&lt;/a&gt; now accepts submissions of Microsoft Office Word .docx files and other Office Open XML documents. Second, the company’s hosted self-publishing eJournal Service, currently in alpha, which helps conference chairs handle submissions of papers, and subsequently allows them to easily select and share those papers (via SharePoint Server 2007) with one click.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(15) Seb Francois, University of Southampton&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; sWordInbox&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; SWORD, ePrints, Wordpress&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; Seb demod an embeddable remote uploader tool for ePrints, which he developed for the University of Lincoln. It addresses the use case more widely seen as individual researchers maintain their own blogs, i.e. it integrates with Wordpress and allows the user to post their papers to their own blog once the deposit to ePrints is complete. There are still some bugs to work out, not least that embedding a login request into a web page has all the appearance of a phishing attack!&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(16) Julian Tenney and Patrick Lockey, &lt;/strong&gt;&lt;a rel="nofollow" target="_blank" href="http://www.nottingham.ac.uk/xerte/"&gt;&lt;strong&gt;Xerte&lt;/strong&gt;&lt;/a&gt;&lt;strong&gt;, University of Nottingham&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; Xerte online authoring toolkit and &lt;a rel="nofollow" target="_blank" href="http://xpert.nottingham.ac.uk/"&gt;Xpert&lt;/a&gt; deposit tool&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; Any LMS or VLE&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt; Web-based&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; This is another of the tools demo’d with a focus on something other than academic research papers. Xerte is an open source suite of tools to rapidly develop richly interactive learning content. Content created in Xerte can be deposited into Xpert, a searchable distributed repository compiled by harvesting content from the publishing institution via RSS feed. The aim is to make learning content available for re-use, re-purposing and adaptation.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(17) James Ballard &amp;amp; Richard Davis, University of London&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; &lt;a rel="nofollow" target="_blank" href="http://clasm.ulcc.ac.uk/wiki"&gt;Copyright Licensing Applications using SWORD for Moodle&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt; SWORD, Moodle, ePrints, DSpace&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt; PHP&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; Another tool in development with a focus on learning materials, CLASM assists students and academics who deposit through the familiar Moodle interface into a closed repository designed with a librarian’s workflow in mind. CLASM is designed to support better management of CLA licensed materials.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;(18) Dan Needham, University of Manchester &amp;amp; Alan Danskin, British Library&lt;/strong&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Tool:&lt;/strong&gt; &lt;a rel="nofollow" target="_blank" href="http://names.mimas.ac.uk/"&gt;Names Project&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Works with:&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Platforms/Languages:&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Description:&lt;/strong&gt; The last of the tools to focus on something other than deposit workflows, the Names Project is developing a pilot name authority system to address the critical issue of author disambiguation. It uses data from Zetoc, British Library and contextual information from research documents to build a database of all UK research authors which will reliably and uniquely identify individuals and institutions. A &lt;a rel="nofollow" target="_blank" href="http://names.mimas.ac.uk/prototype"&gt;public beta API&lt;/a&gt; is available for testing and no doubt all eyes will be on the British Library and Mimas to produce what most think will be an invaluable system.&lt;/li&gt;
&lt;/ul&gt;
&lt;div&gt;The second half of the day focused in on the &lt;em&gt;FEATURES&lt;/em&gt; that each of the above tool provided to the end-user along with the various work(&lt;em&gt;FLOWS&lt;/em&gt;) that depositing research content could take:&lt;/div&gt;
&lt;p&gt;&lt;em&gt;&lt;strong&gt;&lt;a rel="nofollow" target="_blank" href="http://infteam.jiscinvolve.org/2009/11/03/part-2-of-2-evaluation-of-the-deposit-tool-show-and-tell-features-and-flows-of-deposit/"&gt;PLEASE SEE PART 2 OF 2 ON THE EVALUATION (FEATURES AND FLOWS) OF THE &amp;#8216;DEPOST TOOL MEETING&amp;#8217;&lt;/a&gt;&lt;/strong&gt;&lt;/em&gt;&lt;/p&gt;</description>
         <author>dfflanders</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/11/03/part-1-of-2-report-on-depost-deposit-tool-show-tell-meeting-2009-12-10/</guid>
         <pubDate>Tue, 03 Nov 2009 07:14:25 -0800</pubDate>
      </item>
      <item>
         <title>Talk at Edspace Event, University of Southampton</title>
         <link>http://ukwebfocus.wordpress.com/2009/11/03/talk-at-edspace-event-university-of-southampton/</link>
         <description>I have been invited by the JISC-funded Edspace project, based at the University of Southampton to give a talk at an event on &amp;#8220;Traditional educational repositories v. Web 2.0 resource sharing&amp;#8221; to be held on Wednesday 4 November 2009. I have been asked speak on &amp;#8220;the future for educational resources and services on the Web&amp;#8221; [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ukwebfocus.wordpress.com&amp;blog=497535&amp;post=3532&amp;subd=ukwebfocus&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://ukwebfocus.wordpress.com/?p=3532</guid>
         <pubDate>Tue, 03 Nov 2009 00:00:01 -0800</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><p>I have been invited by the JISC-funded <a rel="nofollow" target="_blank" href="http://www.edspace.ecs.soton.ac.uk/">Edspace project</a>, based at the University of Southampton to give a talk at an event on &#8220;<a rel="nofollow" target="_blank" href="http://www.arts-humanities.net/event/traditional_educational_repositories_v_web_20_resource_sharing">Traditional educational repositories v. Web 2.0 resource sharing</a>&#8221; to be held on Wednesday 4 November 2009. I have been asked speak on &#8220;<em>the future for educational resources and services on the Web</em>&#8221; &#8211; a rather grandiose topic, I think! I&#8217;ve entitled the talk &#8220;<a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/web-focus/events/workshops/edspace-2009/">The Future for Educational Resource Repositories and Services in a Web 2.0 World</a>&#8221; as its the Web 2.0 aspect I feel is important (and reflects my area of expertise &#8211; I don&#8217;t claim to have anything particularly significant to say on the repository side of things).</p>
<p>I&#8217;ll be saying that many of the technical aspects of Web 2.0 are now mainstream &#8211; and indeed the Edspace&#8217;s <a rel="nofollow" target="_blank" href="http://www.edshare.soton.ac.uk/">Edshare service</a> provides RSS feeds, <a rel="nofollow" target="_blank" href="http://www.edshare.soton.ac.uk/cgi/tag_cloud">tag clouds</a>, embed functionality and &#8216;cool URIs&#8217;.</p>
<p>But the term Web 2.0 also covers the network as the platform and a culture of openness. The issue of openness of educational resources is being addressed in, for example, the <a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/oer">JISC OER programme</a> and although I personally seek to ensure that my content (such as blog posts, slides and papers) are available under a Creative Commons licence I know that there are added complexities in the area of educational resources &#8211; so I&#8217;ll not focus on the openness issue.</p>
<p>Instead I&#8217;ll raise the question of the network as the platform in the context of the futures for educational resource repositories. I&#8217;ll suggest that as <a rel="nofollow" target="_blank" href="http://ukwebfocus.wordpress.com/2009/06/26/is-it-really-a-good-time-to-be-asking-for-more-it-money/">experts predict further cuts in the public sector</a>, including higher education, wouldn&#8217;t it be appropriate for our repository services to be hosted in the cloud? And the concerns which tend to be raised (sustainability, reliability, legal issues, etc.) are implementation details which do need to be addressed &#8211; but these aren&#8217;t the important policy issues.</p>
<p>The slides I&#8217;ll be using are <a rel="nofollow" target="_blank" href="http://www.slideshare.net/lisbk/the-future-for-educational-resource-repositories-in-a-web-20-world">available on Slideshare</a> (in the Cloud(!) although a master copy is also <a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/web-focus/events/workshops/edspace-2009/">held locally</a>) and is embedded below.</p>
<p></p> 
Posted in Events, Repositories <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/ukwebfocus.wordpress.com/3532/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ukwebfocus.wordpress.com/3532/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/ukwebfocus.wordpress.com/3532/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/ukwebfocus.wordpress.com/3532/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/ukwebfocus.wordpress.com/3532/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/ukwebfocus.wordpress.com/3532/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/ukwebfocus.wordpress.com/3532/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/ukwebfocus.wordpress.com/3532/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/ukwebfocus.wordpress.com/3532/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/ukwebfocus.wordpress.com/3532/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=ukwebfocus.wordpress.com&blog=497535&post=3532&subd=ukwebfocus&ref=&feed=1"/></div>]]></content:encoded>
         <media:content url="http://0.gravatar.com/avatar/27731abff266f585f006998f65c74be9?s=96&amp;amp;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
            <media:title>Brian Kelly (UK Web Focus)</media:title>
         </media:content>
      </item>
      <item>
         <title>Displaying citation counts in DSpace</title>
         <link>http://feedproxy.google.com/~r/stuartlewis/~3/is3YRA_Xu9E/</link>
         <description>In the repository world we&amp;#8217;ve known for a while now that unless the repository provides value to a researcher, they won&amp;#8217;t use it. Nothing pleases a researcher more than to see nice big citation counts for their papers. Wouldn&amp;#8217;t it be nice if DSpace repositories could display the citation count for archived papers? I received an [...]</description>
         <guid isPermaLink="false">http://blog.stuartlewis.com/?p=503</guid>
         <pubDate>Thu, 29 Oct 2009 19:44:22 -0700</pubDate>
         <content:encoded><![CDATA[<p>In the repository world we&#8217;ve known for a while now that unless the repository provides value to a researcher, they won&#8217;t use it. Nothing pleases a researcher more than to see nice big <a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Citation_impact">citation counts</a> for their papers. Wouldn&#8217;t it be nice if <a rel="nofollow" target="_blank" href="http://www.dspace.org/">DSpace</a> repositories could display the citation count for archived papers?</p>
<p><a rel="nofollow" target="_blank" href="http://blog.stuartlewis.com/wp-content/uploads/2009/10/cite.png"><img class="aligncenter size-medium wp-image-504" title="cite" src="http://blog.stuartlewis.com/wp-content/uploads/2009/10/cite.png" alt="cite" width="600" height="144"/></a></p>
<p>I received an email yesterday about the <a rel="nofollow" target="_blank" href="http://searchapi.scopus.com/">Scopus API</a>, so thought I&#8217;d play with it for a bit of a &#8216;Friday afternoon experiment&#8217;. So here is a quick recipe for adding citation counts to DSpace&#8217;s JSPUI:</p>
<ol>
<li>Register for the Scopus API service: <a rel="nofollow" target="_blank" href="http://searchapi.scopus.com/">http://searchapi.scopus.com/</a></li>
<li>Register your website (e.g. http://dspace.example.com/): <a rel="nofollow" target="_blank" href="https://searchapi.scopus.com/developerProfile.url">https://searchapi.scopus.com/developerProfile.url</a></li>
<li>Download and save <a rel="nofollow" target="_blank" href="http://blog.stuartlewis.com/wp-content/uploads/2009/10/scopus-api.txt">this patch</a> (it only edits two files &#8211; display-item.jsp and header-default.jsp)</li>
<li>Edit the patch and insert your developer ID where you see XXXXXXXXXXX in the Javascript</li>
<li>Apply the patch to your DSpace instance</li>
<li>Re-build, and redeploy your DSpace instance</li>
<li>Visit any item that has a DOI stored in the dc.identifier.doi field</li>
<li>Look out for the citation count appearing at the top (if the item has a count of more than 0!)</li>
</ol>]]></content:encoded>
      </item>
      <item>
         <title>EC Digital Libraries and Digital Preservation Call</title>
         <link>http://infteam.jiscinvolve.org/2009/10/29/ec-digital-libraries-and-digital-preservation-call/</link>
         <description>&lt;p&gt;I went to a meeting in Peter Mandelson&amp;#8217;s basement the other day, otherwise known as the Department for Business Innovation and Skills just next to Westminster Abbey. Lord Mandelson (if you go up about 17 levels of management) is my boss, so it&amp;#8217;s good to know where the orders are coming from.&lt;/p&gt;
&lt;p&gt; Anyway &amp;#8230; the meeting was a briefing day and a chance for the European Commission to explain a bit about the priorities and procedures that people should think about if they want to apply for funding for projects in the Digital Libraries and Preservation area (Formally referred to as FP7 ICT Call 6). The presentations are now available online at &lt;a rel="nofollow" target="_blank" href="http://bit.ly/3oPGFe"&gt;http://bit.ly/3oPGFe&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;The headline issues that I took away from the meeting were &amp;#8230;&lt;/p&gt;
&lt;p&gt;The whole funding decision-making process takes nearly a year and is extremely competitive. If you are a small organisation that is simply looking for money &amp;#8230; it probably isn&amp;#8217;t for you! The commission will be evaluating proposals according to 3 main criteria:&lt;/p&gt;
&lt;p&gt;1. Are they proposing something that is useful and is technically robust?&lt;/p&gt;
&lt;p&gt;2. Will they be able to achieve their objectives?&lt;/p&gt;
&lt;p&gt;3. What impact will the work have?&lt;/p&gt;
&lt;p&gt;They are looking for effective collaborations. Consortiums must have a lead and at least 3 other partners. How many partners and where they come from is - contrary to popular belief - not that important! There have been rumours in the past that people needed to hook up with Eastern European partners, or Southern European partners, in order to get funding. This is a fallacy. You just need to demonstrate that your consortium will be effective. In fact, once you have your core group of at least 4 EU partners, additional partners (with appropriate expertise) can come from anywhere in the world.&lt;/p&gt;
&lt;p&gt;It is not generally the job of a research organisation to know about marketing and exploiting products that are created as part of a research programme. Join up with an organisation who knows about this stuff! It&amp;#8217;s important to get it right and sustainability is EXTREMELY important.&lt;/p&gt;
&lt;p&gt;Think hard about what sort of project suits your proposal &amp;#8230; The models on offer are:&lt;/p&gt;
&lt;p&gt;IP&amp;#8217;s = Integrating Projects. Large scale (Euros 6 - 12m &amp;#8230; sometimes more). R&amp;amp;D work, concepts, methods, tools, systems, often many partners. Advancing the state of the art - producing solutions that are within 3-5 years of being marketable.&lt;/p&gt;
&lt;p&gt;STREP&amp;#8217;s = Small to Medium Targeted Research Projects. (Generally Euros 2-4m). Focusing on more specific research problems with outputs that might be 5-7 years away from being marketable solutions.&lt;/p&gt;
&lt;p&gt;NoE&amp;#8217;s = Networks of Excellence. Advancing knowledge and bridging technological domains&lt;/p&gt;
&lt;p&gt;CA&amp;#8217;s = Co-ordinating Actions. Helping to ensure synergy between EC funded work&lt;/p&gt;
&lt;p&gt;SA&amp;#8217;s = Supporting Actions. Helping to maximise the effectiveness and impact of EC funded work&lt;/p&gt;
&lt;p&gt;Total funding available for this call - Euros 69m&lt;/p&gt;
&lt;p&gt;IP&amp;#8217;s and STREPS = Euros 56m&lt;/p&gt;
&lt;p&gt;NoE&amp;#8217;s and C/SA&amp;#8217;s = Euros 13m&lt;/p&gt;
&lt;p&gt;Strikes me that a lot of people will be thinking hard and talking to other people over the next 12 months to really try and grapple with some of the hard problems in the Digital Preservation area and that is going to have a marvellous impact on the amount and quality of proposals that might end up flowing towards JISC. I&amp;#8217;m not saying we&amp;#8217;ll mop up failed EC proposals!! &amp;#8230; I&amp;#8217;m simply saying this has to be good for generally raising our whole collective game in the relevant areas of research and development.&lt;/p&gt;</description>
         <author>Neil Grindley</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/10/29/ec-digital-libraries-and-digital-preservation-call/</guid>
         <pubDate>Thu, 29 Oct 2009 06:15:28 -0700</pubDate>
      </item>
      <item>
         <title>Unlock - index</title>
         <link>http://www.geoxwalk.ac.uk/index.html</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_cb77dad985e78d76a02d6ab258b9b6ec</guid>
         <pubDate>Thu, 29 Oct 2009 05:13:11 -0700</pubDate>
      </item>
      <item>
         <title>OER Programme Meeting - 20/10/2009</title>
         <link>http://infteam.jiscinvolve.org/2009/10/23/oer-programme-meeting-20102009/</link>
         <description>&lt;p&gt;On Tuesday I attended the &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/elearning/oer/progmtgoct09.aspx"&gt;OER programme meeting&lt;/a&gt; in London as the IE team representative. As it was only my second day in post my responsibilities were limited to a brief spell on the registration desk and sitting at the back of the room trying to take in as much as possible.&lt;/p&gt;
&lt;p&gt;The event consisted of three parallel sessions in the morning; an IPR overview given by Jason Campbell from &lt;a rel="nofollow" target="_blank" href="http://www.jisclegal.ac.uk/"&gt;JISC Legal&lt;/a&gt;, a session on accessibility issues led by Sal Cooke from &lt;a rel="nofollow" target="_blank" href="http://www.techdis.ac.uk/"&gt;JISC TechDis&lt;/a&gt; and the session I attended which was about Internationalization of OER led by &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/contactus/staff/davidkernohan.aspx"&gt;David Kernohan&lt;/a&gt; from the JISC eLearning team and &lt;a rel="nofollow" target="_blank" href="http://iet.open.ac.uk/people/view-profile.cfm?staff_id=p.mcandrew"&gt;Patrick McAndrew&lt;/a&gt; from &lt;a rel="nofollow" target="_blank" href="http://olnet.org/"&gt;OLnet&lt;/a&gt; at the Open University.&lt;/p&gt;
&lt;p&gt;David spoke about the work done by &lt;a rel="nofollow" target="_blank" href="http://www.msu.edu/"&gt;Michigan State University&lt;/a&gt; working in Africa with OERs based around Agricultural studies and how they were initially surprised at the lack of take-up of their resources but after additional research discovered the &amp;#8216;cultural context&amp;#8217; of the content was an issue; without a shared cultural background the resources were difficult to really understand.&lt;/p&gt;
&lt;p&gt;In an effort to improve this &lt;a rel="nofollow" target="_blank" href="http://isp.msu.edu/media/news/article.php?a=164"&gt;MSU applied to the Gates Foundation&lt;/a&gt; for funding to run a project in partnership with the actual end-users of these resources (looking beyond academics and students to the actual farmers etc) to find a way to present the content in a way that is most suitable for them. While they are already having some success with this approach they have already identified that it is not truly a sustainable or transferable approach.&lt;/p&gt;
&lt;p&gt;Patricks&amp;#8217; presentation picked up on the theme of &amp;#8216;cultural context&amp;#8217; and the idea of &amp;#8216;cultural colonisation&amp;#8217;. The OU has identified that even the words used to describe their use of OER can be seen to loaded with significance and can be percieved to be more about spreading the OU brand than any form of altruism.&lt;/p&gt;
&lt;p&gt;There were a couple of particularly interesting almost throw away moments during this talk that were picked up again during the day. The first was the assertion by Patrick that use of Creative Commons had saved OpenLearn £100k (that was the sum budgeted for legal advice that wasn&amp;#8217;t required once they committed to CC) and also a comment from the audience along the lines of &amp;#8220;I wonder why we bother with licenses at all as we know users will take no notice and do what they want with the resources!&amp;#8221;&lt;/p&gt;
&lt;p&gt;Issues around &lt;a rel="nofollow" target="_blank" href="http://creativecommons.org/"&gt;Creative Commons&lt;/a&gt; came up again and again throughout the day with questions around what is the right kind of option to choose the dominant topic.&lt;/p&gt;
&lt;p&gt;The afternoon consisted of an update on some Communication/Evaluation/Synthesis issues and an interesting discussion on how to start the process for release of OERs to be recognised and rewarded by Institutions in a manner similar to the publication of research.&lt;/p&gt;
&lt;p&gt;One of the things I was most interested in was the preview of the new JorumOpen interface built on &lt;a rel="nofollow" target="_blank" href="http://www.dspace.org/"&gt;DSpace&lt;/a&gt; and due for release early in 2010. The presentation consisted of screenshots rather than a live demo or a screencast so it was difficult to get a real feel for the interaction and as usual comments were made comparing it unfavourably to Flickr. This is always going to be an issue for a service like Jorum and it will be interesting to see how the live site compares as a user experience to something like Flickr (though the lack of a bulk upload option on launch for JorumOpen is going to be an issue I think.)&lt;/p&gt;
&lt;p&gt;There was also a quick introduction to &lt;a rel="nofollow" target="_blank" href="http://www.open.ac.uk/blogs/openair/?p=366"&gt;SCORE&lt;/a&gt; (Support Centre for Open Resources in Education) from the OU by Rose Webb which is a HEFCE funded support service for OER activity. This project is lagging behind the rest of the OER activity in this programme but there were a couple of interesting elements to the presentation; new OU short term Fellowships in OER that I am sure will be of interest to alot of people and also the creation of a Community Support Officer (part-funded by JISC) who will act as a broker between the OER projects and the huge amount of experience the OU has in working in this area.&lt;/p&gt;
&lt;p&gt;All-in-all I felt it was a very interesting event which all the attendees seemed to be enjoying and finding valuable, I certainly did. Thanks to David and Heather for making it such an interesting day.&lt;/p&gt;</description>
         <author>Matt Jukes</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/10/23/oer-programme-meeting-20102009/</guid>
         <pubDate>Fri, 23 Oct 2009 03:40:30 -0700</pubDate>
      </item>
      <item>
         <title>Look what you can do with library circulation data!</title>
         <link>http://infteam.jiscinvolve.org/2009/10/21/look-what-you-can-do-with-library-circulation-data/</link>
         <description>&lt;p&gt;The JISC MOSAIC project Developer Competition attracted entries from England, Scotland, Wales and the United States. The winner of the £1000 award was Alex Parker, an undergraduate studying Computer Science at the University of Southampton, who developed a compelling presentation of the library user activity data represented as the ‘&lt;a rel="nofollow" target="_blank" href="http://users.ecs.soton.ac.uk/ajp3g08/mosaicbookgalaxy/" title="Book Galaxy"&gt;&lt;font color="#800080"&gt;Book Galaxy’&lt;/font&gt;&lt;/a&gt;. The runners up were &lt;a rel="nofollow" target="_blank" href="http://voyager.aber.ac.uk/mosaic/" title="Voyager"&gt;Andrew Isherwood &lt;/a&gt;(2&lt;sup&gt;nd&lt;/sup&gt; place – University of Aberystwyth) and &lt;a rel="nofollow" target="_blank" href="http://www.codebrane.com/ilib" title="Codebrane"&gt;&lt;font color="#800080"&gt;Alistair Young &lt;/font&gt;&lt;/a&gt;(3&lt;sup&gt;rd&lt;/sup&gt; – University of Highlands and Islands)&lt;/p&gt;
&lt;p&gt;The MOSAIC project is investigating the possibilities for data covering user activity such as book circulation across UK Higher Education libraries. In summer 2009, the project ran the Developer Competition to see what applications might be imagined and built on such data, looking for innovative approaches in terms of applications, query and display interfaces. Entries were required to use data released under Creative Commons licence by the University of Huddersfield, containing circulation records linked to the course affiliations of the borrowers. &lt;/p&gt;
&lt;p&gt;The judges received 6 working applications, which exceeded expectations in terms in terms of quality and imagination. In addition to the award three winners, the judges commended entries from &lt;a rel="nofollow" target="_blank" href="http://ouseful.open.ac.uk/mosaic.php" title="Ouseful"&gt;Tony Hirst &lt;/a&gt;and &lt;a rel="nofollow" target="_blank" href="http://www.meanboyfriend.com/readtolearn/" title="Meanboyfriend"&gt;Owen Stephens &lt;/a&gt;of the Open University and from &lt;a rel="nofollow" target="_blank" href="http://dysinterested.com/mosaicdashboard/" title="Collection Development Dashboard"&gt;Sean Hannan &lt;/a&gt;of John Hopkins University. Ken Chad (Ken Chad Consulting) summed up the feelings of all four judges in commenting ‘I had an enjoyable couple of hours with these; it warmed my heart to see them - hats off to these guys!’ &lt;/p&gt;
&lt;p&gt;The applications covered three important areas:&lt;font face="Times New Roman"&gt; &lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Improving Resource Discovery:&lt;/strong&gt;&lt;font face="Times New Roman"&gt; &lt;/font&gt; &lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Navigate the ‘Book Galaxy’ through links based on borrowing habits&lt;font face="Times New Roman"&gt;– &lt;/font&gt;&lt;/li&gt;
&lt;li&gt;Users create and share reading lists&lt;font face="Times New Roman"&gt; &lt;/font&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Supporting learning choices:&lt;/strong&gt;&lt;font face="Times New Roman"&gt; &lt;/font&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Get a feel for a course based on the books students actually borrow&lt;font face="Times New Roman"&gt; &lt;/font&gt;&lt;/li&gt;
&lt;li&gt;Possible courses suggested based on books you’ve enjoyed reading&lt;font face="Times New Roman"&gt; &lt;/font&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Supporting decision making:&lt;/strong&gt;&lt;font face="Times New Roman"&gt; &lt;/font&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Assess circulation relating to departments and courses&lt;font face="Times New Roman"&gt;– &lt;/font&gt;&lt;/li&gt;
&lt;li&gt;Value the loans per courses as a collection performance indicator &lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The MOSAIC team will be seeking feedback from Higher Education library and learning practitioners on all six applications at the series of workshops over the next month at the Universities of Edinburgh, Sheffield, Sussex and the Open University. &lt;/p&gt;
&lt;p&gt;The applications will also be featured at the concluding MOSAIC event at the University of Wolverhampton on Wednesday 18 November, which will inform the project recommendations to JISC and to the SCONUL Shared Services project on the opportunities relating to activity data and recommendation services that might be pursued within the sector. All interested parties are invited to sign up for the event by emailing &lt;a rel="nofollow" target="_blank" href="mailto:david.kay@sero.co.uk"&gt;david.kay@sero.co.uk&lt;/a&gt;. &lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://www.sero.co.uk/mosaic/091012-MOSAIC-Demonstration-Links.doc" title="Background info for URLs"&gt;Download this document &lt;/a&gt;for URLs and further details for the six entries plus the MOSAIC project demonstrator.&lt;/p&gt;</description>
         <author>Balviar Notay</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/10/21/look-what-you-can-do-with-library-circulation-data/</guid>
         <pubDate>Wed, 21 Oct 2009 09:24:16 -0700</pubDate>
      <enclosure url="http://www.sero.co.uk/mosaic/091012-MOSAIC-Demonstration-Links.doc" length="46592" type="application/msword" /><media:content url="http://www.sero.co.uk/mosaic/091012-MOSAIC-Demonstration-Links.doc" fileSize="46592" type="application/msword" /><itunes:subtitle> The JISC MOSAIC project Developer Competition attracted entries from England, Scotland, Wales and the United States. The winner of the £1000 award was Alex Parker, an undergraduate studying Computer Science at the University of Southampton, who developed</itunes:subtitle><itunes:author>Balviar Notay</itunes:author><itunes:summary> The JISC MOSAIC project Developer Competition attracted entries from England, Scotland, Wales and the United States. The winner of the £1000 award was Alex Parker, an undergraduate studying Computer Science at the University of Southampton, who developed a compelling presentation of the library user activity data represented as the ‘Book Galaxy’. The runners up were Andrew Isherwood (2nd place – University of Aberystwyth) and Alistair Young (3rd – University of Highlands and Islands) The MOSAIC project is investigating the possibilities for data covering user activity such as book circulation across UK Higher Education libraries. In summer 2009, the project ran the Developer Competition to see what applications might be imagined and built on such data, looking for innovative approaches in terms of applications, query and display interfaces. Entries were required to use data released under Creative Commons licence by the University of Huddersfield, containing circulation records linked to the course affiliations of the borrowers.  The judges received 6 working applications, which exceeded expectations in terms in terms of quality and imagination. In addition to the award three winners, the judges commended entries from Tony Hirst and Owen Stephens of the Open University and from Sean Hannan of John Hopkins University. Ken Chad (Ken Chad Consulting) summed up the feelings of all four judges in commenting ‘I had an enjoyable couple of hours with these; it warmed my heart to see them - hats off to these guys!’ The applications covered three important areas: Improving Resource Discovery:   Navigate the ‘Book Galaxy’ through links based on borrowing habits– Users create and share reading lists Supporting learning choices: Get a feel for a course based on the books students actually borrow Possible courses suggested based on books you’ve enjoyed reading Supporting decision making: Assess circulation relating to departments and courses– Value the loans per courses as a collection performance indicator  The MOSAIC team will be seeking feedback from Higher Education library and learning practitioners on all six applications at the series of workshops over the next month at the Universities of Edinburgh, Sheffield, Sussex and the Open University. The applications will also be featured at the concluding MOSAIC event at the University of Wolverhampton on Wednesday 18 November, which will inform the project recommendations to JISC and to the SCONUL Shared Services project on the opportunities relating to activity data and recommendation services that might be pursued within the sector. All interested parties are invited to sign up for the event by emailing david.kay@sero.co.uk. Download this document for URLs and further details for the six entries plus the MOSAIC project demonstrator.</itunes:summary></item>
      <item>
         <title>&amp;#8220;Students like you also borrowed … Harry Potter” Assessing the possibilities and pitfalls surrounding the exploitation of library user activity data and recommendations</title>
         <link>http://infteam.jiscinvolve.org/2009/10/21/students-like-you-also-borrowed-%e2%80%a6-harry-potter%e2%80%9d-assessing-the-possibilities-and-pitfalls-surrounding-the-exploitation-of-library-user-activity-data-and-recommendations/</link>
         <description>&lt;p&gt;&lt;strong&gt;MOSAIC Project Event &lt;/strong&gt;– Wednesday 18 November 2009 (10.00 to 16.00) at the University of Wolverhampton&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Venue&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This event is kindly hosted by the University of Wolverhampton in the MX Building on the City Campus WV1 1AD. There is mainline rail access (Wolverhampton) and nearby car parking (Wolves football ground).&lt;strong&gt; &lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Attendance &amp;amp; Registration&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This event seeks to involve a cross section of managers and practitioners from university library and wider services. It will also be of interest to systems developers and vendors. It is limited to 30 delegates.Email philippe.ugochukwu@sero.co.uk to reserve your place(s) stating any special dietary or access requirements. Joining instructions will be issued.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Overview&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The JISC MOSAIC project has been investigating the possibilities for exploiting the user activity and usage data that might be available to the Higher Education community – to benefit libraries, national services and their users.During 2009 the project has generated 7 demonstrators, worked with a variety of real library data and run a series of practitioner workshops. In the process it has gathered a great deal of intelligence about the possibilities and pitfalls for individual universities and national services.The potential to add value for students, researchers and lecturers is weighed against challenges in terms technology (scale, distribution, aggregation), data ownership and protection, quality of data, differences between media (books, journals, etc) and value relative to competing services in a Web 2.0+ world.This concluding MOSAIC event focuses the project findings in a series of key debates. We hope to draw on your professional experience to develop a shared understanding of the challenges, the desirable opportunities and potential early developments – informing the project recommendations both to JISC and to the SCONUL Shared Services project.&lt;/p&gt;
&lt;p&gt;For more on the MOSAIC project see&lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/inf11/mosaic"&gt;http://www.jisc.ac.uk/whatwedo/programmes/inf11/mosaic&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://www.sero.co.uk/jisc-mosaic.html"&gt;http://www.sero.co.uk/jisc-mosaic.html&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Event Programme&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;09.30 Refreshments on arrival at the University of Wolverhampton&lt;/p&gt;
&lt;p&gt;10.00 Welcome– Fiona Parsons (University of Wolverhampton &amp;amp; SCONUL)&lt;/p&gt;
&lt;p&gt;10.10 Introduction: the Problem &amp;amp; the Project– David Kay (MOSAIC)&lt;/p&gt;
&lt;p&gt;10.25 Where is the data? Discussion&lt;/p&gt;
&lt;p&gt;10.40 Keynote: ‘Activity Data and the Global Information Economy: The who, what, when, where, how, why of an emerging future’- Paul Miller (Cloud of Data)&lt;/p&gt;
&lt;p&gt;11.10 Library &amp;amp; Service Perspective: The University of Huddersfield experience- Dave Pattern&lt;/p&gt;
&lt;p&gt;11.30 Breakout 1: ‘Being Practical: What real uses are there for activity data?’ Session informed by the MOSAIC developer competition applications, with choice of groups (student, research, library uses).&lt;/p&gt;
&lt;p&gt;12.15 Report back and discussion&lt;/p&gt;
&lt;p&gt;12.40 Lunch &amp;amp; Opportunity to view demonstrators&lt;/p&gt;
&lt;p&gt;13.30 Four Perspectives:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;‘The MOSAIC Demonstrator: Approaches &amp;amp; Architectures’ – Mark van Harmelen (MOSAIC)&lt;/li&gt;
&lt;li&gt;‘The good, the bad and the ugly- Data Protection v. Open Data’ – Ken Chad (MOSAIC)&lt;/li&gt;
&lt;li&gt;‘Data is not the answer: An alternative view of user requirements’ – Paul Walk (UKOLN)&lt;/li&gt;
&lt;li&gt;Key findings from the recent library workshops at Sheffield, Sussex, Edinburgh &amp;amp; the OU- Helen Harrop (MOSAIC)&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;14.30 Breakout 2: ‘Challenges &amp;amp; Solutions’ – Choice between Technical, Data and Service breakout groups&lt;/p&gt;
&lt;p&gt;15.15 Refreshments&lt;/p&gt;
&lt;p&gt;15.30 Report back and panel Q&amp;amp;A – ‘Great ideas to desirable solutions’&lt;/p&gt;
&lt;p&gt;16.00 Close&lt;/p&gt;</description>
         <author>Balviar Notay</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/10/21/students-like-you-also-borrowed-%e2%80%a6-harry-potter%e2%80%9d-assessing-the-possibilities-and-pitfalls-surrounding-the-exploitation-of-library-user-activity-data-and-recommendations/</guid>
         <pubDate>Wed, 21 Oct 2009 06:11:05 -0700</pubDate>
      </item>
      <item>
         <title>If SWORD is the answer, what is the question?</title>
         <link>http://feedproxy.google.com/~r/stuartlewis/~3/Hxz3jRMre_U/</link>
         <description>I&amp;#8217;ve just had a new collaborative paper published: &amp;#8216;If SWORD is the answer, what is the question?&amp;#8217; (DOI: 10.1108/00330330910998057). It covers the most recent iteration of the SWORD repository deposit standard, looks briefly at some issues around the present lack of adoption of SWORD, and most usefully presents seven use cases of SWORD written by [...]</description>
         <guid isPermaLink="false">http://blog.stuartlewis.com/?p=491</guid>
         <pubDate>Mon, 12 Oct 2009 13:40:35 -0700</pubDate>
         <content:encoded><![CDATA[<p>I&#8217;ve just had a new collaborative paper published: &#8216;If SWORD is the answer, what is the question?&#8217; (DOI: <a rel="nofollow" target="_blank" href="http://dx.doi.org/10.1108/00330330910998057">10.1108/00330330910998057</a>). It covers the most recent iteration of the <a rel="nofollow" target="_blank" href="http://swordapp.org/">SWORD</a> repository deposit <a rel="nofollow" target="_blank" href="http://purl.org/net/sword/">standard</a>, looks briefly at some issues around the present lack of adoption of SWORD, and most usefully presents seven use cases of SWORD written by their developers:</p>
<p>Lewis, S., Hayes, L., Newton-Wade, V., Corfield, A., Davis, R., Donohue, T., Wilson, S., If SWORD is the answer, what is the question?: Use of the Simple Web-service Offering Repository Deposit protocol, Program: electronic library and information systems, 2009, Vol 43, Issue 4, pp: 407 &#8211; 418, 10.1108/00330330910998057, Emerald Group Publishing Limited</p>
<p>Of course a copy is available open access in our repository: <a rel="nofollow" target="_blank" href="http://hdl.handle.net/2292/5315">http://hdl.handle.net/2292/5315</a></p>
<blockquote><p>Abstract:</p>
<p>Purpose – The purpose of this paper is to describe the repository deposit protocol, Simple Web-service Offering Repository Deposit (SWORD), its development iteration, and some of its potential use cases. In addition, seven case studies of institutional use of SWORD are provided.</p>
<p>Design/methodology/approach – The paper describes the recent development cycle of the SWORD standard, with issues being identified and overcome with a subsequent version. Use cases and case studies of the new standard in action are included to demonstrate the wide range of practical uses of the SWORD standard.</p>
<p>Findings – SWORD has many potential use cases and has quickly become the de facto standard for depositing items into repositories. By making use of a widely-supported interoperable standard, tools can be created that start to overcome some of the problems of gathering content for deposit into institutional repositories. They can do this by changing the submission process from a “one-size-fits-all” solution, as provided by the repository&#8217;s own user interface, to customised solutions for different users.</p>
<p>Originality/value – Many of the case studies described in this paper are new and unpublished, and describe methods of creating novel interoperable tools for depositing items into repositories. The description of SWORD version 1.3 and its development give an insight into the processes involved with the development of a new standard.</p></blockquote>
<p>The seven case studies include a thesis submission system, a <a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/inf11/jiscri/clasm">SWORD plugin for moodle</a>, a<a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/reppres/tools/road.aspx">n automated laboratory data repository deposit tool</a>, <a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/programmes/reppres/tools/feedforward.aspx">a desktop deposit tool</a>, <a rel="nofollow" target="_blank" href="http://bibapp.org/">the BibApp repository integration module</a>, a custom deposit tool for a technical report series, and <a rel="nofollow" target="_blank" href="http://blog.stuartlewis.com/2008/11/17/launched-today-the-facebook-repository-deposit-application/">the Facebook SWORD deposit tool</a>.</p>]]></content:encoded>
      </item>
      <item>
         <title>Engaging developers in Open Source projects</title>
         <link>http://www.cetis.ac.uk/members/scott/blogview?entry=20091012090842</link>
         <description>&lt;p&gt;Last Friday I attended a workshop hosted by &lt;a rel="nofollow" target="_blank" href="http://www.oss-watch.ac.uk/"&gt;OSS Watch&lt;/a&gt; on &lt;a rel="nofollow" target="_blank" href="http://www.oss-watch.ac.uk/events/2009-10-09/programme.xml"&gt;Engaging Developers with Open Source Projects&lt;/a&gt;. &lt;/p&gt; &lt;p&gt;The workshop offered several different perspectives on Open Source projects - from an OSS project, from a project consuming libraries and other outputs of OSS projects, and from a developer submitting their first patch. This was a nice mix, and it was good to see the process from different viewpoints. &lt;/p&gt; &lt;p&gt;My presentation was primarily from the OSS project end, and focussed on Wookie:&lt;/p&gt; &lt;p&gt;&lt;div style="width:425px;text-align:left;" id="__ss_2174544"&gt;&lt;a rel="nofollow" style="font:14px Helvetica, Arial, Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" target="_blank" href="http://www.slideshare.net/scottw/life-of-a-wookie" title="Life of a Wookie"&gt;Life of a Wookie&lt;/a&gt;&lt;iframe class="embeddedvideo" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=osswatchwookie-091009062430-phpapp02&amp;stripped_title=life-of-a-wookie" type="application/x-shockwave-flash" width="425" height="355"&gt;&lt;/iframe&gt;&lt;div style="font-size:11px;font-family:tahoma, arial;height:26px;padding-top:2px;"&gt;View more &lt;a rel="nofollow" style="text-decoration:underline;" target="_blank" href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a rel="nofollow" style="text-decoration:underline;" target="_blank" href="http://www.slideshare.net/scottw"&gt;scottw&lt;/a&gt;.&lt;/div&gt;&lt;/div&gt; &lt;p&gt;I think the main thing I was trying to convey is how from a project perspective you're (usually) keen to get external contributions, no matter how small, and how willing project team members are to help get people involved. My "being nice is a survival strategy in OSS" seemed to go down well as a takeaway message!&lt;/p&gt; &lt;p&gt;Ian Boston was next up and talked about how &lt;a rel="nofollow" target="_blank" href="http://www.oss-watch.ac.uk/events/2009-10-09/IB_slideware.pdf"&gt;Sakai works with Open Source projects&lt;/a&gt; - I think the point he makes about "good code - bad community vs. bad code - good community" is an excellent one. Perhaps one of the reasons why Moodle is so successful with its community are the wide range of issues that users can readily tackle themselves - whereas something very mature and well designed like Apache Commons makes developers averse to touching anything!&lt;/p&gt; &lt;p&gt;The last talk was by Mark Johnson, and was all about &lt;a rel="nofollow" target="_blank" href="http://www.oss-watch.ac.uk/events/2009-10-09/MJ_slideware.pdf"&gt;submitting a one-line patch to Moodle&lt;/a&gt;. This was a really nice walk-through from identifying a problem, engaging with the community, to getting the fix accepted, and with a clear case as to why the college would want to support this activity.&lt;/p&gt; &lt;p&gt;The discussions around the workshop also threw up some interesting issues. One of the big ones would seem to be that the core processes that developers have to engage in - particularly in Open Source projects, but in commercial development, too - are often also not taught in programming courses, namely source control and issue trackers. This is one of the hurdles for bringing on relatively new developers into a mature project. On the other hand it could be argued that Open Source projects provides a very useful training ground for developing the skills of using these systems, which translates well into other developer roles.&lt;/p&gt; &lt;p&gt;Another issue that comes up a lot is sustainability, especially in relation to funded programmes, I think we're making a lot of progress on this one, and at least now sustainability is something that projects funded by JISC have to consider. However its still not quite right, and there are probably things we can do to try to keep a good balance of innovation and sustainability where there is central funding for software development.&lt;/p&gt; &lt;p&gt;OSS Watch are a JISC innovation support centre; and they are focussed on Open Source in education. Find out more at &lt;a rel="nofollow" target="_blank" href="http://www.oss-watch.ac.uk/"&gt;http://www.oss-watch.ac.uk/&lt;/a&gt;.&lt;/p&gt;</description>
         <author>Scott Wilson</author>
         <guid isPermaLink="false">tag:cetis.ac.uk,2009-10-12:20091012090842</guid>
         <pubDate>Mon, 12 Oct 2009 02:08:42 -0700</pubDate>
      <enclosure url="http://www.oss-watch.ac.uk/events/2009-10-09/IB_slideware.pdf" length="1364543" type="application/pdf" /><media:content url="http://www.oss-watch.ac.uk/events/2009-10-09/IB_slideware.pdf" fileSize="1364543" type="application/pdf" /><itunes:subtitle> Last Friday I attended a workshop hosted by OSS Watch on Engaging Developers with Open Source Projects. The workshop offered several different perspectives on Open Source projects - from an OSS project, from a project consuming libraries and other output</itunes:subtitle><itunes:author>Scott Wilson</itunes:author><itunes:summary> Last Friday I attended a workshop hosted by OSS Watch on Engaging Developers with Open Source Projects. The workshop offered several different perspectives on Open Source projects - from an OSS project, from a project consuming libraries and other outputs of OSS projects, and from a developer submitting their first patch. This was a nice mix, and it was good to see the process from different viewpoints. My presentation was primarily from the OSS project end, and focussed on Wookie: Life of a WookieView more presentations from scottw. I think the main thing I was trying to convey is how from a project perspective you're (usually) keen to get external contributions, no matter how small, and how willing project team members are to help get people involved. My "being nice is a survival strategy in OSS" seemed to go down well as a takeaway message! Ian Boston was next up and talked about how Sakai works with Open Source projects - I think the point he makes about "good code - bad community vs. bad code - good community" is an excellent one. Perhaps one of the reasons why Moodle is so successful with its community are the wide range of issues that users can readily tackle themselves - whereas something very mature and well designed like Apache Commons makes developers averse to touching anything! The last talk was by Mark Johnson, and was all about submitting a one-line patch to Moodle. This was a really nice walk-through from identifying a problem, engaging with the community, to getting the fix accepted, and with a clear case as to why the college would want to support this activity. The discussions around the workshop also threw up some interesting issues. One of the big ones would seem to be that the core processes that developers have to engage in - particularly in Open Source projects, but in commercial development, too - are often also not taught in programming courses, namely source control and issue trackers. This is one of the hurdles for bringing on relatively new developers into a mature project. On the other hand it could be argued that Open Source projects provides a very useful training ground for developing the skills of using these systems, which translates well into other developer roles. Another issue that comes up a lot is sustainability, especially in relation to funded programmes, I think we're making a lot of progress on this one, and at least now sustainability is something that projects funded by JISC have to consider. However its still not quite right, and there are probably things we can do to try to keep a good balance of innovation and sustainability where there is central funding for software development. OSS Watch are a JISC innovation support centre; and they are focussed on Open Source in education. Find out more at http://www.oss-watch.ac.uk/.</itunes:summary></item>
      <item>
         <title>Access Day 2 - Afternoon Sessions #1</title>
         <link>http://feedproxy.google.com/~r/typepad/mleggott/loomware/~3/8UYjqO9NZjc/access-day-2---afternoon-sessions-1.html</link>
         <description>&lt;div&gt;&lt;p&gt;Dorothea Salo absolutely rocked the gym with her presentation - not an easy gig directly after lunch. Dorothea talked about data repositories and curation and gave the best presentation I have heard for a long time. The Cassandra of Repositories (not her association) kicked it up because she knows and reminds us that the current IR landscape is broken and is not afraid to say why (hence the Cassandra reference). She highlighted some of the challenges and issues: &lt;ul&gt;
&lt;li&gt;Data are there to be interacted with&lt;/li&gt;
&lt;li&gt;Data have different affordances&lt;/li&gt;
&lt;li&gt;Data and their technical environments are widely divergent in nature&lt;/li&gt;
&lt;li&gt;Data are part of Projects&lt;/li&gt;
&lt;li&gt;Data are messy and not standardized&lt;/li&gt;
&lt;li&gt;There is no magic pixie dust for digital repositories&lt;/li&gt;
&lt;li&gt;Our data buckets and processes are not Taylorist processes and require complex solutions&lt;/li&gt;
&lt;li&gt;We need to provide the ability to re-contextualize data in repositories, rather than build and encourage project silos.&lt;/li&gt; &lt;/ul&gt; Great session - &lt;a rel="nofollow" target="_blank" href="http://hosting2.epresence.tv/UPEI/1/watch/77.aspx"&gt;watch the stream&lt;/a&gt;, as the experience of listening to Dorothea is the only way to appreciate the full message.&lt;/div&gt; 
&lt;div class="feedflare"&gt;
&lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=8UYjqO9NZjc:5_-3ekvTf_A:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?d=yIl2AUoC8zA" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=8UYjqO9NZjc:5_-3ekvTf_A:F7zBnMyn0Lo"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?i=8UYjqO9NZjc:5_-3ekvTf_A:F7zBnMyn0Lo" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=8UYjqO9NZjc:5_-3ekvTf_A:V_sGLiPBpWU"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?i=8UYjqO9NZjc:5_-3ekvTf_A:V_sGLiPBpWU" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=8UYjqO9NZjc:5_-3ekvTf_A:EpLpB3ZkKWg"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?d=EpLpB3ZkKWg" border="0"&gt;&lt;/a&gt;
&lt;/div&gt;</description>
         <author>mleggott</author>
         <guid isPermaLink="false">tag:typepad.com,2003:post-6a00d83452e76c69e20120a62610f3970c</guid>
         <pubDate>Thu, 08 Oct 2009 12:38:23 -0700</pubDate>
      </item>
      <item>
         <title>iPres 2009 - Preservation Infrastructure Track</title>
         <link>http://infteam.jiscinvolve.org/2009/10/05/ipres-2009-preservation-infrastructure-track/</link>
         <description>&lt;p&gt;In San Francisco at iPres sitting in the preservation infrastructure track.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Stephen Abrams (CDL)&lt;/strong&gt; is telling us about micro-curation services. Lots of clear categorisation of types of services that institutions might require. Currently talking about storage requirements. Provide for safety through redundancy, meaning through context, utility through service. Rattling through too fast to capture detail.&lt;/p&gt;
&lt;p&gt;Q. How do CDL services compare with iRods?&lt;/p&gt;
&lt;p&gt;A. i-Rods are all part of one controlled environment. CDL Micro-services can run as small discreet functions&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pam Armstrong and Johanna Smith from Library and Archives Canada&lt;/strong&gt;.&lt;/p&gt;
&lt;p&gt;They have a trusted Digital repository project that is running from 2008-2010. They are showing a value management framework. The first concern is &amp;#8217;significance&amp;#8217;. They are looking at government records and are trying to determine which records are important even before they arrive at the archive. Talking about a filtering process. Trying to deal with web 2.0 issues and are working on some guidelines.&lt;/p&gt;
&lt;p&gt;They have established a records management task force with a high level of government support. A directive on recordkeeping is linked to a management accountability framework. If departments are found to be wanting with their records management function, they are denied the right to delete records. Good stick. There are functional requirements for EDRMS based on ISO. There is a proposed shared service for EDRMS for government info in Canada. They have built open source software eRTA for records managers. They have been working on metadata core set. They are using MODS and MARC and the info is discoverable by public. They have got to their summary already &amp;#8230; my o my - these talks are quick!&lt;/p&gt;
&lt;p&gt;The lessons learnt include the usefulness of the mandatory instrument that has consequences (see above).&lt;/p&gt;
&lt;p&gt;Q. do you accept all formats?&lt;/p&gt;
&lt;p&gt;A. No, they have acceptable formats. Can&amp;#8217;t do all formats.&lt;/p&gt;
&lt;p&gt;Q. How implemented is all of this?&lt;/p&gt;
&lt;p&gt;A. The implementation is uneven. All the instances across govt are implemented inconsistently. They have got lots of work to do to bring the legacy information into line.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Robert Sharpe - Tessella&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Representing PLANETS consortium. Title is &amp;#8220;Are you Ready? Assessment of readiness of organisations for Digital Preservation&amp;#8221;. (I&amp;#8217;m interested in this talk. Wondering how this matches up with JISC-funded AIDA project). They did a survey. To establish whether people were ready to use Digital Preservation solutions. The target group for PLANETS is national libraries and archives. There are 96 of these in Europe. They also invited any other interested parties to contribute. They got 206 responses. 70% responses from Europe. They were a diverse community representing a range of roles.&lt;/p&gt;
&lt;p&gt;15% digital preservation&lt;/p&gt;
&lt;p&gt;16% in general preservation&lt;/p&gt;
&lt;p&gt;22% curation&lt;/p&gt;
&lt;p&gt;16% IT&lt;/p&gt;
&lt;p&gt;also directors researchers data managers etc &amp;#8230;&lt;/p&gt;
&lt;p&gt;93% aware of DP challenges.&lt;/p&gt;
&lt;p&gt;17% had not considered solutions.&lt;/p&gt;
&lt;p&gt;52% did not have preservation policies.&lt;/p&gt;
&lt;p&gt;They were 3 times more likely to have a DP budget if they had a DP policy in place. The majority had budgets to do capital activities. DP not really embedded in the institutions that responded still. What needs to be preserved? Stuff in file systems = 77% &amp;#8230; many other categories going down to a long tail. National Libraries feel they have almost no control of the formats they have to accept. National Archives however claim high levels of control.&lt;/p&gt;
&lt;p&gt;80% of organisations say they have less than 100TB to store in 2009. They think that by 2019, 70% orgs will have more than 100TB and 42% will have more than 1Pb. 85% have a solution or are working on one. They are generally expecting &amp;#8216;plug and play&amp;#8217; components. That&amp;#8217;s the trend and what people are expecting.&lt;/p&gt;
&lt;p&gt;What functionality is important? Single most important function was that the repository must maintain authenticity, reliability and integrity of records. 17 different functions cited. Least important function is &amp;#8216;checks for duplicate items&amp;#8217;.Very little agreement on which standards should be used! (surprise surprise!) Of 13 standards on Robert&amp;#8217;s chart, PREMIS in the middle in terms of who is using it already.&lt;/p&gt;
&lt;p&gt;Summary &amp;#8230;&lt;/p&gt;
&lt;p&gt;Excellent start on getting DP message out&lt;br /&gt;
More work needed on policies and budgets&lt;br /&gt;
Wide range of types of digital info from range of sources&lt;br /&gt;
Significant quantities of data to preserve&lt;br /&gt;
Component-based solutions required&lt;br /&gt;
Best practice not yet clear&lt;br /&gt;
Early adopters are busy and planning to do more&lt;/p&gt;
&lt;p&gt;Q. We are doing a good job with early adopters but what about the wider community. The success factor will be general users engaging with Digital Preservation&lt;/p&gt;
&lt;p&gt;A. Yes&lt;/p&gt;
&lt;p&gt;Q. The standards you showed, the figures are high for people not even having heard of them!&lt;/p&gt;
&lt;p&gt;A. Yes.&lt;/p&gt;
&lt;p&gt;End of session&lt;/p&gt;</description>
         <author>Neil Grindley</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/10/05/ipres-2009-preservation-infrastructure-track/</guid>
         <pubDate>Mon, 05 Oct 2009 15:05:09 -0700</pubDate>
      </item>
      <item>
         <title>Repositories and Research Management Systems</title>
         <link>http://infteam.jiscinvolve.org/2009/10/05/repositories-and-research-management-systems/</link>
         <description>&lt;p&gt;There have been discussions over the years about the potential of repositories to play a role in research management systems, supporting universities&amp;#8217; ability to report, for example, for the Research Assessment Exercise. A few universities have gone down that road a long way, while others have not. Indeed, various surveys (eg &lt;a rel="nofollow" target="_blank" href="http://www.jiscinfonet.ac.uk/research/ref/index_html"&gt;infoNet work&lt;/a&gt;) suggest that research information management in UK universities is supported by a rather patchy infrastructure.&lt;/p&gt;
&lt;p&gt;A &lt;a rel="nofollow" target="_blank" href="http://ie-repository.jisc.ac.uk/408/"&gt;new JISC report &lt;/a&gt;from Rightscom gives a summary of the extent to which repositories and research management systems are integrated in the UK, and abstracts from this a table of institutional drivers that would support such integration. This will be a useful tool to repository managers and others building a case for a role for institutional repositories in research information management.&lt;/p&gt;
&lt;p&gt;Of course, this needs to be seen in the &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/whatwedo/themes/informationenvironment/researchinfomgt.aspx"&gt;wider context of UK and worldwide developments in research information management&lt;/a&gt;.&lt;/p&gt;</description>
         <author>Neil Jacobs</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/10/05/repositories-and-research-management-systems/</guid>
         <pubDate>Mon, 05 Oct 2009 13:28:02 -0700</pubDate>
      </item>
      <item>
         <title>#iPres09: e-Infrastructure and digital preservation: challenges and outlook</title>
         <link>http://davetaz-blog.blogspot.com/2009/10/ipres09-e-infrastructure-and-digital.html</link>
         <description>e-infrastructure: Starts by defining infrastructure (see wikipedia) and e-infrastructure specific to a collection of European digital repositories. So basically we are looking at opportunities to build and supply services which are applicable to these repositories. &lt;br /&gt;&lt;br /&gt;Background: EU is supplying lots of support for this and in germany they are researching national approaches, identifying activities and assign tasks to "expert" institutions. By introducing the current fields of project he is outlining that there is still a significant mismatch between the scale of the problem and the amount of effort being expended. From this he outlines that there is a significant lack of common approaches to solving problems. [I don't think this will ever go away, unless there is a mandate, and even then not everyone will want to sign up].&lt;br /&gt;&lt;br /&gt;[Lots of argument] Funding is focused on many individual projects and thus doubles up the the argument that there are no commons. This led leads to a slide about interoperability and standards and the lack of them. [Which again, i don't think will ever go away and I think that we should be appreciative that people tend to pick XML to encode their data in, this makes it interoperable right]. &lt;br /&gt;&lt;br /&gt;[This is a start of project presentation, I don't seem to see that much output. They have some simple models as diagrams, again though at this stage it is hard to see how they are not just another project which will come up with (another) set of standards which no one will then want to adopt.] &lt;br /&gt;&lt;br /&gt;Giving a set of examples now where they are going to re-use and extend existing software/projects. The goals are good, in terms of concrete steps for global infrastructure for registries, data formats, software deposits and risk management. [Just not sure how achievable all this is based upon the fact it has been the aim of many projects already]&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-5274314513105060713?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-5274314513105060713</guid>
         <pubDate>Mon, 05 Oct 2009 06:39:00 -0700</pubDate>
      </item>
      <item>
         <title>Access Day 1 - Afternoon Sessions #1</title>
         <link>http://feedproxy.google.com/~r/typepad/mleggott/loomware/~3/9IJNE9g8Gf0/access-day-1---afternoon-sessions-1.html</link>
         <description>&lt;div&gt;&lt;p&gt;Dan Chudnov, Donald Moses, Paul Pound and Stevan Harnad do the early afternoon at Access 2009&lt;/p&gt; &lt;p&gt;I unfortunately missed a lot of Dan's talk getting another speaker setup. I came in when Dan was talking about the LOCs new linked data initiative with LC Subject Headings, which are accessible in a smart and flexible framework. The library community should be eating this up - LC Subject Headings available as linked data, and free. LC is also making their newspaper data available as linked data, which is very exciting for us at UPEI with the work we are doing on newspapers. Very exciting stuff - thanks for going to Library of Congress Dan!&lt;/p&gt; &lt;p&gt;Donald and Paul gave an overview of the IslandLives project at UPEI, including a summary of the the new &lt;a rel="nofollow" target="_blank" href="http://islandora.ca/"&gt;Islandora&lt;/a&gt; software and the basic architecture. IslandLives is UPEIs project to digitize the Island's local histories and provide a reach discovery and collaborative layer around these resources. Some of the innovative features of the project include the use of Islandora (a Drupal-Fedora collaborative repository system), automated workflows, Drupal-based TEI editor and more.&lt;/p&gt; &lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=9IJNE9g8Gf0:2c4-J19I9WY:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?d=yIl2AUoC8zA" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=9IJNE9g8Gf0:2c4-J19I9WY:F7zBnMyn0Lo"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?i=9IJNE9g8Gf0:2c4-J19I9WY:F7zBnMyn0Lo" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=9IJNE9g8Gf0:2c4-J19I9WY:V_sGLiPBpWU"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?i=9IJNE9g8Gf0:2c4-J19I9WY:V_sGLiPBpWU" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?a=9IJNE9g8Gf0:2c4-J19I9WY:EpLpB3ZkKWg"&gt;&lt;img src="http://feeds.feedburner.com/~ff/typepad/mleggott/loomware?d=EpLpB3ZkKWg" border="0"&gt;&lt;/a&gt;
&lt;/div&gt;</description>
         <author>mleggott</author>
         <guid isPermaLink="false">tag:typepad.com,2003:post-6a00d83452e76c69e20120a5b419d0970b</guid>
         <pubDate>Thu, 01 Oct 2009 17:31:38 -0700</pubDate>
      </item>
      <item>
         <title>Guide to using some web 2.0 services in JISC projects - part 3 of 3</title>
         <link>http://infteam.jiscinvolve.org/2009/10/01/guide-to-using-some-web-20-services-in-jisc-projects-part-3-of-3/</link>
         <description>&lt;p&gt;This is the final post in a 3 part series about using web 2 services in JISC projects. This final part briefly discusses using skype and dealing with meetings before moving on to general advice about IPR, publicising details and further information.&lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://infteam.jiscinvolve.org/2009/09/23/guide-to-using-some-web-20-services-in-jisc-projects-part-1-of-3/"&gt;Part 1&lt;/a&gt; of the series discusses using tags for projects, twitter and blogs.&lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://infteam.jiscinvolve.org/2009/09/28/guide-to-using-some-web-20-services-in-jisc-projects-part-2-of-3/"&gt;Part 2&lt;/a&gt; discusses social bookmarking, aggregation and note-taking.&lt;/p&gt;
&lt;p&gt;The full document of all 3 parts can be downloaded from the &lt;a rel="nofollow" target="_blank" href="http://ie-repository.jisc.ac.uk/399/"&gt;JISC Information Environment repository&lt;/a&gt;.&lt;/p&gt;
&lt;h3&gt;Skype&lt;/h3&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://skype.com"&gt;Skype&lt;/a&gt; started off as a telephone service which ran over the network. It has now been extended to provide other services including instant messaging, video calls, file-sharing, screen-sharing. If you have a headset and a microphone it is a useful way to hold teleconferences with multiple participants.&lt;/p&gt;
&lt;p&gt;It is also a useful way to get instant answers from contacts as you can see when people are online and available to be contacted to ask questions using the telephone service or the instant messaging service.&lt;/p&gt;
&lt;h4&gt;Skype FAQs&lt;/h4&gt;
&lt;p&gt;Q: Do you have to pay for Skype?&lt;br /&gt;A: No – not for basic services. There are some functions that require payment. Details are on the skype site.&lt;/p&gt;
&lt;p&gt;Q: How do I find out who is available using Skype?&lt;br /&gt;A: Skype has a directory look-up service built in. It also has services whereby it uses your email list of contacts to see if any of your friends are Skype users.&lt;/p&gt;
&lt;h3&gt;Getting consensus – scheduling meetings, doing surveys&lt;/h3&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://www.doodle.com"&gt;Doodle&lt;/a&gt; is a simple but very effective free website that you can use to do things such as agree a meeting date or do a simple survey. It is easy to set up a list of possible dates or options, and you then can email a URL to everyone, so that they can specify their preferences.&lt;/p&gt;
&lt;h3&gt;Publicising your project’s contact details&lt;/h3&gt;
&lt;p&gt;All publicity is good. Make it as easy as possible for people to discover your project and keep up to date with developments by making it easy for people to see what sources are available. List them on your project website and any publicity materials and put them in your email signature. Details that are useful to share include:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Your project’s name (and acronym expansion)&lt;/li&gt;
&lt;li&gt;Your project tag and any other relevant tag&lt;/li&gt;
&lt;li&gt;Relevant twitter ids&lt;/li&gt;
&lt;li&gt;Website and blog site&lt;/li&gt;
&lt;li&gt;Links to relevant resources or public aggregations of resources that provide further information about the area you are working in.&lt;/li&gt;
&lt;li&gt;Your contact details – including Skype if you use it&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Further information&lt;/h3&gt;
&lt;p&gt;Of course it is good practice with using all of these services to read the terms and conditions first to make sure they align with how you want to use the service. There is a useful toolkit called &lt;a rel="nofollow" target="_blank" href="http://www.web2rights.org.uk/index.html"&gt; web2rights&lt;/a&gt; which can help you navigate IPR in a web 2.0 world. &lt;/p&gt;
&lt;p&gt;This is only a limited overview of some of the services available. A more comprehensive list can be found in a &lt;a rel="nofollow" target="_blank" href="http://infteam.jiscinvolve.org/2008/11/06/web-tools-for-programme-management/"&gt;previous post on this blog&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;There is a JISC project called web2practice being run by Netskills to produce guides to using web 2.0. Guides on:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Social Media;&lt;/li&gt;
&lt;li&gt;RSS;&lt;/li&gt;
&lt;li&gt;Collaborative Writing;&lt;/li&gt;
&lt;li&gt;Podcasting;&lt;/li&gt;
&lt;li&gt;Microblogging;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;can be found on the &lt;a rel="nofollow" target="_blank" href="http://web2practice.jiscinvolve.org/"&gt;web2practice blog&lt;/a&gt;.&lt;/p&gt;
&lt;h3&gt;Feedback and the future of this guide&lt;/h3&gt;
&lt;p&gt;So that is a brief overview of some of the most useful services. Obviously there are glaring omissions from this guide and we would really appreciate hearing from you which web services you find useful in managing your projects and how you use them. We will gather feedback into an updated and more thorough guide in the future.&lt;/p&gt;</description>
         <author>Andy McGregor</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/10/01/guide-to-using-some-web-20-services-in-jisc-projects-part-3-of-3/</guid>
         <pubDate>Thu, 01 Oct 2009 03:28:18 -0700</pubDate>
      </item>
      <item>
         <title>Guide to using some web 2.0 services in JISC projects - part 2 of 3</title>
         <link>http://infteam.jiscinvolve.org/2009/09/28/guide-to-using-some-web-20-services-in-jisc-projects-part-2-of-3/</link>
         <description>&lt;p&gt;This post is part 2 of a series of 3 posts about making use of web 2.0 services to help manage JISC projects.&lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://infteam.jiscinvolve.org/2009/09/23/guide-to-using-some-web-20-services-in-jisc-projects-part-1-of-3/" title="Part 1 in this series"&gt;Part 1&lt;/a&gt; discusses using tags for projects, twitter and blogs.&lt;/p&gt;
&lt;p&gt;This part of the series covers: social bookmarking, aggregation and note-taking.&lt;/p&gt;
&lt;h3&gt;Social bookmarking – Delicious&lt;/h3&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://www.delicious.com/"&gt;Delicious&lt;/a&gt; or a similar social bookmarking site is an online store of URL bookmarks that can be shared by many people. It can be used to aggregate content that is of interest to project staff. Delicious allows you to keep URLs, to group them together, and to add a description. When you add a new link to Delicious, tag it with your project’s tag so that other people in your project can find the link. They can also add bookmarks themselves, and hence a collective store of bookmarks is built up by using the tag.&lt;/p&gt;
&lt;p&gt;We will be bookmarking things that we think are useful to projects in the Information Environment programme 2009-11 using the tag inf11. There is a &lt;a rel="nofollow" target="_blank" href="http://delicious.com/search?context=all&amp;amp;p=inf11&amp;amp;lc=1" title="inf11 bookmarks"&gt;page for inf11 bookmarks&lt;/a&gt;and there is also an &lt;a rel="nofollow" target="_blank" href="http://feeds.delicious.com/v2/rss/tag/inf11?count=15" title="RSS for inf11 bookmarks"&gt;RSS feed for this page&lt;/a&gt; that can be used in aggregation pages and feed readers.&lt;/p&gt;
&lt;h4&gt;Delicious FAQs&lt;/h4&gt;
&lt;p&gt;Q: How do we see what tags are available to follow within Delicious?&lt;br /&gt;A: This isn’t necessarily the best way to do things as there is an enormous amount of content on delicious, but you can browse by tags. You could also see the tags people you know on delicious are using. Mine can be seen on &lt;a rel="nofollow" target="_blank" href="http://delicious.com/tags/amcgregor" title="Amcgregor's delicious tags"&gt;my delicious tags page&lt;/a&gt;.&lt;/p&gt;
&lt;h3&gt;Other sites&lt;/h3&gt;
&lt;p&gt;Some other sites that can be useful for projects once you have a project tag to aggregate the content are:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a rel="nofollow" target="_blank" href="http://www.flickr.com" title="Flickr"&gt;Flickr&lt;/a&gt; for photos that may be taken of the project (this could be of whiteboards or brainstorming sessions or presentations as well as records of events etc)&lt;/li&gt;
&lt;li&gt;&lt;a rel="nofollow" target="_blank" href="http://www.youtube.com" title="Youtube"&gt;Youtube&lt;/a&gt; or &lt;a rel="nofollow" target="_blank" href="http://www.vimeo.com" title="Vimeo"&gt;Vimeo&lt;/a&gt; videos are a great way to explain the project and its aims and objectives or to show off something you have developed.&lt;/li&gt;
&lt;/ul&gt;
&lt;h3&gt;Aggregation – RSS feeds &amp;amp; readers&lt;/h3&gt;
&lt;p&gt;There are many information sources on the web. How can you easily keep updated with them all; do you have to studiously visit each one on a regular basis? Well yes, that would work; but an alternative is to use an aggregator tool which pulls the latest content from each source that you’re interested in, and organises it in one place for you to peruse.&lt;/p&gt;
&lt;p&gt;The most commonly used aggregator technology is called RSS (the acronym means Really Simple Syndication). Many websites now publish their updates as “RSS feeds”. The easiest way to use these updates is to use an RSS aggregator website or tool that collects the content from the various feeds, collates it, and presents it to you as a webpage that you access using your web browser or as a piece of software on your desktop.&lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://www.netvibes.com" title="Netvibes"&gt;Netvibes&lt;/a&gt; is an example of an aggregation webpage. Netvibes also offers an option to make the a page of content that you have created available as a public page. This could be useful for providing people with information about your project.&lt;/p&gt;
&lt;p&gt;An example is the &lt;a rel="nofollow" target="_blank" href="http://www.netvibes.com/amcgregor#dev8D" title="dev8D netvibes"&gt;page we put together for an event called dev8D&lt;/a&gt;. There is also a &lt;a rel="nofollow" target="_blank" href="http://www.netvibes.com/amcgregor#inf11" title="inf11 netvibes"&gt;page for the inf11 programme&lt;/a&gt; where you can see the delicious feed and the twitter feed for inf11 on the page. There is a separate &lt;a rel="nofollow" target="_blank" href="http://www.netvibes.com/amcgregor#inf11_blogs"&gt;page that aggregates the inf11 blogs&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://igoogle.com" title="iGoogle"&gt;iGoogle&lt;/a&gt; is a good alternative to Netvibes.&lt;/p&gt;
&lt;p&gt;There are lots of tools to help manage RSS feeds and you may find that a feed reader tool is better than an aggregation web page. &lt;a rel="nofollow" target="_blank" href="http://reader.google.com" title="Google reader"&gt;Google reader&lt;/a&gt; is a popular example of a feed reader and lots of email clients also include a feed reader function.&lt;/p&gt;
&lt;h4&gt;RSS FAQs&lt;/h4&gt;
&lt;p&gt;Q: How can you tell if an information source will work with an RSS feed?&lt;br /&gt;A: Look for the &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/File:Feed-icon.svg"&gt;RSS icon&lt;/a&gt; or a link saying feed.&lt;/p&gt;
&lt;p&gt;Q: How do I add an information source to my RSS aggregator?&lt;br /&gt;A: This varies by the different types of feed reader, but if you copy the URL for the RSS page then it should be straightforward to add it into your reader – most RSS readers have a button with a name such as “Add a subscription” where you can specify the URL for the RSS feed.&lt;/p&gt;
&lt;h3&gt;Note taking&lt;/h3&gt;
&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://evernote.com" title="Evernote"&gt;Evernote&lt;/a&gt; is a fantastic application for note taking, storing photos, web clippings, screenshots and email clippings. I use it as a memory backup. An interesting recent development with Evernote is the ability to share notebooks with a group of people and this could be useful for a project knowledgebase.&lt;/p&gt;
&lt;p&gt;Evernote can be used via a desktop application, a mobile application and a web interface.You can also share evernote notebooks with people via the web. This has the advantage of enabling people to monitor the notebook using RSS. This could be used to share a notebook across a project.&lt;/p&gt;
&lt;h4&gt;Evernote FAQs&lt;/h4&gt;
&lt;p&gt;Q: Is Evernote free?&lt;br /&gt;A: Yes but there is a premium subscription model with extra storage and added features.&lt;/p&gt;
&lt;p&gt;Part 3 of this guide will cover skype, publicising contact details and further information&lt;/p&gt;</description>
         <author>Andy McGregor</author>
         <guid isPermaLink="false">http://infteam.jiscinvolve.org/2009/09/28/guide-to-using-some-web-20-services-in-jisc-projects-part-2-of-3/</guid>
         <pubDate>Mon, 28 Sep 2009 06:18:50 -0700</pubDate>
      <enclosure url="http://feeds.delicious.com/v2/rss/tag/inf11?count=15" length="-1" type="application/rss+xml; charset=utf-8" /><media:content url="http://feeds.delicious.com/v2/rss/tag/inf11?count=15" type="application/rss+xml; charset=utf-8" /><itunes:subtitle> This post is part 2 of a series of 3 posts about making use of web 2.0 services to help manage JISC projects. Part 1 discusses using tags for projects, twitter and blogs. This part of the series covers: social bookmarking, aggregation and note-taking. So</itunes:subtitle><itunes:author>Andy McGregor</itunes:author><itunes:summary> This post is part 2 of a series of 3 posts about making use of web 2.0 services to help manage JISC projects. Part 1 discusses using tags for projects, twitter and blogs. This part of the series covers: social bookmarking, aggregation and note-taking. Social bookmarking – Delicious Delicious or a similar social bookmarking site is an online store of URL bookmarks that can be shared by many people. It can be used to aggregate content that is of interest to project staff. Delicious allows you to keep URLs, to group them together, and to add a description. When you add a new link to Delicious, tag it with your project’s tag so that other people in your project can find the link. They can also add bookmarks themselves, and hence a collective store of bookmarks is built up by using the tag. We will be bookmarking things that we think are useful to projects in the Information Environment programme 2009-11 using the tag inf11. There is a page for inf11 bookmarksand there is also an RSS feed for this page that can be used in aggregation pages and feed readers. Delicious FAQs Q: How do we see what tags are available to follow within Delicious? A: This isn’t necessarily the best way to do things as there is an enormous amount of content on delicious, but you can browse by tags. You could also see the tags people you know on delicious are using. Mine can be seen on my delicious tags page. Other sites Some other sites that can be useful for projects once you have a project tag to aggregate the content are: Flickr for photos that may be taken of the project (this could be of whiteboards or brainstorming sessions or presentations as well as records of events etc) Youtube or Vimeo videos are a great way to explain the project and its aims and objectives or to show off something you have developed. Aggregation – RSS feeds &amp;amp; readers There are many information sources on the web. How can you easily keep updated with them all; do you have to studiously visit each one on a regular basis? Well yes, that would work; but an alternative is to use an aggregator tool which pulls the latest content from each source that you’re interested in, and organises it in one place for you to peruse. The most commonly used aggregator technology is called RSS (the acronym means Really Simple Syndication). Many websites now publish their updates as “RSS feeds”. The easiest way to use these updates is to use an RSS aggregator website or tool that collects the content from the various feeds, collates it, and presents it to you as a webpage that you access using your web browser or as a piece of software on your desktop. Netvibes is an example of an aggregation webpage. Netvibes also offers an option to make the a page of content that you have created available as a public page. This could be useful for providing people with information about your project. An example is the page we put together for an event called dev8D. There is also a page for the inf11 programme where you can see the delicious feed and the twitter feed for inf11 on the page. There is a separate page that aggregates the inf11 blogs. iGoogle is a good alternative to Netvibes. There are lots of tools to help manage RSS feeds and you may find that a feed reader tool is better than an aggregation web page. Google reader is a popular example of a feed reader and lots of email clients also include a feed reader function. RSS FAQs Q: How can you tell if an information source will work with an RSS feed? A: Look for the RSS icon or a link saying feed. Q: How do I add an information source to my RSS aggregator? A: This varies by the different types of feed reader, but if you copy the URL for the RSS page then it should be straightforward to add it into your reader – most RSS readers have a button with a name such as “Add a subscription” where you can specify the URL for the RSS feed. Note taking Evernote is a fantastic application for note taking, storing photos, web clippings, screenshots and email clipp</itunes:summary></item>
      <item>
         <title>Exploring curation micro-services</title>
         <link>http://feedproxy.google.com/~r/technosophia/~3/zPtUh6TmyRo/</link>
         <description>As far as I&amp;#039;m concerned, the most exciting developments this year in repositories and digital curation have come out of the California Digital Library. It has been impossible not to notice their papers and presentations. Put simply, their idea is that digital curation is enabled by &amp;#034;micro-services&amp;#034; built upon well-known abstractions such [...]</description>
         <guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=504</guid>
         <pubDate>Sat, 26 Sep 2009 21:00:29 -0700</pubDate>
         <content:encoded><![CDATA[
<p><img src="http://lackoftalent.org/images/micro_repo_thumb.png" alt="thumbnail of micro-repo tree" style="float:left;"/>As far as I&#039;m concerned, the most exciting developments this year in repositories and digital curation have come out of the California Digital Library. It has been impossible <a rel="nofollow" target="_blank" href="http://conferences.library.gatech.edu/or/or09/paper/view/95">not</a> <a rel="nofollow" target="_blank" href="http://uccsc2009.ucdavis.edu/preso/UCCSC-2009-CDL-PODS-v05.ppt">to</a> <a rel="nofollow" target="_blank" href="http://www.ijdc.net/index.php/ijdc/article/view/98">notice</a> <a rel="nofollow" target="_blank" href="https://meeting-reg.com/sunpasig/abstracts.php">their</a> <a rel="nofollow" target="_blank" href="http://www.digitalpreservation.gov/news/events/ndiipp_meetings/ndiipp09/docs/NDIIPP%20Partner%20Meeting%202009_Breakout%20Session%20Schedule.pdf">papers</a> <a rel="nofollow" target="_blank" href="http://www.ijdc.net/index.php/ijdc/article/view/108/84">and</a> <a rel="nofollow" target="_blank" href="http://www.cdlib.org/iPres/confsched.html">presentations</a>. Put simply, their idea is that digital curation is enabled by &#034;micro-services&#034; built upon well-known abstractions such as the filesystem. The benefits are obvious: filesystem tools are ubiquitous and cross-platform, and there are strong market forces to ensure the filesystem persists. The idea is radically simple and straightforward, though many questions remain about such a paradigm. I&#039;ll return to those later. </p>
<p>If you have not yet taken a look at CDL&#039;s curation micro-service specifications, most of which may be printed on as few as one or two sheets of paper, see the <a rel="nofollow" target="_blank" href="http://www.cdlib.org/inside/diglib/">Digital Library Building Blocks</a>.</p>
<p>My co-workers in the LC Repository Development Center have been chatting about these specs on and off throughout the year. After months of procrastinating, I finally read all of the specs on Thursday; it&#039;s wonderful that you can do so in the course of one reading session, I might add. Yesterday a bunch of us RDCers got together to chat (informally) about the specs: what they&#039;re for, how they work, and how they interact with one another. I learn by doing, by examples, so I combed through each of the specs in advance of our meeting and tried to <a rel="nofollow" target="_blank" href="http://twitter.com/mjgiarlo/statuses/4371794936">construct</a> a minimal repository[1] based on micro-services.<br />
<span id="more-504"></span><br />
Here is a tree visualization of the final product, inevitable warts and all: <a rel="nofollow" target="_blank" href="http://lackoftalent.org/images/micro_repo.png"><img src="http://lackoftalent.org/images/micro_repo.png" alt="sample micro-services repo tree"/></a> The services I used were <a rel="nofollow" target="_blank" href="http://www.cdlib.org/inside/diglib/namaste/namastespec.html">Namaste</a>, <a rel="nofollow" target="_blank" href="http://www.cdlib.org/inside/diglib/can/canspec.pdf">Content Access Node (CAN)</a>, <a rel="nofollow" target="_blank" href="http://www.cdlib.org/inside/diglib/pairtree/pairtreespec.html">Pairtree</a>, <a rel="nofollow" target="_blank" href="http://www.cdlib.org/inside/diglib/dflat/dflatspec.pdf">Dflat</a>, <a rel="nofollow" target="_blank" href="http://www.cdlib.org/inside/diglib/redd/reddspec.html">Reverse Directory Deltas (ReDD)</a>, <a rel="nofollow" target="_blank" href="http://www.cdlib.org/inside/diglib/clop/clopspec.pdf">Class-based System for Managing Object Properties (CLOP)</a>, and <a rel="nofollow" target="_blank" href="http://www.digitalpreservation.gov/library/resources/tools/docs/bagitspec.pdf">BagIt</a> (co-developed by LC and CDL).</p>
<p>As I mentioned in our Friday meeting, recounting my experience exploring the specs: the bad thing is that I spent an hour building a repository with rudimentary tools such as mkdir, touch, cp, ln, and emacs; but the good thing is that I built a <em>repository</em> in <em>one hour</em> using <em>common, rudimentary tools</em>. It&#039;s a very compelling paradigm. <a rel="nofollow" target="_blank" href="http://inkdroid.org/ehs">Ed</a>&#039;s already built a <a rel="nofollow" target="_blank" href="http://github.com/edsu/dflat">tool</a> implementing some of Dflat, further demonstrating how lightweight these micro-services are. (<strong>UPDATE</strong>: Ed notes that this code is a work in progress and is &#034;barely functional.&#034;) (<strong>UPDATE 2</strong>: The dflat library has come a long way. Check it out if you&#039;re interested. Also, I just committed a pretty basic Namaste library: <a rel="nofollow" target="_blank" href="http://github.com/mjgiarlo/namaste">http://github.com/mjgiarlo/namaste</a>. Only took about an hour, which is a testament to the power of lightweight specs.)</p>
<p>I am certain this will be a running thread at work as the specifications evolve and our understanding of them grows. Some questions and comments that occurred to me while exploring the micro-service specs and building the minimal repo:</p>
<ul>
<li>CAN was a bit puzzling. The spec is simple enough, but I found some of the conventions confusing, and I was left wondering what CAN provides other than a container. What I would like to see is a simple use case and perhaps more examples. Thus, the CAN stuff in my sample repo doesn&#039;t feel very useful only because I had a hard time working with the spec.</li>
<li>CLOP feels like the least mature of the specifications. It seems generally useful to be able to put digital objects, however you define that, into classes and define properties on those classes. The spec did not clearly convey to me just how it accomplishes that aim. A few examples would go a very long way. I&#039;ve got some CLOP stuff in the sample repo but I have no idea how close my implementation matches the spec.</li>
<li>Is Dflat dependent on ReDD? One would assume not since there&#039;s an optional property in the dflat-info.txt file for specifying a delta scheme. But, say, could you stub out the v001 directory (reserved to hold the initial version of a digital object) and use a system such as <a rel="nofollow" target="_blank" href="http://git-scm.com/">git</a> or <a rel="nofollow" target="_blank" href="http://bazaar-vcs.org/">bazaar</a>? <br/><br/>One might argue that these established delta schemes, if you want to call them that, have many more developers and users than a system such as ReDD and thus should persist longer and have more tools built around them. I imagine the micro-service viewpoint would acknowledge that point, but counter that the spirit of these specs is to avoid dependencies from outside the filesystem?</li>
<li>Is the ReDD specification meaningful outside of a Dflat given that any one ReDD directory knows nothing of its successors and predecessors, or is it dependent upon Dflat?</li>
<li>Could a BagIt bag live inside of the ReDD reserved &#034;full&#034; directory? That is, could the &#034;full&#034; directory be marked up appropriately to <em>be</em> a BagIt bag?</li>
<li>How many tools exist for these specs? I notice there&#039;s code in CPAN for Pairtree and Namaste, which is a fabulous start. Tools are the difference between YAMF (Yet Another Messy Filesystem) and reliably managed curation services. Granted, tools such as cp and emacs already exist and are part of the appeal of these micro-services, but there&#039;s also tremendous room for error if operations are all done &#034;by hand.&#034;</li>
<li>To what extent has CDL transitioned to using these specs/tools?</li>
<li>Are other institutions using these specs/tools? I have heard tell that digital library folks from the University of Michigan and the University of North Texas may be involved.</li>
</ul>
<p>I hope I don&#039;t sound overly critical. I&#039;m really glad our colleagues at the California Digital Library have written these specifications and applied their deep experience to what could be a transformative paradigm[2] in the digital curation world. Kudos to them!</p>
<h5>Notes</h5><ol class="footnotes"><li id="footnote_0_504" class="footnote">Perhaps it&#039;s more in line with the specs to refer to this space as &#034;a managed filesystem that drives repository and curation services,&#034; given the CDL philosophy that preservation is not a place/repository. But it&#039;s easier to say &#034;repository,&#034; so there you go.</li><li id="footnote_1_504" class="footnote">Please excuse the fanboyishness; this filesystem fetishism is exciting stuff!</li></ol><br/>
<hr/><div class="feedflare">
<a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=zPtUh6TmyRo:Uy68kBvfdAI:bcOpcFrp8Mo"><img src="http://feeds.feedburner.com/~ff/technosophia?d=bcOpcFrp8Mo" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=zPtUh6TmyRo:Uy68kBvfdAI:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/technosophia?d=dnMXMwOfBR0" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=zPtUh6TmyRo:Uy68kBvfdAI:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/technosophia?i=zPtUh6TmyRo:Uy68kBvfdAI:D7DqB2pKExk" border="0"></a>
</div><img src="http://feeds.feedburner.com/~r/technosophia/~4/zPtUh6TmyRo" height="1" width="1"/>]]></content:encoded>
      <enclosure url="http://uccsc2009.ucdavis.edu/preso/UCCSC-2009-CDL-PODS-v05.ppt" length="1569792" type="application/vnd.ms-powerpoint" /><media:content url="http://uccsc2009.ucdavis.edu/preso/UCCSC-2009-CDL-PODS-v05.ppt" fileSize="1569792" type="application/vnd.ms-powerpoint" /><itunes:subtitle>As far as I&amp;#039;m concerned, the most exciting developments this year in repositories and digital curation have come out of the California Digital Library. It has been impossible not to notice their papers and presentations. Put simply, their idea is tha</itunes:subtitle><itunes:summary>As far as I&amp;#039;m concerned, the most exciting developments this year in repositories and digital curation have come out of the California Digital Library. It has been impossible not to notice their papers and presentations. Put simply, their idea is that digital curation is enabled by &amp;#034;micro-services&amp;#034; built upon well-known abstractions such [...]</itunes:summary></item>
      <item>
         <title>I2: Survey results</title>
         <link>http://feedproxy.google.com/~r/technosophia/~3/4SISswBO5mE/</link>
         <description>I wrote in June that the I2 subgroup surveyed &amp;#034;repository managers to determine the current practices and needs of the repository community regarding institutional identifiers. Results from the survey will inform a set of use cases that will be shared with the community, and that are expected to drive the development of a new standard [...]</description>
         <guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=497</guid>
         <pubDate>Tue, 15 Sep 2009 05:29:55 -0700</pubDate>
         <content:encoded><![CDATA[
<p>I <a rel="nofollow" target="_blank" href="http://lackoftalent.org/michael/blog/2009/06/20/i2-survey/">wrote</a> in June that the I2 subgroup surveyed &#034;repository managers to determine the current practices and needs of the repository community regarding institutional identifiers. Results from the survey will inform a set of use cases that will be shared with the community, and that are expected to drive the development of a new standard for institutional identifiers.&#034;</p>
<p>The survey closed in July, and the subgroup spent August writing a report on the survey results. That report is now <a rel="nofollow" target="_blank" href="http://www.niso.org/apps/group_public/document.php?document_id=2773">final</a> and it&#039;s available to the public. Feedback may be sent to our (woefully underutilized) public <a rel="nofollow" target="_blank" href="http://www.niso.org/lists/i2info/">i2info</a> mailing list, left as a comment on this post, or e-mailed to me privately which I can forward to our internal list.</p>
<p>The next step is to build upon the report to draw yet more conclusions from the data &#8212; there&#039;s an awful lot there &#8212; and flesh out some repository use cases for institutional identifiers. The I2 core group is moving quickly towards finalizing identifier metadata elements so that a standard may be drafted, and I think having some use cases documented will help drive the standard in a direction the community can get behind.</p>
<p>Onward and upward.</p>
<div class="feedflare">
<a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=4SISswBO5mE:r7DDup2itSc:bcOpcFrp8Mo"><img src="http://feeds.feedburner.com/~ff/technosophia?d=bcOpcFrp8Mo" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=4SISswBO5mE:r7DDup2itSc:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/technosophia?d=dnMXMwOfBR0" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=4SISswBO5mE:r7DDup2itSc:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/technosophia?i=4SISswBO5mE:r7DDup2itSc:D7DqB2pKExk" border="0"></a>
</div><img src="http://feeds.feedburner.com/~r/technosophia/~4/4SISswBO5mE" height="1" width="1"/>]]></content:encoded>
      </item>
      <item>
         <title>Thoughts on digitization, data deluge and linking</title>
         <link>http://davetaz-blog.blogspot.com/2009/09/thoughts-on-digitization-data-deluge.html</link>
         <description>It's been a while since I've put a post up and this is probably due to being busy and also trying to tidy up a lot of stuff before starting on new projects.&lt;br /&gt;&lt;br /&gt;In this post then: &lt;b&gt;Digitisation&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;I never really gathered how big the area of digitisation is and how many non repository people are actively involved in digitisation. There are a great many projects &amp;gt;50 who are digitising resources and these include national libraries. Items being digitised include everything from postcards and newspapers to full books and old journals.&lt;br /&gt;&lt;br /&gt;So what's the problem here ... simple ... how many people are digitising the same things? Yes I know that there is so much out there that this is unlikely to be the case however it brings me nicely to the problem of information overload. There is already more valuable information on the internet than we can possibly handle effectively, so how do you ensure that any resources you digitize for open access usage on the web can be found and used? &lt;br /&gt;&lt;br /&gt;I don't normally say this but perhaps we should look at physical libraries for the answer. Libraries are a very good central point where you can find publications related to all subject areas, and if your local library does not have a copy then it will try and find a copy somewhere else. &lt;br /&gt;&lt;br /&gt;How then does this map onto the web? Web sites become the library and links become the references to additional items or items this site does not contain, simple right? Unfortunately with 50+ projects I can count already, this leads to 50+ different web sites all with differing information presented in different ways. Due to the presentation of each web site being totally different this means that in fact they are not a library - that pride themselves on the standard way to organise resources -&lt;br /&gt;thus web sites become books. Thus to find resources we have to rely on search engines and federation. Thus we are back to where we started and we have a problem with information overload. &lt;br /&gt;&lt;br /&gt;Unfotunately I don't have an answer to this problem, however I do know that links hold the key to the solution. Each website at the moment is simply an island of infromation, what is desperately required is the authors and community to establish links to these resources. If digitisation houses are curating refereed resources then the simplist way to link to these would be to put information about them on wikipedia.&lt;br /&gt;&lt;br /&gt;This would be my final point then, wikipedia is actually a good thing, simply because of the the community aspect. However it also provides many other huge benefits:&lt;br /&gt;&lt;ul&gt;&lt;br /&gt;&lt;li&gt;External resources such as photoes have to have a licience&lt;/li&gt;&lt;br /&gt;&lt;li&gt;In annotating a page/item you create links and establish facts which are available by semantic wikipedia (dbpedia)&lt;/li&gt;&lt;br /&gt;&lt;li&gt;Wikipedia is an easy way to establish your presence on the link data web (linkeddata.org)&lt;/li&gt;&lt;br /&gt;&lt;/ul&gt;&lt;br /&gt;&lt;br /&gt;So if you are digitising books by an author, add this link to their wikipedia page. If you are digitising a collection of World War images, add links to some of these to wikipedia and flikr.&lt;br /&gt;&lt;br /&gt;Establish links and help yourself to help everyone else.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-1374762836853461443?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-1374762836853461443</guid>
         <pubDate>Tue, 08 Sep 2009 00:01:00 -0700</pubDate>
      </item>
      <item>
         <title>Using GMail with DSpace</title>
         <link>http://feedproxy.google.com/~r/stuartlewis/~3/XdatO1EUtTI/</link>
         <description>From time to time a DSpace repository will send emails. It does this when new users are added, when new items are added, when workflow tasks need to be completed, when exports have completed etc etc. For DSpace production servers this is normally trivial to set up; enter the name of your email server in [...]</description>
         <guid isPermaLink="false">http://blog.stuartlewis.com/?p=452</guid>
         <pubDate>Fri, 04 Sep 2009 15:22:17 -0700</pubDate>
         <content:encoded><![CDATA[<p><a rel="nofollow" target="_blank" href="http://blog.stuartlewis.com/wp-content/uploads/2009/09/gmail.png"><img class="alignleft size-thumbnail wp-image-453" title="gmail" src="http://blog.stuartlewis.com/wp-content/uploads/2009/09/gmail-150x150.png" alt="gmail" width="150" height="150"/></a>From time to time a <a rel="nofollow" target="_blank" href="http://dspace.org/">DSpace</a> repository will send emails. It does this when new users are added, when new items are added, when workflow tasks need to be completed, when exports have completed etc etc. For DSpace production servers this is normally trivial to set up; enter the name of your email server in the configuration file, enter the email address that emails should be sent from, and it usually just works</p>
<p>However for developers working on DSpace, it can be a bit harder. For good reasons (e.g. spam reduction etc), many institutions make their SMTP email servers accessible only from within their network. So if you are working on DSpace from home, a cafe, or on the train, you don&#8217;t have access to your institutional network because you don&#8217;t want to bother running VPN. An ideal solution to this problem would be to use a third party SMTP server outside of your institution which you can use which developing. An ideal candidate for this is <a rel="nofollow" target="_blank" href="http://google.com/">Google&#8217;s</a> <a rel="nofollow" target="_blank" href="http://www.gmail.com/">GMail</a>.</p>
<p>At present though, DSpace does not allow you to configure some of the more complex aspects of email server configuration like enabling SSL connections, or which <a rel="nofollow" target="_blank" href="http://java.sun.com/javase/6/docs/api/javax/net/SocketFactory.html">SocketFactory</a> to use. These need to be set in order to connect securely to the GMail servers. We&#8217;ve had an <a rel="nofollow" target="_blank" href="http://jira.dspace.org/jira/browse/DS-234">open issue</a> in the <a rel="nofollow" target="_blank" href="http://jira.dspace.org/">DSpace JIRA issue tracking system</a> to address this, and it has now been included ready for DSpace version 1.6, and can be easily back ported to earlier versions.</p>
<p>To use the GMail servers, configure dspace.cfg as follows:</p>
<pre># SMTP mail server
mail.server=smtp.gmail.com # SMTP mail server authentication username and password (if required)
mail.server.username = your-user-name@gmail.com
mail.server.password = your-gmail-password # Pass extra settings to the Java mail library. Comma separated, equals sign between
# the key and the value.
mail.extraproperties = mail.smtp.socketFactory.port=465, &#92; mail.smtp.socketFactory.class=javax.net.ssl.SSLSocketFactory, &#92; mail.smtp.socketFactory.fallback=false</pre>
<p>If you are a <a rel="nofollow" target="_blank" href="http://www.google.com/apps/intl/en/group/index.html">Google Apps</a> user this will also work if you substitute the username with your Google Apps email address.</p>]]></content:encoded>
      </item>
      <item>
         <title>JISC Rapid Innovation Event</title>
         <link>http://blog.paulwalk.net/2009/09/04/jisc-rapid-innovation-event/</link>
         <description>I have just spent an interesting and inspiring 24 hours at the JISC Rapid Innovation Programme meeting, which was organised by UKOLN (disclaimer: I work for UKOLN), and funded through the JISC-funded IE Demonstrator project. The venue chosen for the event was certainly an unusual one &amp;#8211; the City of Manchester Stadium, home of Manchester [...]</description>
         <guid isPermaLink="false">http://blog.paulwalk.net/2009/09/04/untitled-3/</guid>
         <pubDate>Fri, 04 Sep 2009 12:50:49 -0700</pubDate>
         <content:encoded><![CDATA[<p><a rel="nofollow" target="_blank" href="http://www.paulwalk.net/images/mcfc.jpg"><img style="padding-right:5px;float:left;" src="http://www.paulwalk.net/images/mcfc.jpg" alt="Manchester City FC Ground" width="240" height="168"/></a>I have just spent an interesting and inspiring 24 hours at the <a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/events/jiscri-2009/">JISC Rapid Innovation Programme meeting</a>, which was organised by <a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk">UKOLN</a> (disclaimer: I work for UKOLN), and funded through the JISC-funded <a rel="nofollow" target="_blank" href="http://blog.iedemonstrator.org/">IE Demonstrator</a> project. The venue chosen for the event was certainly an unusual one &#8211; the <a rel="nofollow" target="_blank" href="http://www.cityofmanchesterstadium.co.uk/index.php">City of Manchester Stadium</a>, home of Manchester City Football Club. I thought the venue worked very well for this event and would recommend it. The event was primarily aimed at developers from the JISC Rapid Innovation projects, but with a significant number of others delegates drawn from JISC programme management as well as the the wider developer community</p>
<p>With this event, we decided to address an issue that has become apparent as JISC has started to engage more directly with developers in the HE sector: developers are often untrained, and sometimes not naturally disposed to explaining their projects to others, <em>especially</em> when those others are not themselves developers. So we hit on the (admittedly somewhat artificial) exercise of requiring a representative of each project to deliver a 45 second ‘pitch’ to the assembled audience, which was recorded to video. The project reps were then given a 15 minute consultation with one of a set of panels of three ‘experts’, led by people from media and communications backgrounds, where they reviewed the video of their pitch and discussed ways of improving upon it. These were held in the Stadium&#8217;s executive boxes overlooking the pitch, which was pretty cool! The reps were warned that they would be required to deliver a new, improved <em>20 second</em> pitch the following day….</p>
<p>This exercise was something of a gamble to be honest. We were confident that a significant number of the project reps would appreciate why it is important that they be able to clearly explain what their project is about in a few sentences, in terms that a wide variety of people might be able to understand. We <em>hoped</em> that the majority would be able to benefit from these exercises to the point where they could deliver a compelling, clear pitch for their project. The results, I’m really happy to say, were outstanding! The improvement over 24 hours was remarkable, and JISC now has a portfolio of clear explanations of the 40+ ‘rapid innovation’ projects, not to mention a group of developers better equipped to explain what it is they are working on.</p>
<p>As well as a training exercise, this event delivered a series of ‘lightening talks’, panel sessions and ‘show &amp; tell’ opportunities &#8211; a set of features which has become a staple of developer-centric events. Twitter was actively used as a <a rel="nofollow" target="_blank" href="http://twitter.com/#search?q=jiscri">back-channel</a> to the event so you can get a small sense of what was going on from that stream.</p>
<p>I also used this event to ‘officially’ launch the <a rel="nofollow" target="_blank" href="http://devcsi.ukoln.ac.uk/">DevCSI</a> project &#8211; but managed to cock up my presentation &#8211; by ‘losing’ my presenter’s display with all my notes. As I’d decided to go for a one word per slide approach for much of the presentation, this was a bit of a disaster for me. Oh well, I gathered some real interest in this nonetheless and some opportunities for events and other engagements. If you’re interested, you can read more about this on the <a rel="nofollow" target="_blank" href="http://devcsi.ukoln.ac.uk/blog/">project blog</a>.</p>
<p>I’d like to extend a big thank-you to everyone who came, many of whom stepped some way out of their ‘comfort zone’ to engage with the ‘pitching’ exercise. I’d especially like to thank Mahendra Mahey (UKOLN) who did most of the organising together with David Flanders (JISC), as well as the ever-professional UKOLN Events Team (Natasha Bishop and Michelle Smith) who seemed to work non-stop for 24 hours. Our army of professional bloggers was fantastic, offering expert advice on the pitches as well as conducting interviews, all of which have already been transcribed to the <a rel="nofollow" target="_blank" href="http://blog.iedemonstrator.org/">IE Demonstrator Blog</a>, with a large number of the projects. David Tarrant (Southampton University) and Julian Cheal (UKOLN) provided excellent technical support, maintaining a networked news service which was displayed all over the venue.</p>
<p>This event was a pleasure to be involved in &#8211; there was a great spirit of cooperation throughout which bodes really well for future events with the developer community.</p>]]></content:encoded>
      </item>
      <item>
         <title>Agile Protoyping in Academia</title>
         <link>http://feedproxy.google.com/~r/DFFlanders/~3/zbHccNFNRAU/</link>
         <description>I&amp;#8217;ve given this talk a couple of times, at the JISC Digtital Content Conference, The JISC INF11 Programme Start-Up Meeting and most recently (where I presented it virtually over twitter) at IWMW2009. It orgins stem back to chats with Ben O&amp;#8217;Steen while on the CRIGshow and most recently in more serious discussions with OSSWatch in [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&amp;blog=532341&amp;post=207&amp;subd=dfflanders&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://dfflanders.wordpress.com/2009/08/16/agile-protoyping-in-academia/</guid>
         <pubDate>Sun, 16 Aug 2009 06:11:48 -0700</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><p>I&#8217;ve given this talk a couple of times, at the JISC Digtital Content Conference, The JISC INF11 Programme Start-Up Meeting and most recently (where I presented it virtually over twitter) at IWMW2009. It orgins stem back to chats with <a rel="nofollow" target="_blank" href="http://oxfordrepo.blogspot.com/">Ben O&#8217;Steen</a> while on the <a rel="nofollow" target="_blank" href="http://crigshow.blogspot.com/">CRIGshow</a> and most recently in more serious discussions with OSSWatch in their &#8220;<a rel="nofollow" target="_blank" href="http://wiki.oss-watch.ac.uk/OpenAndAgileDevelopment">Agile and Open Expert Group</a>&#8220;. I still have a lot of questions on how project jump from being small one or two user acorns into massive oak services with lots of users, if you have solutions please do comment or send me a tweet @dfflanders.</p>
<p><iframe class="embeddedvideo" src='http://widgets.vodpod.com/w/video_embed/ExternalVideo.861308' type='application/x-shockwave-flash' pluginspage='http://www.macromedia.com/go/getflashplayer' width='425' height='350'/></p>
<p><span style="display:block;width:425px;margin:0 auto;"> </span></p>
<div style="font-size:10px;">more about &#8220;<a rel="nofollow" target="_blank" href="http://vodpod.com/watch/2062485-agile-protoyping-in-academia?pod=">Agile Protoyping in Academia</a>&#8220;<a rel="nofollow" target="_blank" href="http://vodpod.com?r=wp"></a></div> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/dfflanders.wordpress.com/207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dfflanders.wordpress.com/207/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/dfflanders.wordpress.com/207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dfflanders.wordpress.com/207/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/dfflanders.wordpress.com/207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dfflanders.wordpress.com/207/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/dfflanders.wordpress.com/207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dfflanders.wordpress.com/207/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/dfflanders.wordpress.com/207/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dfflanders.wordpress.com/207/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&blog=532341&post=207&subd=dfflanders&ref=&feed=1"/></div>]]></content:encoded>
         <media:content url="http://feedproxy.google.com/~r/DFFlanders/~5/W86qUHea5Xg/ExternalVideo.861308" medium="image">
            <media:title>dfflanders</media:title>
         </media:content>
         <category>Uncategorized</category>
         <enclosure length="33004" url="http://feedproxy.google.com/~r/DFFlanders/~5/W86qUHea5Xg/ExternalVideo.861308" type="application/x-shockwave-flash" />
      <itunes:subtitle>I&amp;#8217;ve given this talk a couple of times, at the JISC Digtital Content Conference, The JISC INF11 Programme Start-Up Meeting and most recently (where I presented it virtually over twitter) at IWMW2009. It orgins stem back to chats with Ben O&amp;#8217;Ste</itunes:subtitle><itunes:summary>I&amp;#8217;ve given this talk a couple of times, at the JISC Digtital Content Conference, The JISC INF11 Programme Start-Up Meeting and most recently (where I presented it virtually over twitter) at IWMW2009. It orgins stem back to chats with Ben O&amp;#8217;Steen while on the CRIGshow and most recently in more serious discussions with OSSWatch in [...]</itunes:summary><itunes:keywords>Uncategorized</itunes:keywords></item>
      <item>
         <title>DevCSI</title>
         <link>http://devcsi.ukoln.ac.uk/</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_eecc529d644da37dd9e074222079777b</guid>
         <pubDate>Fri, 31 Jul 2009 02:58:00 -0700</pubDate>
      </item>
      <item>
         <title>Email your repository</title>
         <link>http://feedproxy.google.com/~r/stuartlewis/~3/et9tVvf1_Yc/</link>
         <description>What modern information handling system do we probably interact with most each day? For the majority of us, it is probably our email. We send and recive dozens of emails each day. So how about enabling repository deposit via email?
It has certainly been talked about from time to time, and a plugin to the Thunderbird [...]</description>
         <guid isPermaLink="false">http://blog.stuartlewis.com/?p=419</guid>
         <pubDate>Tue, 28 Jul 2009 01:19:29 -0700</pubDate>
         <content:encoded><![CDATA[<p>What modern information handling system do we probably interact with most each day? For the majority of us, it is probably our email. We send and recive dozens of emails each day. So how about enabling repository deposit via email?</p>
<p>It has certainly been talked about from time to time, and <a rel="nofollow" target="_blank" href="http://www.vimeo.com/4733141">a plugin</a> to the <a rel="nofollow" target="_blank" href="http://www.mozillamessaging.com/">Thunderbird</a> email client has even been written that allows you to deposit attachments into repositories using <a rel="nofollow" target="_blank" href="http://swordapp.org/">SWORD</a> (no mention is made of metadata). This plugin should work with any SWORD enabled repository, but only works with one email client. Wouldn&#8217;t it be great if there was a more general solution that worked with <em>all</em> repositories, and <em>all</em> email clients?</p>
<p>Now there is! The latest version of the <a rel="nofollow" target="_blank" href="http://php.swordapp.org/">SWORD PHP library</a> (version 0.8) contains an example script showing SWORD and the PHP library in use. To make it work, just fill out a configuration file with your email address, password and IMAP mailbox details, and your repository login, password and deposit URL. After the configuration file has been filled in, all you need to do is run the script &#8216;imap-mail.php&#8217; on the command line. The script will connect to your mailbox and look at each unread message. It will package each one up and deposit it into the repository.</p>
<p>How does it work?</p>
<p>It uses the standard <a rel="nofollow" target="_blank" href="http://www.php.net/imap">PHP IMAP library</a> to connect to your inbox. For each unread message it finds, it extracts the name of the sender of the email, the email subject, and the body of the message. It uses these for metadata:</p>
<ul>
<li>From name -&gt; Author</li>
<li>Subject -&gt; Title</li>
<li>Message body -&gt; Abstract</li>
</ul>
<p>Along with the metadata, the script adds each email attachment to the deposited item. An example of an email deposited this way can be seen at: <a rel="nofollow" target="_blank" href="http://dspace.swordapp.org/jspui/handle/123456789/318">http://dspace.swordapp.org/jspui/handle/123456789/318</a></p>
<p>If you want to try it out, but don&#8217;t want to set it all up, for a limited time I&#8217;ll leave it running against the <a rel="nofollow" target="_blank" href="mailto:deposit@swordapp.org">deposit@swordapp.org</a> mailbox. Send an email to <a rel="nofollow" target="_blank" href="mailto:deposit@swordapp.org">deposit@swordapp.org</a> and when I periodically run the script, your email will be deposited into the test <a rel="nofollow" target="_blank" href="http://dspace.org/">DSpace</a>/<a rel="nofollow" target="_blank" href="http://swordapp.org/">SWORD</a> repository at <a rel="nofollow" target="_blank" href="http://dspace.swordapp.org/">http://dspace.swordapp.org/</a>. Please bear in mind that the repository is open access to the world, so anyone can see what your email and optional attachments contain! (Please consider working hours / timezones etc when working out when I am likely to next run the script! You will receive a confirmation email when your deposit has taken place.)</p>
<p>A few further thoughts:</p>
<ul>
<li>What is the use of this script? I can think of a few. If you want to allow faculty members to deposit full texts easily, and you are happy to update the record with better metadata, this script could help you out. Alternatively if you have a system that for example generates weekly reports, and you want to archive these, you may find it easier to get the system to email the reports to the SWORD script than to develop a SWORD deposit interface to the system.</li>
<li><em>n</em> for the price of 1: As far as I know, none of the major SWORD enabled repository platforms has an email deposit facility. This script shows one aspect of the real power of interoperability and SWORD&#8217;s part in creating an interoperable environment. Rather than developing an email deposit facility for just one repository, I have developed one for any SWORD compliant repository.</li>
<li>Extensions to the script: The following idea came up in a conversation with Kim Shepherd, the <a rel="nofollow" target="_blank" href="http://www.lconz.ac.nz/">LCoNZ</a> DSpace programmer who suggested that the &#8216;local part&#8217; of an email address could be used to set further options. Email addresses can have extra tags applied following a &#8216;+&#8217; or &#8216;-&#8217; (for example username-XYZ@example.com). So if for example you wanted your users to be able to choose which collection in the repository they wanted to deposit an item into, they could change the deposit email to something like deposit+datasets@example.com or deposit-chemistrylearningobjects@example.com. These emails would all end up in the same mailbox, but the script could process them differently. Or of course the parameters could be used to set other options.</li>
</ul>
<p>If I get time, my next extension to the PHP SWORD library will be a basic web client (similar to <a rel="nofollow" target="_blank" href="http://client.swordapp.org/">http://client.swordapp.org/</a> except written in <a rel="nofollow" target="_blank" href="http://www.php.net/">PHP</a>, and will create packages from files for you). If you have any other suggestions, please leave a comment!</p>]]></content:encoded>
      </item>
      <item>
         <title>RIRI Year Two Redux</title>
         <link>http://feedproxy.google.com/~r/MediashelfBlog/~3/bBznH1dvtVo/</link>
         <description>This week the University of Prince Edward Island hosted the second installment of Red Island Repository Institute (RIRI). Participants came from North America and Europe to get a week-long intensive immersion in all things Fedora. The institute is organized by Mark Leggott and the team at UPEI who created Islandora, a plugin that integrates Fedora [...]</description>
         <guid isPermaLink="false">http://yourmediashelf.com/blog/?p=174</guid>
         <pubDate>Sat, 25 Jul 2009 09:50:55 -0700</pubDate>
         <content:encoded><![CDATA[<p>This week the University of Prince Edward Island hosted the second installment of <a rel="nofollow" title="Red Island Repository Institute" target="_blank" href="http://vre2.upei.ca/riri">Red Island Repository Institute</a> (RIRI). Participants came from North America and Europe to get a week-long intensive immersion in all things Fedora. The institute is organized by Mark Leggott and the team at UPEI who created <a rel="nofollow" title="Islandora" target="_blank" href="http://vre.upei.ca/dev/islandora">Islandora</a>, a plugin that integrates Fedora into Drupal. Thorny Staples (Product Director for Fedora Commons), Chris Wilper (Technical Lead for Fedora Commons), and I were the primary instructors.</p>
<p>Due to budget cuts across the board, there were less people in attendance at RIRI this year. Over the next few years, broad community collaboration is certain to involve much more remote interaction online and less face-to-face exchange at large conferences. I have a hunch that this will actually be beneficial for the growth of technical communities as long as it doesn&#8217;t last too long or become too restrictive. After all, there is something potent about the ethos of <a rel="nofollow" title="Ben O'Steen's Blog" target="_blank" href="http://oxfordrepo.blogspot.com/">Less Talk, More Code</a>. In the meantime, those of us who were able to make it to PEI had a really productive week.</p>
<p>Though structured as an intensive week-long training exercise, RIRI has taken on an element of information exchange between all of the participants. The predominant flow of information was, as you would expect, from the three instructors to the 15 attendees, but I think that Thorny and Chris will agree with me that we have all picked up some good information and ideas along the way. If anything, the institute has started to migrate in the direction of that quasi-mythical ideal conference where you get the conference-type info about who is doing what in your field while also getting your hands dirty with workshops and in-depth detail and discussion of how they are doing it and how you might build on their efforts.</p>
<p><strong>Learning Curve</strong></p>
<p>Those of us who were attending RIRI for the second time were of the consensus that this year had a much more gratifying learning curve with a sufficient continuity carrying through all five days. This was predominantly thanks to how much Islandora and ActiveFedora have matured in the past twelve months. We had everyone up and running, writing and running code against Fedora, in the multiple hands-on sessions through the week.</p>
<p>In contrast with last year, where we spent quite a bit of time on installing and configuring Fedora, this year we were able to use a <a rel="nofollow" title="VirtualBox" target="_blank" href="http://www.virtualbox.org/">VirtualBox</a> that the Islandora team set up. The VirtualBox has a debian image with Fedora, Islandora and Drupal pre-installed. On Tuesday I installed Ruby, ActiveFedora and Solr on there too. By Wednesday, we were able to pass around a couple of USB sticks and have everyone running identical development environments for the hands-on sessions.</p>
<p>I&#8217;m sure we will soon post the VirtualBox image for anyone to download. Next year, everyone will probably have it running on their laptops before they even arrive on the Island.</p>
<p><strong>Islandora</strong></p>
<p>The <a rel="nofollow" title="Islandora" target="_blank" href="http://vre.upei.ca/dev/islandora">Islandora</a> team at UPEI have been very busy setting up Fedora-based Virtual Research Environments for the variety of scholars at their University. It seems that once they had hammered out the module that integrates Fedora into Drupal, they fearlessly dove into spinning off deployments for numerous disciplines and projects. I was thoroughly impressed by how many VREs they have set up, the diversity of content they are collecting, and the range of tools that they are setting up to manage, display, and operate on that content. Four years ago, I looked at Fedora and saw a tidal wave of potential uses behind it. That wave has finally begun to break and it is as exciting as we had imagined.</p>
<p><strong>ActiveFedora</strong></p>
<p>We used <a rel="nofollow" title="ActiveFedora" target="_blank" href="http://yourmediashelf.com/activefedora">ActiveFedora</a> as the basis for the hands-on sections of the institute. On Wednesday we basically ran through the console tour that you can find on the ActiveFedora project site. In the developer breakout on Thursday, we defined some ActiveFedora models in a rails app, created some Fedora objects based on those models, and explored their RDF relationships.</p>
<p>&#8220;That really makes sense.&#8221; seemed like the main response to ActiveFedora. I can&#8217;t think of a better confirmation that designing the Domain Specific Language (DSL) for ActiveFedora&#8217;s models was worth it.</p>
<p>As part of the workshops, I also ran through the improvements I have planned for <a rel="nofollow" title="ActiveFedora 1.1 Roadmap" target="_blank" href="http://projects.mediashelf.us/versions/show/7">ActiveFedora version 1.1</a>.</p>
<p><strong>Hydra</strong></p>
<p>Thorny gave a really great presentation on the background, goals, and status of the <a rel="nofollow" title="Hydra Project" target="_blank" href="http://www.fedora-commons.org/confluence/display/hydra/The+Hydra+Project">Hydra project</a>. Hydra and Islandora stand as wonderful complements to each other. Islandora, written in PHP, takes Drupal as the starting point from which it reaches into the wide open space of a Fedora repository. Hydra takes nearly the opposite approach. Hydra does not assume an overarching system as its operating context. Instead it builds outwards from ActiveFedora, which in turn builds upon Fedora&#8217;s internal flexibilities and strengths. Where Islandora functions as a component that you plug into Drupal, Hydra apps are free-standing solutions that in turn rely on a core whose functionality can be integrated into potentially any system. One is a <a rel="nofollow" title="wikipedia: stalactite" target="_blank" href="http://en.wikipedia.org/wiki/Stalactite">stalactite</a>, the other is a <a rel="nofollow" title="wikipedia: stalagmite" target="_blank" href="http://en.wikipedia.org/wiki/Stalagmite">stalagmite</a>. Both have grown naturally out of real-world needs.</p>
<p><strong>FeSL</strong></p>
<p>Projects seeking to adopt Fedora right now have great options before them. Different technologies will suit different projects while the overall vision and best practices pollinate across solutions. The best example of this cross pollination is the FeSL Project, which Islandora and Hydra are both participating in along with MediaShelf and DuraSpace. This effort will result in a complete replacement of Fedora&#8217;s security implementation so that it can be used more effectively and more flexibly by any of the client applications we write &#8212; Hydra, ActiveFedora, Islandora, Muradora, or otherwise. We are still seeking additional community funding for FeSL. For more info, see the <a rel="nofollow" title="FeSL on Fedora Wiki" target="_blank" href="https://fedora-commons.org/confluence/display/DEV/Improved+Fedora+Security+Layer">Fedora Enhanced Security Layer pag</a>e on the Fedora wiki.</p>
<p><strong>Workflow</strong></p>
<p>One of the more exciting themes for me this year was workflow for Fedora Repositories. I&#8217;ve been actively interested in the topic since Richard Green and Chris Awre presented RepoMMan at OpenRepositories in 2007. Despite my interest, I have tread softly in that realm because the topic, and particularly technologies like BPEL, have always seemed dangerously askew from the actual problem(s) at hand. I&#8217;ve always felt that we were framing the entire problem incorrectly. In the past nine months, that exact line of thinking has come to a head, leading people like the Hydra project (including the former RepoMMan team) to explore a variety of approaches. This resulted in a well-received presentation at OpenRepositories this year in Atlanta, which Thorny presented again for the RIRI participants this week.</p>
<p>The topic of workflow finally congealed for me in an actionable way here at RIRI when Mark Leggott started talking about supporting <a rel="nofollow" title="Taverna Workflow Project" target="_blank" href="http://taverna.sourceforge.net/">Taverna</a> in the Islandora VREs. Taverna is one of a handful of &#8220;workflow engines&#8221; designed to allow scientists to chain together batches of computational operations that they want to perform on data in their labs. From an architectural perspective, Taverna isn&#8217;t all that different from BPEL tools. The difference is that Taverna and its ilk are designed with the assumption that the scientist who owns the data will create the workflows. Consider this in contrast to the idea of a developer or repository manager creating workflows on behalf of the end users. This simple assumption, that workflows will be constructed and run by the people who own/create the underlying content, leads the technology down a very different path of development. Until now, we have mostly thought of workflows as services that a repository provides for end users. It&#8217;s time for us to flip that around and instead think of ways to expose repositories and their supporting services as nodes that end users can tie into their own workflows as they see fit. This approach has the ring of good engineering. It&#8217;s a simpler, loosely-coupled, user-centric solution to the problem. It has the additional benefit of putting the repository engineer alongside the content owner/creator while they both create and share workflows to perform their corresponding tasks.</p>
<p><strong>Solutions Integration Community</strong></p>
<p>RIRI has also brought focus back onto the utility of the Solutions Integration Community that we have been gradually building this past year. I&#8217;ve now set up a <a rel="nofollow" title="Fedora Solutions Google Group" target="_blank" href="http://groups.google.com/group/fedora-solutions">fedora-solutions google group</a>. Join up if you want to be a part of the stuff described on the <a rel="nofollow" target="_blank" href="http://www.fedora-commons.org/confluence/display/FCCWG/Solutions+Integration">solutions integration community page</a> on the Fedora wiki.</p>
<p><strong>A Great Year to Come</strong></p>
<p>Over the next 12 months we are going to see a brilliant array of advances around Fedora Repositories. MediaShelf will certainly touch many parts of those advances. We&#8217;re here to help you make your mark in this space, and we&#8217;re here to make Fedora work for your users. <a rel="nofollow" title="Contact MediaShelf" target="_blank" href="http://yourmediashelf.com/blog/contact-us/">Contact us</a> if you want to put MediaShelf to work on your repository efforts.</p>
<img src="http://feeds.feedburner.com/~r/MediashelfBlog/~4/bBznH1dvtVo" height="1" width="1"/>]]></content:encoded>
      </item>
      <item>
         <title>Wookie Accepted into Apache Incubator</title>
         <link>http://www.cetis.ac.uk/members/scott/blogview?entry=20090721125354</link>
         <description>&lt;p&gt;Last week the Wookie Widget Server developed by our team based at the University of Bolton was accepted into the Apache Incubator. You can find quite a few posts on this blog about Wookie; but the basic idea is that this is a server that provides support for W3C Widgets and related technologies (e.g. Google Wave Gadgets) in a way that can be easily integrated into other applications, including Moodle, LAMS, Wordpress, and Elgg.
&lt;/p&gt; &lt;p&gt;We're now working on moving all the source code and sorting out the legal and licensing side of things, but this shouldn't take long; the &lt;a rel="nofollow" target="_blank" href="http://incubator.apache.org/projects/wookie.html"&gt;incubator project page&lt;/a&gt; is now up, so you can keep an eye on our progress there.&lt;/p&gt; &lt;p&gt;Once this stage is completed, we'll be working to ensure that Wookie is sustainable with a broad group of committers from different organisations. We've already had quite a few people interested in contributing which is very encouraging. If you're interested, there is a mailing list set up for developers; to subscribe to the list, send a message to wookie-dev-subscribe@incubator.apache.org&lt;/p&gt; &lt;p&gt;So exciting times ahead for Paul, Kris and I working on Wookie in the Apache Incubator!&lt;/p&gt; &lt;p&gt;Thanks to everyone who has supported Wookie - especially the TenCompetence EU Project that funded the original development work, and also everyone who participated in the CETIS Widgets Working Group, which proved very helpful in informing the direction of the project, and to the folks at the Palette project (Alain, Stephane and Jerome) and the W3C Webapps group.&lt;/p&gt; &lt;p&gt;A special thanks to Ross at OSSWatch for helping out in many different ways, from helping us to develop the business case at the university to the actual Apache Incubator submission process. See also the post by Ross on the &lt;a rel="nofollow" target="_blank" href="http://osswatch.jiscinvolve.org/2009/07/17/wookie-accepted-into-apache-incubator/"&gt;OSSWatch blog&lt;/a&gt;&lt;/p&gt;</description>
         <author>Scott Wilson</author>
         <guid isPermaLink="false">tag:cetis.ac.uk,2009-07-21:20090721125354</guid>
         <pubDate>Tue, 21 Jul 2009 05:53:54 -0700</pubDate>
      </item>
      <item>
         <title>Direct from MS Word to DSpace via SWORD</title>
         <link>http://feedproxy.google.com/~r/stuartlewis/~3/mUx976tduIE/</link>
         <description>As a member of the SWORD project, it has been a great seeing Microsoft&amp;#8217;s External Research group integrate SWORD into Word 2007, their Zentity repository, and their online journal hosting system. There is a good overview of this work in a presentation given by Pablo Fernicola at the Open Repositories 2009 conference entitled &amp;#8216;Connecting Authors and Repositories [...]</description>
         <guid isPermaLink="false">http://blog.stuartlewis.com/?p=379</guid>
         <pubDate>Sat, 04 Jul 2009 01:23:12 -0700</pubDate>
         <content:encoded><![CDATA[<p>As a member of the <a rel="nofollow" target="_blank" href="http://www.swordapp.org/">SWORD project</a>, it has been a great seeing <a rel="nofollow" target="_blank" href="http://research.microsoft.com/en-us/collaboration/about/default.aspx">Microsoft&#8217;s External Research group</a> integrate SWORD into <a rel="nofollow" target="_blank" href="http://www.microsoft.com/word/">Word 2007</a>, their <a rel="nofollow" target="_blank" href="http://research.microsoft.com/en-us/projects/zentity/">Zentity</a> repository, and their <a rel="nofollow" target="_blank" href="http://journal.mssandbox.net/">online journal hosting system</a>. There is a good overview of this work in a presentation given by <a rel="nofollow" target="_blank" href="http://research.microsoft.com/en-us/people/pablofe/">Pablo Fernicola</a> at the <a rel="nofollow" target="_blank" href="http://or09.library.gatech.edu/">Open Repositories 2009</a> conference entitled &#8216;<a rel="nofollow" target="_blank" href="http://research.microsoft.com/en-us/um/redmond/projects/authoring/090518-pablofe-connecting%20authors%20and%20repositories%20through%20sword.pptx">Connecting Authors and Repositories Through SWORD</a>&#8216;.</p>
<p>This blog post is about the functionality I have added to <a rel="nofollow" target="_blank" href="http://www.dspace.org/">DSpace</a> to allow it to accept deposits from within Microsoft Word using SWORD.</p>
<p>If you are unaware of the authoring add-in, then before reading the rest of this blog, take a look at Pablo&#8217;s YouTube video &#8216;Integrating with repositories and journal submissions&#8217; at <a rel="nofollow" target="_blank" href="http://www.youtube.com/watch?v=2_M2gfUyVzU">http://www.youtube.com/watch?v=2_M2gfUyVzU</a>. The video explains the authoring add-in, so I&#8217;ll not duplicate that information in this blog post. The rest of this post explains how I extended DSpace to work with the add-in&#8230;</p>
<p>In order for DSpace to be able to ingest a package, it needs an ingester that understands the format and knows how to unpack it and extract the metadata and file(s). In the case of .docx files created by Microsoft Word, it needs to know how to extract the metadata from within the file, and to archive the file as-is. This is a pretty easy task as a .docx file is actually just a zip file (try renaming it from .docx to .zip and then take a peek inside!). So I wrote an ingester than unzips the file, extracts the <a rel="nofollow" target="_blank" href="http://www.nlm.nih.gov/tsd/cataloging/metafilenew.html">NLM metadata</a> that the add-in inserted in the file, and then creates a new DSpace item with that metadata. Finally it adds the complete .docx file as a bitstream for people to download.</p>
<p>Some of the metadata such as the authors identities are held in the .docx file is held in the customXml/item*.xml files, and other parts such as the article title and abstract are held in the actual document contents in word/document.xml. The ingester extracts these values for use in the new DSpace item.</p>
<pre>
&lt;w:t&gt;Add an S to Microsoft Word and you get SWORD&lt;/w:t&gt;
</pre>
<pre>
&lt;my:name.&gt;
&lt;my:name.content-type.datatypeattribute.attribute.&gt;&lt;/my:name.content-type.datatypeattribute.attribute.&gt;
&lt;my:name.name-style.datatypeattribute.attribute.&gt;&lt;/my:name.name-style.datatypeattribute.attribute.&gt;
&lt;my:surname.&gt;Lewis&lt;/my:surname.&gt;
&lt;my:given-names.&gt;Stuart&lt;/my:given-names.&gt;
&lt;/my:name.&gt;
</pre>
<p>I then configured the DSpace ingesters to use the docx ingester when it encountered .docx files:</p>
<blockquote><p>plugin.named.org.dspace.content.packager.PackageIngester = &#92;<br />
org.dspace.content.packager.PDFPackager = Adobe PDF, PDF, &#92;<br />
org.dspace.content.packager.DSpaceMETSIngester = METS, &#92;<br />
org.dspace.content.packager.DSpaceDocxIngester = DOCX</p></blockquote>
<p>I then configured the SWORD package to expose the fact that it supported .docx files in its SWORD service document:</p>
<blockquote><p>sword.accept-packaging.Docx.identifier = application/vnd.openxmlformats-officedocument.wordprocessingml.document<br />
sword.accept-packaging.Docx.q = 1.0</p></blockquote>
<p>Finally the DSpace SWORD interface needed to know which packager to use for .docx files based on their MIME type:</p>
<blockquote><p>plugin.named.org.dspace.sword.SWORDIngester = &#92;<br />
org.dspace.sword.SWORDMETSIngester = http://purl.org/net/sword-types/METSDSpaceSIP &#92;<br />
org.dspace.sword.SimpleFileIngester = SimpleFileIngester &#92;<br />
org.dspace.sword.DocxIngester = application/vnd.openxmlformats-officedocument.wordprocessingml.document</p></blockquote>
<p>All that is needed to use this is a copy of the authoring add-in (<a rel="nofollow" target="_blank" href="http://research.microsoft.com/en-us/projects/authoring/">http://research.microsoft.com/en-us/projects/authoring/</a>), and a suitable formatted template for the repository that you wish to deposit the document into (<a rel="nofollow" target="_blank" href="http://blog.stuartlewis.com/wp-content/uploads/2009/06/dspace-swordapp-org.docx">dspace-swordapp-org.docx</a>). The template is preconfigured to deposit directly into the <a rel="nofollow" target="_blank" href="http://dspace.swordapp.org/">DSpace SWORD demo repository</a> which I have upgraded with the new code to accept .docx deposits. Feel free to create an account in that repository, install the add-in, load the template, and try out a deposit!</p>
<p>This complete end to end process allows you to create Word templates, and to mark them up with required and optional fields. It also allows you to embed details of the SWORD deposit repository URL (so the users do not need to know what it is) within the template for easy deposit. This could be used for example for a journal editor to provide a template <em>and</em> a deposit location for new paper submissions all-in-one. And this use case could be extended: for example if a faculty member wants all their students to submit an assignment with a template, they could do so and use the repository as the end point rather than a traditional VLE. And unlike a VLE, the repository will probably provide search and indexing facilities across the deposited documents. I&#8217;m sure as this tool gets used more, there will be a lot of new ideas for how it can be used.</p>
<p>Comments welcome! <img src='http://blog.stuartlewis.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley'/> </p>]]></content:encoded>
      <enclosure url="http://research.microsoft.com/en-us/um/redmond/projects/authoring/090518-pablofe-connecting%20authors%20and%20repositories%20through%20sword.pptx" length="70525" type="application/vnd.openxmlformats-officedocument.presentationml.presentation" /><media:content url="http://research.microsoft.com/en-us/um/redmond/projects/authoring/090518-pablofe-connecting%20authors%20and%20repositories%20through%20sword.pptx" fileSize="70525" type="application/vnd.openxmlformats-officedocument.presentationml.presentation" /><itunes:subtitle>As a member of the SWORD project, it has been a great seeing Microsoft&amp;#8217;s External Research group integrate SWORD into Word 2007, their Zentity repository, and their online journal hosting system. There is a good overview of this work in a presentati</itunes:subtitle><itunes:summary>As a member of the SWORD project, it has been a great seeing Microsoft&amp;#8217;s External Research group integrate SWORD into Word 2007, their Zentity repository, and their online journal hosting system. There is a good overview of this work in a presentation given by Pablo Fernicola at the Open Repositories 2009 conference entitled &amp;#8216;Connecting Authors and Repositories [...]</itunes:summary></item>
      <item>
         <title>I2: Survey</title>
         <link>http://feedproxy.google.com/~r/technosophia/~3/9AGGicyXiVs/</link>
         <description>[Series]
Near the end of my strawman post, I wrote:
The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week. It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey. We completed [...]</description>
         <guid isPermaLink="false">http://lackoftalent.org/michael/blog/?p=363</guid>
         <pubDate>Sat, 20 Jun 2009 11:30:53 -0700</pubDate>
         <content:encoded><![CDATA[
<p>[<a rel="nofollow" target="_blank" href="http://lackoftalent.org/michael/blog/category/niso-i2/">Series</a>]</p>
<p>Near the end of my <a rel="nofollow" target="_blank" href="http://lackoftalent.org/michael/blog/2009/06/13/i2-strawman/">strawman post</a>, I wrote:<br />
<blockquote>The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week. It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey. </p></blockquote>
<p>We completed the survey late last week and began distributing it. Here&#039;s what we sent out:<br />
<blockquote>
<p>The NISO I2 Working Group is surveying repository managers to determine the current practices and needs of the repository community regarding institutional identifiers. We value your time and your input in the process to create a standard for a new institutional identifier. We hope that you will complete the survey which should take less than 15 minutes. The survey will remain open through Monday, July 6th.</p>
<p>
Here is a link to the survey:<br />
<a rel="nofollow" target="_blank" href="http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d">http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d</a></p>
<p>
Please feel free to share this message with other interested parties.</p>
</blockquote>
<p>First we used <a rel="nofollow" target="_blank" href="http://www.surveymonkey.com/">Survey Monkey</a> to send the survey link to approximately one-hundred repository managers that the subgroup identified. Our process for identifying repository managers involved pulling together a list of prominent repositories from subgroup members, and then gathering more from <a rel="nofollow" target="_blank" href="http://www.opendoar.org/">OpenDOAR</a>, &#034;an authoritative directory of academic open access repositories.&#034; Then subgroup members were encouraged to share the survey link with colleagues, and post it far and wide via blogs, listservs, and <a rel="nofollow" target="_blank" href="http://twitter.com/mjgiarlo/status/2230486784">tweets</a>. The listservs we targeted were: <a rel="nofollow" target="_blank" href="http://www.jiscmail.ac.uk/archives/jisc-repositories.html">JISC-REPOSITORIES</a>, <a rel="nofollow" target="_blank" href="http://metadatalibrarians.monarchos.com/">metadataLibrarians</a>, <a rel="nofollow" target="_blank" href="http://groups.google.com/group/digital-curation">digital-curation</a>, <a rel="nofollow" target="_blank" href="https://arl.org/Lists/SPARC-IR/">SPARC-IR</a>, <a rel="nofollow" target="_blank" href="http://mailman.anu.edu.au/mailman/listinfo/ir-net">ir-net</a>, <a rel="nofollow" target="_blank" href="http://www.lsoft.com/SCRIPTS/WL.EXE?SL1=REPOMAN-L&#038;H=LISTSERV.INDIANA.EDU">REPOMAN-L</a>, <a rel="nofollow" target="_blank" href="http://larch.palinet.org/archives/palinet-ir-l.html">PALINET-IR-L</a>, <a rel="nofollow" target="_blank" href="http://mailman.mit.edu/mailman/listinfo/dspace-general">dspace-general</a>, <a rel="nofollow" target="_blank" href="https://lists.sourceforge.net/lists/listinfo/fedora-commons-users">fedora-commons-users</a>, <a rel="nofollow" target="_blank" href="http://dublincore.org/groups/identifiers/">DC-IDENTIFIERS</a>, and <a rel="nofollow" target="_blank" href="http://dewey.library.nd.edu/mailing-lists/code4lib/">code4lib</a>.</p>
<p>I&#039;ve already received a few responses and have gotten useful feedback. Two of the hardest questions to answer so far have been: &#034;What is an institutional identifier?&#034; and &#034;What is a repository?&#034;</p>
<dl>
<dt><strong>Institutional identifier</strong></dt>
<dd>
<p>An institutional identifier is defined as a symbol or code that uniquely identifies an institution. Domain-specific examples of existing identifiers include SAN, IPEDS, GLN, MARC Org Code, and ISIL. Another example might be a Handle prefix or ARK name authority assigning number.</p>
</dd>
<dt><strong>Repository</strong></dt>
<dd>
<p>Institutional repositories and subject repositories like arxiv.org are clearly &#039;repositories&#039;, but beyond that it is a somewhat ill-defined term. One might look to the <a rel="nofollow" target="_blank" href="http://www.cnri.reston.va.us/k-w.html">Kahn-Wilensky architecture</a>, or the <a rel="nofollow" target="_blank" href="http://public.ccsds.org/publications/archive/650x0b1.pdf">OAIS reference model (PDF)</a>, or even <a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Digital_repository">Wikipedia</a> for definitions, but it&#039;s not clear that even the authorities agree on what constitutes a repository.</p>
<p>It&#039;s a system. It&#039;s network-accessible and typically has a web interface of some sort. Files and groups of files sometimes known as objects tend to be deposited in them, perhaps for some combination of management, access, or preservation. Many run Fedora, DSpace, and ePrints, and factor heavily in scholarly communication. Some are document-centric. Some will accept anything. To some, a learning management system may be a repo. To others, a content management system may fit.</p>
<p>My background is in academia so my own definition is somewhat based in that context, but I wouldn&#039;t say the term is necessarily limited to that context. There are other NISO I2 scenarios for library workflows and electronic resources, so it&#039;s safe to assume that repository does not mean ILS or OPAC or ERP system. My hope is that folks have their own working definitions of the term and can decide for themselves what it means.</p>
</dd>
</dl>
<p>We&#039;ve given folks a little over two weeks to respond to the survey, so the constant I2 drum-beating will quiet down for a while around here. I am very interested in what sorts of responses we get from the survey. Fun times!</p>
<p>Oh, and perhaps it goes without saying, but if you&#039;re a repository owner, manager, expert, developer, or stakeholder with an interest in identifiers, please feel free to take the <a rel="nofollow" target="_blank" href="http://www.surveymonkey.com/s.aspx?sm=RGQgZ3090DVrb3kFzr3P3Q_3d_3d">survey</a>!</p>
<div class="feedflare">
<a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=9AGGicyXiVs:0SR0L0APkaQ:bcOpcFrp8Mo"><img src="http://feeds.feedburner.com/~ff/technosophia?d=bcOpcFrp8Mo" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=9AGGicyXiVs:0SR0L0APkaQ:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/technosophia?d=dnMXMwOfBR0" border="0"></a> <a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/technosophia?a=9AGGicyXiVs:0SR0L0APkaQ:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/technosophia?i=9AGGicyXiVs:0SR0L0APkaQ:D7DqB2pKExk" border="0"></a>
</div><img src="http://feeds.feedburner.com/~r/technosophia/~4/9AGGicyXiVs" height="1" width="1"/>]]></content:encoded>
      <enclosure url="http://public.ccsds.org/publications/archive/650x0b1.pdf" length="654750" type="application/pdf" /><media:content url="http://public.ccsds.org/publications/archive/650x0b1.pdf" fileSize="654750" type="application/pdf" /><itunes:subtitle>[Series] Near the end of my strawman post, I wrote: The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week. It will be interesting to see if the requirements we have thus far identified still obtain in light</itunes:subtitle><itunes:summary>[Series] Near the end of my strawman post, I wrote: The I2 repositories subgroup will be sending out its survey on identifier use cases in the coming week. It will be interesting to see if the requirements we have thus far identified still obtain in light of the data we collect from the survey. We completed [...]</itunes:summary></item>
      <item>
         <title>UK open data, open government</title>
         <link>http://feedproxy.google.com/~r/ScienceLibraryPad/~3/Mcd6S2Ic8BA/uk-open-data-open-government.html</link>
         <description>&lt;div&gt;&lt;p&gt;I was sorely tempted to title this "would uk like some data, guv?"&lt;/p&gt;&lt;p&gt;The UK government is picking up the challenges issued in the excellent &lt;a rel="nofollow" target="_blank" href="http://scilib.typepad.com/science_library_pad/2009/02/uk-power-of-information-taskforce-report-beta.html"&gt;Power of Information Taskforce report&lt;/a&gt;.&lt;/p&gt;&lt;p&gt;Via &lt;a rel="nofollow" target="_blank" href="http://friendfeed.com/andypowe11/8f56a751/open-government-data"&gt;Andy Powell&lt;/a&gt; in my FriendFeed, I find a &lt;cite&gt;Guardian&lt;/cite&gt; article &lt;a rel="nofollow" target="_blank" href="http://www.guardian.co.uk/technology/2009/jun/04/free-our-data"&gt;Free our data: UK set to follow successful US data method&lt;/a&gt; &lt;/p&gt;&lt;blockquote&gt;&lt;p&gt; Now the UK government has picked up on the idea, and in a post on the Cabinet Office blog Richard Stirling is &lt;a rel="nofollow" target="_blank" href="http://blogs.cabinetoffice.gov.uk/digitalengagement/post/2009/05/22/Information-and-how-to-make-it-useful.aspx"&gt;asking the British public how a UK version of the US site should be implemented&lt;/a&gt;. "What characteristics would be most useful to you - feeds (ATOM or RSS) or bulk download by FTP?," he asks. "Should this be an index or a repository? Should this serve particular types of data eg XML, JSON or RDF?" &lt;/p&gt;&lt;p&gt; Although there is a list of dozens of the &lt;a rel="nofollow" target="_blank" href="http://wiki.rewiredstate.org/?page=APIs"&gt;UK government's published data sources&lt;/a&gt; there is no clear pan-governmental approach to making data available. The proposal has been received with pleasure by a number of web developers and would-be data users, although it is not clear how free people would be to use the data commercially. &lt;/p&gt;&lt;/blockquote&gt;&lt;p&gt; Richard Stirling is writing in the UK Cabinet Office Digital Engagement blog&lt;/p&gt;&lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://blogs.cabinetoffice.gov.uk/digitalengagement/"&gt;http://blogs.cabinetoffice.gov.uk/digitalengagement/&lt;/a&gt; &lt;/p&gt;&lt;p&gt;At some point I will no longer be saying things like "yes, that's an official gov.uk blog" but... well, it is. &lt;/p&gt;&lt;p&gt; The four themes they list on their &lt;a rel="nofollow" target="_blank" href="http://blogs.cabinetoffice.gov.uk/digitalengagement/page/About.aspx"&gt;about&lt;/a&gt; page: open information, open feedback, open conversation, open innovation.&lt;/p&gt;&lt;p&gt;A more extensive extract of what Richard Stirling asks in his posting &lt;a rel="nofollow" target="_blank" href="http://blogs.cabinetoffice.gov.uk/digitalengagement/post/2009/05/22/Information-and-how-to-make-it-useful.aspx"&gt;Information and how to make it useful&lt;/a&gt; :&lt;/p&gt;&lt;blockquote&gt;&lt;p&gt; Any solution must support open standards and would ideally be open source, but there are a couple of other questions we are pondering at the moment: &lt;/p&gt;&lt;ul&gt; &lt;li&gt; What characteristics would be most useful to you – feeds (ATOM or RSS) or bulk download by e.g. FTP, etc?&lt;/li&gt; &lt;li&gt; Should this be an index or a repository?&lt;/li&gt; &lt;li&gt; Should this serve particular types of data e.g. XML, JSON or RDF?&lt;/li&gt; &lt;li&gt; What examples should we be looking at (beyond data.gov e.g.http://ideas.welcomebackstage.com/data)?&lt;/li&gt; &lt;li&gt; Does this need its own domain, or should it sit on an existing supersite (e.g. http://direct.gov.uk)&lt;/li&gt; &lt;/ul&gt; &lt;/blockquote&gt; &lt;p&gt;There are already 19 substantive comments, and he indicates they are also monitoring Twitter for the hashtags &lt;a rel="nofollow" target="_blank" href="http://search.twitter.com/search?q=%23poit"&gt;#poit&lt;/a&gt; (Power of Information Taskforce) and &lt;a rel="nofollow" target="_blank" href="http://search.twitter.com/search?q=%23opendata"&gt;#opendata&lt;/a&gt;&lt;br&gt; &lt;/p&gt;&lt;p&gt;There is a new Director of Digital Engagement, Andrew Stott, according to his official Twitter feed, &lt;a rel="nofollow" target="_blank" href="http://twitter.com/dirdigeng"&gt;@DirDigEng&lt;/a&gt; , he was scheduled to start in his position yesterday. &lt;/p&gt;&lt;p&gt; Sometimes I feel like a certain country often considered to be between the UK and the US is missing out on this whole official open data, blogging, twitter thing... &lt;/p&gt;&lt;p&gt; If anyone were to want someone to start blogging officially about government open data in a certain northern neighbour of the US, I am available...&lt;/p&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=yIl2AUoC8zA" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:F7zBnMyn0Lo"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?i=Mcd6S2Ic8BA:_kmCJo89dWU:F7zBnMyn0Lo" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:qj6IDK7rITs"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=qj6IDK7rITs" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:EpLpB3ZkKWg"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=EpLpB3ZkKWg" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:V-t1I-SPZMU"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=V-t1I-SPZMU" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:XAVGb8Xj5zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=XAVGb8Xj5zA" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:dnMXMwOfBR0"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=dnMXMwOfBR0" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=Mcd6S2Ic8BA:_kmCJo89dWU:V_sGLiPBpWU"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?i=Mcd6S2Ic8BA:_kmCJo89dWU:V_sGLiPBpWU" border="0"&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/ScienceLibraryPad/~4/Mcd6S2Ic8BA" height="1" width="1"/&gt;</description>
         <author>Richard Akerman</author>
         <guid isPermaLink="false">tag:typepad.com,2003:post-67650285</guid>
         <pubDate>Thu, 04 Jun 2009 12:47:26 -0700</pubDate>
      </item>
      <item>
         <title>IMS's three-pronged strategy</title>
         <link>http://www.cetis.ac.uk/members/scott/blogview?entry=20090525100440</link>
         <description>&lt;p&gt;Two weeks ago I attended the IMS Learning Impact conference in Barcelona; this was one of the first IMS events I'd attended in some time, and I've tried to put together some thoughts on the IMS technical strategy. &lt;/p&gt; &lt;p&gt;Now a lot of discussion around IMS is about its policies, politics, membership, processes and so on, but I'm not going into any of that here; for now I want to focus on specifications. (Note also this is a personal reflection, and doesn't necessarily represent the view of JISC or CETIS)&lt;/p&gt; &lt;p&gt;IMS currently sets out its stall as offering three key "products" in its specification portfolio; together they make up its "&lt;a rel="nofollow" target="_blank" href="http://www.imsglobal.org/digitallearningservices.html"&gt;Digital Learning Services Standards&lt;/a&gt;" portfolio.&lt;/p&gt; &lt;h4&gt;Common Cartridge&lt;/h4&gt; &lt;p&gt;The first product is the &lt;a rel="nofollow" target="_blank" href="http://www.imsglobal.org/cc/index.html"&gt;IMS Common Cartridge&lt;/a&gt;. I remember this originating in discussions way back as to whether there should be a SCORM-like profile of standards (packaging, metadata, runtime and so on) that was better suited to Higher Education. Its come a long way since, and has gathered a lot of influential support. However, ultimately it is still a means of putting a bunch of web pages into a zip file to import into an LMS, which seems increasingly an odd thing to do. &lt;/p&gt; &lt;p&gt;CC does add some interesting capabilities - it adds the QTI (Question and Test Interoperability) specification to the profile, enabling automated assessments along with content (something ADL was considering for SCORM a few years back but never got around to), and it also adds a capability for adding "tools" using a version of the IMS Learning Tools Interoperability specification (more on which later). However it misses out things like SCORM's tracking functionality and CMI runtime, which have been a major selling point of SCORM.&lt;/p&gt; &lt;p&gt;Overall CC is an odd mix (one of my colleagues referred to it as the "curate's egg" specification) but is quite likely to gain some traction with publishers and commercial LMSs. But what will the impact be? I'm not convinced a market for common cartridges will open up in HE in the same manner as occurred in commercial training with SCORM; they really are quite different. However, IMS is putting a lot of effort into marketing CC to get adoption, so I remain open minded. &lt;/p&gt; &lt;p&gt;For the moment the key question is whether OER and CC are a natural fit - clearly the UK's Open University considers it an option.&lt;/p&gt; &lt;h4&gt;Learner Information Services&lt;/h4&gt; &lt;p&gt;IMS Learner Information Services is the latest revision of the IMS Enterprise specification, which was one of the first IMS specifications and the first I was personally involved in. IMS LIS, like its predecessor, is focussed on the connection between an LMS and a student record system; this means being able to provide groups and users to the LMS and to handle reporting back for things like final grades or tracking data. LIS extends Enterprise with new services for areas like course structures (a bit similar to &lt;a rel="nofollow" target="_blank" href="http://xcri.org"&gt;XCRI&lt;/a&gt;, though not compatible with it) and continues the service-oriented approach that started with IMS Enterprise Services 1.0. &lt;/p&gt; &lt;p&gt;However, IMS LIS also plans to add batch-file bindings that would also enable REST services; this is important, as some of the most successful applications of IMS Enterprise have focussed on simple REST services and batch processing, and I'm glad to see IMS is recognising this and providing official support.&lt;/p&gt; &lt;h4&gt;Learning Tools Interoperability&lt;/h4&gt; &lt;p&gt;IMS LTI v2.0 is a specification for enabling LMS's to include external applications running in iFrames that can communicate with the host LMS for things like user information. (There isn't a page for v2.0 yet, but there is a description of LTI &lt;a rel="nofollow" target="_blank" href="http://www.imsglobal.org/activities.html"&gt;on this page&lt;/a&gt;)&lt;/p&gt; &lt;p&gt;If you've been following this blog you'll know that I've been contributing to the W3C Widgets familiy of specifications, and have been keen to point out that the IMS specification is rather similar, begging the question as to why IMS is bothering to reinvent this particular wheel. Part of the reason is historical - IMS LTI started earlier than W3C's activity (although that itself is based on earlier technologies, such as Apple Dashboard Widgets and Yahoo! Konfabulator) and so has had time to diverge from common practice. Its also harder to backtrack on the legacy of its "version 1.0" which was really an exploration of a possible common extension mechanism for (back then) WebCT and Blackboard (and later Sakai). Another part of the reason is that many IMS members are not really that well connected into web standards generally, and even though I personally prompted the IMS working group to look into Widgets they never really managed to connect it with what they were working on.&lt;/p&gt; &lt;p&gt;I think IMS is putting a lot of pressure on LTI to succeed, although I think its fundamentally misconceived. For example, the current IMS LTI 2.0 document set consists of 20 word documents covering everything from inter-widget messaging to APIs for passing "outcomes" from widgets to the LMS - most of it involving lots of SOAP and WSDL. Using the W3C Widgets specification I reckon this could be whittled down to one or two very basic documents for things like common education vocabularies and how to use Widgets with an open API on the LMS end (or more likely, just IMS LIS with a REST binding). Already I've seen a lot of European projects working with W3C Widgets and Google OpenSocial to deliver IMS-LTI-like functionality; we've also been working with Sakai 3 and Moodle to integrate both Widgets and OpenSocial applications. Perhaps the main value IMS could contribute would be to sort out the REST APIs that Widgets could call to enable tracking.&lt;/p&gt; &lt;p&gt;Its unlikely that IMS would embrace W3C for LTI for a number of reasons, mostly that the whole approach doesn't just invalidate most of the effort on IMS LTI 2.0, it also calls into question the whole "DIgital Learning Services" strategy: if we used web standards, plus a REST API for things like cohorts, would we even need IMS Common Cartridge or IMS LTI &lt;em&gt;at all&lt;/em&gt;? From this perspective, the IMS strategy seems less about opening up education systems so much as supporting an education technology silo into which a few suppliers can offer services with limited external competition. After all, who apart from a few established players in the HE sector is going to implement those 20 IMS LTI documents?&lt;/p&gt; &lt;h4&gt;The Missing Specifications&lt;/h4&gt; &lt;p&gt;As well as what is IN the IMS strategy, its worth considering for a moment what's OUT of the strategy:&lt;/p&gt; &lt;p&gt;&lt;b&gt;IMS Learning Design&lt;/b&gt; is one of the most widely discussed specifications in a European context (or certainly seems to be in the circles I move in) and yet is completely absent. It had some discussion in relation to a K-12 profile of Common Cartridge, but I don't think this progressed anywhere. Its especially unusual that IMS isn't promoting any further development or marketing for LD, given that quite a few of the entrants to its own Learning Impact awards were using LD.&lt;/p&gt; &lt;p&gt;&lt;b&gt;IMS ePortfolio&lt;/b&gt; was a relatively recent specification, but was released with some fundamental problems. So far there has been insufficient interest from members to fix it, and CETIS has switched its efforts to looking into practical interoperability between ePortfolio applications using Atom feeds (see &lt;a rel="nofollow" target="_blank" href="http://wiki.cetis.ac.uk/Portfolio_interoperability_prototyping"&gt;PIOP&lt;/a&gt;).&lt;/p&gt; &lt;p&gt;&lt;b&gt;IMS QTI 2.x&lt;/b&gt; is largely completed, but was recently withdrawn by IMS and then reinstated after a number of complaints. I think its fair to say IMS doesn't really know what to do with it, given that IMS CC uses the older 1.x version of QTI.&lt;/p&gt; &lt;p&gt;Finally, IMS has a number of specifications that really should be retired - IMS LIP, IMS SSP, IMS RLI, IMS VDEX come to mind.&lt;/p&gt; &lt;h4&gt;The Rub?&lt;h4&gt; &lt;p&gt;Overall the IMS specifications seem oddly out of step with the wider web. There is still an adherence to SOAP and WSDL doctrine that has slid rapidly into the Trough of Despond elsewhere. Where REST APIs are considered, they are a bit of an add-on rather than at the core - IMS specifications are not based on web architecture, irrespective of binding. And, most critically, there is a big gap between IMS and web standards, as evidenced by the disconnect between IMS LTI 2.0 and W3C Widgets (or even Google OpenSocial). This was pointed out at the plenary of the event by Mark Stiles (who chairs the JISC-CETIS board as well as being on the IMS board of directors): IMS really needs to work more closely with horizontal standards rather than build a silo.&lt;/p&gt; &lt;p&gt;I think IMS LIS is worth a look when it comes out, as is Common Cartridge (though in the short term we can easily convert Cartridges to the more commonly-supported formats such as SCORM 1.2 and using the &lt;a rel="nofollow" target="_blank" href="http://blogs.cetis.ac.uk/sheilamacneill/2008/10/10/content-transcoder-demonstration/"&gt;Content Transcoder&lt;/a&gt;). However I'd give IMS LTI 2.0 a miss.&lt;/p&gt; &lt;p&gt;But more generally, is IMS too old-fashioned to remain relevant? Or is its focus on rather "unfashionable" areas of education technology a good bet for longer-term survival? A lot will depend on the success - or otherwise - of the current three-spec strategy&lt;/p&gt;&lt;/h4&gt;&lt;/h4&gt;</description>
         <author>Scott Wilson</author>
         <guid isPermaLink="false">tag:cetis.ac.uk,2009-05-25:20090525100440</guid>
         <pubDate>Sun, 24 May 2009 17:00:00 -0700</pubDate>
      </item>
      <item>
         <title>Agile Languages &amp; Fedora — Update from OR09</title>
         <link>http://feedproxy.google.com/~r/MediashelfBlog/~3/T6ptEsFdP3A/</link>
         <description>Leading up to this year's Open Repositories, it became clear that there was demand for a BOF (Birds of Feather) session focused on agile languages and Fedora. I pitched the idea in an email to a couple colleagues beforehand and then announced the BOF at my presentation on Monday morning. Rather than constricting it to Fedora, I billed it as Agile Languages and Repositories. About 30 people showed up. We all exchanged info and I set up some new collaboration email lists.</description>
         <guid isPermaLink="false">http://yourmediashelf.com/blog/?p=147</guid>
         <pubDate>Tue, 19 May 2009 22:41:42 -0700</pubDate>
         <content:encoded><![CDATA[<p>Leading up to this year&#8217;s <a rel="nofollow" title="Open Repositories 2009" target="_blank" href="https://or09.library.gatech.edu/">Open Repositories</a>, it became clear that there was demand for a <a rel="nofollow" title="wikipedia: Birds of a Feather (computing)" target="_blank" href="http://en.wikipedia.org/wiki/Birds_of_a_Feather_(computing)">BOF</a> (Birds of Feather) session focused on agile languages and Fedora. I pitched the idea in an email to a couple colleagues beforehand and then announced the BOF at <a rel="nofollow" title="Many Lightweight Views into Complex Repository Content: Enabling Rapid Application Development for Fedora Repositories" target="_blank" href="https://or09.library.gatech.edu/general114.php">my presentation</a> on Monday morning. Rather than constricting it to Fedora projects, I billed it as Agile Languages and Repositories. About 30 people showed up. The split was pretty even between Ruby, Python, and PHP developers. About a third seemed to be Java developers in the process of defecting. In addition to people doing stuff with Fedora, there were a handful of DSpace developers and possibly a couple who maintain ePrints repositories. </p>
<p>For the first half of the BOF we sat in mixed groups, eating our lunches and each talking about the work we do. We then split up by language (Ruby, Python, PHP) and discussed language-specific topics. For that second half I sat at the Ruby table where we talked about <a rel="nofollow" title="ActiveFedora" target="_blank" href="http://mediashelf.us/activefedora">ActiveFedora</a>, <a rel="nofollow" title="JRuby Home" target="_blank" href="http://jruby.codehaus.org">JRuby</a>, <a rel="nofollow" title="Jena Ruby bindings: accessing Jena&#x002019;s feature rich RDF api from Ruby" target="_blank" href="http://www.thewebsemantic.com/2008/01/05/jena-ruby-bindings-accessing-jenas-feature-rich-rdf-api-from-ruby/">RDF support for Ruby</a>, MODS support for Ruby, Solr (<a rel="nofollow" title="solr-ruby" target="_blank" href="http://wiki.apache.org/solr/solr-ruby">solr-ruby</a> and <a rel="nofollow" title="RSolr" target="_blank" href="http://github.com/mwmitchell/rsolr/tree/master">RSolr</a>), and how <a rel="nofollow" title="Blacklight OPAC Homepage" target="_blank" href="http://blacklightopac.org">Blacklight</a> fits into the mix. </p>
<p>I closed the conversation by asking if we should set up email lists for collaboration. It seemed reasonable to set up a general mailing list for the solutions community as well as a list specifically for people doing stuff with Ruby, Fedora repositories, and (most likely) ActiveFedora. I also resolved to encourage the creation of Python-oriented and PHP-oriented equivalents. For now I have created two lists on Google Groups. The first one, <a rel="nofollow" title="Fedora Commons Create Group" target="_blank" href="http://groups.google.com/group/fedora-commons-create">Fedora Commons Create</a>, is for general discourse about creating client applications for Fedora. The second, <a rel="nofollow" title="ActiveFedora Group" target="_blank" href="http://groups.google.com/group/active-fedora">ActiveFedora / Ruby + Fedora Commons</a>, is for Ruby-specific collaboration.</p>
<p>In the end, I was really pleased to realize that for the first time we had a substantial group of people interested in each of the main interpreted languages (Ruby, Python, PHP) and each group had at least one open source Fedora-based project to use as a starting point for their conversations. The Ruby group had <a rel="nofollow" title="ActiveFedora" target="_blank" href="http://mediashelf.us/activefedora">ActiveFedora</a>, the Python group had <a rel="nofollow" title="Less Talk, More Code" target="_blank" href="http://oxfordrepo.blogspot.com/">Ben O&#8217;Steen&#8217;s work</a> and <a rel="nofollow" title="Fedora and Django for an image repository: a new front-end" target="_blank" href="https://or09.library.gatech.edu/fedora69.php">Peter Herndon&#8217;s Django integration</a>, and the PHP/Drupal people had <a rel="nofollow" title="Islandora" target="_blank" href="http://vre.upei.ca/dev/islandora">Islandora</a> &amp; <a rel="nofollow" title="Fez" target="_blank" href="http://sourceforge.net/projects/fez/">Fez</a> to start from. </p>
<p>This was a comfortable step forward from <a rel="nofollow" title="Inroads to Application Development for Fedora Commons" target="_blank" href="http://yourmediashelf.com/blog/2008/05/14/inroads-to-application-development-for-fedora-commons/">the scenario as it was a year ago</a>.</p>
<table style="background-color:#fff;padding:5px;" border="0" cellspacing="0">
<tbody>
<tr>
<td><img src="http://groups.google.com/groups/img/3nb/groups_bar.gif" alt="Google Groups" width="132" height="26"/></td>
</tr>
<tr>
<td style="padding-left:5px;font-size:125%;"><strong>Fedora Commons Create</strong></td>
</tr>
<tr>
<td style="padding-left:5px;"><a rel="nofollow" target="_blank" href="http://groups.google.com/group/fedora-commons-create">Visit this group</a></td>
</tr>
</tbody>
</table>
<table style="background-color:#fff;padding:5px;" border="0" cellspacing="0">
<tbody>
<tr>
<td><img src="http://groups.google.com/groups/img/3nb/groups_bar.gif" alt="Google Groups" width="132" height="26"/></td>
</tr>
<tr>
<td style="padding-left:5px;font-size:125%;"><strong>ActiveFedora / Ruby + Fedora Commons</strong></td>
</tr>
<tr>
<td style="padding-left:5px;"><a rel="nofollow" target="_blank" href="http://groups.google.com/group/active-fedora">Visit this group</a></td>
</tr>
</tbody>
</table>
<img src="http://feeds.feedburner.com/~r/MediashelfBlog/~4/T6ptEsFdP3A" height="1" width="1"/>]]></content:encoded>
      </item>
      <item>
         <title>Auto paste of JISCBIDS spreadsheet marks.</title>
         <link>http://feedproxy.google.com/~r/DFFlanders/~3/BOSZnsoWIY4/</link>
         <description>So @cardcc suggested at #rpmeet that most everyone was using an offline template spreadsheet to mark #jiscbids and that it would be a real time saver if there was a way to auto copy and paste the offline spreadsheet in one go. Well I don&amp;#8217;t have a one go solution but I do have a [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&amp;blog=532341&amp;post=184&amp;subd=dfflanders&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://dfflanders.wordpress.com/?p=184</guid>
         <pubDate>Mon, 11 May 2009 13:52:22 -0700</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><p>So @cardcc suggested at #rpmeet that most everyone was using an offline template spreadsheet to mark #jiscbids and that it would be a real time saver if there was a way to auto copy and paste the offline spreadsheet in one go. Well I don&#8217;t have a one go solution but I do have a three step solution that I was able to piece together over the weekend (while marking bids).</p>
<p>DISCLAIMER: THIS IS ONLY A PROTOTYPE AND IS NOT THE OFFICIAL WAY TO SUBMIT MARKS FOR JISCBIDS, THIS IS ONLY AN PROTOTYPE TO SHOW HOW THIS MIGHT BE ACHIEVED. YOU STILL MUST SUBMIT BIDS USING THE CURRENT APPROVED FORM.</p>
<p>None the less, if you would like to help me tweak this form so that it could be useable in the future then please follow the below instructions:</p>
<p>1.)<a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/ccc?key=rw_g4TI9EwZ6fjZ00GXh_-A"> Download this Excel Spreadsheet</a>. This spreadsheet is the marking template. Use the template spreadsheet exactly as it is set up. If you change any of the headings rows or columns it will render the spreadsheet useless.</p>
<p>2.) Once you have completed filling out the template spreadsheet<a rel="nofollow" target="_blank" href="http://www.screencast.com/users/DFFlanders/folders/Jing/media/49858083-418a-439b-b468-0300658a35c1"> follow this screencast (w/audio) step by step</a> for uploading to GoogleDocs. The above screencast will also show you how to publish an RSS feed from the document. TO NOTE: this RSS feed must be a &#8220;cell&#8221; rss feed so the data can be sliced and diced in multiple way so that it is exposed on the backend system to multiple outputs. Eventually, I&#8217;ll enable it for just the simple url for the SS, for now please just make sure you are exposing the right RSS feed.</p>
<p>3.) Once you have the RSS feed<a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/viewform?hl=en&amp;formkey=clZMcDQ3eEZxZHlMVEt4NWtiUjVOUXc6MA.."> go to this form and fill it out</a>. Job done, well then there is a lot of back end processing that takes place to parse your spreadsheet into the various forms it needs to go to as part of the next step in the marking workflow.</p>
<p>Resources used in this work:</p>
<ul>
<li>http://docs.google.com/support/bin/answer.py?hl=en&amp;answer=75507y</li>
<li>jQuery and the joy of arrays <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley'/> </li>
</ul> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/dfflanders.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dfflanders.wordpress.com/184/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/dfflanders.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dfflanders.wordpress.com/184/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/dfflanders.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dfflanders.wordpress.com/184/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/dfflanders.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dfflanders.wordpress.com/184/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/dfflanders.wordpress.com/184/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dfflanders.wordpress.com/184/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&blog=532341&post=184&subd=dfflanders&ref=&feed=1"/></div>]]></content:encoded>
         <media:content url="http://0.gravatar.com/avatar/8cdc26501af3b7e571a4d23f0afbbf08?s=96&amp;amp;d=identicon&amp;amp;r=G" medium="image">
            <media:title>dfflanders</media:title>
         </media:content>
      </item>
      <item>
         <title>Government of Canada internal IT Innovation Campaign</title>
         <link>http://feedproxy.google.com/~r/ScienceLibraryPad/~3/iY41WW9r_9g/government-of-canada-internal-it-innovation-campaign.html</link>
         <description>&lt;div&gt;&lt;p&gt;The Government of Canada has opened what I would call an ideas market, a system to submit ideas, vote on them, and comment on them. The Campaign is internal only and for IT staff only. You can see a screenshot (by permission of the GC and the developers, &lt;a rel="nofollow" target="_blank" href="http://www.publivate.com/"&gt;Publivate&lt;/a&gt;) &lt;/p&gt; &lt;p&gt;&lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/rakerman/3505237714/" title="GC Innovation Campaign - Mozilla Firefox 2009-05-05 15824 PM - edit by rakerman, on Flickr"&gt;&lt;img alt="[GC Innovation Campaign - Mozilla Firefox 2009-05-05 15824 PM - edit]" height="416" src="http://farm4.static.flickr.com/3568/3505237714_bdb8a7b0a0.jpg" width="500"&gt;&lt;/a&gt; &lt;/p&gt;&lt;p&gt;Rather than give a URL which most of you can't access anyway, I'll just suggest that if you're in the Government of Canada, you can find more information and the link to the site by searching for "innovation campaign" on GCPEDIA (which is also accessible within GC only). The site opened for submissions of ideas May 4, and will close May 29, as you can see from the countdown bar in the upper right.&lt;/p&gt;&lt;p&gt;Like GCPEDIA itself, I consider this a great development, showing a government willing to embrace risk, try new technologies, and draw upon the expertise of the community (about 18,000 federal IT specialists, in this case). I think this is a measured approach, and I certainly would expect that if successful, it will lead to more consultations more broadly both within the government as well as ones open to all Canadian citizens.&lt;/p&gt;&lt;p&gt;I think this approach is really effective for breaking silos and circulating information - in some of the ideas already, a few things being proposed turn out to already be available, people weren't just aware of them elsewhere in the government.&lt;/p&gt;&lt;p&gt;I'm using the hashtag &lt;a rel="nofollow" target="_blank" href="http://search.twitter.com/search?q=%23gcitic"&gt;#gcitic&lt;/a&gt; to discuss and ask questions about the campaign on Twitter.&lt;/p&gt;&lt;p&gt;UPDATE 2009-05-06: I should mention another innovation aspect of this site, which is that it is the first time I've seen machine translation use in an official way. Currently all Government of Canada websites must have all text in both official languages, which is usually done through manual translation. This makes it basically impossible to have a dynamic site with constant changes. If we're allowed to use machine translation, it will make it much easier to bring up e.g. public blogs. Now that being said, the translation engine they are using is Google Translate, which is ok for a free translator but is by no means perfect. I know there's tons of work being done on machine translation at NRC and elsewhere in the government - it would be nice if there was a standard machine translation service available that we could also use... hmm... I think I'll submit an idea... ENDUPDATE&lt;/p&gt;&lt;p&gt;Google uses this approach internally, their site is called simply Google Ideas.&lt;/p&gt; &lt;p&gt;&lt;img src="http://blogoscoped.com/files/tools-presentation/google-ideas.png"&gt;&lt;br&gt;(Screenshot via blogoscoped.com - &lt;a rel="nofollow" target="_blank" href="http://blogoscoped.com/archive/2008-03-12-n39.html"&gt;The Tools Google Uses Internally&lt;/a&gt;)&lt;/p&gt;&lt;p&gt;I have seen this approach used to some extent in the library community, the example I always point to is JISC &lt;a rel="nofollow" target="_blank" href="http://jiscrepository.ideascale.com/"&gt;Repository Ideas&lt;/a&gt; (which is still up, but no longer active). There are probably many others that I have missed.&lt;/p&gt;&lt;p&gt;The Obama administration has also used this approach a number of times, see e.g. my recent posting about the &lt;a rel="nofollow" target="_blank" href="http://scilib.typepad.com/science_library_pad/2009/04/recoverygov-crowdsources.html"&gt;National Dialogue to gather IT ideas for the Recovery.gov site&lt;/a&gt;.&lt;/p&gt;&lt;p&gt;Previously:&lt;br&gt;November 03, 2008 &lt;a rel="nofollow" target="_blank" href="http://scilib.typepad.com/science_library_pad/2008/11/government-of-c.html"&gt;Government of Canada launches official wiki for federal employees&lt;/a&gt;&lt;/p&gt;&lt;/div&gt;&lt;div class="feedflare"&gt;
&lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=yIl2AUoC8zA" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:F7zBnMyn0Lo"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?i=iY41WW9r_9g:mgGwmHOR4iQ:F7zBnMyn0Lo" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:qj6IDK7rITs"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=qj6IDK7rITs" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:EpLpB3ZkKWg"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=EpLpB3ZkKWg" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:V-t1I-SPZMU"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=V-t1I-SPZMU" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:XAVGb8Xj5zA"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=XAVGb8Xj5zA" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:dnMXMwOfBR0"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?d=dnMXMwOfBR0" border="0"&gt;&lt;/a&gt; &lt;a rel="nofollow" target="_blank" href="http://feeds.feedburner.com/~ff/ScienceLibraryPad?a=iY41WW9r_9g:mgGwmHOR4iQ:V_sGLiPBpWU"&gt;&lt;img src="http://feeds.feedburner.com/~ff/ScienceLibraryPad?i=iY41WW9r_9g:mgGwmHOR4iQ:V_sGLiPBpWU" border="0"&gt;&lt;/a&gt;
&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/ScienceLibraryPad/~4/iY41WW9r_9g" height="1" width="1"/&gt;</description>
         <author>Richard Akerman</author>
         <guid isPermaLink="false">tag:typepad.com,2003:post-66409445</guid>
         <pubDate>Tue, 05 May 2009 09:44:10 -0700</pubDate>
      </item>
      <item>
         <title>cnewmark: Social media trend sweeping Washington?</title>
         <link>http://www.cnewmark.com/2009/04/social-media-trend-sweeping-washington.html</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_bb65750a1d89c821445fd965b1b3b66b</guid>
         <pubDate>Mon, 27 Apr 2009 20:27:06 -0700</pubDate>
      </item>
      <item>
         <title>Podcast notes: Mark Shuttleworth, Canonical, free software and wealth creation</title>
         <link>http://feedproxy.google.com/~r/DFFlanders/~3/Of4dkViCSog/</link>
         <description>Podcast notes: Mark Shuttleworth, Canonical, free software and wealth creation Originally uploaded by dff.jisc itc.conversationsnetwork.org/shows/detail3994.html
* The stimulus of innovation is disclosure: not only saying what but
how we created the innovation
* how can we drive innovation faster?
** architect platform to be extensible and imbeddable, e.g. mozilla addons
** platform tolerance, must embrace our enemies (windows) to embrace
all humans, [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&amp;blog=532341&amp;post=180&amp;subd=dfflanders&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://dfflanders.wordpress.com/2009/04/21/podcast-notes-mark-shuttleworth-canonical-free-software-and-wealth-creation/</guid>
         <pubDate>Tue, 21 Apr 2009 12:57:43 -0700</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><div style="float:right;margin-left:10px;margin-bottom:10px;">
<a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/36299951@N08/3463733906/" title="photo sharing"><img src="http://farm4.static.flickr.com/3594/3463733906_55a2ce54c2.jpg" alt="" style="border:solid 2px #000000;"/></a><br />
<br />
<span style="font-size:.9em;margin-top:0;"><br />
<a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/36299951@N08/3463733906/">Podcast notes: Mark Shuttleworth, Canonical, free software and wealth creation</a><br />
<br />
Originally uploaded by <a rel="nofollow" target="_blank" href="http://www.flickr.com/people/36299951@N08/">dff.jisc</a><br />
</span>
</div>
<p><a rel="nofollow" target="_blank" href="http://itc.conversationsnetwork.org/shows/detail3994.html">itc.conversationsnetwork.org/shows/detail3994.html</a></p>
<p>* The stimulus of innovation is disclosure: not only saying what but<br />
how we created the innovation<br />
* how can we drive innovation faster?<br />
** architect platform to be extensible and imbeddable, e.g. mozilla addons<br />
** platform tolerance, must embrace our enemies (windows) to embrace<br />
all humans, be benevolentt<br />
** bring in fresh blood, it is not neccessarily the core that makes<br />
things happen<br />
* methodology and software development processes<br />
** purpose of methodology is to harness talent<br />
** aggregate development by building interest<br />
** agile for community (not pair prog) by phil oconnor<br />
** architect by community and collaboration<br />
* permission free development to enable web services<br />
** tools must interoperate over the web, eg bugzilla, trac, launchpad<br />
** branching should be enabled to any commmunity<br />
* change in innovation drives economics<br />
** economic models: advertisement, hardware providers pay for<br />
innovation&#8230; not likely<br />
** pay for service provision?<br />
* can a community create the cutting edge experience&#8230; better than Apple?<br />
** "we should be building software that helps it users get laid"<br />
* conical investing in a beautiful experience.<br /></p> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/dfflanders.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dfflanders.wordpress.com/180/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/dfflanders.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dfflanders.wordpress.com/180/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/dfflanders.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dfflanders.wordpress.com/180/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/dfflanders.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dfflanders.wordpress.com/180/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/dfflanders.wordpress.com/180/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dfflanders.wordpress.com/180/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&blog=532341&post=180&subd=dfflanders&ref=&feed=1"/></div>]]></content:encoded>
         <media:content url="http://0.gravatar.com/avatar/8cdc26501af3b7e571a4d23f0afbbf08?s=96&amp;amp;d=identicon&amp;amp;r=G" medium="image">
            <media:title>dfflanders</media:title>
         </media:content>
         <media:content url="http://farm4.static.flickr.com/3594/3463733906_55a2ce54c2.jpg" medium="image" />
         <category>Uncategorized</category>
      </item>
      <item>
         <title>Podcast notes: IT conversations, Anne Thomas Manes, ‘Is SOA Dead’</title>
         <link>http://feedproxy.google.com/~r/DFFlanders/~3/zaWLrtzcyUM/</link>
         <description>Podcast notes: IT conversations, Anne Thomas Manes, &amp;#8216;Is SOA Dead&amp;#8217; Originally uploaded by dff.jisc itc.conversationsnetwork.org/shows/detail4004.html
* We have been doing &amp;#34;Service Oriented Integration&amp;#34; which has been
focused on integration rather than architecture
* Intel paper on IT business valuation for metrics that effect bottom
line: how many people, how many hours, etc
* biggest impediment to SOA is cultural and political, not [...]&lt;img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&amp;blog=532341&amp;post=179&amp;subd=dfflanders&amp;ref=&amp;feed=1"/&gt;</description>
         <guid isPermaLink="false">http://dfflanders.wordpress.com/2009/04/21/podcast-notes-it-conversations-anne-thomas-manes-is-soa-dead/</guid>
         <pubDate>Tue, 21 Apr 2009 12:11:04 -0700</pubDate>
         <content:encoded><![CDATA[<div class='snap_preview'><br /><div style="float:right;margin-left:10px;margin-bottom:10px;">
<a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/36299951@N08/3462795633/" title="photo sharing"><img src="http://farm4.static.flickr.com/3583/3462795633_d2e3d5d228.jpg" alt="" style="border:solid 2px #000000;"/></a><br />
<br />
<span style="font-size:.9em;margin-top:0;"><br />
<a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/36299951@N08/3462795633/">Podcast notes: IT conversations, Anne Thomas Manes, &#8216;Is SOA Dead&#8217;</a><br />
<br />
Originally uploaded by <a rel="nofollow" target="_blank" href="http://www.flickr.com/people/36299951@N08/">dff.jisc</a><br />
</span>
</div>
<p><a rel="nofollow" target="_blank" href="http://itc.conversationsnetwork.org/shows/detail4004.html">itc.conversationsnetwork.org/shows/detail4004.html</a></p>
<p>* We have been doing "Service Oriented Integration" which has been<br />
focused on integration rather than architecture<br />
* Intel paper on IT business valuation for metrics that effect bottom<br />
line: how many people, how many hours, etc<br />
* biggest impediment to SOA is cultural and political, not technological<br />
** the question for those not obsessed with technology is: what is<br />
going to increase bottm line?<br />
* business managers and leaders won&#8217;t understand SOA, developers are<br />
beginning to understand and so are building it from the bottom up.<br />
SOA itself will very seldom ever happen as a top down political<br />
process.<br />
* we *as humans* suck at architecture.<br />
* enterprise architecture is completly different to application architecture.<br />
** enterprise architecture is about reducing cost.<br /></p> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gocomments/dfflanders.wordpress.com/179/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dfflanders.wordpress.com/179/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godelicious/dfflanders.wordpress.com/179/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/dfflanders.wordpress.com/179/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/gostumble/dfflanders.wordpress.com/179/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/dfflanders.wordpress.com/179/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/godigg/dfflanders.wordpress.com/179/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/dfflanders.wordpress.com/179/"/></a> <a rel="nofollow" target="_blank" href="http://feeds.wordpress.com/1.0/goreddit/dfflanders.wordpress.com/179/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/dfflanders.wordpress.com/179/"/></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=dfflanders.wordpress.com&blog=532341&post=179&subd=dfflanders&ref=&feed=1"/></div>]]></content:encoded>
         <media:content url="http://0.gravatar.com/avatar/8cdc26501af3b7e571a4d23f0afbbf08?s=96&amp;amp;d=identicon&amp;amp;r=G" medium="image">
            <media:title>dfflanders</media:title>
         </media:content>
         <media:content url="http://farm4.static.flickr.com/3583/3462795633_d2e3d5d228.jpg" medium="image" />
         <category>Uncategorized</category>
      </item>
      <item>
         <title>Less talk, less code, more data - The Preserv2 Data Registry</title>
         <link>http://davetaz-blog.blogspot.com/2009/04/talk-code-data-preserv2-data-registry.html</link>
         <description>Yes, less talk more code (oxfordrepo.blogspot.com) is a good saying but i'm going to argue in this post that in fact we need more data! Having a ton of available services and a load of highly complex and well considered data models is all well and good but without data all of these services are useless; A repository is not a repository until it has something in it (Harnad). &lt;br /&gt;&lt;br /&gt;If we look outside of the repository community for a minute we find the web community we are accumulating a whole ton of data, wikipedia being the main point of reference here. Yet in the repository community we are not harnessing this open linked data model to enhance our data. &lt;br /&gt;&lt;br /&gt;I have been working in the area of digital preservation for a while now and the PRONOM file format registry (TNA UK) has been my friend for many years now and contains some valuable data. However I am concerned with the way I see it progressing. The main thing I use the PRONOM registry for is as a complement to DROID for file format information, and the data here is not even that complete. I am concerned however at the size of the new data model and the sheer effort which is going to be required to fill it with the data which it specifies. &lt;br /&gt;&lt;br /&gt;Why not looked to the linked data web to see how to tie a series of smaller systems together to make a much more powerful and easier to maintain one! &lt;br /&gt;&lt;br /&gt;This is where I have started with the preserv2 registry available at &lt;a rel="nofollow" target="_blank" href="http://p2-registry.ecs.soton.ac.uk/"&gt;http://p2-registry.ecs.soton.ac.uk/&lt;/a&gt;. &lt;br /&gt;&lt;br /&gt;The preserv2 registry is a semantic knowledge base (RDF triples based) with an SPARQL endpoint, RESTful services and a basic browser. Currently the data is focussed on file formats and is basically made up of the PRONOM database ported from a complex XML schema into simple RDF triples. On top of this i'm beginning to add data from dbpedia (wikipedia RDF'd) and making links between the PRONOM data and the dbpedia data! &lt;br /&gt;&lt;br /&gt;Already this is helping is ascertain a greater knowledge base and the cost of gathering and compiling this data is very low. Other than that the registry took me less than a week to construct! &lt;br /&gt;&lt;br /&gt;So "Go forth and make links" (Wendy Hall) is exactly what I'm now doing. With enough data you will be able to make complex OWL-S rules that can be used to deduce accurately facts such as formats which are at risk.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-3516378215565453655?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-3516378215565453655</guid>
         <pubDate>Wed, 08 Apr 2009 01:21:00 -0700</pubDate>
      </item>
      <item>
         <title>Account from OGF25 Repositories Workshop: Creating a Repository Standard?</title>
         <link>http://feedproxy.google.com/~r/MediashelfBlog/~3/kgenvp2ahUE/</link>
         <description>04 March 2009
Catania, Sicily, Italy
Open Grid Forum 25th Conference (OGF25) It&amp;#8217;s not entirely clear when I figured out that I was sitting on a standards body panel discussing the creation of a digital repository related standard. I&amp;#8217;m pretty sure it finally clicked sometime after the session was over, once I had consumed a couple glasses of [...]</description>
         <guid isPermaLink="false">http://yourmediashelf.com/blog/?p=71</guid>
         <pubDate>Fri, 20 Mar 2009 13:56:09 -0700</pubDate>
         <content:encoded><![CDATA[<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;">04 March 2009</p>
<p style="font:normal normal normal 12px/normal Helvetica;color:#000000;margin:0px;">Catania, Sicily, Italy</p>
<p style="font:normal normal normal 12px/normal Helvetica;color:#000000;margin:0px;">Open Grid Forum 25th Conference (OGF25)</p>
<p style="font:normal normal normal 12px/normal Helvetica;color:#000000;margin:0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">It&#8217;s not entirely clear when I figured out that I was sitting on a standards body panel discussing the creation of a digital repository related standard. I&#8217;m pretty sure it finally clicked sometime after the session was over, once I had consumed a couple glasses of wine.</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">I still don&#8217;t see what I contributed to the conversation, though the other participants assured me that my comments were useful. The experience reminds me of my friend, let&#8217;s call him Josh, a community organizer who was recently pulled onto one of the Obama administration&#8217;s advisory panels. Shortly after joining the advisory panel, Josh confessed that at the end of most calls he has to follow up with a friend and ask &#8220;Ok. So what exactly did we just decide and who is responsible for doing what?&#8221;</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">The panel discussion started by making observations that we&#8217;re all familiar with:</p>
<ul>
<li>the importance, and associated challenges, of unique identifiers and persistent URIs</li>
<li>search, retrieval, and management are separate concerns, each with appropriate standards associated (ie. SWORD, OpenSearch, etc.)</li>
<li>cloud computing is very different from cloud storage</li>
</ul>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">After this, I quickly found myself in completely unfamiliar waters when conversation abruptly turned to the creation of a standard for digital repositories. I thought &#8220;Pshaw. We don&#8217;t need Yet Another Standard. Where did this come from?&#8221; In fact, the whole field of repositories is so new that the prospect of a repository standard seems absurdly premature to me. Discussion on the panel honed in on the two obvious contenders for a standard: 1) metadata requirements, and 2) functionality profiles (a list of features necessary in order for a repository system to be deemed compliant and interoperable). From my perspective, repositories already swim in a glut of metadata standards (as well as non-standard, ad-hoc metadata) and, by nature, must embrace heterogeneous metadata. The second notion, that of functionality profiles, sounds like something that few will read and none will understand. To be honest, the entire discussion confused me. I did my best to contribute to the discussion where I could.</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">After the workshop ended, I had a chance to catch my breath and discuss the panel with a couple of people. Eventually, I came to look at the whole scenario from a different perspective and had a mild change of heart. In a discussion with Neil Chue Hong, a very smart guy from Edinburgh, I started thinking about all the informal conclusions that frame discussions between developers at conferences like Dev8D, Code4Lib and RepoCamp. I then thought about all the little architectural wins and failures that I see in software like Flickr, YouTube, Hulu and (woah) ABC&#8217;s full episode player. After all, these are repositories too. Within a few moments of pondering, an initial list of obvious basic guidelines shone through quite clearly.</p>
<ul>
<li>give permanent, unique URIs to all content you expose, even if you intend to limit access to that content based on geography or time of access</li>
<li>support linking with versioning or datetime info</li>
<li>expose a RESTful API</li>
<li>give preference to AtomPub</li>
<li>consider ORE when you need to express aggregations of data</li>
<li>provide linked data (RDF) endpoints</li>
</ul>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">Some open topics also seem like obvious fodder for discussion:</p>
<ul>
<li>what query language(s) to use in search APIs</li>
<li>navigating the difference between standards and interoperability</li>
<li>leveraging standards where possible (ie SWORD)</li>
</ul>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">These are merely the things that seem obvious to me right away. What would happen if we got the <a rel="nofollow" title="Inkdroid" target="_blank" href="http://inkdroid.org/">really</a> <a rel="nofollow" title="OxfordRepo Blog" target="_blank" href="http://oxfordrepo.blogspot.com/">smart</a> <a rel="nofollow" title="Dave Tarrant" target="_blank" href="http://www.ecs.soton.ac.uk/people/dct05r">people</a> talking in this vein? </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">I think this warrants further exploration and, strange though it is, I expect that the outputs of such exploration might resemble the stuff of standards bodies (be it a recommendation, a community document, or a standard). Possibly I have been infected with that odd standards-wonk bug, or possibly I&#8217;m just catching up with the rest of the world in acknowledging the inevitable.</p>
<img src="http://feeds.feedburner.com/~r/MediashelfBlog/~4/kgenvp2ahUE" height="1" width="1"/>]]></content:encoded>
      </item>
      <item>
         <title>Session Hopping, LinkedData, and Data APIs at OGF25 in Sicily</title>
         <link>http://feedproxy.google.com/~r/MediashelfBlog/~3/iAbuQz22MQQ/</link>
         <description>04 March 2009
Catania, Sicily, Italy
Open Grid Forum 25th Conference (OGF25) [Note: I'm posting my backlog of updates from the past 2 months of travel. An update specifically about the OGF Repositories Workshop will follow shortly] I made it to the conference center in Catania, Sicily a few hours before the OGF Repositories Workshop. Immediately upon arriving I met Nick [...]</description>
         <guid isPermaLink="false">http://yourmediashelf.com/blog/?p=69</guid>
         <pubDate>Fri, 20 Mar 2009 13:43:50 -0700</pubDate>
         <content:encoded><![CDATA[<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">04 March 2009</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">Catania, Sicily, Italy</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">Open Grid Forum 25th Conference (OGF25)</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">[Note: I'm posting my backlog of updates from the past 2 months of travel. An update specifically about the OGF Repositories Workshop will follow shortly]</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">I made it to the conference center in Catania, Sicily a few hours before the OGF Repositories Workshop. Immediately upon arriving I met Nick Ferguson, coordinator of the workshop, and had a nice chat with Neil Chue Hong about repositories, ORE, and grid computing vs. cloud computing. After that I was left to kill time until the workshop by sitting in on one of the OGF sessions. At first, I stepped into what I thought was the Earth Sciences session, but it turned out to be the Computational Chemistry session and went way over my head. I then passed through a handful of other random presentations before settling on a room where about 30 people were having a discussion about XQuery.</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">I soon discerned that this group was hammering out the spec for some sort of standard data systems interface. When I arrived, they had been debating the strengths and demerits of XPath/XQuery vs. SQL as a query language. The converation quickly stumbled into the pit of interoperability hell. Standard interjections abounded: &#8220;Some implementations won&#8217;t have that data to return&#8230;&#8221;, &#8220;you will have to expose user info in order to support that&#8230;&#8221; mumble mumble &#8220;&#8230; we didn&#8217;t do it that way because one unnamed vendor couldn&#8217;t support it&#8230;&#8221; I nearly laughed out loud when an attendee from the back of the room interrupted the discussion declaring &#8220;But in most situations, you should only be returning items owned by the current user.&#8221; </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">I still had no idea what data they were attempting to expose. (I later learned that it was the <a rel="nofollow" title="RUS-WG" target="_blank" href="http://www.to.infn.it/grid/accounting/NA5/rus-wg.html">RUS-WG</a>, who are defining a standard interface for retrieving<span class="Apple-style-span" style="font-style:italic;"> job usage records &#8230; </span>Obscure indeed.) The 90-minute discussion ended up having nearly nothing to do with the actual data these people want to work with. Instead, the conversation was entirely dominated by the travails of navigating the strange space of Data API design.</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">Meanwhile, serendipitously, I was using this downtime (and the conference wifi access) to finally read George Thomas&#8217;s <a rel="nofollow" target="_blank" href="http://george.thomas.name/omb/recovery.gov.pdf">slides about recovery.gov publishing open data</a>. Though I missed the presentation, the slides spell out the project&#8217;s intentions pretty clearly. They&#8217;re full of references to REST, ATOM, RDFa and the <a rel="nofollow" target="_blank" href="http://richard.cyganiak.de/2007/10/lod/">LOD cloud</a>. I experienced such a fascinating contrast between the exposition before my eyes and the discussion filtering in through my ears. In particular, one of Thomas&#8217;s slides jumped out at me. The slide, titled &#8220;Follow the dollar, not the person&#8221;, showed a semantic model for users, user groups, and posts in a bulletin-board style Community Forum system. It was totally readable, totally understandable, precise, flexible, and using an ontology that lends itself to re-use. </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">Over the past year, I have satirically placed a golden halo above &#8220;linked data&#8221; in my mind. As I sat in the RUS-WG session, light fell upon that halo and it glowed.</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">This experience, as well as consequent discussions at OGF, has left me with a distinct sense that there&#8217;s a pattern here. We are all, of our own accord and in our own little techno-fiefdoms, attempting to do the same things and running into the same challenges. I think that the previously obscure field of digital repositories has valuable perspective to provide and many pieces of wisdom to share in this domain. I hope to see more public discourse about these topics, and I know who to start prodding to speak up. Watch this space.</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">Post Script:</p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;min-height:14.0px;"> </p>
<p style="margin:0.0px 0.0px 0.0px 0.0px;font:12.0px Helvetica;">The morning following my OGF session-hopping experience, I realized that the track I had passed over, innocuously titled &#8220;HEP&#8221;, was a meeting of the <a rel="nofollow" title="Wikipedia Entry for HEP" target="_blank" href="http://en.wikipedia.org/wiki/Particle_physics">High Energy Physics</a> community. In particular, it was primarily a discussion about how they are going to handle processing the data outputs from the LHC experiments when they fire up <a rel="nofollow" title="Cern Large Hadron Collider Website" target="_blank" href="http://www.cern.ch/lhc">the collider</a> later this year. /me kicks himself for missing this.</p>
<img src="http://feeds.feedburner.com/~r/MediashelfBlog/~4/iAbuQz22MQQ" height="1" width="1"/>]]></content:encoded>
      <enclosure url="http://george.thomas.name/omb/recovery.gov.pdf" length="904104" type="application/pdf" /><media:content url="http://george.thomas.name/omb/recovery.gov.pdf" fileSize="904104" type="application/pdf" /><itunes:subtitle>04 March 2009 Catania, Sicily, Italy Open Grid Forum 25th Conference (OGF25) [Note: I'm posting my backlog of updates from the past 2 months of travel. An update specifically about the OGF Repositories Workshop will follow shortly] I made it to the confer</itunes:subtitle><itunes:summary>04 March 2009 Catania, Sicily, Italy Open Grid Forum 25th Conference (OGF25) [Note: I'm posting my backlog of updates from the past 2 months of travel. An update specifically about the OGF Repositories Workshop will follow shortly] I made it to the conference center in Catania, Sicily a few hours before the OGF Repositories Workshop. Immediately upon arriving I met Nick [...]</itunes:summary></item>
      <item>
         <title>We need people!</title>
         <link>http://oxfordrepo.blogspot.com/2009/03/we-need-people.html</link>
         <description>(UPDATE - Grrr.... seems that the concept of persistent URLs is lost on the admin - link below has been removed - see google cached copy &lt;a rel="nofollow" target="_blank" href="http://209.85.229.132/search?q=cache:YsdhcWzKWksJ:www.admin.ox.ac.uk/ps/oao/ar/ar3979j.shtml+http://www.admin.ox.ac.uk/ps/oao/ar/ar3979j.shtml&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk"&gt;here&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.admin.ox.ac.uk/ps/oao/ar/ar3979j.shtml"&gt;http://www.admin.ox.ac.uk/ps/oao/ar/ar3979j.shtml&lt;/a&gt; - job description.&lt;br /&gt;&lt;br /&gt;Essentially, we need smart people who are willing to join us to do good, innovative stuff; work that isn't by-the-numbers with room for initiative and ideas.&lt;br /&gt;&lt;br /&gt;Help us turn our digital repository into a digital library, it'll be fun! Well, maybe not fun, but it will be very interesting at least!&lt;br /&gt;&lt;br /&gt;bulletpoints: python/ruby frameworks, REST, a little SemWeb, ajax, jQuery, AMQP, Atom, JSON, RDF+RDFa, Apache WSGI deployment, VMs, linux, NFS, storage, RAID, etc.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-462847752063249912?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-462847752063249912</guid>
         <pubDate>Thu, 19 Mar 2009 00:46:00 -0700</pubDate>
      </item>
      <item>
         <title>Developer Happiness days - why happyness is important</title>
         <link>http://oxfordrepo.blogspot.com/2009/02/developer-happiness-days-why-happyness.html</link>
         <description>&lt;big&gt;&lt;big&gt;&lt;b&gt;Creativity and innovation&lt;/b&gt;&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;One of the defining qualities of a good innovative developer is creativity and a pragmatic attitude; someone with the '&lt;i&gt;rough consensus, running code&lt;/i&gt;' mentality that pervades good software innovation. This can be seen as the drive to experiment, to turn inspiration and ideas into real, running code or to pathfind by trying out different things. Innovation can often happen when talking about quite separate, seemingly unrelated things, even to the point that most of the time, the 'outcomes' of an interaction are impossible to pin down.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;&lt;b&gt;Play, vagueness and communication&lt;/b&gt;&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Creativity, inspiration, innovation, ideas, fun, and curiousity&lt;/b&gt; are all useful and important when developing software. These words convey concepts that do not thrive in situations that are purely scheduled, didactic, and teacher-pupil focussed. There needs to be an amount of '&lt;b&gt;&lt;i&gt;play&lt;/i&gt;&lt;/b&gt;' in the system (&lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Play_%28engineering%29"&gt;see 'Play'.&lt;/a&gt;) While this '&lt;i&gt;&lt;b&gt;play&lt;/b&gt;&lt;/i&gt;' is bad in a tightly regimented system, it is an essential part in a creative system, to allow for new things to develop, new ideas to happen and for 'random' interactions to take place.&lt;br /&gt;&lt;br /&gt;Alongside this notion of &lt;i&gt;&lt;b&gt;play&lt;/b&gt;&lt;/i&gt; in an event, there also needs to be an amount of blank space, a &lt;i&gt;&lt;b&gt;vagueness&lt;/b&gt;&lt;/i&gt; to the event. I think that we can agree that much of the usefulness of normal conferences comes from the 'coffee breaks' and 'lunch breaks', which are blank spaces of a sort. It is the recognition of this that is important and to factor it in more.&lt;br /&gt;&lt;br /&gt;Note that if a single developer could guess at how things should best be developed in the academic space, they would have done so by now. &lt;i&gt;Pre-compartmentalisation of ideas into 'tracks' can kill potential innovation stone-dead.&lt;/i&gt; The distinction between CMSs, repositories and VLE developers is purely semantic and it is detrimental for people involved in one space to not overhear the developments, needs, ideas and issues in another. It is especially counter-productive to further segregate by community, such as having simultaneous Fedora, DSpace and EPrints strands at an event.&lt;br /&gt;&lt;br /&gt;While the inherent and intended &lt;i&gt;&lt;b&gt;vagueness&lt;/b&gt;&lt;/i&gt; provides the potential for cross-fertilisation of ideas, and the room for &lt;i&gt;&lt;b&gt;play&lt;/b&gt;&lt;/i&gt; provides the space, the final ingredient is that of &lt;i&gt;&lt;b&gt;speech, or any communication that takes place with the same ease and at the same speed of speech&lt;/b&gt;&lt;/i&gt;. While some may find the 140 character limit on twitter or identi.ca a strange constraint, this provides a target for people to really think about what they wish to convey and keeps the dialogue from becoming a series of monologues - much like the majority of emails of mailing lists - and keeps it as a dialogue between people.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;&lt;b&gt;Communication and Developers&lt;/b&gt;&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;One of the dichotomies in the necessity of communication to development is that developers can be shy, initially preferring the false anonymity of textual communication to spoken words between real people. There is a need to provide means for people to break the ice, and to strike up conversations with people that they can recognise as being of like minds. Asking that people's public online avatars are changed to be pictures of them can help people at an event find those that they have been talking to online and to start talking, face to face.&lt;br /&gt;&lt;br /&gt;On a personal note, one of the most difficult things I have to do when meeting people out in real life is answer the question 'What do you do?' - it is much easier when I already know that the person asking the question has a technical background.&lt;br /&gt;&lt;br /&gt;And again, going back to the concept of compartmentalisation - &lt;i&gt;developers who only deal with developers and their managers/peers will build systems that work best for their peers and their managers.&lt;/i&gt; If these people are not the only users then they need to widen their communications. It is important for the developers that do not use their own systems to engage with the people who actually do. They should do this directly, without the potential for garbled dialogue via layers of protocol. This part needs managing in whatever space, both to avoid dominance by loud, disgruntled users and to mitigate anti-social behaviour. By and large, I am optimistic of this process, people tend to want to be thanked, and this simple &lt;i&gt;feedback loop&lt;/i&gt; can be used to help motivate. Making this feedback more disproportionate (a small 'thank you' can lead to great effects) and adding in the notion of &lt;i&gt;highscore&lt;/i&gt; can lead to all sorts of interaction and outcomes, most notably being the rapid reinforcement of any behaviour that led to a positive outcome.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;&lt;b&gt;Disproportionate feedback loops and Highscores drive human behaviour&lt;/b&gt;&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;I'll just digress quickly to cover what I mean be a &lt;b&gt;&lt;i&gt;disproportionate feedback loop&lt;/i&gt;&lt;/b&gt;: A disproportionate feedback loop is something that encourages a certain behaviour; the input to which is something small and inexpensive, in either time or effort but the output can be large and very rewarding. This pattern can be seen in very many interactions: playing the lottery, [good] video game controls, twitter and facebook, musical instruments, the 'who wants to be a millionaire' format, mashups, posting to a blog ('free' comments, auto rss updating, a google-able webpage for each post) etc.&lt;br /&gt;&lt;br /&gt;The &lt;i&gt;&lt;b&gt;natural drive for highscores&lt;/b&gt;&lt;/i&gt; is also worth pointing out. At first glance, is it as simple as considering its use in videogames? How about the concept of getting your '5 fruit and veg a day'? &lt;a rel="nofollow" target="_blank" href="http://www.5aday.nhs.uk/topTips/default.html"&gt;http://www.5aday.nhs.uk/topTips/default.html&lt;/a&gt; Running in a marathon against other people? Inbox Zero (&lt;a rel="nofollow" target="_blank" href="http://www.slideshare.net/merlinmann/inbox-zero-actionbased-email"&gt;http://www.slideshare.net/merlinmann/inbox-zero-actionbased-email&lt;/a&gt;), Learning to play different musical scores? Your work being rated highly online? An innovation of yours being commented on by 5 different people in quick succession? Highscores can be very good drivers for human behaviour, addictive to some personalities.&lt;br /&gt;&lt;br /&gt;Why not set up some software highscores? For example, in the world of repositories, how about 'Fastest UI for self-submission' - encouraging automatic metadata/datamining, a monthly prize for 'Most issue tickets handled' - to the satisfaction of those posting the tickets, and so on.&lt;br /&gt;&lt;br /&gt;It is very easy to over-metricise this - some will purposefully abstain from this and some metrics are truely misleading. In the 90s, there was a push to have lines of code added as a metric to productivity. The false assumption is that lines of code have anything to do with producitivity - code should be lean, but not too lean to maintain.&lt;br /&gt;&lt;br /&gt;So be very careful when adding means to record highscores - they should be flexible, and be fun - if they are no fun for the developers and/or the users, they become a pointless metric, more of an obstacle than a motivation.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;&lt;b&gt;The Dev8D event&lt;/b&gt;&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;People were free to roam and interact at the Dev8D event and there was no enforced schedule, but twitter and a loudhailer were used to make people aware of things that were going on. Talks and discussions were lined up prior to the event of course, but the event was organised on a wiki which all were free to edit. As experience has told us, the important and sometimes inspired ideas occur in relaxed and informal surroundings where people just talk and share information, such as in a typical social situation like having food and drink.&lt;br /&gt;&lt;br /&gt;As a specific example, look &lt;a rel="nofollow" target="_blank" href="http://oxfordrepo.blogspot.com/2009/02/tracking-conferences-at-dev8d-with.html"&gt;at the role of twitter at the event&lt;/a&gt;. Sam Easterby-Smith (&lt;a rel="nofollow" target="_blank" href="http://twitter.com/samscam%29"&gt;http://twitter.com/samscam)&lt;/a&gt; created a means to track 'developer happiness' and shared the tracking '&lt;a rel="nofollow" target="_blank" href="http://samscam.co.uk/happier/"&gt;Happyness-o-meter'&lt;/a&gt; site with us all. This unplanned development inspired me to relay the infomation back to twitter and similarly led to me running an operating system/hardware survey in a very similar fashion.&lt;br /&gt;&lt;br /&gt;To help break the ice and to encourage play, we instituted a number of ideas:&lt;br /&gt;&lt;br /&gt;A &lt;b&gt;wordcloud on each attendees badge&lt;/b&gt;, consisting of whatever we could find of their work online, be it their blog or similar so that it might provide a talking point, or allow people to spot people who write about things they might be interested in learning more about.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;The poker chip game&lt;/b&gt; - each attendee was given 5 poker chips at the start of the event, and it was encouraged that chips were to be traded for help, advice or as a way to convey a thank you. The goal was that the top 4 people ranked by amounts of chips at the end of the third day would receive a Dell mini 9 computer. The balance to this was that each chip was also worth a drink at the bar on that day too.&lt;br /&gt;&lt;br /&gt;We were well aware that we'd left a lot of play in this particular system, allowing for lotteries to be set up, people pooling their chips, and so on. As the sole purpose of this was to encourage people to interact, to talk and bargain with each other, and to provide that feedback loop I mentioned earlier, it wasn't too important how people got the chips as long as it wasn't underhanded. It was the interaction and the 'fun' that we were after. Just as an aside, Dave Flanders deserves the credit for this particular scheme.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;&lt;b&gt;Developer Decathlon&lt;/b&gt;&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;The basic concept of the &lt;a rel="nofollow" target="_blank" href="http://code.google.com/p/developerhappinessdays/wiki/DeveloperDecathlon"&gt;Developer Decathlon&lt;/a&gt; was also reusing these ideas of play and feedback: "&lt;a rel="nofollow" name="What_is_the_Developer_Decathlon?"&gt;The Developer Decathlon is a competition at dev8D that enables developers to come together face-to-face to do rapid prototyping of software ideas. [..] &lt;/a&gt; We help facilitate this at dev8D by providing both 'real users' and 'expert advice' on how to run these rapid prototyping sprints. [..] The 'Decathlon' part of the competition represents the '10 users' who will be available on the day to present the biggest issues they have with the apps they use and in turn to help answer developer questions as the prototypes applications are being created. The developers will have two days to work with the users in creating their prototype applications."&lt;br /&gt;&lt;br /&gt;The best two submissions will get cash prizes that go to the individual, not to the company or institution that they are affiliated with. The outcomes of which will be made public shortly, once the judging panel have done their work.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;Summary&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;To foster innovation and to allow for creativity in software development:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Having &lt;b&gt;play&lt;/b&gt; space is &lt;b&gt;important&lt;/b&gt;&lt;/li&gt;&lt;li&gt;Being &lt;b&gt;vague&lt;/b&gt; with aims and &lt;b&gt;flexible&lt;/b&gt; with outcomes is not a bad thing and is &lt;b&gt;vital&lt;/b&gt; for unexpected things to develop - &lt;i&gt;e.g. A project's outcomes should be under continual re-negotiation as a general rule, not as the exception.&lt;/i&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Encouraging&lt;/b&gt; and &lt;b&gt;enabling&lt;/b&gt; free and easy communication is &lt;b&gt;crucial&lt;/b&gt;.&lt;/li&gt;&lt;li&gt;Be aware of what drives people to do what they do. Push all feedback to be &lt;b&gt;as disproportionate as possible&lt;/b&gt;, allowing both developers and users to benefit, with only putting a relatively trivial amount of input in (this pattern affects web UIs, development cycles, team interaction, etc)&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;b&gt;Choose useful highscores&lt;/b&gt; and be prepared to ditch them or change them if they are no longer &lt;b&gt;fun and motivational&lt;/b&gt;.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-7889113205346661214?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-7889113205346661214</guid>
         <pubDate>Tue, 24 Feb 2009 22:09:00 -0800</pubDate>
      </item>
      <item>
         <title>Less Talk, More Code: Tracking conferences (at Dev8D) with python, twitter and tags</title>
         <link>http://oxfordrepo.blogspot.com/2009/02/tracking-conferences-at-dev8d-with.html</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_37a36fe6a393a4ab706eac1eebbaba13</guid>
         <pubDate>Mon, 23 Feb 2009 06:45:38 -0800</pubDate>
      </item>
      <item>
         <title>Handling Tabular data</title>
         <link>http://oxfordrepo.blogspot.com/2009/02/handling-tabular-data.html</link>
         <description>&lt;div&gt;&lt;b&gt;"Storage"&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;I put the s-word in quotes because the storing of the item is actually a very straightforward process - we have been dealing with storing tabular data for computation for a very long time now. Unfortunately, this also means that there are very many ways to capture, edit and present tables of information.&lt;br/&gt;&lt;br/&gt;One realisation to make with regards to preserving access to data coming from research is that there is a huge backlog of data in formats that we shall kindly call 'legacy'. Not only is there this issue, but data is being made with tools and systems that effectively 'trap' or lock-in a lot of this information - case in point being any research being recorded using Microsoft Access. While the tables of data can often be extracted with some effort, it is normally difficult to impossible to extract the implicit information; how tables interlink, how the Access Form adds information to the dataset, etc.&lt;br/&gt;&lt;br/&gt;It is this implicit knowledge that is the elephant in the room. Very many serialisations, such as SQL 'dumps', csv, xsl and so on, rely on implicit knowledge that is either related to the particulars of the application used to open it, or is actually highly domain specific.&lt;br/&gt;&lt;br/&gt;So, it is trivial and easy to specify a model for storing data, but without also encoding the implied information and without making allowances for the myriad of sources, the model is useless; it would be akin to defining the colour of storage boxes holding bric-a-brac. The datasets need to be characterised, and the implied information recorded in as good a way as possible.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Characterisation&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;The first step is to characterise the dataset that has been marked for archival and reuse. (Strictly, the best first step is to consult with the researcher or research team and help and guide them so that as much of the unsaid knowledge is known by all parties.)&lt;br/&gt;&lt;br/&gt;Some serialisations so a good job of this themselves, *SQL-based serialisations include basic data type information inside the table declarations themselves. As a pragmatic measure, it seems sensible to accept SQL-style table descriptions as a reasonable beginning. Later, we'll consider the implicit information that also needs to be recorded alongside such a declaration.&lt;br/&gt;&lt;br/&gt;Some others, such as CSV, leave it up to the parsing agent to guess at the type of information included. In these cases, it is important to find out or even deduce the type of data held in each column. Again, this data can be serialised in a SQL table declaration held alongside the original &lt;i&gt;unmodified&lt;/i&gt; dataset.&lt;br/&gt;&lt;br/&gt;(It is assumed that a basic data review will be carried out; does the csv have a consistent number of columns per row, is the version and operating system known for the MySQL that held the data, is there a PI or responsible party for the data, etc.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Implicit information&lt;br/&gt;&lt;/b&gt;&lt;br/&gt;Good teachers are right to point out this simple truth: "don't forget to write down the obvious!" It may seem obvious that all your data is latin-1 encoded, or that you are using a FAT32 filesystem, or even that you are running in a 32-bit environment, the painful truth is that we can't guarantee that these aspects won't affect how the data is held, accessed or stored. There may be systematic issues that we are not aware of, such as the problems with early versions of ZFS causing [, at the time, detected] data corruption, or MySQL truncating fields when serialised in a way that is not anticipated or discovered until later.&lt;br/&gt;&lt;br/&gt;In characterising the legacy sets of data, it is important to realise that there will be loss, especially with the formats and applications that blend presentation with storage. For example, it will require a major effort to attempt to recover the forms and logic bound into the various versions of MS Access. I am even aware of a major dataset, a highly researched dictionary of old english words and phrases, that the final output of which is a Macromedia Authorware application, and the source files are held by an unknown party (that is if they still exist at all) - the Joy of hiring Contractors. In fact, this warrants a slight digression:&lt;br/&gt;&lt;br/&gt;&lt;b&gt;The gap in IT support for research&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;If an academic researcher wishes to gain an external email account at their institution, there is an established protocol for this. Email is so commonplace, it sounds an easy thing to provide, but you need expertise, server hardware, multiuser configuration, adoption of certain access standards (IMAP, POP3, etc), and generally there are very few types of email (text or text with MIME attachments - NB the IM in MIME stands for Internet Mail)&lt;br/&gt;&lt;br/&gt;If a researcher has a need to store tables of data, where do they turn? They should turn to the same department, who will handle the heavy lifting of guiding standards, recording the implicit information and providing standard access APIs to the data. What the IT departments seem to be doing currently is - to carry on the metaphor - handing the researcher the email server software and telling them to get on with it, to configure it as they want. No wonder the resulting legacy systems are as free-form as they are.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Practical measures - Curation&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;Back to specifics now, consider that a set of data has been found to be important, research has been based on it, and it's been recognised that this dataset needs to be looked after. [This will illustrate the technical measures. Licencing, dialogue with the data owners, and other non-technical analysis and administration is left out, but assumed.]&lt;br/&gt;&lt;br/&gt;First task is to store the incoming data, byte-for-byte, as much as is possible - storing the iso image of the media the data is stored on, storing the SQL dump of a database, etc.&lt;br/&gt;&lt;br/&gt;Analyse the tables of data - record the base types of each column (text, binary, float, decimal, etc) apeing the syntax of a SQL table declaration, as well as trying to identify the key columns.&lt;br/&gt;&lt;br/&gt;Record the inter-table joins between primary and secondary keys, possibly by using a &lt;i&gt;"table.column SAMEAS table.column;"&lt;/i&gt; declaration after the table declarations.&lt;br/&gt;&lt;br/&gt;Likewise, attempt to add information concerning each column, information such as units or any other identifying material.&lt;br/&gt;&lt;br/&gt;Store this table description alongside the recorded tabular data source.&lt;br/&gt;&lt;br/&gt;Form a representation of this data in a well-known, current format such as a MySQL dump. For spreadsheets that are 'frozen', cells that are the results of embedded formula should be calculated and added as fixed values. It is important to record the environment, library and platform that these calculations are made with.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Table description as RDF &lt;/b&gt;(strictly, referencing cols/rows via the URI)&lt;br/&gt;&lt;br/&gt;One syntax I am playing around with is the notion that by appending sensible suffixes to the base URI for a dataset, we can unique specify a row, a column, a region or even a single cell. Simply put:&lt;br/&gt;&lt;br/&gt;http://datasethost/datasets/{data-id}#table/{table-name}/column/{column-id} to reference a whole column&lt;br/&gt;http://datasethost/datasets/{data-id}#table/{table-name}/row/{column-id} to reference a whole row, etc&lt;br/&gt;&lt;br/&gt;[The use of the # in the position it is in will no doubt cause debate. Suffice it to say, this is a pragmatic measure, as I suspect that an intermediary layer will have to take care of dereferencing a GET on these forms in any case.]&lt;br/&gt;&lt;br/&gt;The purpose for this is so that the tabular description can be made using common and established namespaces to describe and characterise the tables of data. Following on from a previous post on extending the BagIt protocol with an RDF manifest, this information can be included in said manifest, alongside the more expected metadata without disrupting or altering how this is handled.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;A possible content type for tabular data&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;By considering the base Fedora repository object model, or the BagIt model, we can apply the above to form a content model for a dataset:&lt;br/&gt;&lt;br/&gt;As a Fedora Object:&lt;br/&gt;&lt;br/&gt;&lt;ul&gt;&lt;li&gt;Original data in whatever forms or formats it arrives in (dsid prefix convention: DATA*)&lt;/li&gt;&lt;li&gt;Binary/textual serialisation in a well-understood format (dsid prefix convention: DERIV*)&lt;/li&gt;&lt;li&gt;'Manifest' of the contents (dsid convention: RELS-INT)&lt;/li&gt;&lt;li&gt;Connections between this dataset and other objects, like articles, etc as well as the RDF description of this item (RELS-EXT)&lt;/li&gt;&lt;li&gt;Basic description of dataset for interoperability (Simple dublin core - DC)&lt;/li&gt;&lt;/ul&gt;&lt;br/&gt;As a BagIt+RDF:&lt;br/&gt;&lt;br/&gt;Zip archive - &lt;br/&gt;&lt;ul&gt;&lt;li&gt;/MANIFEST (list of files and checksums)&lt;/li&gt;&lt;li&gt;/RDFMANIFEST (RELS-INT and RELS-EXT from above)&lt;/li&gt;&lt;li&gt;/data/* (original dataset files/disk images/etc)&lt;/li&gt;&lt;li&gt;/derived/* (normalised/re-rendered datasets in a well known format)&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;Presentation - the important part&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;What is described above is the archival of the data. This is a form suited for discovery, but is not in a form suited for reuse. So, what is the possibility?&lt;br/&gt;&lt;br/&gt;BigTable (Google) or HBase (Hadoop) provides a platform where tabular data can be put in a scalable manner. In fact, I would go on to suggest that HBase should be a basic service offered by the IT department of any institution. By providing this database as a service, it should be easier to normalise, and to educate the academic users in a manner that is useful to them, not just to the archivist. Google spreadsheet is an extremely good example of how such a large, scalable database might be presented to the end-user.&lt;br/&gt;&lt;br/&gt;For archival sets with a good (RDF) description of the table, it should be possible to instantiate working versions of the tabular data on a scalable database platform like HBase on demand. Having a policy to put to 'sleep' unused datasets can provide a useful comprimise, avoiding having all the tables live but still providing a useful service. &lt;br/&gt;&lt;br/&gt;It should also be noted that the adoption of popular methods of data access should be part of the responsibility of the data providers - this will change as time goes on, and protocols and methods for access alter with fashion. Currently, Atom/RSS feeds of any part of a table of data (the google spreadsheet model) fits very well with the landscape of applications that can reuse this information.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Summary&lt;/b&gt;&lt;br/&gt;&lt;ul&gt;&lt;li&gt;Try to record as much information as can be found or derived - from host operating system to column types.&lt;/li&gt;&lt;li&gt;Keep the original dataset byte-for-byte as you recieved it.&lt;/li&gt;&lt;li&gt;Try to maintain a version of the data in a well-understood format&lt;/li&gt;&lt;li&gt;Describe the tables of information in a reusable way, preferably by adopting a machine-readable mechanism&lt;/li&gt;&lt;li&gt;Be prepared to create services that the users want and need, not services that you think they should have.&lt;/li&gt;&lt;/ul&gt;&lt;br/&gt;&lt;br/&gt;&lt;div class='zemanta-pixie'&gt;&lt;img src='http://img.zemanta.com/pixy.gif?x-id=38add0ef-8f30-4993-98ab-7ae1db1f0b20' class='zemanta-pixie-img'/&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-7013951292415915268?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-7013951292415915268</guid>
         <pubDate>Sun, 22 Feb 2009 10:28:00 -0800</pubDate>
      </item>
      <item>
         <title>Tracking conferences (at Dev8D) with python, twitter and tags</title>
         <link>http://oxfordrepo.blogspot.com/2009/02/tracking-conferences-at-dev8d-with.html</link>
         <description>There was so much going on at http://www.dev8d.org (#dev8d) that it might be foolish for me to attempt to write up what happened.&lt;br /&gt;&lt;div&gt;&lt;br /&gt;So, I'll focus on a small, but to my mind, crucial aspect of it - tag tracking with a focus on &lt;a rel="nofollow" target="_blank" href="http://twitter.com/"&gt;Twitter&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;The Importance of Tags&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;First, the tag (#)dev8d was cloudburst over a number of social sites - &lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/tags/dev8d/"&gt;Flickr&lt;/a&gt;(dev8d tagged photos), &lt;a rel="nofollow" target="_blank" href="http://search.twitter.com/search?q=%23dev8d"&gt;Twitter&lt;/a&gt;(dev8d feed), blogs such as the &lt;a rel="nofollow" target="_blank" href="http://dev8d.jiscinvolve.org/"&gt;JISCInvolve Dev8D site&lt;/a&gt;, and so on. This was not just done for publicity, but as a means to track and re-assemble the various inputs to and outputs from the event.&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/tags/dev8d/"&gt;Flickr&lt;/a&gt; has some really nice photos on it, shared by people like &lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/ianibbo/"&gt;Ian Ibbotson&lt;/a&gt; (who caught &lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/ianibbo/3275945388/"&gt;an urban fox&lt;/a&gt; on camera during the event!) While there was an 'official' &lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/dev8d"&gt;dev8d flickr user&lt;/a&gt;, I expect the most unexpected and most interesting photos to be shared by other people who kindly add on the dev8d tag so we can find them. For conference organisers, this means that there is a pool of images that we can choose from, each with their own provenance so we can contact the owner if we wanted to re-use, or re-publish. Of course, if the owner puts a &lt;a rel="nofollow" target="_blank" href="http://creativecommons.org/"&gt;CC licence&lt;/a&gt; on them, it makes things easier :)&lt;br /&gt;&lt;br /&gt;So, asserting a tag or label for an event is a useful thing to do in any case. But, this twinned with using a messaging system like &lt;a rel="nofollow" target="_blank" href="http://twitter.com/"&gt;Twitter&lt;/a&gt; or &lt;a rel="nofollow" target="_blank" href="http://identi.ca/"&gt;Identi.ca&lt;/a&gt;, means that you can coordinate, share, and bring together an event. There was a projector in the Basecamp room, which was either the bar, or one of the large basement rooms at Birkbeck depending on the day. Initially, this was used to run through the basic flow of events, which was primarily organised through the use of a &lt;a rel="nofollow" target="_blank" href="http://code.google.com/p/developerhappinessdays/"&gt;wiki&lt;/a&gt;, to which all of us and the attendees were members.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;Projecting the bird's eye view of the event&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;I am not entirely sure whose idea it was initially to use the projector to follow the dev8d tag on twitter, auto-refreshing itself every minute, but it would be one or more of the following: Dave Flanders(&lt;a rel="nofollow" target="_blank" href="http://twitter.com/dfflanders"&gt;@dfflanders&lt;/a&gt;), Andy McGregor(&lt;a rel="nofollow" target="_blank" href="http://twitter.com/andymcg"&gt;@andymcg&lt;/a&gt;) and Dave Tarrant(&lt;a rel="nofollow" target="_blank" href="http://twitter.com/davetaz"&gt;@davetaz&lt;/a&gt;) who is aka BitTarrant due to his network wizardry keeping the wifi going despite Birkbeck's network's best efforts at stopping any form of useful networking going.&lt;br /&gt;&lt;br /&gt;The funny thing about the feed being there, was that it felt perfectly natural from the start. Almost like a mix of notice board, event liveblog and facebook status updates, but the overall effect was like it was the&lt;i&gt; bird's eye view&lt;/i&gt; of the entire event, which you could dip into and out of at will, follow up on talks you weren't even attending, catch interesting links that people posted, and just follow the whole event while doing your own thing.&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;Then things got interesting.&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;From what I heard, a conversation in the bar about developer happiness (involving &lt;a rel="nofollow" target="_blank" href="http://twitter.com/rgardler"&gt;@rgardler&lt;/a&gt;?) lead to &lt;a rel="nofollow" target="_blank" href="http://samscam.co.uk/"&gt;Sam Easterby-Smith&lt;/a&gt; (&lt;a rel="nofollow" target="_blank" href="http://twitter.com/samscam"&gt;@samscam&lt;/a&gt;) to create a script that dug through the dev8d tweets looking for &lt;i&gt;n/m&lt;/i&gt; (like 7/10) and to use that as a mark of happyness e.g.&lt;br /&gt;&lt;blockquote&gt;&lt;a rel="nofollow" target="_blank" href="http://twitter.com/samscam"&gt;" @samscam&lt;/a&gt; #dev8d I am seriously 9/10 happy &lt;a rel="nofollow" target="_blank" href="http://samscam.co.uk/happier"&gt;http://samscam.co.uk/happier&lt;/a&gt; HOW HAPPY ARE YOU? " &lt;a rel="nofollow" target="_blank" href="http://twitter.com/samscam/status/1197185415"&gt; (Tue, 10 Feb 2009 11:17:15)&lt;/a&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;img src="http://lh4.ggpht.com/_KLlGSypGAvw/SZvxf5lx4rI/AAAAAAAAAEA/C9twrbS5xgE/%5BUNSET%5D.png?imgmax=800" style="max-width:800px;"/&gt;&lt;br /&gt;&lt;br /&gt;And computed the average happyness and overall happyness of those who tweeted how they were doing!&lt;br /&gt;&lt;br /&gt;Of course, being friendly, constructive sorts, we knew the best way to help 'improve' his happyometer was to try to break it by sending it bad input... *ahem*.&lt;br /&gt;&lt;blockquote&gt;&lt;a rel="nofollow" target="_blank" href="http://twitter.com/samscam"&gt;" @samscam&lt;/a&gt; #dev8d based on instant discovery of bugs in the Happier Pipe am now only 3/5 happy " (&lt;a rel="nofollow" target="_blank" href="http://twitter.com/samscam/statuses/1197215138"&gt;Tue, 10 Feb 2009 23:05:05&lt;/a&gt;)&lt;br /&gt;&lt;/blockquote&gt;BUT things got fixed, and the community got involved and interested. It caused talk and debate, got people wondering how that it was done, how they could do the same thing and how to take it further.&lt;br /&gt;&lt;br /&gt;At which point, I thought it might be fun to 'retweet' the happyness ratings as they change, to keep a running track of things. And so, a purpose for &lt;a rel="nofollow" target="_blank" href="http://twitter.com/randomdev8d"&gt;@randomdev8d&lt;/a&gt; was born:&lt;br /&gt;&lt;br /&gt;&lt;img src="http://lh4.ggpht.com/_KLlGSypGAvw/SZvxqf_Xz5I/AAAAAAAAAEE/Gr_rAh0ojPs/%5BUNSET%5D.png?imgmax=800" style="max-width:800px;"/&gt;&lt;br /&gt;&lt;br /&gt;How I did this was fairly simple: I grabbed his page every minute or so, used BeautifulSoup to parse the HTML, got the happyness numbers out and compared it to the last ones the script had seen. If there was a change, it tweeted it and seconds later, the projected tweet feed updated to show the new values - a disproportionate feedback loop, the key to involvement in games; you do something small like press a button or add 4/10 to a message, and you can affect the stock-market ticker of happyness :)&lt;br /&gt;&lt;br /&gt;If I had been able to give my talk on the python code day, the code to do this would contain zero surprises, because I covered 99% of this - so here's my &lt;a rel="nofollow" target="_blank" href="http://developerhappinessdays.googlecode.com/files/dev8d-presentation.pdf"&gt;'slides'&lt;/a&gt;[pdf] - basically a snapshot screencast.&lt;br /&gt;&lt;br /&gt;Here's the crufty code though that did this:&lt;br /&gt;&lt;blockquote&gt;import time&lt;br /&gt;import simplejson, httplib2, BeautifulSoup&lt;br /&gt;h = httplib2.Http()&lt;br /&gt;h.add_credentials('randomdev8d','PASSWORD')&lt;br /&gt;happy = httplib2.Http()&lt;br /&gt;o = 130.9&lt;br /&gt;a = 7.7&lt;br /&gt;import urllib&lt;br /&gt;&lt;br /&gt;while True:&lt;br /&gt;print "Checking happiness...."&lt;br /&gt;(resp, content) = happy.request('http://samscam.co.uk/happier/')&lt;br /&gt;soup = BeautifulSoup.BeautifulSoup(content)&lt;br /&gt;overallHappyness = soup.findAll('div')[2].contents&lt;br /&gt;avergeHappyness = soup.findAll('div')[4].contents&lt;br /&gt;over = float(overallHappyness[0])&lt;br /&gt;ave = float(avergeHappyness[0])&lt;br /&gt;print "Overall %s - Average %s" % (over, ave)&lt;br /&gt;omess = "DOWN"&lt;br /&gt;if over &amp;gt; o:&lt;br /&gt;omess = "UP!"&lt;br /&gt;amess = "DOWN"&lt;br /&gt;if ave &amp;gt; a:&lt;br /&gt;amess= "UP!"&lt;br /&gt;if over == o:&lt;br /&gt;omess = "SAME"&lt;br /&gt;if ave == a:&lt;br /&gt;amess = "SAME"&lt;br /&gt;if not (o == over and a == ave):&lt;br /&gt;print "Change!"&lt;br /&gt;o = over&lt;br /&gt;a = ave&lt;br /&gt;tweet = "Overall happiness is now %s(%s), with an average=%s(%s) #dev8d (from http://is.gd/j99q)" % (overallHappyness[0], omess, avergeHappyness[0], amess)&lt;br /&gt;data = {'status':tweet}&lt;br /&gt;body = urllib.urlencode(data)&lt;br /&gt;(rs,cont) = h.request('http://www.twitter.com/statuses/update.json', "POST", body=body)&lt;br /&gt;else:&lt;br /&gt;print "No change"&lt;br /&gt;time.sleep(120)&lt;br /&gt;&lt;/blockquote&gt;(Available from &lt;a rel="nofollow" target="_blank" href="http://pastebin.com/f3d42c348"&gt;http://pastebin.com/f3d42c348&lt;/a&gt; with syntax highlighting - NB this was written beat-poet style, written from A to B with little concern for form. The fact that it works is a miracle, so comment on the code if you must.)&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;The grand, official #Dev8D survey!&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;... which was anything but official, or grand. The happyness-o-meter idea lead BitTarrant and I to think "Wouldn't it be cool to find out what computers people have brought here?" Essentially, finding out what computer environment developers &lt;i&gt;choose&lt;/i&gt; to use is a very valuable thing - developers choose things which make our lives easier, by and large, so finding out which setups they use by preference to develop or work with could guide later choices, such as being able to actually target the majority of environments for wifi, software, or talks.&lt;br /&gt;&lt;br /&gt;So, on the Wednesday morning, Dave put out the call on @dev8d for people to post the operating systems on the hardware they brought to this event, in the form of OS/HW. I then busied myself with writing a script that hit the twitter search api directly, and parsed it itself. As this was a more intended script, I made sure that it kept track of things properly, pickling its per-person tallys. (You could post up multiple configurations in one or more tweets, and it kept track of it per-person.) This script was a little bloated at 86 lines, so I won't post it inline - plus, it also showed that I should've gone to the regexp lesson, as I got stuck trying to do it with regexp, gave up, and then used whitespace-tokenising... but it worked fine ;)&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://pastebin.com/f2c04719b"&gt;Survey code: http://pastebin.com/f2c04719b&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Survey results:&lt;/span&gt; &lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=pDKcyrBE6SJqToHzjCs2jaQ"&gt;http://spreadsheets.google.com/pub?key=pDKcyrBE6SJqToHzjCs2jaQ&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;OS:&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Linux was the majority at 42%&lt;/span&gt; closely followed by Apple at 37% with MS-based OS at 18% with a stellar showing of one user of OpenSolaris (4%)!&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Hardware type:&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;66% were laptops, with 25% of the machines there being classed as netbooks&lt;/span&gt;. 8% of the hardware there were iPhones too, and one person claimed to have brought Amazon EC2 with them ;)&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;The post hoc analysis&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;Now then, having gotten back to normal life, I've spent a little time grabbing stuff from twitter and digging through them. Here is the &lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=pDKcyrBE6SJoJdIcm7mdpBg"&gt;list of the 1300+ tweets with the #dev8d tag in them&lt;/a&gt; published via google docs, and here is some derived things posted by Tony Hirst(&lt;a rel="nofollow" target="_blank" href="http://twitter.com/psychemedia"&gt;@psychemedia&lt;/a&gt;) and Chris Wilper(&lt;a rel="nofollow" target="_blank" href="http://twitter.com/cwilper"&gt;@cwilper&lt;/a&gt;) seconds after I posted this:&lt;br /&gt;&lt;br /&gt;Tagcloud of twitterer's:&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.wordle.net/gallery/wrdl/549364/dev8_twitterers"&gt;http://www.wordle.net/gallery/wrdl/549364/dev8_twitterers&lt;/a&gt; [java needed]&lt;br /&gt;&lt;br /&gt;Tagcloud of tweeted words:&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.wordle.net/gallery/wrdl/549350/dev8d"&gt;http://www.wordle.net/gallery/wrdl/549350/dev8d&lt;/a&gt; [java needed]&lt;br /&gt;&lt;br /&gt;And a column of all the tweeted links:&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=p1rHUqg4g423-wWQn8arcTg"&gt;http://spreadsheets.google.com/pub?key=p1rHUqg4g423-wWQn8arcTg&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;This lead me to dig through them and republish the list of tweets, but try to unminimise the urls and try to grab the &amp;amp;lt;title&amp;gt; tag of the html page it goes to, which you can find here:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=pDKcyrBE6SJpwVmV4_4qOdg"&gt;http://spreadsheets.google.com/pub?key=pDKcyrBE6SJpwVmV4_4qOdg&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;(Which incidently, lead me to spot that there was one link to "YouTube - Rick Astley - Never Gonna Give You Up" which means the hacking was all worthwhile :))&lt;br /&gt;&lt;br /&gt;&lt;big&gt;&lt;big&gt;Graphing Happyness&lt;/big&gt;&lt;/big&gt;&lt;br /&gt;&lt;br /&gt;For one, I've re-analysed the happyness tweets and posted up the following:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;A &lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=pDKcyrBE6SJqHVP8Fb7euEA"&gt;full log of happyness with timeline attached to it&lt;/a&gt;,&lt;/li&gt;&lt;li&gt;&lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=pDKcyrBE6SJoxj8D7_EWscQ"&gt;The running average, with accompanying timeline,&lt;/a&gt;&lt;/li&gt;&lt;li&gt;and the &lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=pDKcyrBE6SJp6acAAn77SZQ"&gt;average of the last 10 tweets&lt;/a&gt; in much the same way as before.&lt;/li&gt;&lt;/ul&gt;It is easier to understand the averages as graphs over time of course! You could also use Tony Hirst's excellent write up here about &lt;a rel="nofollow" target="_blank" href="http://ouseful.wordpress.com/2009/02/17/creating-your-own-results-charts-for-surveys-created-with-google-forms/"&gt;creating graphs from google forms and spreadsheets.&lt;/a&gt; I'm having issues embedding the google timeline widget here, so you'll have to make do with static graphs.&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://lh6.ggpht.com/_KLlGSypGAvw/SZr0KFxfnRI/AAAAAAAAADk/AJQI307X1As/s800/dev8d_running_total_average.png"&gt;&lt;img style="margin:0px auto 10px;display:block;text-align:center;cursor:pointer;width:800px;height:468px;" src="http://lh6.ggpht.com/_KLlGSypGAvw/SZr0KFxfnRI/AAAAAAAAADk/AJQI307X1As/s800/dev8d_running_total_average.png" alt="" border="0"/&gt;&lt;/a&gt;&lt;span style="font-weight:bold;"&gt;Average happyness over the course of the event - all tweets counted towards the average.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://lh4.ggpht.com/_KLlGSypGAvw/SZr0KnArJ0I/AAAAAAAAADs/N5OdzUBDefQ/s912/dev8d_last10HappynessTweetsCount.png"&gt;&lt;img style="margin:0px auto 10px;display:block;text-align:center;cursor:pointer;width:912px;height:510px;" src="http://lh4.ggpht.com/_KLlGSypGAvw/SZr0KnArJ0I/AAAAAAAAADs/N5OdzUBDefQ/s912/dev8d_last10HappynessTweetsCount.png" alt="" border="0"/&gt;&lt;/a&gt;&lt;span style="font-weight:bold;"&gt;Average happyness, but only the previous 10 tweets counted towards the average making it more reflective of the happyness at that time.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;If you are wondering about the first dip, that was when we all tried to break Sam's tracker by sending it bad data, a lot of 0 happyness's were recorded therefore :) As for the second dip, well, you can see that from the &lt;a rel="nofollow" target="_blank" href="http://spreadsheets.google.com/pub?key=pDKcyrBE6SJqHVP8Fb7euEA"&gt;log of happyness&lt;/a&gt;, yourselves :)&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-3073595490420683868?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-3073595490420683868</guid>
         <pubDate>Tue, 17 Feb 2009 20:18:00 -0800</pubDate>
      </item>
      <item>
         <title>Making developers happy</title>
         <link>http://blog.paulwalk.net/2009/02/13/making-developers-happy/</link>
         <description>Since I joined UKOLN two years ago, I have frequently claimed that we (JISC, the sector, our community) don&amp;#8217;t do enough to support and listen to developers. Well, I&amp;#8217;m just back from The Developers Happiness Days (dev8D) in London and I can certainly no longer say this. A solid week of developer happiness! A week [...]</description>
         <guid isPermaLink="false">http://blog.paulwalk.net/2009/02/13/making-developers-happy/</guid>
         <pubDate>Fri, 13 Feb 2009 12:39:29 -0800</pubDate>
         <content:encoded><![CDATA[<p>Since I joined <a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk" title="UKOLN">UKOLN</a> two years ago, I have frequently claimed that we (<a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk" title="JISC">JISC</a>, the sector, our community) don&#8217;t do enough to support and listen to developers. Well, I&#8217;m just back from <a rel="nofollow" target="_blank" href="http://www.dev8d.org/">The Developers Happiness Days (dev8D)</a> in London and I can certainly no longer say this. A solid week of developer happiness! A week of ideas generated, geeks networking with users, competitive and yet collaborative development, knowledge being exchanged&#8230;. followed by fun and, yes, a bit of drinking.</p>
<p>
<img src="http://blog.paulwalk.net/wp-content/uploads/2009/02/dev8d-developers.jpg" width="480" height="204" alt="dev8d-developers.jpg" name="dev8d-developers.jpg" style="padding-right:5px;float:left;" id="dev8d-developers.jpg"/></p>
<p>The brain-child of <a rel="nofollow" target="_blank" href="http://dfflanders.wordpress.com/">David Flanders</a> and <a rel="nofollow" target="_blank" href="http://www.oxfordrepo.blogspot.com/">Ben O&#8217;Steen</a>, with support and ideas from several others and funding from the JISC, dev8D has been a fantastic success, and has managed the difficult task of appealing to, and being successful for, a range of people with varying levels of experience and technical chops. The inexperienced developer looking to be exposed to new ideas and to the wisdom of more experienced folk was well served. Julian Cheal of UKOLN fitted this description and he embraced the opportunities dev8D presented to him, engaging at all levels with the event to the extent that he was rewarded with both a prize for his helpfulness and a special mention at the awards dinner for &#8216;best newcomer&#8217;. But the older hands were fully engaged nonetheless &#8211; presenting on their areas of expertise in &#8216;lightning talks&#8217; in the true barcamp style which geeks have embraced as their own way of conducting conference sessions. It was great to see so many familiar faces together at one event, being unashamedly techie, exchanging ideas and help.</p>
<p>Although, like some others, I was forced to miss some of the event due to a deadline for bids to a JISC call falling on the Wednesday, I still managed to sit in on some sessions, and I learned plenty, especially in a talk on agile development about which I&#8217;ll blog more, separately.</p>
<p>One of the things which stood out at dev8D was the way in which users (or <a rel="nofollow" target="_blank" href="http://code.google.com/p/developerhappinessdays/wiki/UberUsers"><em>UberUsers</em></a>) were invited to engage with developers. There&#8217;s an <strong>important, non-obvious</strong> distinction here. Users were invited to come into the developers&#8217; environment. Brave users, you might say! Normally, developers are invited into the users&#8217; environment&#8230;. for just long enough to explain to them what the users require. Users would often rather not have to deal with developers all that much. To step into an environment of happy, busy developers must have been an eye-opening experience for those users who were brave enough, and open-minded enough to try it. Although I wasn&#8217;t on the &#8216;dragons den&#8217; panel looking at the prototypes being developed in the <em><a rel="nofollow" target="_blank" href="http://code.google.com/p/developerhappinessdays/wiki/DeveloperDecathlon">Developer Decathlon</a></em>, it was remarked to me several times that the quality of submissions was better than in previous events &#8211; and that this was attributed to the fact that users had been involved in the prototyping process. I&#8217;m one of the judges who&#8217;ll be marking these submissions and I&#8217;m really looking forward to seeing what was produced.</p>
<p>With these events, there are little things which can make a difference. The use of <a rel="nofollow" target="_blank" href="http://www.wordle.net/">Wordle</a> to produce personalised name badges for each delegate was inspired, as was the use of happiness tokens to reward help or ideas. The Twitter back-channel was used to tremendous effect &#8211; the &#8216;#dev8D&#8217; tag made the top ten Twitter &#8216;trends&#8217; worldwide. Sam Easterby Smith even built a <a rel="nofollow" target="_blank" href="http://samscam.co.uk/happier/">Twitter-powered developer-happiness meter</a>!</p>
<p>I&#8217;d like to go on record thanking David Flanders in particular for driving this event &#8211; the guy must be utterly exhausted after working 18 hour days for a week. I think we should also recognise the vision of those in the JISC (and especially <a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/contactus/staff/rachelbruce">Rachel Bruce</a>) who were prepared to back what must have looked like a risky proposition. There was value in the event itself &#8211; the networking, and the capacity building which went with this and I have good reason to believe there will be value in the prototypes and ideas generated as a result. But, perhaps most importantly, the sector has just shown the world that it values its developers, and is prepared to invest in them, and even spend a little to make them happy. I believe this will have been a wise investment. As I said on Twitter, <em><a rel="nofollow" target="_blank" href="http://twitter.com/paulwalk/status/1204506122">there&#8217;s a community developing which I&#8217;m proud to be associated with.</a></em></p>
<p>If you want to know more, the tag &#8216;dev8D&#8217; has been used extensively in various systems. Some examples:</p>
<ul>
<li><a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/tags/dev8d">Photos on Flickr</a></li>
<li><a rel="nofollow" target="_blank" href="http://blogsearch.google.com/blogsearch?hl=en&amp;ie=UTF-8&amp;q=dev8d&amp;btnG=Search+Blogs">Google blog search</a></li>
</ul>
<p><em><br /></em></p>
<p style="text-align:right;"><em>Image by Dave Pattern (<a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/davepattern/3274205523/sizes/m/">http://www.flickr.com/photos/davepattern/3274205523/sizes/m/</a>)</em></p>]]></content:encoded>
      </item>
      <item>
         <title>briankelly: Blog post on possible synergies between UK's #IWMW event &amp; US's #eduWeb http://is.gd/jgkK (with refs to #Dev8D / #CRIG folk</title>
         <link>http://twitter.com/briankelly/statuses/1202055528</link>
         <description>briankelly: Blog post on possible synergies between UK's #IWMW event &amp; US's #eduWeb http://is.gd/jgkK (with refs to #Dev8D / #CRIG folk</description>
         <guid isPermaLink="false">http://twitter.com/briankelly/statuses/1202055528</guid>
         <pubDate>Thu, 12 Feb 2009 00:48:36 -0800</pubDate>
      </item>
      <item>
         <title>EPrints 3.2 - Amazon S3/Cloudfront Plug-in</title>
         <link>http://davetaz-blog.blogspot.com/2009/01/eprints-32-amazon-s3cloudfront-plug-in.html</link>
         <description>A quick post to say that we have just successfully tested an EPrints 3.2 (svn) install with the new Storage Controller plugged into Amazon S3! &lt;br /&gt;&lt;br /&gt;This has quiet a lot of implications for both EPrints and other projects wanting to provide external services which operate on objects in a repository. We hope to bring people more news on this at the upcoming Open Repositories 2009 conference in Atlanta. &lt;br /&gt;&lt;br /&gt;For more information on this all check out storage section on the Preserv2 website @ &lt;a rel="nofollow" target="_blank" href="http://www.preserv.org.uk"&gt;www.preserv.org.uk&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-7166652851684473013?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-7166652851684473013</guid>
         <pubDate>Tue, 20 Jan 2009 21:45:00 -0800</pubDate>
      </item>
      <item>
         <title>eFoundations: The apples and oranges of Shibboleth and OpenID</title>
         <link>http://efoundations.typepad.com/efoundations/2008/12/the-apples-and-oranges-of-shibboleth-and-openid.html</link>
         <description>I&amp;#039;ve found the Computing Services at Cambridge to be very positive about OpenID. Reports like that aren&amp;#039;t likely to move it any closer at a strategic level though.</description>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_33b132a13fa53a9fa20e5bf44605c7f4</guid>
         <pubDate>Wed, 24 Dec 2008 00:10:57 -0800</pubDate>
      </item>
      <item>
         <title>Caveat Lector » Blog Archive » Home-grown versus outsourced repository software</title>
         <link>http://cavlec.yarinareth.net/2008/11/24/home-grown-versus-outsourced-repository-software/</link>
         <description>Interesting commentary on development communities around open source repositories.</description>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_84eb55f2d9a173f8fd6f3c125d6a18df</guid>
         <pubDate>Tue, 25 Nov 2008 01:57:48 -0800</pubDate>
      </item>
      <item>
         <title>Big Blue Brainstorm</title>
         <link>http://www.businessweek.com/magazine/content/06_32/b3996062.htm</link>
         <description>The big blue jumps in with the rest of barcamping fools!</description>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_6e4540b5fb634bb0ed3877e63960a9ec</guid>
         <pubDate>Sun, 23 Nov 2008 06:34:28 -0800</pubDate>
      </item>
      <item>
         <title>Panlibus » Blog Archive » The economics of scarcity – JP Rangaswami talks ahead of Online Information 2008</title>
         <link>http://blogs.talis.com/panlibus/archives/2008/11/the-economics-of-scarcity-jp-rangaswami-talks-ahead-of-online-information-2008.php</link>
         <description>Finally got around to listening to this today. Fantastic stuff with insight on topics from Open Source to scholarly publishing to librarians and digital freedoms. Nearly shouted hallelujah on the train at the bit about enterprise integration being like sticking your data in a hole and filling it with concrete!</description>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_edf0c3440ce4e66d0da50714fabf1f89</guid>
         <pubDate>Thu, 20 Nov 2008 04:31:03 -0800</pubDate>
      </item>
      <item>
         <title>The Future of Repositories? Patterns for (Cross-)Repository Architectures</title>
         <link>http://dlib.org/dlib/november08/aschenbrenner/11aschenbrenner.html</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_f58eae525e5d7bb29011ae9d0e8c4ce1</guid>
         <pubDate>Wed, 19 Nov 2008 02:40:55 -0800</pubDate>
      </item>
      <item>
         <title>The Future of Repositories? Patterns for (Cross-)Repository Architectures</title>
         <link>http://dlib.org/dlib/november08/aschenbrenner/11aschenbrenner.html</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_f58eae525e5d7bb29011ae9d0e8c4ce1</guid>
         <pubDate>Tue, 18 Nov 2008 07:34:36 -0800</pubDate>
      </item>
      <item>
         <title>Twitter / Richard Akerman: @dfflanders idea (not mine ...</title>
         <link>http://twitter.com/scilib/status/1008777800</link>
         <description>tweets from SPARC-DR</description>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_fea585e820379c8ddb30a6b818d24cfb</guid>
         <pubDate>Mon, 17 Nov 2008 00:44:00 -0800</pubDate>
      </item>
      <item>
         <title>1,000 Web APIs</title>
         <link>http://blog.programmableweb.com/2008/11/03/1000-web-apis/</link>
         <description>Some nice statistics in regards to API trends.</description>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_1e22f3334b3e6b31e905bd236a6e5456</guid>
         <pubDate>Fri, 14 Nov 2008 13:01:56 -0800</pubDate>
      </item>
      <item>
         <title>Beginning with RDF triplestores - a 'survey'</title>
         <link>http://oxfordrepo.blogspot.com/2008/11/beginning-with-rdf-triplestores.html</link>
         <description>&lt;div&gt;Like last time, this was prompted by an email that eventually was passed to me. It was a call for opinion - "&lt;tt&gt;&lt;font color='#737373'&gt;we thought we'd check first to see what software&lt;/font&gt;&lt;/tt&gt;&lt;tt&gt;&lt;font color='#737373'&gt; either of you recommend or use for an RDF database.&lt;/font&gt;&lt;/tt&gt;"&lt;br/&gt;&lt;br/&gt;It's a good question.&lt;br/&gt;&lt;br/&gt;In fact, it's a really great question, as searching for similar advice online results in very few opinions on the subject.&lt;br/&gt;&lt;br/&gt;But which one's are the best for novices? Which have the best learning curves? which has the easiest install or the shortest time between starting out and being able to query things?&lt;br/&gt;&lt;br/&gt;I'll try to pose as much as I can as a newcomer which won't be too hard :) Some of the comments will be my own, and some will be comments from others, but I'll try to be as honest as I can be to reflect new user expectation and experience and most importantly, developer-attention span. (See the end for some of my reasons for this approach.)&lt;br/&gt;&lt;br/&gt;&lt;em&gt;(Puts on newbie hat and enables PEBKAC mode.)&lt;/em&gt;&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Installable (local) triplestores&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Sesame&lt;/b&gt; - &lt;a rel="nofollow" target="_blank" href='http://www.openrdf.org/'&gt;http://www.openrdf.org/&lt;/a&gt;&lt;br/&gt;&lt;br/&gt;Simple menu on the left of the website, one called downloads. Great, I'll give that a whirl. "Download &lt;a rel="nofollow" target="_blank" href='http://sourceforge.net/project/showfiles.php?group_id=46509&amp;amp;package_id=168413'&gt;the latest Sesame 2.x release&lt;/a&gt;" looks good to me. Hmm 5 differently named files... I'll grab the 'onejar' file and try to run it. "Failed to load Main-Class manifest attribute from openrdf-sesame-2.2.1-onejar.jar", okay... so back to the site to find out how to install this thing.&lt;br/&gt;&lt;br/&gt;No links for installation guide... on the &lt;a rel="nofollow" target="_blank" href='http://www.openrdf.org/documentation.jsp'&gt;Documentation&lt;/a&gt; page, no link for installation instructions for the sesame 2.2.1 I downloaded, but there is &lt;a rel="nofollow" target="_blank" href='http://www.openrdf.org/doc/sesame2/users/'&gt;Sesame 2 user documentation&lt;/a&gt; and &lt;a rel="nofollow" target="_blank" href='http://www.openrdf.org/doc/sesame2/system/'&gt;Sesame 2 system documentation&lt;/a&gt;. Phew, after guessing that the user documentation might have the guide, I finally found the &lt;a rel="nofollow" target="_blank" href='http://www.openrdf.org/doc/sesame2/users/ch06.html'&gt;installation guide&lt;/a&gt; (system documentation was about the architecture, not how to administer the system as you might expect.)&lt;br/&gt;&lt;br/&gt;(Developer losing interest...)&lt;br/&gt;&lt;br/&gt;Ah, I see, I need the SDK. I wonder what that 'onejar' was then... "The deployment process is container-specific, please consult the&lt;br/&gt; documentation for your container on how to deploy a web application. " - right, okay... let's assume that I have a Java background and am not just a user wanting to hook into it from my language of choice, such as php, ruby, python, or dare I say it, javascript.&lt;br/&gt;&lt;br/&gt;(Only Java-friendly developers continue on)&lt;br/&gt;&lt;br/&gt;Right, got Tomcat, and put in the war file... right so, now I need to work out how to use a &lt;a rel="nofollow" target="_blank" href='http://www.openrdf.org/doc/sesame2/users/ch07.html#d0e354'&gt;commandline&lt;/a&gt; console tool to set up a 'repository'... does this use SVN or CVS then? Oh, it doesn't do anything unless I end the line with a period. I thought it had hung trying to connect! "Triple indexes [spoc,posc]" Wha? Well, whatever that was, the test repository is created. Let's see what's at http://localhost:8080/openrdf-sesame then. &lt;br/&gt;&lt;br/&gt;"You are currently accessing an OpenRDF Sesame server. This server is&lt;br/&gt;intended to be accessed by dedicated clients, using a specialized&lt;br/&gt;protocol. To access the information on this server through a browser,&lt;br/&gt;we recommend using the OpenRDF Workbench software."&lt;br/&gt;&lt;br/&gt;Bugger. Google for "sesame clients" then.&lt;br/&gt;&lt;ul&gt;&lt;li&gt;There is a Java client it seems, but it seems to need a lot to get going. Oh, and useful if my application is in Java or in a JVM (jRuby, jython)&lt;br/&gt;&lt;/li&gt;&lt;li&gt;&lt;a rel="nofollow" target="_blank" href='http://jeenbroekstra.blogspot.com/2008/09/sesame-2-desktop-client.html'&gt;http://jeenbroekstra.blogspot.com/2008/09/sesame-2-desktop-client.html&lt;/a&gt; .Net GUI... not so useful for programmatic stuff&lt;/li&gt;&lt;li&gt;...&lt;/li&gt;&lt;/ul&gt;I've pretty much given up at this point. If I knew I needed to use a triplestore then I might have persisted, but if I was just investigating it? I would've probably given up earlier.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Mulgara&lt;/b&gt; - &lt;a rel="nofollow" target="_blank" href='http://www.mulgara.org/'&gt;http://www.mulgara.org/&lt;/a&gt;&lt;br/&gt;&lt;br/&gt;Nice, they've given the frontpage some style, not too keen on orange, but the effort makes it look professional. "&lt;em&gt;Mulgara&lt;/em&gt; is a scalable RDF database written entirely in &lt;strong&gt;&lt;a rel="nofollow" class='styleBlack' target="_blank" href='http://java.com/'&gt;Java&lt;/a&gt;&lt;/strong&gt;." -&amp;gt; Great, I found what I am looking for, and it warns me it needs Java. "DOWNLOAD NOW" - that's pretty clear. *click*&lt;br/&gt;&lt;br/&gt;Hmm, where's the style gone? Lots of download options, but thankfully one is marked by "These released binaries are all that are required for most applications." so I'll grab &lt;a rel="nofollow" target="_blank" href='http://www.mulgara.org/files/v2.0.6/mulgara-2.0.6-bin.tar.gz'&gt;those&lt;/a&gt;. 25Mb? Wow...&lt;br/&gt;&lt;br/&gt;Okay, it's downloaded and unpacked now. Let's see what we've got - a 'dist/' directory and two jars. Well, I guess I should try to run one (wonder what the licence is, where's the README?)&lt;br/&gt;&lt;blockquote&gt;&lt;em&gt;Mulgara Semantic Store Version 2.0.6 (Build 2.0.6.local) INFO [main] (EmbeddedMulgaraServer.java:715) - RMI Registry started automatically on port 10990 [main] INFO org.mulgara.server.EmbeddedMulgaraServer - RMI Registry started automatically on port 1099 INFO [main] (EmbeddedMulgaraServer.java:738) - java.security.policy set to jar:file:/home/ben/Desktop/apache-tomcat-6.0.18/mulgara-2.0.6/dist/mulgara-2.0.6.jar!/conf/mulgara-rmi.policy3 [main] INFO org.mulgara.server.EmbeddedMulgaraServer - java.security.policy set to jar:file:/home/ben/Desktop/apache-tomcat-6.0.18/mulgara-2.0.6/dist/mulgara-2.0.6.jar!/conf/mulgara-rmi.policy2008-11-14 14:06:39,899 INFO Database - Host name aliases for this server are: [billpardy, localhost, 127.0.0.1]&lt;/em&gt;&lt;br/&gt;&lt;/blockquote&gt;Well, I guess something has started... back to the site, there is a documentation page and a wiki. A quick view of the official documentation has just confused me, is &lt;a rel="nofollow" target="_blank" href='http://docs.mulgara.org/'&gt;this an external site&lt;/a&gt;? No easy link to something like 'getting started' or tutorials. I've heard of SPARQL, what's iTQL? nevermind, let's see if the &lt;a rel="nofollow" target="_blank" href='http://www.mulgara.org/trac/wiki'&gt;wiki&lt;/a&gt; is more helpful.&lt;br/&gt;&lt;br/&gt;Let's try '&lt;a rel="nofollow" target="_blank" href='http://www.mulgara.org/trac/wiki/Docs'&gt;Documentation&lt;/a&gt;' - sweet, first link looks like what I want - &lt;a rel="nofollow" target="_blank" href='http://www.mulgara.org/trac/wiki/WebUI' class='wiki'&gt;Web User Interface&lt;/a&gt;.&lt;br/&gt;&lt;blockquote&gt;A default configuration for a standalone Mulgara server runs a set of&lt;br/&gt;web services, including the Web User Interface. The standard&lt;br/&gt;configuration puts uses port 8080, so the web services can be seen by&lt;br/&gt;pointing a browser on the server running Mulgara to &lt;a rel="nofollow" target="_blank" href='http://localhost:8080/' class='ext-link'&gt;&lt;span class='icon'&gt;http://localhost:8080/&lt;/span&gt;&lt;/a&gt;.&lt;br/&gt;&lt;/blockquote&gt;Ooo cool. *click* &lt;br/&gt;&lt;blockquote&gt;&lt;h2&gt;Available Services&lt;/h2&gt;&lt;ul&gt;&lt;li&gt;&lt;a rel="nofollow" target="_blank" href='http://localhost:8080/sparql'&gt;SPARQL HTTP Service&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a rel="nofollow" target="_blank" href='http://localhost:8080/webui'&gt;User Interface&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a rel="nofollow" target="_blank" href='http://localhost:8080/webservices'&gt;Web Services&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a rel="nofollow" target="_blank" href='http://localhost:8080/tql'&gt;TQL HTTP Service&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/blockquote&gt;&lt;br/&gt;SPARQL, I've heard of that. *click* &lt;br/&gt;&lt;blockquote&gt;&lt;h2&gt;HTTP ERROR: 400&lt;/h2&gt;&lt;pre&gt;Query must be supplied&lt;/pre&gt;&lt;p&gt;RequestURI=/sparql/&lt;/p&gt;&lt;p&gt;&lt;i&gt;&lt;small&gt;&lt;a rel="nofollow" target="_blank" href='http://jetty.mortbay.org/'&gt;Powered by Jetty://&lt;/a&gt;&lt;/small&gt;&lt;/i&gt;&lt;/p&gt;&lt;/blockquote&gt;I guess that's the SPARQL api, good to know, but the frontpage could've warned me a little. Ah, second link is to the User Interface.&lt;br/&gt;&lt;br/&gt;Good, I can use a drop down to look at lots of example queries, nice. Don't understand most of them at the moment, but it's definitely comforting to have examples. They look nothing like SPARQL though... wonder what it is? I'm sure it does SPARQL... was I wrong?&lt;br/&gt;&lt;br/&gt;Quick poke at the HTML shows that it is just POSTing the query text to webui/ExecuteQuery. Looks straightforward to start hacking against too, but probably should password protect this somehow! I wonder how that is done... documentation mentions a '&lt;tt&gt;java.security.policy'&lt;/tt&gt; field:&lt;tt&gt;&lt;br/&gt;&lt;br/&gt;java.security.policy&lt;/tt&gt;&lt;i&gt;&lt;br/&gt;string: URL&lt;/i&gt;: The URL for the security policy file to use.&lt;br/&gt;Default: jar:file:/jar_path!/conf/mulgara-rmi.policy &lt;br/&gt;&lt;blockquote&gt;&lt;p/&gt;&lt;/blockquote&gt;Kinda stumped... will investigate that later, but at least there's hope. Just be firing off the example queries though shows me stuff, so I've got something to work with at least.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Jena&lt;/b&gt; - &lt;a rel="nofollow" target="_blank" href='http://jena.sourceforge.net/'&gt;http://jena.sourceforge.net/&lt;/a&gt;&lt;br/&gt;&lt;br/&gt;Front page is pretty clear, even if I don't understand what all those acronyms are. &lt;a rel="nofollow" target="_blank" href='http://jena.sourceforge.net/downloads.html'&gt;downloads&lt;/a&gt; link takes me to a page with an obvious download link, good. (Oh, and sourceforge, you suck. How many frikkin mirrors do I have to try to get this file?)&lt;br/&gt;&lt;br/&gt;Have to put Jena on pause while Sourceforge sorts its life out.&lt;br/&gt;&lt;br/&gt;&lt;b&gt;ARC2&lt;/b&gt; - http://arc.semsol.org/&lt;br/&gt;&lt;br/&gt;Frontpage: "Easy RDF and SPARQL for LAMP systems" Nice, I know of LAMP and I particularly like the word Easy. Let's see... &lt;a rel="nofollow" target="_blank" href='http://arc.semsol.org/download'&gt;Download&lt;/a&gt; is easy to find, and tells me straight away I need PHP 4.3+ and MySQL 4.0.4+ *check* Right, now how do I enable PHP for apache again?... Ah, it helps if I install it first... Okay, done. Dropping the folder into my web space... Hmm nothing does anything. From the documentation, it does look like it is geared to providing a PHP library framework for working with its triplestore and RDF. Hang on, &lt;a rel="nofollow" target="_blank" href='http://arc.semsol.org/docs/v2/endpoint'&gt;SPARQL Endpoint Setup&lt;/a&gt; looks like what I want. It wants a database, okay... done, bit of a hassle though.&lt;br/&gt;&lt;br/&gt;Hmm, all I get is "&lt;b&gt;Fatal error&lt;/b&gt;: Call to undefined function mysql_connect() in &lt;b&gt;/********/arc2/store/ARC2_Store.php&lt;/b&gt; on line &lt;b&gt;53"&lt;br/&gt;&lt;/b&gt;&lt;br/&gt;Of course, install php libraries to access mysql (PEBKAC)... done and I also realise I need to set up the store, like the example in "&lt;a rel="nofollow" target="_blank" href='http://arc.semsol.org/docs/v2/getting_started'&gt;Getting Started&lt;/a&gt;"... done (with &lt;a rel="nofollow" target="_blank" href='http://pastebin.com/f2ca379e7'&gt;this&lt;/a&gt;) and what does the index page now look like?&lt;br/&gt;&lt;br/&gt;&lt;img src='http://lh4.ggpht.com/_KLlGSypGAvw/SR2Xk92vjbI/AAAAAAAAACo/RhWSkZvbYCM/%5BUNSET%5D.png?imgmax=800' style='max-width:800px;'/&gt;&lt;br/&gt;&lt;br/&gt;Yay! there's like SPARQL and stuff... I guess 'load' and 'insert' will help me stick stuff in, and 'select' looks familiar... Well, it seems to be working at least.&lt;br/&gt;&lt;br/&gt;Unfortunately, it looks like the Jena download from sourceforge is in a world of FAIL for now. Maybe I'll look at it next time?&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Triplestores in the cloud&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;Talis Platform - &lt;a rel="nofollow" target="_blank" href='http://www.talis.com/platform/'&gt;http://www.talis.com/platform/&lt;/a&gt;&lt;br/&gt;&lt;br/&gt;From the frontpage - "&lt;i&gt;Developers using the Platform can spend more of their time building&lt;br /&gt;extraordinary applications and less of their time worrying about how&lt;br /&gt;they will scale their data storage.&lt;/i&gt;" - pretty much want I wanted to hear, so how do I get to play with it?&lt;br/&gt;&lt;br/&gt;There is a &lt;a rel="nofollow" title='Get involved' target="_blank" href='http://www.talis.com/platform/get_involved/index.shtml'&gt;Get involved&lt;/a&gt; link on the left, which rapidly leads me to see the section: "Develop, play and try out" - &lt;a rel="nofollow" target="_blank" href='http://n2.talis.com/wiki/Main_Page'&gt;n&lt;sup&gt;2&lt;/sup&gt; developer community &lt;/a&gt; seems to be where it wants me to go. &lt;br/&gt;&lt;br/&gt;Lots of links on the frontpage, takes a few seconds to spot: "&lt;a rel="nofollow" title='Join' target="_blank" href='http://n2.talis.com/wiki/Join'&gt;Join&lt;/a&gt; - join the n² community to get free developer stores and online support" - free, nice word that. So, I just have to email someone? Okay, I can live with that.&lt;br/&gt;&lt;br/&gt;Documentation seems good, lots of choices though, a little hard to spot a single thread to follow to get up to speed, but &lt;a rel="nofollow" title='Guides and Tutorials' target="_blank" href='http://n2.talis.com/wiki/Guides_and_Tutorials'&gt;Guides and Tutorials&lt;/a&gt; looks right to get going with. The &lt;a rel="nofollow" target="_blank" href='http://n2.talis.com/wiki/Kniblet_Tutorial'&gt;Kniblet tutorial&lt;/a&gt; (whatever a kniblet is) looks the most beginnerish, and it's also very PHP focussed, which is either a good thing or a bad thing depending on the user :)&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Commercial triplestores&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Openlink Virtuoso&lt;/b&gt; - &lt;a rel="nofollow" target="_blank" href='http://virtuoso.openlinksw.com/'&gt;http://virtuoso.openlinksw.com/&lt;/a&gt;&lt;br/&gt;&lt;br/&gt;Okay, I tried the &lt;a rel="nofollow" target="_blank" href='http://download.openlinksw.com/download/'&gt;Download&lt;/a&gt; link, but I am pretty confused by what I'm greeted with: &lt;br/&gt;&lt;br/&gt;&lt;img src='http://lh3.ggpht.com/_KLlGSypGAvw/SR2rWiIXUBI/AAAAAAAAACs/qOB0ORoEOI4/%5BUNSET%5D.png?imgmax=800' style='max-width:800px;'/&gt;&lt;br/&gt;&lt;br/&gt;Not sure what one to pick just to try it out, it's late in the day, and my tolerance for all things installable has ended.&lt;br/&gt;&lt;br/&gt;-----------------------------------------&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Why take the http/web-centric, newbie approach to looking at these?&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;&lt;i&gt;Answer: &lt;/i&gt;In part, I am taking this approach because I have a deep belief that it&lt;br/&gt;was only after relational DBs became commoditised - "You want fries&lt;br/&gt;with you MySQL database?" - that the dynamic web kicked off. If we want&lt;br/&gt;the semantic web to kick off, we need to commoditise it or at least, make&lt;br/&gt;it very easy for developers to get started. And I mean &lt;b&gt;&lt;i&gt;EASY&lt;/i&gt;&lt;/b&gt;. A query that I want answered is: "Is there something that fits: 'apt-get install&lt;br/&gt;triplestore; r = store('localhost'), r.add(rdf), r.query(blah)'? " &lt;br/&gt;&lt;br/&gt;(I am particularly interested to see what happens when &lt;a rel="nofollow" target="_blank" href='http://tom.opiumfield.com/'&gt;Tom Morris&lt;/a&gt;'s work on &lt;a rel="nofollow" target="_blank" href='http://github.com/tommorris/reddy/tree/master'&gt;Reddy&lt;/a&gt; collides with ActiveRecord or activerdf...)&lt;br/&gt;&lt;br/&gt;&lt;b&gt;NB &lt;/b&gt;I've short circuited the discovery of software homepages - Imagine&lt;br/&gt;I've seen projects stating that they use "XXXXX as a triplestore". I know&lt;br/&gt;this will likely mean I've compared apples to oranges, but as a newbie, how&lt;br/&gt;would I be expected to know this? "Powered by the Talis Platform" and&lt;br/&gt;"Powered by Jena" seem pretty similar on the surface.)&lt;br/&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-280227876627147956?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-280227876627147956</guid>
         <pubDate>Fri, 14 Nov 2008 00:50:00 -0800</pubDate>
      <enclosure url="http://www.mulgara.org/files/v2.0.6/mulgara-2.0.6-bin.tar.gz" length="26470059" type="application/x-gzip" /><media:content url="http://www.mulgara.org/files/v2.0.6/mulgara-2.0.6-bin.tar.gz" fileSize="26470059" type="application/x-gzip" /><itunes:subtitle>Like last time, this was prompted by an email that eventually was passed to me. It was a call for opinion - "we thought we'd check first to see what software either of you recommend or use for an RDF database." It's a good question. In fact, it's a really</itunes:subtitle><itunes:author>Ben O'Steen</itunes:author><itunes:summary>Like last time, this was prompted by an email that eventually was passed to me. It was a call for opinion - "we thought we'd check first to see what software either of you recommend or use for an RDF database." It's a good question. In fact, it's a really great question, as searching for similar advice online results in very few opinions on the subject. But which one's are the best for novices? Which have the best learning curves? which has the easiest install or the shortest time between starting out and being able to query things? I'll try to pose as much as I can as a newcomer which won't be too hard :) Some of the comments will be my own, and some will be comments from others, but I'll try to be as honest as I can be to reflect new user expectation and experience and most importantly, developer-attention span. (See the end for some of my reasons for this approach.) (Puts on newbie hat and enables PEBKAC mode.) Installable (local) triplestores Sesame - http://www.openrdf.org/ Simple menu on the left of the website, one called downloads. Great, I'll give that a whirl. "Download the latest Sesame 2.x release" looks good to me. Hmm 5 differently named files... I'll grab the 'onejar' file and try to run it. "Failed to load Main-Class manifest attribute from openrdf-sesame-2.2.1-onejar.jar", okay... so back to the site to find out how to install this thing. No links for installation guide... on the Documentation page, no link for installation instructions for the sesame 2.2.1 I downloaded, but there is Sesame 2 user documentation and Sesame 2 system documentation. Phew, after guessing that the user documentation might have the guide, I finally found the installation guide (system documentation was about the architecture, not how to administer the system as you might expect.) (Developer losing interest...) Ah, I see, I need the SDK. I wonder what that 'onejar' was then... "The deployment process is container-specific, please consult the documentation for your container on how to deploy a web application. " - right, okay... let's assume that I have a Java background and am not just a user wanting to hook into it from my language of choice, such as php, ruby, python, or dare I say it, javascript. (Only Java-friendly developers continue on) Right, got Tomcat, and put in the war file... right so, now I need to work out how to use a commandline console tool to set up a 'repository'... does this use SVN or CVS then? Oh, it doesn't do anything unless I end the line with a period. I thought it had hung trying to connect! "Triple indexes [spoc,posc]" Wha? Well, whatever that was, the test repository is created. Let's see what's at http://localhost:8080/openrdf-sesame then. "You are currently accessing an OpenRDF Sesame server. This server is intended to be accessed by dedicated clients, using a specialized protocol. To access the information on this server through a browser, we recommend using the OpenRDF Workbench software." Bugger. Google for "sesame clients" then. There is a Java client it seems, but it seems to need a lot to get going. Oh, and useful if my application is in Java or in a JVM (jRuby, jython) http://jeenbroekstra.blogspot.com/2008/09/sesame-2-desktop-client.html .Net GUI... not so useful for programmatic stuff...I've pretty much given up at this point. If I knew I needed to use a triplestore then I might have persisted, but if I was just investigating it? I would've probably given up earlier. Mulgara - http://www.mulgara.org/ Nice, they've given the frontpage some style, not too keen on orange, but the effort makes it look professional. "Mulgara is a scalable RDF database written entirely in Java." -&amp;gt; Great, I found what I am looking for, and it warns me it needs Java. "DOWNLOAD NOW" - that's pretty clear. *click* Hmm, where's the style gone? Lots of download options, but thankfully one is marked by "These released binaries are all that are required for most applications." so I'll grab those. 25Mb? Wow... Okay, it's d</itunes:summary></item>
      <item>
         <title>A Fedora/Solr Digital Library for Oxford's 'Forced Migration Online'</title>
         <link>http://oxfordrepo.blogspot.com/2008/11/fedorasolr-digital-library-for-oxford.html</link>
         <description>&lt;div&gt;(mods:subtitle - Slightly more technical follow-up to the &lt;a rel="nofollow" target="_blank" href='http://expertvoices.nsdl.org/hatcheck/2008/11/06/a-fedorasolr-digital-library-for-oxfords-forced-migration-online/'&gt;Fedora Hatcheck piece&lt;/a&gt;.)&lt;br/&gt;&lt;br/&gt;As I have been prompted via email by Phil Cryer (of the &lt;a rel="nofollow" target="_blank" href='http://mobot.org/'&gt;Missouri Botanical Garden&lt;/a&gt;) to talk more about how this technically works, I thought it would be best to make it a written post, rather than the more limited email response.&lt;br/&gt;&lt;br/&gt;&lt;big&gt;Background&lt;/big&gt;&lt;br/&gt;&lt;br/&gt;Forced Migration Online (FMO) had a proprietary system, supporting their document needs. It was originally designed for newpaper holdings and applied that model to encoding the mostly paginated documents that FMO held - such that each part was broken up into paragraphs of text, images and the location of all these parts on a page. It even encoded (in its own format) the location of the words on the page when it OCR'd the documents, making per-word higlighting possible. Which is nice.&lt;br/&gt;&lt;br/&gt;However, the backend that powered this was over-priced, and FMO wanted to move to a more open, sustainable platform.&lt;br/&gt;&lt;br/&gt;&lt;big&gt;Enter the DAMS&lt;/big&gt;&lt;br/&gt;&lt;br/&gt;(DAMS = Digital Asset Management System)&lt;br/&gt;&lt;br/&gt;I have been doing work on trying to make a service out of a base of fedora-commons and additional 'plugin' services, such as the wonderful Apache Solr and the useful eXist XML db. The end aim is for departments/users/whoever to requisition a 'store' with a certain quality of service (solr attached, 50Gb+ etc) but this is not yet an automated process.&lt;br/&gt;&lt;br/&gt;The focus for the store is a very clear separation between storage, management, indexing services and distribution - Normal filesystems, or Sun Honeycomb are the storage, Fedora-commons provides the management + CRUD, solr, eXist, mulgara, sesame, and couchDB can provide potential index and query services, and distribution is handed pragmatically, caching outgoing and mirroring where necessary.&lt;br/&gt;&lt;br/&gt;&lt;big&gt;The FMO 'store'&lt;/big&gt;&lt;br/&gt;&lt;br/&gt;From discussions with FMO, and examining the information they held and the way they wished to make use of it, a simple Fedora/Solr store seemed to fufill what they wanted: a persistant store of items with attachments and the ability to search the metadata and retrieve results.&lt;br/&gt;&lt;br/&gt;&lt;big&gt;Bring in the consultants&lt;/big&gt;&lt;br/&gt;&lt;br/&gt;FMO hired Aptivate to do the migration of their data from the proprietary system, in its custom format, to a Fedora/Solr store and trying as much as possible to retain the functionality they had.&lt;br/&gt;&lt;br/&gt;Some points that I think it is important to impress on people here:&lt;br/&gt;&lt;ul&gt;&lt;li&gt;In general, software engineer consultants don't understand METS or FOXML.&lt;/li&gt;&lt;li&gt;They *really* don't understand the point of disseminators.&lt;br/&gt;&lt;/li&gt;&lt;li&gt;Having to teach software engineer consultants to do METS/FOXML/bDef's etc is likely an arduous and costly task.&lt;/li&gt;&lt;li&gt;Consultants add lots of money to do things their team don't already have the experience to do.&lt;/li&gt;&lt;/ul&gt;So, my conclusion was to not make these things part of the development at all to the extent that I might even have forgotten to mention these things to them except in passing. I helped them install their own local store and helped them with the various interfaces and gotchas of the two software packages. By showing them how I use Fedora and Solr in ora.ouls.ox.ac.uk, they were able to hit the ground running.&lt;br/&gt;&lt;br/&gt;They began by using the REST interface to Fedora and the RESTful interface to Solr. By having them begin by using the simple put/get REST interface to Fedora, they could concentrate on getting used to the nature of Fedora as an objectstore. I think they moved to use the SOAP interface as it better suited their Java background, although I cannot be certain as it wasn't an issue that came up.&lt;br/&gt;&lt;br/&gt;Once they had developed the migration scripts to their satisfaction, they asked me to give them a store, which I did (but due to hardware and stupid support issues here I am afraid to say I held them up on this.) They fired off their scripts, moved all the content into the fedora with a straightforward layout per object (pdf, metadata, fulltext and thumbnail) The metadata is - from what I can see - the same XML metadata as before - very MARCXML in nature, with 'Application_Info' elements having types like 'bl:DC.Title'. If necessary, we will strip out the dublin core metadata and put what we can into the DC datastream, but that's not of particular interest to FMO right now. &lt;br/&gt;&lt;br/&gt;&lt;big&gt;Fedora/Solr notes&lt;/big&gt;&lt;br/&gt;&lt;br/&gt;As for the link between Solr and Fedora? This is very loosely coupled, such that they are running in the same Tomcat container for convenience, but aren't linked in a hard way. &lt;br/&gt;&lt;br/&gt;I've looked at GSearch, which is great for a homogenous collection of items, such that they can be acted on by the same XSLT to produce a suitable record for Solr, but as the metadata was a complete unknown for this project, it wasn't too suitable.&lt;br/&gt;&lt;br/&gt;Currently, they have one main route into the fedora store, and so, it isn't hard to simply reindex an item after a change is made, especially for services such as Solr or eXist, which expect to have things change incrementally. I am looking at services such as ActiveMQ for scheduling these index tasks, but more and more I am starting to favour RabbitMQ which seems to be more useful, while retaining the scalability and very robust nature.&lt;br/&gt;&lt;br/&gt;Sending an update to Solr is as simple as an HTTP POST to its /update service, consisting of a XML or JSON packet like " changeme:1 John Smith .... " - it uses a transactional model, such that you can push all the changes and additions into the live index via a commit call, without taking the index offline. To query Solr, all manner of clients exist, and it is built to be very simple to interact with, handling facet queries, filtering, ordering and can deliver the results in XML, JSON, PHP or Python directly. It can even do a XSLT transform of the results on the way out, leading to a trivial way to support OpenSearch, Atom feeds and even HTML blocks for embedding in other sites.&lt;br/&gt;&lt;br/&gt;Likewise, to change a PDF in Fedora can be done by a HTTP POST as well. Does it need to be more complicated?&lt;br/&gt;&lt;br/&gt;&lt;b&gt;Last, but not least, a project to watch closely:&lt;/b&gt;&lt;br/&gt;&lt;br/&gt;The &lt;a rel="nofollow" target="_blank" href='http://ice.usq.edu.au/projects/fascinator/trac'&gt;Fascinator project&lt;/a&gt;, funded by &lt;a rel="nofollow" class='ext-link' target="_blank" href='http://www.arrow.edu.au/'&gt;&lt;span class='icon'&gt;ARROW&lt;/span&gt;&lt;/a&gt;, as part of their mini project scheme, is an Apache &lt;a rel="nofollow" class='ext-link' target="_blank" href='http://lucene.apache.org/solr/'&gt;&lt;span class='icon'&gt;Solr&lt;/span&gt;&lt;/a&gt; front end to the &lt;a rel="nofollow" class='ext-link' target="_blank" href='http://www.fedora-commons.org/'&gt;&lt;span class='icon'&gt;Fedora commons&lt;/span&gt;&lt;/a&gt; repository. The goal of the project is to create a simple interface to Fedora that uses a single technology – that’s Solr – to handle all browsing, searching and security. Well worth a look, as it seeks to turn this Fedora/Solr pairing truly into an appliance, with a simple installer and handling the linkage between the two.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-7367532608884117842?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-7367532608884117842</guid>
         <pubDate>Wed, 12 Nov 2008 22:30:00 -0800</pubDate>
      </item>
      <item>
         <title>The World's Best Photos of crig. Flickr Hive Mind</title>
         <link>http://fiveprime.org/hivemind/Tags/crig</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_1a20f44ae51cd3b5aba2889983406caa</guid>
         <pubDate>Fri, 07 Nov 2008 01:21:04 -0800</pubDate>
      </item>
      <item>
         <title>Codependency Checklist</title>
         <link>http://shalomplace.com/res/codepcheck.html</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_5a2f4b78f2f26e2da042b827091cb035</guid>
         <pubDate>Mon, 03 Nov 2008 22:42:49 -0800</pubDate>
      </item>
      <item>
         <title>OfficeSWORD - Home</title>
         <link>http://www.codeplex.com/OfficeSWORD</link>
         <guid isPermaLink="false">IOhBPNqn3BG4kFuCJZhxuA_da68e69f17678efa3260da5b85c1ac0c</guid>
         <pubDate>Fri, 31 Oct 2008 02:52:41 -0700</pubDate>
      </item>
      <item>
         <title>News and updates Oct 2008</title>
         <link>http://oxfordrepo.blogspot.com/2008/10/news-and-updates-oct-2008.html</link>
         <description>&lt;div&gt;Right, I haven't forgotten about this blog, just getting all my ducks in a line as it were. Some updates:&lt;br/&gt;&lt;br/&gt;&lt;ul&gt;&lt;li&gt;The JISC bid for eAdministration was successful, titled "Building the Research Information Infrastructure (BRII)". The project will categorise the research information structure, build vocabularies if necessary, and populate it with information. It will link research outputs (text and data), people, projects, groups, departments, grant/funding information and funding bodys together, using RDF and as many pre-existing vocabularies as is suitable. The first vocab gap we've hit is one for funding, and I've made a draft RDF schema for this which will be openly published once we've worked out a way to make it persistent here at Oxford (trying to get a vocab.ox.ac.uk address)&lt;/li&gt;&lt;ul&gt;&lt;li&gt;One of the final outputs will be a 'foafbook' which will re-use data in the BRII store - it will act as a blue book of researchers. Think Cornell's Vivo, but with the idea of Linked Data firmly in mind.&lt;/li&gt;&lt;li&gt;We are just sorting out a home for this project, and I'll post up an update as soon as it is there.&lt;br/&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;Forced Migration Online (FMO) have completed their archived document migration from a crufty, proprietary store to a ORA-style store (Fedora/Solr) - you can see their preliminary frontend at &lt;a rel="nofollow" target="_blank" href='http://fmo.qeh.ox.ac.uk'&gt;http://fmo.qeh.ox.ac.uk.&lt;/a&gt; Be aware that this is a work in progress. We provide the store as a service to them, giving them a Fedora and a Solr to use. They contracted a company called Aptivate to migrate their content, and I believe also to create their frontend. This is a pilot project to show that repositories can be treated in a distributed way, given out like very smart, shared drive space.&lt;/li&gt;&lt;li&gt;We are working to archive and migrate a number of library and historical catalogs. A few projects have a similar aim to provide an architecture and software system to hold medieval catalog research - a record of what libraries existed, and what books and works they held. This is much more complex that a normal catalog, as each assertion is backed by a type of evidence, ranging from the solid (first-hand catalog evidence), to the more loose (handwriting on the front page looks like a certain person who worked at a certain library.) So modelling this informational structure is looking to be very exciting, and we will have to try a number of ways to represent this, starting with RDF due to the interlinked nature of the data. This is related to the kinds of evidence that genealogy uses, and so related ontologies may be of use.&lt;/li&gt;&lt;li&gt;The work on storing and presenting scanned imagery is gearing up. We are investigating storing the sequence of images and associated metadata/ocr text/etc as a single tar file as part of a Fedora object (i.e. a book object will have a catalog record, technical/provenance information and an attached tar file and and a list of file to offset information.)&lt;/li&gt;&lt;ul&gt;&lt;li&gt;This is due to us trying to hit the 'sweet spot' for most file systems. A very large number of highly compressed images and little pieces of text does not fit well with most FS internals. We estimate that for a book there will be around [4+PDFs+2xPages] files, or 500+ typically. Just counting up the various sources of scanned media we already have, we are pressing for about 1/2 million books from one source, 200,000 images from another, 54,000 from yet another... it's adding up real fast.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;We are starting to deal with archiving/curating the 'long-tail' of data - small, bespoke datasets that are useful to many, but don't fall into the realm of Big Data, or Web data. I don't plan on touching Access/FoxPro databases any time soon though! I am building a Fedora/Solr/eXist box to hold and disseminate these, which should live at databank.ouls.ox.ac.uk very, very shortly. (Just waiting on a new VMware host to grow into, our current one is at capacity.)&lt;br/&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;To give a better idea of the structure, etc, I am writing it up in a second blog post to follow shortly - currently named "Modelling and storing a phonetics database inside a store"&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;I am in the process of integrating the Google-analytics-style statistics package at http://piwik.org with the ORA interface, to give relatively live hit counts on a per-item and to build per-collection reports.&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Right now, piwik is capturing the hits and downloads from ORA, but I have yet to add in the count display on each item page, so halfway there :)&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;We are just waiting on a number of departments here to upgrade the version of EPrints they are using for their internal, disciplinary repositories, so that we can begin archiving surrogate copies of the work they wish to put up for this service. (Using ORE descriptions of their items) By doing so, their content becomes exposed in ORA, mirror copies are made (working on a good way to maintain these as content evolves), but they retain the content control, ORA will also act as a registry for their content. It's only when their service drops do the users get redirected to the mirror copies that ORA holds (think google cache, but a 100% copy).&lt;/li&gt;&lt;li&gt;In the process of battle-testing the Fedora-Honeycomb connection, but as mentioned above, just waiting for a little more hardware before I set to it. Also, we are examining a number of other storage boxes that should plug in under Fedora, using the Honeycomb software, such as the new and shiny Thumper box, "Thor" Sun Fire Xsomething-or-other. Also, getting pretty interested at the idea of MAID storage - massive array of idle disks. Hopefully, this will act like tape, but have a sustainable access speed of disk. Also, a little more green than a tower of spinning hardware.&lt;/li&gt;&lt;li&gt;Planning out the indexer service at the moment. It will use the Solr 1.3 multicore functionality, with a little parsing magic at the ingest side of things to make a generic indexer-as-a-service type system. One use-case is to be able to bring up VM machines with multicore solr on to act as indexers/search engines as needed. An example aim? "Economics want an index that facets on their JEL codes." POST a schema and ingest indexer to the nearest free indexer, and point the search interface at it once an XMPP message comes back that it is finished.&lt;/li&gt;&lt;li&gt;URI resolvers - still investigating what can be put in place for this, as I strongly wish to avoid coding this myself. Looking at OCLC's OpenURL and how I can hack it to feed it info:fedora uris and link them to their disseminated location. Also, using a tinyurl type library + simple interface might not be a bad idea for a quick PoC.&lt;br/&gt;&lt;/li&gt;&lt;li&gt;Just to let you all know that we are building up the digital team here, most recently held interviews for the futureArch project but we are looking for about 3 others to hire, due to all the projects we are doing. We will be putting out job adverts as and when we feel up to battling with HR :)&lt;/li&gt;&lt;/ul&gt;That's most of the more interesting hot topics and projects I am doing at the moment.... phew :)&lt;br/&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-2317123182133225178?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-2317123182133225178</guid>
         <pubDate>Wed, 15 Oct 2008 20:39:00 -0700</pubDate>
      </item>
      <item>
         <title>davetaz: google, fedora, microsoft, eprints, OAI ORE and SWORD. Player rich workshop. Repository Futures? #dorsdl2, #crig</title>
         <link>http://twitter.com/davetaz/statuses/925910531</link>
         <description>davetaz: google, fedora, microsoft, eprints, OAI ORE and SWORD. Player rich workshop. Repository Futures? #dorsdl2, #crig</description>
         <guid isPermaLink="false">http://twitter.com/davetaz/statuses/925910531</guid>
         <pubDate>Thu, 18 Sep 2008 06:54:01 -0700</pubDate>
      </item>
      <item>
         <title>Institutions hate repositories... one simple reason.</title>
         <link>http://davetaz-blog.blogspot.com/2008/09/institutions-hate-repositories-one.html</link>
         <description>Open access is not enough!&lt;br /&gt;&lt;br /&gt;People want to give Open Access to some of their materials at their institution however the IR software is seen as a means to manage all Institutional content and not just that which is Open Access and part of the external image of the Institution.&lt;br /&gt;The problem exists in the other direction as well where repository software is trying to solve these problems, thus people are not likely to use this software until it is included.&lt;br /&gt;&lt;br /&gt;So what do we end up with...&lt;br /&gt;&lt;br /&gt;Lots of Repository Islands which aren't interoperable with each other!&lt;br /&gt;&lt;br /&gt;So if we solve the access and copyright issue will people use the software? errrr No. At this point the software is an all in solution and not a service which can be utilised by current institutional practise ... Give up...?&lt;br /&gt;&lt;br /&gt;No!&lt;br /&gt;&lt;br /&gt;Focus on providing a service, e.g. something which can manage your Digital Resources and enable this to plug to existing institutional services. Some softwares would argue they support this already. OK good, so don't try and solve the problem if it is just an integration issue.&lt;br /&gt;&lt;br /&gt;To the repositories: Decouple! Build a set of services, build ways of plugging services together and allow the community to pic 'n' mix.&lt;br /&gt;&lt;br /&gt;To the institution: You already have access control systems ask your Information/Computer Systems department. You probably already have a Content Management System for educational resources for students (Blackboard? - Integrates with an LDAP server), these use external services to manage access and authentication! Here's a few services for you... LDAP, Radius, Eduroam, Domain Controller. &lt;br /&gt; &lt;br /&gt; Till next time!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-3694637630389146162?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-3694637630389146162</guid>
         <pubDate>Wed, 17 Sep 2008 19:14:00 -0700</pubDate>
      </item>
      <item>
         <title>DSpace and Fedora *need* opinionated installers.</title>
         <link>http://oxfordrepo.blogspot.com/2008/08/dspace-and-fedora-need-opinionated.html</link>
         <description>Just to say that both Fedora-Commons and DSpace really, really need opinionated installers that make choices for the user. Getting either installed is a real struggle - which we demonstrated during the Crigshow, so please don't write in the comments that it is easy, it just isn't.&lt;br /&gt;&lt;br /&gt;Something that is relatively straightforward to install, is a debian package.&lt;br /&gt;&lt;br /&gt;So, just a plea in the dark, can we set up a race? Who can make their repository software installable as a .deb first? will it be DSpace or Fedora? Who am I going to send a box of cookies to and a thank you note from the entire developer community?&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://wiki.eprints.org/w/Installing_EPrints_3_via_apt_%28Debian/Ubuntu%29"&gt;(EPrints doesnt count in this race; they've already done it)&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-1371143081899015772?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-1371143081899015772</guid>
         <pubDate>Sun, 17 Aug 2008 22:16:00 -0700</pubDate>
      </item>
      <item>
         <title>Re-using video compression code to aid document quality checking</title>
         <link>http://oxfordrepo.blogspot.com/2008/08/re-using-video-compression-code-to-aid.html</link>
         <description>&lt;a rel="nofollow" target="_blank" href="http://crigshow.blogspot.com/2008/07/prototype-motion-analysis-to-detect.html"&gt;(Expanding on this video post from the Crigshow)&lt;/a&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;&lt;br /&gt;Problem:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The volume of pages from a large digitisation project can be overwhelming. Add into that the simple fact that all (UK) institutional projects are woefully underfunded and underresourced, it's surprising that we can cope with them really.&lt;br /&gt;&lt;br /&gt;One issue that repeatedly comes up is the idea of quality assurance; How can we know that a given book has been scanned well? How can we spot images easily? Can we detect if foreign bodies were present in the scan, such as thumbs, fingers or bookmarks?&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;A quick solution:&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Inspired by a talk at one of the conference strands at WorldComp, where the author talked about the use of a component of a commonly used video compression standard (MPEG2) to detect degrees of motion and change in a video, without having to analyse the image sequences using a novel, or smart algorithm.&lt;br /&gt;&lt;br /&gt;He talked about using the motion vector stream to be a good rough guide to the amount of change between frames of video.&lt;br /&gt;&lt;br /&gt;So, why did this inspire me?&lt;br /&gt;&lt;ul&gt;&lt;li&gt;MPEG-2 compression is a pretty much a solved problem; there are some very fast and scalable solutions out there today - direct benefit: &lt;span style="font-weight:bold;"&gt;No new code needs to be written and maintained&lt;/span&gt;&lt;/li&gt;&lt;li&gt;The format is very well understood and stripping out the motion vector stream wouldn't be tricky. Code exists for this too.&lt;/li&gt;&lt;li&gt;Pages of text in printed documents tend towards being justified so that the two edges of the text columns are straight lines. There is also (typically) a fixed number of lines on a page.&lt;/li&gt;&lt;li&gt;A (comparatively rapid) MPEG2 compression of the scans of a book would have the following qualities:&lt;/li&gt;&lt;ul&gt;&lt;li&gt;The motion vectors between pages of text would either shown little overall change (as differing letters are actually quite similar) or a small, global shift if the page was printed on a slight offset.&lt;/li&gt;&lt;li&gt;The motion vectors between a page of text and a page with an image embedded in text on the next, or a thumb on the edge, would show localised and distinct changes that differ greatly from the overall perspective.&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;In fact, a real crude solution could be, just using the vector stream to create a bookmark list for all the suspect changes. This might bring the number of pages to check down to a level that a human mediator could handle.&lt;/li&gt;&lt;/ul&gt;How much needs to be checked?&lt;br /&gt;&lt;br /&gt;Via basic sample survey statistics: to be sure to 95% (±5%) that the scanned images of 300 million pages are okay, just 387 totally random pages need to be checked. However, to be sure that each individual book is okay to the same degree, a book being ~300 pages, 169 pages need to be checked &lt;span style="font-weight:bold;"&gt;in each book&lt;/span&gt;. I would suggest that the above technique would significantly lower this threshold, but it would be by an empirically found amount.&lt;br /&gt;&lt;br /&gt;Also note that the above figures carry the assumption that the scanning process doesn't change over time, which of course it does!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-7963164596904275017?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-7963164596904275017</guid>
         <pubDate>Sun, 17 Aug 2008 21:47:00 -0700</pubDate>
      </item>
      <item>
         <title>The four rules of the web and compound documents</title>
         <link>http://oxfordrepo.blogspot.com/2008/08/four-rules-of-web-and-compound.html</link>
         <description>A real quirk that truly interests me is the difference in aims between the way documents are typically published and the way that the information within them is reused.&lt;br /&gt;&lt;br /&gt;A published document is normally in a single 'format' - a paginated layout, and this may comprise text, numerical charts, diagrams, tables of data and so on.&lt;br /&gt;&lt;br /&gt;My assumption is that, to support a given view or argument, a reference to the entirety of an article is not necessary; The full paper gives the context to the information, but it is much more likely that a small part of this paper contains the novel insight being referenced.&lt;br /&gt;&lt;br /&gt;In the paper-based method, it is difficult to uniquely identify parts of an article as items in their own right. You could reference a page number, give line numbers, or quote a table number, but this doesn't solve this issue that the author hadn't put time to considering that a chart, a table or a section of text would be reused.&lt;br /&gt;&lt;br /&gt;So, on the web, where multiple representations of the same information is getting to be commonplace (mashups, rss, microblogs, etc), what can we do to help better fulfill both aims, to show a paginated final version of a document, and also to allow each of the components to exist as items in their own right, with their own URIs (or better, URLs containing some notion of the context e.g. if /store/article-id gets to the splash page of the article, /store/article-id/paragraph-id will resolve to the text for that paragraph in the article.)&lt;br /&gt;&lt;br /&gt;Note that the four rules of the web (well, of &lt;a rel="nofollow" target="_blank" href="http://linkeddata.org/"&gt;Linked Data&lt;/a&gt;) are in essence:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;give everything a name,&lt;br /&gt;&lt;/li&gt;&lt;li&gt;make that name a URL ...&lt;/li&gt;&lt;li&gt;...which results in data about that thing,&lt;br /&gt;&lt;/li&gt;&lt;li&gt;and have it link to other related things. &lt;/li&gt;&lt;/ul&gt;[From &lt;a rel="nofollow" target="_blank" href="http://www.w3.org/DesignIssues/LinkedData.html"&gt;TimBL's originating article&lt;/a&gt;. Also, see this &lt;a rel="nofollow" target="_blank" href="http://virtuoso.openlinksw.com/presentations/Creating_Deploying_Exploiting_Linked_Data2/Creating_Deploying_Exploiting_Linked_Data2_TimBL_v3.html#%281%29"&gt;presentation &lt;/a&gt;- a remix of presentations from TimBL and the speaker, &lt;span style="font-style:italic;font-size:100%;"&gt;&lt;a rel="nofollow" target="_blank" href="http://myopenlink.net/dataspace/person/kidehen#this"&gt;&lt;span style="color:rgb(0, 147, 182);text-decoration:underline;"&gt;Kingsley Idehen&lt;/span&gt;&lt;/a&gt;&lt;/span&gt;&lt;span style="font-size:100%;"&gt; - given at the recent Linked Data Planet conference&lt;/span&gt;]&lt;br /&gt;&lt;br /&gt;I strongly believe that applying this to the individual components of a document is a very good and useful thing.&lt;br /&gt;&lt;br /&gt;One thing first, we have to get over the legal issue of just storing and presenting a bitwise perfect copy of what an author gives us. We need to let author's know that we may present alternate versions, based on a user's demands. This actually needs to be the case for preservation and the repository needs to make it part of their submission policy to allow for format migrations, accessibility requirements and so on.&lt;br /&gt;&lt;br /&gt;The system holding the articles needs to be able to clearly indicate versions and show multiple versions for a single record.&lt;br /&gt;&lt;br /&gt;When a compound document is submitted to the archive, a second parallel version should be made by fragmenting the document into paragraphs of text, individual diagrams, tables of data, and other natural elements. One issue that has already come up in testing, is that documents tend to clump multiple, separate diagrams together into a single physical image. It is likely that the only solution to breaking these up to this is going to be a human one, either author/publisher education(unlikely) or by breaking them up by hand.&lt;br /&gt;&lt;br /&gt;I would suggest using a very lightweight, hierarchical structure to record the document's logical structure. I have yet to settle on basing it on the content XML format inside the OpenDocument format, or on something very lightweight, using HTML elements, which would have a double benefit of being able to be sent directly to a browser to 'recreate' the document roughly.&lt;br /&gt;&lt;br /&gt;Summary:&lt;br /&gt;&lt;br /&gt;1) Break apart any compound document into its constituent elements (paragraph level is suggested for text)&lt;br /&gt;2) Make sure that each one of these parts are clearly expressed in the context they are in, using hierarchical URLs, /article/paragraph or even better, /article/page/chart&lt;br /&gt;3) On the article's splashpage, make a clear distinction between the real article and the broken up version. I would suggest a scheme like Google search's 'View [PDF, PPT, etc] as HTML'. I would assert that many people intuitively understand that this view is not like the original and will look or act differently.&lt;br /&gt;&lt;br /&gt;Some related video blogs from the &lt;a rel="nofollow" target="_blank" href="http://crigshow.blogspot.com/"&gt;Crigshow&lt;/a&gt; trip&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://crigshow.blogspot.com/2008/07/prototype-extracting-and-finding.html"&gt;Finding and reusing algorithms from published articles&lt;/a&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://crigshow.blogspot.com/2008/07/real-documents-are-complex-objects.html"&gt;OCR'ing documents; Real documents are always complex&lt;/a&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://crigshow.blogspot.com/2008/07/protoype-providing-overviews-of.html"&gt;Providing a systematic overview of how a Research paper is written&lt;/a&gt; - giving each component and each version of a component would have major benefits here&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-2742105134406656447?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-2742105134406656447</guid>
         <pubDate>Sun, 17 Aug 2008 20:40:00 -0700</pubDate>
      </item>
      <item>
         <title>Trackbacks, and spammers, and DDoS, oh my!</title>
         <link>http://oxfordrepo.blogspot.com/2008/08/trackbacks-and-spammers-and-ddos-oh-my.html</link>
         <description>&lt;span style="font-size:130%;"&gt;The Idea&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Before I give you all the dark news about this, let me set out my position: I really, really think that repositories communicating the papers that are cited and referenced to each other is a really good thing. If a paper was deposited in the Oxford archive, and it referenced a paper held in a different repository, say in Southampton's EPrints archive, I think that it is a really fantastic idea to let the Oxford archive tell the Southampton one about it.&lt;br /&gt;&lt;br /&gt;And I decided to do something about it - I added two linkback facilities to the archive's user interface, allowing both &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;trackbacks&lt;/a&gt; and &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Pingback"&gt;pingbacks&lt;/a&gt; to be archived by the system. I adopted the pre-existing "standards" - really, they are just rough api's - because I think we have all learned our lessons about making up new APIs for basic tasks.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:130%;"&gt;What is &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;Trackback&lt;/a&gt;?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;Trackback&lt;/a&gt; is an agreed technique from the blogging world. Many blogging systems have it built in, and it enables one blog post to explicitly reference and talk about another post, made on a remote blog somewhere. It does this by &lt;a rel="nofollow" target="_blank" href="http://www.ietf.org/rfc/rfc2616.txt"&gt;POST&lt;/a&gt;ing a number of form-encoded parameters to a specific URL, specific to the item that is being referenced. The parameters include things like title, abstract and URL of the item making the reference.&lt;br /&gt;&lt;br /&gt;So on the surface, it appears that this &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;trackback&lt;/a&gt; idea performs exactly what I was looking for.&lt;br /&gt;&lt;br /&gt;BUT! &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;Trackback&lt;/a&gt; has massive, gaping flaws, akin to the flaws in the email system which is full of spam. For one, all trackbacks are trusted in the basic specifications. No checking that the URL exists, no checking of the text for relevance, etc.&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Pingback"&gt;Pingback&lt;/a&gt; is a slightly different system, in that all that is passed, is the URL of the referencing item. It is then up to the remote server to go and get the requested page and parse that out to find the reference. (The next version of the specification is crying out to recommend microformats et al, in my opinion)&lt;br /&gt;&lt;br /&gt;So, these systems, &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;trackback&lt;/a&gt; and &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Pingback"&gt;pingback&lt;/a&gt;, have been on trial in the live system for about 4 or 5 months, and I am sure you all want to hear my conclusions:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-weight:bold;"&gt;Don't implement &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;Trackback&lt;/a&gt;&lt;/span&gt; as it is defined in its &lt;a rel="nofollow" target="_blank" href="http://www.sixapart.com/pronet/docs/trackback_spec"&gt;specifications...&lt;/a&gt; seriously. It is a poorly designed method, with so much slack that it is a spammers goldmine.&lt;/li&gt;&lt;li&gt;Even after adding in some safeguards to the &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;Trackback&lt;/a&gt; method, such as parsing the supposed referencing page and checking for HTML and the presence of the supposed link, it was still possible for spammers to get through.&lt;/li&gt;&lt;li&gt;When I implemented &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;Trackbacks&lt;/a&gt;, I did so with the full knowledge that I might have to stand at a safe distance and nuke the lot. Here is the Trackback model used in the &lt;a rel="nofollow" target="_blank" href="http://ora.ouls.ox.ac.uk/"&gt;Fedora repository&lt;/a&gt; - A DC datastream containing the &lt;a rel="nofollow" target="_blank" href="http://www.ietf.org/rfc/rfc2616.txt"&gt;POST&lt;/a&gt;ed information mapped to simple dublin core and a RELS-EXT RDF entry asserting that this &lt;a rel="nofollow" target="_blank" href="http://www.fedora-commons.org/"&gt;Fedora&lt;/a&gt; Object &amp;lt;info:fedora/trackback-id&amp;gt;, referenced &lt;a rel="nofollow" target="_blank" href="http://dublincore.org/documents/dcmi-terms/#terms-references"&gt;&amp;lt;dcterms:references&amp;gt;&lt;/a&gt; the main item in the archive &amp;lt;info:fedora/item-id&amp;gt;. As the user interface for the archive gets the graph for that object, it was easy to get the trackbacks out as well. Having separate objects for the trackbacks and not changing the referenced item &lt;span style="font-weight:bold;"&gt;at all&lt;/span&gt;, made it very easy to remove the trackbacks at the end.&lt;/li&gt;&lt;li&gt;The &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Trackback"&gt;Trackback&lt;/a&gt; system did get hit, once the spammers found a way around my safeguards. So, yes, the trackbacks got 'nuked' and the system turned off.&lt;/li&gt;&lt;li&gt;Currently, the system is under a sort of mini-DDoS, from the spammer's botnet trying to make trackbacks and overloading the session tracking system.&lt;/li&gt;&lt;li&gt;The &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Pingback"&gt;Pingback&lt;/a&gt; system, utilising XML-RPC calls, was never hit by spam. I still turned it off, as the safeguards on this system were equivalent to the Trackback system.&lt;/li&gt;&lt;/ul&gt;&lt;span style="font-size:130%;"&gt;So, how do we go on from this quagmire of spam?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Well, for one, if I had time (and resources) to pass all requests through spamassassin or pay for akismet, that would have cut down the number drastically. Also, if I had time to sit and moderate all the linkbacks, again, spam would be nipped in the bud.&lt;br /&gt;&lt;br /&gt;So, while I truly believe that this type of system is the future, it certainly isn't the case that it can be a system that can just be turned on and the responsibility for maintaining it added to an already full workload.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:130%;"&gt;Alternatives?&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;White-listing sites may be one method. To limit the application to sharing references between institutions, you could use the &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Pretty_Good_Privacy#Web_of_trust"&gt;PGP idea - a web of trust&lt;/a&gt;; a technique of encrypting the passed information with a private key that resolves to a public key from a white-listed institution. This would ensure that the passed reference really was from a given institution. This should be more flexible than requiring a single IP address to accept references from.&lt;br /&gt;&lt;br /&gt;(There is always the chance that the private key could be leaked and made not-so-private by the institution, but that would have to be their responsibility. Any spam from a mistake of this sort would be directly attributed to those at fault!)&lt;br /&gt;&lt;br /&gt;A slower, far less accurate but more traditional method, would be for a given institution to harvest references from &lt;span style="font-weight:bold;"&gt;all&lt;/span&gt; the other repositories it knows about. I really don't think this is workable, but has the pro that a harvester can be sure that a reference links to a given URL, (barring the more and more common &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/DNS_cache_poisoning"&gt;DNS poisoning attacks&lt;/a&gt;)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-5023695488040957840?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-5023695488040957840</guid>
         <pubDate>Sun, 17 Aug 2008 19:52:00 -0700</pubDate>
      </item>
      <item>
         <title>#crigshow - Conference 2 - Worldcomp</title>
         <link>http://davetaz-blog.blogspot.com/2008/07/crigshow-conference-2-worldcomp.html</link>
         <description>&lt;span style="font-weight:bold;"&gt;Agents and Web Services... Why no collaboration?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Out of all the presentations at worldcomp this one struck me as one of the most obvious but not covered areas for research in computer science. Probably the most well known agent system is that used by the travel industry where they have standard ways of interfacing with each other to find details of travel and hotels available on a global scale. This is no mean feat with the number of companies there are hooking into this network. &lt;br /&gt;&lt;br /&gt;So why doesn't the same exist for web services or if there is such a system why isn't everyone in the open community using it? &lt;br /&gt;&lt;br /&gt;Surely the point of web services is for people to discover and use them in their own scenarios just like the agents in the travel industry do. OK so maybe the problem lies in the fact that there are so many communities that there will never be a specific use case or framework and thus hosting a generic web service network becomes infinitely hard with the number of different APIs and Implementations. &lt;br /&gt;&lt;br /&gt;OK so if you are going to use Agents in Web Services what issues do you need to consider? Also what do you gain through doing this?&lt;br /&gt;&lt;br /&gt;One of the key ideas which came out of a talk at worldcomp is to use Agents to be the intelligent front to a web service. This enables an agent to track of a set of web services including information about a specific web service such as availability, versions, changing cost and and offline copy if the service allows this. So the agent becomes a Rendezvous Point for a series of web services. &lt;br /&gt;&lt;br /&gt;So why aren't we seeing more collaboration between the Agent community and the Web Services community?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-6139828693528407847?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-6139828693528407847</guid>
         <pubDate>Wed, 16 Jul 2008 06:19:00 -0700</pubDate>
      </item>
      <item>
         <title>jimdowning: Anyone at #jif08 wants me to give their project some publicity in #crig in DC next week, grab me in a break. I'll be near a power socket.</title>
         <link>http://twitter.com/jimdowning/statuses/858844575</link>
         <description>jimdowning: Anyone at #jif08 wants me to give their project some publicity in #crig in DC next week, grab me in a break. I'll be near a power socket.</description>
         <guid isPermaLink="false">http://twitter.com/jimdowning/statuses/858844575</guid>
         <pubDate>Tue, 15 Jul 2008 02:45:03 -0700</pubDate>
      </item>
      <item>
         <title>OSCELOT Open Source Day III - views</title>
         <link>http://oxfordrepo.blogspot.com/2008/07/oscelot-open-source-day-iii-views.html</link>
         <description>The event was held at the Nevada gaming institute, and was overall, a well-structured day. The driving ideology was that of the &lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Unconference"&gt;unconference&lt;/a&gt; - "... a facilitated, face-to-face, and participant-driven conference centered around a theme or purpose."&lt;br /&gt;&lt;br /&gt;However, it seemed that the theme or purpose of the event was not about Open Source - it was as if it were a Blackboard self-help group, trying to solve the issues and failings of this proprietary software. Some of the issues were a little shocking - someone proposed that they had "a need to search the content of [Blackboard Vista] repository" - it came as some surprise to me that this wasn't already possible in such a mature product.&lt;br /&gt;&lt;br /&gt;I was pleased that we were able to help and inform the other attendees about more open technology and standards, such as &lt;a rel="nofollow" target="_blank" href="http://oauth.net"&gt;OAuth&lt;/a&gt;, resource-orientated architecture, creative commons licensing and more. &lt;br /&gt;&lt;br /&gt;One session I lead on was titled - controversially - "Why [bother with] Portals?" - in which I wanted to get a discussion on what students actually use. The point I wanted to make was that URLs are the base currency of the internet - search engines produce lists of them, people bookmark them, and URLs are used when sharing information between people. &lt;br /&gt;&lt;br /&gt;This means that there is a very large responsibility on the content providers &lt;span style="font-weight:bold;"&gt;not&lt;/span&gt; to change URLs, or they will devalue the very resources they are trying to get people to use. This is the reason why persistent URLs are a crucial thing to aim for.&lt;br /&gt;&lt;br /&gt;I hope that we were able to bring extra value to the meeting, due to the fact that, unlike the vast majority of attendees, we do not have a Blackboard background.&lt;br /&gt;&lt;br /&gt;However, I do think that the event needed to have more emphasis on real-world open source projects such as Sakai and Moodle, and examine how best to intergrate their systems with external systems.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3090914606822911489-6727893367281856190?l=oxfordrepo.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>Ben O'Steen</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3090914606822911489.post-6727893367281856190</guid>
         <pubDate>Mon, 14 Jul 2008 08:24:00 -0700</pubDate>
      </item>
      <item>
         <title>#crigshow - Conference 1 - Oscelot</title>
         <link>http://davetaz-blog.blogspot.com/2008/07/crigshow-conference-1-oscelot.html</link>
         <description>This open source day (#osdiii) hosted by Oscelot was an unconferene which soon became based heavily around the Blackboard platform. This was expected as the majority of people attending it were then going on to attend the BbWorld conference. With the title of the conference being Open Source and yet the main topic being that of a Closed Source product this gave an opening for the CRIG team to promote the wider Open Source community to those who are focused on Blackboard use cases. &lt;br /&gt;&lt;br /&gt;The day was a success for the team as we promoted good practices in web development, standards, resource management and the fact that the people who manage an eLearning platform has a responsibility to the content they hold. &lt;br /&gt;&lt;br /&gt;From our point of view, we discovered: If blackboard is the industry leader in learning management systems then the repository community is big problems when it comes to archiving these resources by the current methodologies each community practices. &lt;br /&gt;&lt;br /&gt;More Collaboration and Awareness please!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-8889383351258675109?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-8889383351258675109</guid>
         <pubDate>Mon, 14 Jul 2008 06:00:00 -0700</pubDate>
      </item>
      <item>
         <title>dfflanders: @paulwalk just met with tiddlywiki team, they are interested to see tw as a scholarly tool / integration into repo? #crig</title>
         <link>http://twitter.com/dfflanders/statuses/850660240</link>
         <description>dfflanders: @paulwalk just met with tiddlywiki team, they are interested to see tw as a scholarly tool / integration into repo? #crig</description>
         <guid isPermaLink="false">http://twitter.com/dfflanders/statuses/850660240</guid>
         <pubDate>Sat, 05 Jul 2008 05:21:23 -0700</pubDate>
      </item>
      <item>
         <title>dfflanders: #crig #sword much easier to use eduroam here at BBK than guest access, see config here: http://tinyurl.com/59wra9</title>
         <link>http://twitter.com/dfflanders/statuses/847477260</link>
         <description>dfflanders: #crig #sword much easier to use eduroam here at BBK than guest access, see config here: http://tinyurl.com/59wra9</description>
         <guid isPermaLink="false">http://twitter.com/dfflanders/statuses/847477260</guid>
         <pubDate>Tue, 01 Jul 2008 00:56:23 -0700</pubDate>
      </item>
      <item>
         <title>dfflanders: #crig #sword room is air conditioned</title>
         <link>http://twitter.com/dfflanders/statuses/847475211</link>
         <description>dfflanders: #crig #sword room is air conditioned</description>
         <guid isPermaLink="false">http://twitter.com/dfflanders/statuses/847475211</guid>
         <pubDate>Tue, 01 Jul 2008 00:51:20 -0700</pubDate>
      </item>
      <item>
         <title>dfflanders: #crig #sword in room 538 setting up for SWORD2 meeting, donuts good (don't think they will last 'til the 10.30 start)</title>
         <link>http://twitter.com/dfflanders/statuses/847465687</link>
         <description>dfflanders: #crig #sword in room 538 setting up for SWORD2 meeting, donuts good (don't think they will last 'til the 10.30 start)</description>
         <guid isPermaLink="false">http://twitter.com/dfflanders/statuses/847465687</guid>
         <pubDate>Tue, 01 Jul 2008 00:28:22 -0700</pubDate>
      </item>
      <item>
         <title>OAI-PMH + OAI-ORE (Atom) + Pronom Droid = Pretty</title>
         <link>http://davetaz-blog.blogspot.com/2008/06/oai-pmh-oai-ore-atom-pronom-droid.html</link>
         <description>I've just finished writing a wrapper (very simple!) which takes a &lt;a rel="nofollow" target="_blank" href="http://www.openarchives.org/ore/"&gt;OAI-ORE&lt;/a&gt; Resource Map in Atom Format and classifies the objects which are listed in the Aggregation using the National Archives (UK) technical registry (&lt;a rel="nofollow" target="_blank" href="http://www.nationalarchives.gov.uk/pronom/"&gt;Pronom&lt;/a&gt;). &lt;br /&gt;&lt;br /&gt;The wrapper provides a simple front end to the &lt;a rel="nofollow" target="_blank" href="http://droid.sourceforge.net/wiki/index.php/Introduction"&gt;DROID tool&lt;/a&gt;, it takes an &lt;a rel="nofollow" target="_blank" href="http://www.openarchives.org/pmh/"&gt;OAI-PHM&lt;/a&gt; URI and requests the latest resource maps in atom format (ore-atom) and creates a list of the resources which are passed to DROID to classify directly. &lt;br /&gt;&lt;br /&gt;The wrapper requires OAI-PMH as it requests all records which have been modified since it last did a parse of the repository. This way the wrapper can be scheduled to run once a day/week/month etc. &lt;br /&gt;&lt;br /&gt;A single DROID xml file comes back as the output. &lt;br /&gt;&lt;br /&gt;This is all working with EPrints repository software currently.&lt;br /&gt;&lt;br /&gt;Next stage is to do something useful with the output xml in terms of providing useful data back to the repository manager.&lt;br /&gt;&lt;br /&gt;Total lines of source code for the wrapper: 302 :)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-6418976742344708314?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-6418976742344708314</guid>
         <pubDate>Thu, 26 Jun 2008 22:56:00 -0700</pubDate>
      </item>
      <item>
         <title>ORE software libraries from Foresite</title>
         <link>http://chronicles-of-richard.blogspot.com/2008/06/foresite-1-project-is-pleased-to.html</link>
         <description>The Foresite [1] project is pleased to announce the initial code of two software libraries for constructing, parsing, manipulating and serialising OAI-ORE [2] Resource Maps. These libraries are being written in Java and Python, and can be used generically to provide advanced functionality to OAI-ORE aware applications, and are compliant with the latest release (0.9) of the specification. The software is open source, released under a BSD licence, and is available from a Google Code repository:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://code.google.com/p/foresite-toolkit/"&gt;http://code.google.com/p/foresite-toolkit/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;You will find that the implementations are not absolutely complete yet, and are lacking good documentation for this early release, but we will be continuing to develop this software throughout the project and hope that it will be of use to the community immediately and beyond the end of the project.&lt;br /&gt;&lt;br /&gt;Both libraries support parsing and serialising in: ATOM, RDF/XML, N3, N-Triples, Turtle and RDFa&lt;br /&gt;&lt;br /&gt;Foresite is a JISC [3] funded project which aims to produce a demonstrator and test of the OAI-ORE standard by creating Resource Maps of journals and their contents held in JSTOR [4], and delivering them as ATOM documents via the SWORD [5] interface to DSpace [6]. DSpace will ingest these resource maps, and convert them into repository items which reference content which continues to reside in JSTOR. The Python library is being used to generate the resource maps from JSTOR and the Java library is being used to provide all the ingest, transformation and dissemination support required in DSpace.&lt;br /&gt;&lt;br /&gt;Please feel free to download and play with the source code, and let us have your feedback via the Google group:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="mailto:foresite@googlegroups.com"&gt;foresite@googlegroups.com&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Richard Jones &amp; Rob Sanderson&lt;br /&gt;&lt;br /&gt;[1] Foresite project page: &lt;a rel="nofollow" target="_blank" href="http://foresite.cheshire3.org/"&gt;http://foresite.cheshire3.org/&lt;/a&gt;&lt;br /&gt;[2] OAI-ORE specification: &lt;a rel="nofollow" target="_blank" href="http://www.openarchives.org/ore/0.9/toc"&gt;http://www.openarchives.org/ore/0.9/toc&lt;/a&gt;&lt;br /&gt;[3] Joint Information Systems Committee (JISC): &lt;a rel="nofollow" target="_blank" href="http://www.jisc.ac.uk/"&gt;http://www.jisc.ac.uk/&lt;/a&gt;&lt;br /&gt;[4] JSTOR: &lt;a rel="nofollow" target="_blank" href="http://www.jstor.org/"&gt;http://www.jstor.org/&lt;/a&gt;&lt;br /&gt;[5] Simple Web Service Offering Repository Deposit (SWORD):&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/repositories/digirep/index/SWORD"&gt;http://www.ukoln.ac.uk/repositories/digirep/index/SWORD&lt;/a&gt;&lt;br /&gt;[6] DSpace: &lt;a rel="nofollow" target="_blank" href="http://www.dspace.org/"&gt;http://www.dspace.org/&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-4049331535370251135?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/CiZe1c6fjFs" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-4049331535370251135</guid>
         <pubDate>Mon, 09 Jun 2008 07:56:00 -0700</pubDate>
      </item>
      <item>
         <title>Repository Software is Dead</title>
         <link>http://davetaz-blog.blogspot.com/2008/06/repository-software-is-dead.html</link>
         <description>Repository Software for digital collections as we know it supplies the complete solution to the client, thus without the software you cannot access any of the data in your repository. This is a bad thing for object reuse and digital preservation! &lt;br /&gt;&lt;br /&gt;Many people at conferences such as &lt;a rel="nofollow" target="_blank" href="http://or08.ecs.soton.ac.uk"&gt;Open Repositories 2008&lt;/a&gt; and from workgroups like &lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/repositories/digirep/index/CRIG"&gt;CRIG&lt;/a&gt; have been talking for a long while about the importance of Interoperability. However, if you get rid of the need for the interoperability and use a standard specification for accessing simple data objects (pdfs and their metadata), then you don't need interoperability! &lt;br /&gt;&lt;br /&gt;So this leads me to the fact that &lt;a rel="nofollow" target="_blank" href="http://www.eprints.org"&gt;EPrints&lt;/a&gt;, &lt;a rel="nofollow" target="_blank" href="http://www.fedora-commons.org/"&gt;Fedora&lt;/a&gt; and hopefully at some point &lt;a rel="nofollow" target="_blank" href="http://www.dspace.org"&gt;DSpace&lt;/a&gt; are abstracting their database and storage layers to support use of any type of storage platform. Thanks goes &lt;a rel="nofollow" target="_blank" href="http://www.sun.com"&gt;SUN Microsystems&lt;/a&gt; preservation action group and open storage group for pushing this work from a commercial perspective. But we need to go further than this to get rid of the need for interoperability.&lt;br /&gt;&lt;br /&gt;From &lt;a rel="nofollow" target="_blank" href="http://or08.ecs.soton.ac.uk"&gt;Open Repositories 2008&lt;/a&gt;, myself and a college &lt;a rel="nofollow" target="_blank" href="http://oxfordrepo.blogspot.com/"&gt;Ben O'Steen&lt;/a&gt; from Oxford University proved how &lt;a rel="nofollow" target="_blank" href="http://www.ope anarchives.org/ore/"&gt;OAI-ORE (OAI specification for Object Reuse and Exchange)&lt;/a&gt; can be used to enable high level repository interoperability. &lt;a rel="nofollow" target="_blank" href="http://www.preserv.org.uk/?page=oai-ore"&gt;This work won us $5000&lt;/a&gt; but more importantly got the community thinking about the true power of a specification like OAI-ORE. Ben and I are now hoping to push this work down to the low level storage such that the objects within an ORE map (documents and metadata) can be directly referenced without the need for the current repository layer. For this to happen &lt;b&gt;all objects need to be stored in their simplest form - NO WRAPPER FORMATS ALLOWED at the lowest level&lt;/b&gt;. &lt;br /&gt;&lt;br /&gt;From recent talks with Sandy Payette and Les Carr (Fedora and EPrints respectively) I am envisaging that the current repository software becomes classified as repository service software which is able to manage low level objects but is not specifically required to access these objects. So current services which plug into the repository software can act directly on the objects.&lt;br /&gt;&lt;br /&gt;A couple of problems to solve, security and consistency of cached data. All especially applicable if you have more than one piece of repository service software modifying your objects.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-6344307351894496505?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-6344307351894496505</guid>
         <pubDate>Sat, 07 Jun 2008 21:43:00 -0700</pubDate>
      </item>
      <item>
         <title>CRIG / IEDemonstator After Thoughts</title>
         <link>http://davetaz-blog.blogspot.com/2008/06/crig-iedemonstator-after-thoughts.html</link>
         <description>IEDemonstrator is a really bad name for a project as it just says Microsoft to me but I'm fairly it isn't anything to do with that most stable of web browsers.&lt;br /&gt;&lt;br /&gt;From the workshop it has become clear to me that discussing a specification for service interaction globally is going to be impossible. This could be due to the fact that SOAP did such a good job of it and no one wants to use anything else (enough sarcasm??). I think many people left the workshop with a much better idea at how HTTP error codes (which have been around years) already go most of the way to solving a web service model. We also realised quickly that any specification would have to be built specifically for pay services (e.g. make use of the 402 code), this would then encourage companies/institutions to supply reliable services which last more than 4 years (cough AHDS cough).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-8557063529511560836?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-8557063529511560836</guid>
         <pubDate>Sat, 07 Jun 2008 21:35:00 -0700</pubDate>
      </item>
      <item>
         <title>First Post - CRIG DRY Workshop</title>
         <link>http://davetaz-blog.blogspot.com/2008/06/first-post-crig-dry-workshop.html</link>
         <description>Well there's a surprise!&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_DRY_Workshop"&gt;CRIG DRY Workshop&lt;/a&gt; in Bath is where I am now. So what's happening:&lt;br /&gt;&lt;br /&gt;People have been talking about services and proposed projects to provide authoritative and complete services to users/agents/repositories. A couple of themes have come out morning session for me:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.w3.org/2004/02/skos/"&gt;SKOS: &lt;/a&gt; A lot of projects (incl. &lt;a rel="nofollow" target="_blank" href="http://lcsh.info"&gt;Library of Congress&lt;/a&gt;) are using this RDF language to describe subject and properties. Each provides access to this information in so many different ways it is hard to see how to interact in a constant manor.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Service Interaction&lt;/b&gt; (read on as the name is not that descriptive)&lt;br /&gt;&lt;br /&gt;This moves us on from the Open Storage stuff i've been working on (again more later in another blog post) into how we facilitate the use of services and discover how to interact with these services. We are pushing for the use of http codes! CRIG it. &lt;br /&gt;&lt;br /&gt;Tis it for now....&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-1214878339545743666?l=davetaz-blog.blogspot.com'/&gt;&lt;/div&gt;</description>
         <author>davetaz</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-4926451824261299693.post-1214878339545743666</guid>
         <pubDate>Thu, 05 Jun 2008 21:43:00 -0700</pubDate>
      </item>
      <item>
         <title>dfflanders: its been officially announced on listservs: DSpace and Fedora will collaborate on development #crig</title>
         <link>http://twitter.com/dfflanders/statuses/825051785</link>
         <description>dfflanders: its been officially announced on listservs: DSpace and Fedora will collaborate on development #crig</description>
         <guid isPermaLink="false">http://twitter.com/dfflanders/statuses/825051785</guid>
         <pubDate>Mon, 02 Jun 2008 01:24:32 -0700</pubDate>
      </item>
      <item>
         <title>dfflanders: testing #crig twemes tag for RSS input into CRIG blog planet (death to listservs!)</title>
         <link>http://twitter.com/dfflanders/statuses/786601816</link>
         <description>dfflanders: testing #crig twemes tag for RSS input into CRIG blog planet (death to listservs!)</description>
         <guid isPermaLink="false">http://twitter.com/dfflanders/statuses/786601816</guid>
         <pubDate>Thu, 10 Apr 2008 10:21:23 -0700</pubDate>
      </item>
      <item>
         <title>More online information on our repository platform</title>
         <link>http://santoshb.spaces.live.com/Blog/cns!4E66AFE14080180C!1109.entry</link>
         <description>&lt;p&gt;&lt;strong&gt;Demo&lt;/strong&gt; - &lt;/p&gt; &lt;div style="padding-right:0px;display:inline;padding-left:0px;padding-bottom:0px;margin:0px;padding-top:0px;"&gt;&lt;div&gt;&lt;/div&gt;&lt;/div&gt; &lt;p&gt;Savas Parastatidis (architect of the Microsoft repository platform) created a demo for Open Repositories Challenge 2008 that shows the ability to pull data from two repositories, create the appropriate relationships and display the relationships in a WPF based network graph. &lt;/p&gt; &lt;p&gt;He pulls the relationships b/w people and resources from &lt;a rel="nofollow" target="_blank" href="http://www.myexperiment.org/"&gt;MyExperiment&lt;/a&gt;, and resources from &lt;a rel="nofollow" target="_blank" href="http://savas.parastatidis.name/2008/03/24/19a7fa4a-560d-47b4-bbea-9a22e9824bb8.aspx"&gt;Microsoft Research Output Repository Platform&lt;/a&gt;. He then creates a network graph displaying relationships b/w people and resources from the two repositories. He also displays a preview when the appropriate resource or person is selected. For ex. in the demo, you will notice upon selecting a video, the video is played. &lt;/p&gt; &lt;p&gt;This kind of visualization of relationships using network graphs on web pages (using technologies like Silverlight) is truly very useful to visualize the social networking relationships. &lt;/p&gt; &lt;p&gt;Additional info - &lt;a rel="nofollow" title="http://savas.parastatidis.name/2008/04/03/2e308e65-a670-40a3-80ae-a283de1ee917.aspx" target="_blank" href="http://savas.parastatidis.name/2008/04/03/2e308e65-a670-40a3-80ae-a283de1ee917.aspx"&gt;http://savas.parastatidis.name/2008/04/03/2e308e65-a670-40a3-80ae-a283de1ee917.aspx&lt;/a&gt;&lt;/p&gt; &lt;p&gt;&lt;strong&gt;Blog&lt;/strong&gt; - &lt;a rel="nofollow" title="http://orweblog.oclc.org/archives/001591.html" target="_blank" href="http://orweblog.oclc.org/archives/001591.html"&gt;http://orweblog.oclc.org/archives/001591.html&lt;/a&gt; &lt;/p&gt; &lt;div style="padding-right:0px;display:inline;padding-left:0px;float:none;padding-bottom:0px;margin:0px;padding-top:0px;"&gt;del.icio.us Tags: &lt;a rel="nofollow" target="_blank" href="http://del.icio.us/popular/Microsoft Research Output Repository Platform"&gt;Microsoft Research Output Repository Platform&lt;/a&gt;,&lt;a rel="nofollow" target="_blank" href="http://del.icio.us/popular/OR08"&gt;OR08&lt;/a&gt;&lt;/div&gt;</description>
         <guid isPermaLink="false">4E66AFE14080180C!1109</guid>
         <pubDate>Fri, 04 Apr 2008 12:22:38 -0700</pubDate>
         <category>Computers and Internet</category>
      </item>
      <item>
         <title>Successful Presentation of the Microsoft Research Output Repository Platform at Open Repositories Conference</title>
         <link>http://santoshb.spaces.live.com/Blog/cns!4E66AFE14080180C!1108.entry</link>
         <description>&lt;p&gt;&lt;p&gt;&amp;lt;EDITED UpdatedDate="Apr 4 08"&amp;gt; &lt;p&gt;Slides - &lt;a rel="nofollow" title="http://pubs.or08.ecs.soton.ac.uk/84/1/Research_Output_Repositories_-_Microsoft_Initiatives.pdf" target="_blank" href="http://pubs.or08.ecs.soton.ac.uk/84/1/Research_Output_Repositories_-_Microsoft_Initiatives.pdf"&gt;http://pubs.or08.ecs.soton.ac.uk/84/1/Research_Output_Repositories_-_Microsoft_Initiatives.pdf&lt;/a&gt; &lt;p&gt;&amp;lt;/EDITED&amp;gt; &lt;p&gt;The Presentation at the Conference today went well (Monday session was held by Microsoft for interop b/w repositories, with a few known members of the community). This session was open to all members of the conference - we had a decent turnout. &lt;p&gt;Lee Dirks went through the charter of TCI team, and Microsoft, in the Scholarly Communications space, followed by my presentation and Savas' demos providing the "Wow!" factor. &lt;p&gt;Overall the response/questions were positive. We had a few folks talk to us after the presentation for additional information and explanation. &lt;p&gt;-santosh.</description>
         <guid isPermaLink="false">4E66AFE14080180C!1108</guid>
         <pubDate>Wed, 02 Apr 2008 15:16:17 -0700</pubDate>
         <category>Computers and Internet</category>
      <enclosure url="http://pubs.or08.ecs.soton.ac.uk/84/1/Research_Output_Repositories_-_Microsoft_Initiatives.pdf" length="1720033" type="application/pdf" /><media:content url="http://pubs.or08.ecs.soton.ac.uk/84/1/Research_Output_Repositories_-_Microsoft_Initiatives.pdf" fileSize="1720033" type="application/pdf" /><itunes:subtitle> &amp;lt;EDITED UpdatedDate="Apr 4 08"&amp;gt; Slides - http://pubs.or08.ecs.soton.ac.uk/84/1/Research_Output_Repositories_-_Microsoft_Initiatives.pdf &amp;lt;/EDITED&amp;gt; The Presentation at the Conference today went well (Monday session was held by Microsoft for int</itunes:subtitle><itunes:summary> &amp;lt;EDITED UpdatedDate="Apr 4 08"&amp;gt; Slides - http://pubs.or08.ecs.soton.ac.uk/84/1/Research_Output_Repositories_-_Microsoft_Initiatives.pdf &amp;lt;/EDITED&amp;gt; The Presentation at the Conference today went well (Monday session was held by Microsoft for interop b/w repositories, with a few known members of the community). This session was open to all members of the conference - we had a decent turnout. Lee Dirks went through the charter of TCI team, and Microsoft, in the Scholarly Communications space, followed by my presentation and Savas' demos providing the "Wow!" factor. Overall the response/questions were positive. We had a few folks talk to us after the presentation for additional information and explanation. -santosh.</itunes:summary><itunes:keywords>Computers and Internet</itunes:keywords></item>
      <item>
         <title>Microsoft Research Output Repository Platform Previewed</title>
         <link>http://santoshb.spaces.live.com/Blog/cns!4E66AFE14080180C!1104.entry</link>
         <description>&lt;div&gt;Project Info - &lt;a rel="nofollow" target="_blank" href="http://savas.parastatidis.name/2008/03/24/19a7fa4a-560d-47b4-bbea-9a22e9824bb8.aspx"&gt;http://savas.parastatidis.name/2008/03/24/19a7fa4a-560d-47b4-bbea-9a22e9824bb8.aspx&lt;/a&gt; &lt;/div&gt; &lt;div&gt; &lt;/div&gt; &lt;div&gt;Microsoft presentation of the Scholarly Communications space and Research Output Repository platform on Monday (3/31) went very well. this was followed by presentation by leaders/members of other repositories, protocols in the Institutional Repository space. There was a rich discussion around repositories, interop between repositories, protocols etc. &lt;/div&gt; &lt;div&gt; &lt;/div&gt; &lt;div&gt;Live blog notes taken by one of the participants - Andy Powell - &lt;a rel="nofollow" target="_blank" href="http://andypowe11.net/coveritlive/"&gt;http://andypowe11.net/coveritlive/&lt;/a&gt; &lt;/div&gt; &lt;div&gt;Blog notes by another participant - &lt;span style="font-size:12pt;font-family:'Calibri', 'sans-serif';"&gt;&lt;font color="#000000"&gt;&lt;font size="2"&gt;Richard Akerman &lt;/font&gt;- &lt;/font&gt;&lt;/span&gt; &lt;a rel="nofollow" target="_blank" href="http://scilib.typepad.com/science_library_pad/2008/04/microsoft-summi.html"&gt;http://scilib.typepad.com/science_library_pad/2008/04/microsoft-summi.html&lt;/a&gt;&lt;/div&gt; &lt;div&gt; &lt;/div&gt; &lt;div&gt;Our team will be posting notes from the meeting on 3/31 tomorrow. &lt;/div&gt; &lt;div&gt; &lt;/div&gt; &lt;div&gt;Getting ready for another presentation in the Open Repositories conference in an hour. :-)&lt;/div&gt; &lt;div&gt; &lt;/div&gt; &lt;div&gt;-santosh.&lt;/div&gt;</description>
         <guid isPermaLink="false">4E66AFE14080180C!1104</guid>
         <pubDate>Wed, 02 Apr 2008 10:59:17 -0700</pubDate>
         <category>Computers and Internet</category>
      </item>
      <item>
         <title>CRIG Flipchart Outputs</title>
         <link>http://chronicles-of-richard.blogspot.com/2008/01/crig-flipchart-outputs.html</link>
         <description>The JISC &lt;a rel="nofollow" target="_blank" href="http://chronicles-of-richard.blogspot.com/search/label/CRIG"&gt;CRIG&lt;/a&gt; meeting which I previously &lt;a rel="nofollow" target="_blank" href="http://chronicles-of-richard.blogspot.com/2007/12/crig-meeting-day-1-1.html"&gt;live-blogged&lt;/a&gt; from has now had its output formulated into a series of slides with annotations on Flickr, which can be found here:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.flickr.com/photos/wocrig/"&gt;http://www.flickr.com/photos/wocrig/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The process by which this was achieved was through an intense round of brain-storming sessions culminating in a room full of topic spaced flip chart sheets. We then performed a &lt;a rel="nofollow" target="_blank" href="http://www.dotmocracy.org/"&gt;Dotmocracy&lt;/a&gt;, and the results that you see on the Flickr page are the ideas which made it through the process as having some interest invested in them.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-4749146878939399170?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/emYCzRUx-Pk" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-4749146878939399170</guid>
         <pubDate>Thu, 24 Jan 2008 06:23:00 -0800</pubDate>
      </item>
      <item>
         <title>European ORE Roll-Out at Open Repositories 2008</title>
         <link>http://chronicles-of-richard.blogspot.com/2008/01/european-ore-roll-out-at-open.html</link>
         <description>The European leg of the ORE roll-out has been announced and will occur on the final day of the &lt;a rel="nofollow" target="_blank" href="http://or08.ecs.soton.ac.uk/"&gt;Open Repositories 2008&lt;/a&gt; conference in Southampton, UK. This is to complement the meeting at Johns Hopkins University in Baltimore on March 3. From the email circular:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;A meeting will be held on April 4, 2008 at the University of Southampton, in conjunction with Open Repositories 2008, to roll-out the beta release of the OAI-ORE specifications. This meeting is the European follow-on to a meeting that will be held in the USA on March 3, 2008 at Johns Hopkins University.&lt;br /&gt;&lt;br /&gt;The OAI-ORE specifications describe a data model to identify and describe aggregations of web resources, and they introduce machine-readable formats to describe these aggregations based on ATOM and RDF/XML. The current, alpha version of the OAI-ORE specifications is at &lt;a rel="nofollow" target="_blank" href="http://www.openarchives.org/ore/0.1/"&gt;http://www.openarchives.org/ore/0.1/&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Additional details for the OAI-ORE European Open Meeting are available at:&lt;br /&gt;&lt;br /&gt;- The full press release for this event:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.openarchives.org/ore/documents/EUKickoffPressrelease.pdf"&gt;http://www.openarchives.org/ore/documents/EUKickoffPressrelease.pdf&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;- The registration site for the event:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://regonline.com/eu-oai-ore"&gt;http://regonline.com/eu-oai-ore&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Note that registration is required and space is limited.&lt;br /&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-8526639596679087835?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/lsmJMN_BEro" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-8526639596679087835</guid>
         <pubDate>Wed, 23 Jan 2008 09:11:00 -0800</pubDate>
      <enclosure url="http://www.openarchives.org/ore/documents/EUKickoffPressrelease.pdf" length="59937" type="application/pdf" /><media:content url="http://www.openarchives.org/ore/documents/EUKickoffPressrelease.pdf" fileSize="59937" type="application/pdf" /><itunes:subtitle>The European leg of the ORE roll-out has been announced and will occur on the final day of the Open Repositories 2008 conference in Southampton, UK. This is to complement the meeting at Johns Hopkins University in Baltimore on March 3. From the email circ</itunes:subtitle><itunes:author>Richard</itunes:author><itunes:summary>The European leg of the ORE roll-out has been announced and will occur on the final day of the Open Repositories 2008 conference in Southampton, UK. This is to complement the meeting at Johns Hopkins University in Baltimore on March 3. From the email circular: A meeting will be held on April 4, 2008 at the University of Southampton, in conjunction with Open Repositories 2008, to roll-out the beta release of the OAI-ORE specifications. This meeting is the European follow-on to a meeting that will be held in the USA on March 3, 2008 at Johns Hopkins University. The OAI-ORE specifications describe a data model to identify and describe aggregations of web resources, and they introduce machine-readable formats to describe these aggregations based on ATOM and RDF/XML. The current, alpha version of the OAI-ORE specifications is at http://www.openarchives.org/ore/0.1/. Additional details for the OAI-ORE European Open Meeting are available at: - The full press release for this event: http://www.openarchives.org/ore/documents/EUKickoffPressrelease.pdf - The registration site for the event: http://regonline.com/eu-oai-ore Note that registration is required and space is limited. </itunes:summary></item>
      <item>
         <title>Fine Grained Repository Interoperability: can't package, won't package</title>
         <link>http://chronicles-of-richard.blogspot.com/2008/01/fine-grained-repository.html</link>
         <description>Sadly (although some of you may not agree!), my paper proposed for this year's Open Repositories conference in Southampton has not made it through the Programme Committee. I include here, therefore, my submission so that it may live on, and you can get an idea of the sorts of things I was thinking about talking about.&lt;br /&gt;&lt;br /&gt;The reasons given for not accepting it are probably valid; mostly concerning a lack of focus. Honestly, I thought it did a pretty good job of saying what I would talk about, but such is life.&lt;br /&gt;&lt;br /&gt;&lt;hr/&gt;&lt;br /&gt;&lt;br /&gt;What is the point of interoperability, what might it allow us to achieve, and why aren't we very good at it yet? &lt;br /&gt;&lt;br /&gt;Interoperability is a loosely defined concept. It can allow systems to talk to each other about the information that they hold, about the information that they can disseminate, and to interchange that information. It can allow us to tie systems together to improve ingest and dissemination of repository holdings, and allows us to distribute repository functions across multiple systems. It ought even to allow us to offer repository services to systems which don't do so natively, improving the richness of the information space; repository interoperability is not just about repository to repository, it is also about cross-system communications. The maturing set of repositories such as DSpace, Fedora and EPrints and other information systems such as publications management tools and research information systems, as well as home-spun solutions are making the task of taking on the interoperability beast both tangible and urgent.&lt;br /&gt;&lt;br /&gt;Traditional approaches to interoperability have often centred around moving packaged information between systems (often other repositories). The effect this has is to introduce a black-box problem concerning the content of the package itself. We are no longer transferring information, we are transferring data! It therefore becomes necessary to introduce package descriptors which allow the endpoint to re-interpret the package correctly, to turn it back into information. But this constrains us very tightly in the form of our packages, and introduces a great risk of data loss. Furthermore, it means that we cannot perform temporally and spatially disparate interoperability on an object level (that is, assemble an object's content over a period of time, and from a variety of sources). A more general approach to information interchange may be more powerful. &lt;br /&gt;&lt;br /&gt;This paper brings together a number of sources. It discusses some of the work undertaken at Imperial College London to connect a distributed repository system (built on top of DSpace) to an existing information environment. This provides repository services to existing systems, and offers library administrators custom repository management tools in an integrated way. It also considers some of the thoughts arising from the JISC Common Repository Interfaces Group (CRIG) in this area, as well as some speculative proposals for future work and further ideas that may need to be explored.&lt;br /&gt;&lt;br /&gt;Where do we start? The most basic way to address this problem is to break the idea of the package down into its most simple component parts in the context of a repository: the object metadata, the file content, and the use rights metadata. Using this approach, you can go a surprisingly long way down the interoperability route without adding further complexity. At the heart of the Imperial College Digital Repository is a set of web services which deal with exactly this fine structure of the package, because the content for the repository may be fed from a number of sources over a period of time, and thus there never is a definitive package.&lt;br /&gt;&lt;br /&gt;These sorts of operations are not new, though, and there are a variety of approaches to it which have already been undertaken. For example, WebDAV offers extensions to HTTP to deal with objects using operations such as PUT, COPY or MOVE which could be used to achieve the effects that we desire. The real challenge, therefore, is not in the mechanics of the web services which we use to exchange details about this deconstructed package, but is in the additional complexities which we can introduce to enhance the interoperability of our systems and provide the value-added services which repositories wish to offer. &lt;br /&gt;&lt;br /&gt;Consider some other features of interoperability which might be desirable&lt;br /&gt;&lt;br /&gt;- &lt;strong&gt;fine grained or partial metadata records.&lt;/strong&gt; We may wish to ingest partial records from a variety of sources to assemble into a single record, or disseminate only subsets of our stored metadata.&lt;br /&gt;- &lt;strong&gt;file metadata&lt;/strong&gt;, or any other sub-structure element of the object. This may include bibliographic, administrative or technical metadata.&lt;br /&gt;object structural information, to allow complex hierarchies and relationships to be expressed and modified.&lt;br /&gt;- &lt;strong&gt;versioning&lt;/strong&gt;, and other inter-object relationships.&lt;br /&gt;- &lt;strong&gt;workflow status&lt;/strong&gt;, if performing deposit across multiple systems, it may be necessary to be aware of the status of the object in each system to calculate an overall state.&lt;br /&gt;- &lt;strong&gt;state and provenance reporting&lt;/strong&gt;, to offer feedback on the repository state to other information systems, administrators or users.&lt;br /&gt;- &lt;strong&gt;statistics&lt;/strong&gt;, to allow content delivery services to aggregate statistics globally.&lt;br /&gt;- &lt;strong&gt;identifiers&lt;/strong&gt;, to support multiple identification schemes.&lt;br /&gt;&lt;br /&gt;Techniques such as application profiling for metadata allow us to frame entire metadata records in terms of their interpretation (e.g. the Scholarly Works Application Profile (SWAP)), but should also be used to frame individual metadata elements. Object structural data can be encoded using standards such as METS, which can also help us with attaching metadata to sub-structures of the object itself, such as its files. Versioning, and other inter-object relationships could be achieved using an RDF approach, and perhaps the OAI-ORE project will offer some guidance. But other operations such as workflow status, and state and provenance reporting do not have such clear approaches. Meanwhile, the Interoperable Repository Statistics (IRS) project has looked at the statistics problem, and the RIDIR project is looking into interoperable identifiers. In these latter cases, can we ever consider providing access to their outcomes or services through some general fine grained interface?&lt;br /&gt;&lt;br /&gt;The Imperial College Digital Repository offers limited file metadata which is attached during upload and exposed as part of a METS record, detailing the entire digital object, as a descriptive metadata section. It can deal with the idea that some metadata comes from one source, while other metadata comes from another, allowing for a primitive partial metadata interchange process. Conversely, it will also deal with multiple metadata records for the same item. Also introduced are custom workflow metadata fields which allow some basic interaction between different systems to track deposit of objects both from the point of view of the administrator, the depositor and the systems themselves. In addition, there is an extensible notifications engine which is used to produce periodic reports to all depositors whose content has undergone some sort of modification or interesting event in a given time period. This notifications engine is behind a very generic web service which offers extreme flexibility within the context of the College's information environment.&lt;br /&gt;&lt;br /&gt;Important work in the fields that will help achieve this interoperability include the SWORD deposit mechanism which currently deals with packages but may be extensible to include these much needed enhancements. Meanwhile, the OAI-ORE will be able to provide the semantics for complex objects which will no doubt assist in framing the problems that interoperability faces in a manor in which they can be solved.&lt;br /&gt;&lt;br /&gt;Other examples of the spaces in which interoperability needs to work would include the EThOSnet project, the UK national e-theses effort, where it is conceivable that institutions may want to provide their own e-theses submission system with integration into the central hub to offer seamless distributed submission. Or in the relationship between Current Research Information Systems (CRIS) and open access repositories, to offer a full-stack information environment for researchers and administrators alike. The possibilities are extensive and the benefit to the research community would be truly great. HP Labs is actively researching in these and related areas with its continued work on the DSpace platform.&lt;br /&gt;&lt;br /&gt;&lt;hr/&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-2615047246925194437?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/ykFbvAZETZs" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-2615047246925194437</guid>
         <pubDate>Tue, 22 Jan 2008 10:56:00 -0800</pubDate>
      </item>
      <item>
         <title>SWORD/ORE</title>
         <link>http://chronicles-of-richard.blogspot.com/2008/01/swordore.html</link>
         <description>Last week I was at the ORE meeting in Washington DC, and presented some thoughts regarding SWORD and its relationship to ORE. The slides I presented can be found here:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://wiki.dspace.org/static_files/1/1d/Sword-ore.pdf"&gt;http://wiki.dspace.org/static_files/1/1d/Sword-ore.pdf&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;[Be warned that discussion on these slides ensued, and they therefore don't reflect the most recent thinking on the topic]&lt;br /&gt;&lt;br /&gt;The overall approach of using SWORD as the infrastructure to do deposit for ORE seems sound. There are three main approaches identified:&lt;br /&gt;&lt;br /&gt;- &lt;strong&gt;SWORD is used to deposit the URI of a Resource Map onto a repository&lt;/strong&gt;&lt;br /&gt;- &lt;strong&gt;SWORD is used to deposit the Resource Map as XML onto a repository&lt;/strong&gt;&lt;br /&gt;- &lt;strong&gt;SWORD is used to deposit a package containing the digital object and its Resource Map onto a repository&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;In terms of complications there are two primary ones which concern me the most:&lt;br /&gt;&lt;br /&gt;- &lt;strong&gt;Mapping of the SWORD levels to the usage of ORE.&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;The principal issue is that level 1 implies level 0, and therefore level 2 implies level 1 and level 0. The inclusion of semantics to support ORE specifics could invoke a new level, and if this level is (for argument's sake) level 3, it implies all the levels beneath it, whatever they might require. Since the service, by this stage, is becoming complex in itself, such a linear relationship might not follow.&lt;br /&gt;&lt;br /&gt;A brief option discussed at the meeting would be to modularise the SWORD support instead of implementing a level based approach. That is, the service document would describe the actual services offered by the server, such as ORE support, NoOp support, Verbose support and so forth, with no recourse to "bundles" of functionality labelled by linear levelling.&lt;br /&gt;&lt;br /&gt;- &lt;strong&gt;Scalability of the service document&lt;/strong&gt;&lt;br /&gt;&lt;br /&gt;The mechanisms imposed by ORE allow for complex objects to be attached to other complex objects as aggregated resources (ORE term). This means that you could have a resource map which you wish to tell a repository describes a new part of an existing complex object. In order to do this, the service document will need to supply the appropriate deposit URI for a segment of an existing repository item. In DSpace semantics, for example, we may be adding a cluster of files to an existing item, and would therefore require the deposit URI of the item itself. To do otherwise would be to limit the applicability of ORE within SWORD and the repository model. Our current service document is a flat document describing what is pragmatically assumed (correctly, in virtually all cases) to be a small selection of deposit URIs. The same will not be true of item level deposit targets, which could be a very large number of possible deposit targets. Furthermore, in repositories which exploit the full descriptive capabilities of ORE, the number of deposit targets could be identical to the number of aggregations described (which can be more than one per resource map), which has the potential to be a very large number.&lt;br /&gt;&lt;br /&gt;The consequences are in scalability of response time, which is a platform specific issue, and the scalability of the document itself and the usefulness of the consequences. It may be more useful to navigate hierarchically through the different levels of the service document in order to identify deposit nodes.&lt;br /&gt;&lt;br /&gt;Any feedback on this topic is probably most useful in the &lt;a rel="nofollow" target="_blank" href="http://groups.google.com/group/oai-ore"&gt;ORE Google Group&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-9149789036743981806?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/DJEvhQnxebE" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-9149789036743981806</guid>
         <pubDate>Mon, 21 Jan 2008 05:05:00 -0800</pubDate>
      <enclosure url="http://wiki.dspace.org/static_files/1/1d/Sword-ore.pdf" length="37773" type="application/pdf" /><media:content url="http://wiki.dspace.org/static_files/1/1d/Sword-ore.pdf" fileSize="37773" type="application/pdf" /><itunes:subtitle>Last week I was at the ORE meeting in Washington DC, and presented some thoughts regarding SWORD and its relationship to ORE. The slides I presented can be found here: http://wiki.dspace.org/static_files/1/1d/Sword-ore.pdf [Be warned that discussion on th</itunes:subtitle><itunes:author>Richard</itunes:author><itunes:summary>Last week I was at the ORE meeting in Washington DC, and presented some thoughts regarding SWORD and its relationship to ORE. The slides I presented can be found here: http://wiki.dspace.org/static_files/1/1d/Sword-ore.pdf [Be warned that discussion on these slides ensued, and they therefore don't reflect the most recent thinking on the topic] The overall approach of using SWORD as the infrastructure to do deposit for ORE seems sound. There are three main approaches identified: - SWORD is used to deposit the URI of a Resource Map onto a repository - SWORD is used to deposit the Resource Map as XML onto a repository - SWORD is used to deposit a package containing the digital object and its Resource Map onto a repository In terms of complications there are two primary ones which concern me the most: - Mapping of the SWORD levels to the usage of ORE. The principal issue is that level 1 implies level 0, and therefore level 2 implies level 1 and level 0. The inclusion of semantics to support ORE specifics could invoke a new level, and if this level is (for argument's sake) level 3, it implies all the levels beneath it, whatever they might require. Since the service, by this stage, is becoming complex in itself, such a linear relationship might not follow. A brief option discussed at the meeting would be to modularise the SWORD support instead of implementing a level based approach. That is, the service document would describe the actual services offered by the server, such as ORE support, NoOp support, Verbose support and so forth, with no recourse to "bundles" of functionality labelled by linear levelling. - Scalability of the service document The mechanisms imposed by ORE allow for complex objects to be attached to other complex objects as aggregated resources (ORE term). This means that you could have a resource map which you wish to tell a repository describes a new part of an existing complex object. In order to do this, the service document will need to supply the appropriate deposit URI for a segment of an existing repository item. In DSpace semantics, for example, we may be adding a cluster of files to an existing item, and would therefore require the deposit URI of the item itself. To do otherwise would be to limit the applicability of ORE within SWORD and the repository model. Our current service document is a flat document describing what is pragmatically assumed (correctly, in virtually all cases) to be a small selection of deposit URIs. The same will not be true of item level deposit targets, which could be a very large number of possible deposit targets. Furthermore, in repositories which exploit the full descriptive capabilities of ORE, the number of deposit targets could be identical to the number of aggregations described (which can be more than one per resource map), which has the potential to be a very large number. The consequences are in scalability of response time, which is a platform specific issue, and the scalability of the document itself and the usefulness of the consequences. It may be more useful to navigate hierarchically through the different levels of the service document in order to identify deposit nodes. Any feedback on this topic is probably most useful in the ORE Google Group</itunes:summary></item>
      <item>
         <title>BMC and the Free Open Repository Trial</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/12/bmc-and-free-open-repository-trial.html</link>
         <description>Our good buddies at BioMedCentral's Open Repository team have released the latest upgrade to their service, and are offering 3 month trial repositories for evaluation. From the &lt;a rel="nofollow" target="_blank" href="http://www.dspace.org/"&gt;DSpace home page&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;BioMed Central announced the latest upgrades to Open Repository, the open access publisher's hosted repository solution. Open Repository offers institutions a cost effective repository solution (setup, hosting and maintenance) which includes new DSpace features, customization options, improved user interface. Along with the annoucement of the upgrades, Open Repository is offereing a free 3-month pilot repository, so institutions can test the suitability of the service without obligation. See the full articles in &lt;a rel="nofollow" target="_blank" href="http://newsbreaks.infotoday.com/wndReader.asp?ArticleId=40331"&gt;Weekly News Digest&lt;/a&gt; and in &lt;a rel="nofollow" target="_blank" href="http://www.alphagalileo.org/index.cfm?fuseaction=readrelease&amp;releaseid=525415"&gt;Alpha Galieo&lt;/a&gt;.&lt;br /&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-5373224028320805461?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/cxCzlAqTC1k" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-5373224028320805461</guid>
         <pubDate>Wed, 12 Dec 2007 01:36:00 -0800</pubDate>
      </item>
      <item>
         <title>CRIG Meeting Day 2 (2)</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/12/crig-meeting-day-2-2.html</link>
         <description>Topics for today:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_Unconference#Friday_December_7th"&gt;http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_Unconference#Friday_December_7th&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The ones that interest me the most are probably these:&lt;br /&gt;&lt;br /&gt;- Death to Packages&lt;br /&gt;&lt;br /&gt;Not really Death to Packages, but lets not forget that packaging sometimes isn't what we want to do or what we can do.&lt;br /&gt;&lt;br /&gt;- Get What?&lt;br /&gt;&lt;br /&gt;This harks to my ORE interest, as to what is available under the URLs, and what that means for something like content negotiation.&lt;br /&gt;&lt;br /&gt;- One Put to Multiple Places&lt;br /&gt;&lt;br /&gt;Really important to distributed information systems (e.g. ethosnet integration into local institutions). Also, this relates, for me, to the unpackaging question, because it introduces differences between what systems might all be expecting.&lt;br /&gt;&lt;br /&gt;- Web 2.0 interfaces (ok, ok)&lt;br /&gt;&lt;br /&gt;I'm interested in web services. Yes it's a bit trendy. But it is useful.&lt;br /&gt;&lt;br /&gt;- Core Servies of a Repository&lt;br /&gt;&lt;br /&gt;For repository core architecture, this is important. With my DSpace hat on I'd like to see what sorts of things an internal service architecture or api ought to be able to support&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-1908914017517604548?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/XguJLOxM_NU" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-1908914017517604548</guid>
         <pubDate>Fri, 07 Dec 2007 02:08:00 -0800</pubDate>
      </item>
      <item>
         <title>CRIG Meeting Day 2 (1)</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/12/crig-meeting-day-2-1.html</link>
         <description>It's first thing on day two. I'm late because I have to get all the way across town, which takes a surprisingly long time in London. I should have just stayed at a nearby hotel. Oh well.&lt;br /&gt;&lt;br /&gt;The remainder of yesterday was interesting. Scope for live blogging is difficult, as the conference is extremely mobile. Today I will have to pick a point and hide in a corner to get you up to date.&lt;br /&gt;&lt;br /&gt;In the afternoon we discussed the CRIG scenarios, and then implemented something called a Dotmocracy, which involves sticking dots (like house points at school) next to topics which appeared which we were interested in. When we start up today, the first order of business will be to see what topics made the cut. From what I saw at the end of the day, this will include Federated Searching, Google Search, and package deconstruction (my personal favourite this week).&lt;br /&gt;&lt;br /&gt;As a brief aside, one running theme has been "no more standards". As it happens, I disagree with this. We're never going to get everything thinking the same and working the same. That's why there are so many standards, and why new ones get made all the time. It's the way of the world. At least, with a standard, though, when you have implemented one, you at least have a way of telling people what you did, over the home grown undocumented solutions which are the alternative.&lt;br /&gt;&lt;br /&gt;Right, I suppose I'd better get my skates on.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-4799712183286978005?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/dPbLzdCq4ok" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-4799712183286978005</guid>
         <pubDate>Fri, 07 Dec 2007 00:42:00 -0800</pubDate>
      </item>
      <item>
         <title>CRIG Meeting Day 1 (2)</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/12/crig-meeting-day-1-2.html</link>
         <description>&lt;a rel="nofollow" target="_blank" href="http://en.wikipedia.org/wiki/Unconference"&gt;http://en.wikipedia.org/wiki/Unconference&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;See also &lt;a rel="nofollow" target="_blank" href="http://wwmm.ch.cam.ac.uk/blogs/downing/"&gt;Jim Downing's&lt;/a&gt; live blogging.&lt;br /&gt;&lt;br /&gt;We've just done a round of preliminary unconferencing, where the &lt;a rel="nofollow" target="_blank" href="http://chronicles-of-richard.blogspot.com/2007/11/sword-10-released.html"&gt;CRIG Podcast&lt;/a&gt; topics were brainstormed onto flip charts. Not sure how useful that's going to be, but I'm going to approach the whole thing with an open mind. I've got my marker pen, my baloon, and my three dots.&lt;br /&gt;&lt;br /&gt;wish me luck ...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-1512541800794566898?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/5j8sMCBCo5k" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-1512541800794566898</guid>
         <pubDate>Thu, 06 Dec 2007 07:01:00 -0800</pubDate>
      </item>
      <item>
         <title>CRIG Meeting Day 1 (1)</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/12/crig-meeting-day-1-1.html</link>
         <description>Some live blogging; may be slightly malformed, as this is happening inline, with no post-editing.&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_Unconference"&gt;http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_Unconference&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Les Carr and Jim Downing have introduced us to the CRIG workshop first day. We're unconferencing which means that there's not a programme! We're going to try and stay at the abstract or high level discussion, not try to talk about technology.&lt;br /&gt;&lt;br /&gt;David Flanders outlines the meeting philosophy. The outputs aimed for the meeting include: ideas (bluesky), standards and scenarios and how they can be linked together. The outputs will be taken to OR08. The best way for a group to produce good stuff is for everyone to think about themselves. Makes me think of an article I read recently:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www7.nationalgeographic.com/ngm/0707/feature5/index.html"&gt;http://www7.nationalgeographic.com/ngm/0707/feature5/index.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;We are &lt;em&gt;not&lt;/em&gt; about creating new specs.&lt;br /&gt;&lt;br /&gt;Julie then brings us some stuff about SWORD. See my &lt;a rel="nofollow" target="_blank" href="http://chronicles-of-richard.blogspot.com/2007/11/sword-10-released.html"&gt;previous post&lt;/a&gt; on this. We are going to have implementations for xrXiv, white rose research online and Jorum. A SPECTRa deposit client, and later an article in Ariadne and a presentation at OR08.&lt;br /&gt;&lt;br /&gt;Break time ... tea and coffee!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-6875507276473434800?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/wSKAR7Doxt8" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-6875507276473434800</guid>
         <pubDate>Thu, 06 Dec 2007 05:19:00 -0800</pubDate>
      </item>
      <item>
         <title>CRIG Podcast</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/11/crig-podcast.html</link>
         <description>A couple of weeks ago the JISC CRIG (Common Repository Interfaces Group) organised a series of telephone debates on important areas for it. These have now been edited into short commentaries which might be of interest to you, and are aimed at priming and informing the upcoming "unconference" to be held 6/7 December in London:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_Podcasts"&gt;http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_Podcasts&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The "unconference" will take place at Birkbeck College in Bloomsbury, London. Take a listen, and enjoy. Yours truly appears in the "Get and Put within Repositories" and the "Object Interoperability" discussions.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-2034686157678763588?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/WXv1M_TyW-w" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-2034686157678763588</guid>
         <pubDate>Fri, 30 Nov 2007 01:19:00 -0800</pubDate>
      </item>
      <item>
         <title>SWORD 1.0 Released</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/11/sword-10-released.html</link>
         <description>Just a quick heads up to say that the SWORD 1.0 release is now out and ready for download from SourceForge:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://sourceforge.net/projects/sword-app/"&gt;http://sourceforge.net/projects/sword-app/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Here you will find the common java library which supports repositories wanting to implement SWORD, plus implementations for DSpace and Fedora. There is also a client (with GUI and CLI versions) which you can use to deposit content into the repositories.&lt;br /&gt;&lt;br /&gt;The DSpace implementation is designed only to work with the forthcoming DSpace 1.5 (which is currently in Alpha release). Your feedback and experiences with the code would be much appreciated. We expect to be making refinements to the DSpace implementation up unitl DSpace 1.5 is released as stable.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-8632597815206720838?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/OIEs2RRDcp4" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-8632597815206720838</guid>
         <pubDate>Thu, 08 Nov 2007 06:45:00 -0800</pubDate>
      </item>
      <item>
         <title>Scandinavian Dugnad</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/10/scandinavian-dugnad.html</link>
         <description>I was invited by the Scandinavian DSpace User Group meeting to join them in their first official meeting yesterday in Oslo. It was great to see so many people representing a small-ish geographical area and a reasonably small population all together from 4 nations (Norway, Sweden, Finland and Denmark) to talk about DSpace. Probably 35 people all-in, with plans to extend the group to be the Nordic DSpace User Group to include members from Iceland, and perhaps even the Faroe Islands, and Greenland (if DSpace instances appear there).&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://wiki.dspace.org/index.php/Scandinavia"&gt;http://wiki.dspace.org/index.php/Scandinavia&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;In the grand traditions of Open Source and Open Access, I borrowed presentations given at the recent DSpace User Group Rome, and gave them an update on the state of the DSpace Foundation, DSpace 2.0, and then went on to produce some original slides telling folks how to get involved in DSpace developments. Hopefully all the content will be available on the web soon.&lt;br /&gt;&lt;br /&gt;As your humble chronicaller struggled with his sub-par Norwegian, he picked up some interesting things. There is good user end development going on in Scandinavia which could be harnessed to bring improvements to the DSpace UI. There are also increasingly many requests for "Integration with ...", where the object of integration is one of a variety of library information systems. Statistics are high on the agenda here as they are everywhere else. They are also a base of experts in multi-language problems stemming from being polyglot nations with additional letters in their native alphabets.&lt;br /&gt;&lt;br /&gt;It's clear where the future of repositories lie in Scandinavian nations where the national interest and the community feature prominently in society and culture. Bibsys, a major supplier of library systems and services in Norway (and organisers of the meeting), have 29 DSpace clients on their books already, and are looking at tighter integration between it and their other products, right down to the information model level. National research reporting systems are much desired repository data sources, and internal information systems at each institutions are starting to feed into their public repositories.&lt;br /&gt;&lt;br /&gt;With such a big user group, and such a community focus, there is little doubt in my mind that the Nordic user group will be a great asset to the DSpace users in that region, and probably to the DSpace community as a whole.&lt;br /&gt;&lt;br /&gt;PS Dugnad is a Norwegian word effectively referring to voluntary, communal work which benefits the community to some degree, but is also social and enjoyable for the participants. It also formed the basis of the 2006 DSpace User Group Meeting in Bergen&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://dsug2006.uib.no/"&gt;http://dsug2006.uib.no/&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-7713247949699281106?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/BlPZVoGLi1A" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-7713247949699281106</guid>
         <pubDate>Wed, 31 Oct 2007 05:23:00 -0700</pubDate>
      </item>
      <item>
         <title>DSpace 1.5 Alpha with experimental binary distribution</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/10/dspace-15-alpha-with-experimental.html</link>
         <description>The DSpace 1.5 Alpha has now been released and we encourage you to download this exciting new release of DSpace and try it out.&lt;br /&gt;&lt;br /&gt;There are big changes in this code base, both in terms of functionality and organisation. First, we are now using Maven to manage our build process, and have carved the application into a set of core modules which can be used to assemble your desired DSpace instance. For example, the JSP UI and the Manakin UI are now available as separate UI modules, and you may build either or both of these. We are taking an important step down the road, here, to allowing for community developments to be more easily created, and also more easily shared. You should be able, with a little tinkering, to provide separate code packages which can be dropped in alongside the dspace core modules, and built along with them. There are many stages to go through before this process is complete or perfect, so we encourage you to try out this new mechanism, and to let us know how you get on, or what changes you would make. Oh, and please do share your modules with the community! Props to Mark Diggory and the MIT guys for this restructuring work.&lt;br /&gt;&lt;br /&gt;The second big and most exciting thing is that Manakin is now part of our standard distribution, and we want to see it taking over from the JSP UI over the next few major releases. A big hand for Scott Phillips and the Texas A&amp;M guys for getting this code into the distribution; they have worked really hard.&lt;br /&gt;&lt;br /&gt;In addition to this, we have an Event System which should help us start to decouple tightly integrated parts of the repository, from Richard Rodgers and the guys at MIT. Browsing is now done with a heavily configurable system written initially by myself, but with significant assistance from Graham Triggs at BioMed Central. Tim Donohue's much desired Configurable Submission system is now integrated with both JSP and Manakin interfaces and is part of the release too.&lt;br /&gt;&lt;br /&gt;Further to this we have a bunch of other functionality including: IP Authentication, better metadata and schema registry import, move items from one collection to another, metadata export, configurable multilingualism support, Google and html sitemap generator, Community and Sub-Communities as OAI Sets, and Item metadata in XHTML head &amp;lt;meta&amp;gt; elements.&lt;br /&gt;&lt;br /&gt;All in all, a good looking release. There will be a testathon organised shortly which will be announced on the mailing lists, so that we can run this up to beta and then into final release as soon as possible. There's lots to test, so please lend a hand.&lt;br /&gt;&lt;br /&gt;We are also experimenting with a binary release, which can be downloaded from the same page as the source release. We are interested in how people get on with this, so let us know on the mailing lists.&lt;br /&gt;&lt;br /&gt;Come and get it:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://sourceforge.net/project/showfiles.php?group_id=19984"&gt;http://sourceforge.net/project/showfiles.php?group_id=19984&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-4691783139275190839?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/Su8qnj6lHUM" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-4691783139275190839</guid>
         <pubDate>Thu, 25 Oct 2007 02:48:00 -0700</pubDate>
      </item>
      <item>
         <title>my my where did the summer go</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/10/my-my-where-did-summer-go.html</link>
         <description>OK, ok, it's been a long long time since I updated. Did I say at the beginning that this was an experiment in seeing if I was capable of maintaining a blog? If I didn't I should have done.&lt;br /&gt;&lt;br /&gt;But there's a good reason that I've not updated for a while. That is, that I've been working flat out on the Imperial College Digital Repository: Spir@l, and am pleased to finally announce in a quiet way that we are officially LIVE:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://spiral.imperial.ac.uk/"&gt;http://spiral.imperial.ac.uk/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;On the outside it doesn't look too serious. A standard looking DSpace, I hear you say, with an Imperial College site template on it. And you'd be right. But only about the tip of the ice-berg.&lt;br /&gt;&lt;br /&gt;Without wishing to blow my own trumpet (modesty &lt;em&gt;is&lt;/em&gt; the third or fourth best thing about me), please do check out the article which I co-wrote with my good colleague Fereshteh Afshari:&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://hdl.handle.net/10044/1/493"&gt;http://hdl.handle.net/10044/1/493&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;And you may also be interested in my presentation at the recent DSpace User Group Meeting in Rome 2007 (more on that later, maybe):&lt;br /&gt;&lt;br /&gt;&lt;a rel="nofollow" target="_blank" href="http://www.aepic.it/conf/viewabstract.php?id=200&amp;cf=11"&gt;http://www.aepic.it/conf/viewabstract.php?id=200&amp;cf=11&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;I could probably be persueded to write a little here about how it works; maybe you'll even get snippets from the monolithic technical documentation that I'm in the middle of writing.&lt;br /&gt;&lt;br /&gt;Oh, and there's more news, but now I've got your attention again you have to wait for the next installment.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-5921631848918710005?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/nIamUPYRpyU" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-5921631848918710005</guid>
         <pubDate>Wed, 24 Oct 2007 08:27:00 -0700</pubDate>
      </item>
      <item>
         <title>EThOSnet Kick-Off</title>
         <link>http://chronicles-of-richard.blogspot.com/2007/05/ethosnet-kick-off.html</link>
         <description>On Tuesday of this week the EThOSnet Project Board met for the first time to kick off this significant new project. For background, this project is the successor to the &lt;a rel="nofollow" target="_blank" href="http://www.ethos.ac.uk/"&gt;EThOS&lt;/a&gt; project, which in turn grew out of the Scottish projects: &lt;a rel="nofollow" target="_blank" href="http://www.thesesalive.ac.uk/"&gt;Theses Alive&lt;/a&gt; at Edinburgh, &lt;a rel="nofollow" target="_blank" href="http://www.lib.gla.ac.uk/daedalus/"&gt;DAEDALUS&lt;/a&gt; at Glasgow, and &lt;a rel="nofollow" target="_blank" href="http://www2.rgu.ac.uk/library/e-theses.htm"&gt;Electronic Theses&lt;/a&gt; at the Robert Gordon University.&lt;br /&gt;&lt;br /&gt;The aim of EThOSnet is to take the work done under EThOS and bring it up to a point where UK institutions can actually start to become early adopters, to start to digitise the back-catalogue of print theses in the UK, investigate technology for the current and the future incarnations of the system, and to basically kick-start a genuinely viable service for deposit and dissemination of UK theses.&lt;br /&gt;&lt;br /&gt;At this stage, the project does not have a Project Manager, which is causing minor hold-ups initially, but Project Director, and Director of Library Services Clare Jenkins of Imperial College Library has stepped in to hold things together until one is appointed (we are expecting to hear very soon). In the interim, the Project Board has also been put in place to check that all the 7 Work Packages have the things they need to get going.&lt;br /&gt;&lt;br /&gt;Of these 7 workpackages, the first and last are concerned with project management and exit strategy, and the meat of the project will take place in packages 2 - 6. Details of these work packages are available in the project proposal, which will hopefully be available on the JISC website soon.&lt;br /&gt;&lt;br /&gt;A quick summary, then, of some of the changes and more concrete decisions that we made during the meeting:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;br /&gt;&lt;li&gt;We have set a pleasingly high target of 20,000 digitised theses and 3,000 born-digital theses by the end of the project. This will be sourced from the many institutions who have already expressed an interest in adopting the service, before the project is even going!&lt;/li&gt;&lt;br /&gt;&lt;li&gt;The first port of call for the technology is to smooth the process of the existing software tools for repository users. I would hope to have something which works well for DSpace available quickly, and general enough to be part of the main distribution. EPrints is already fully compliant, and Fedora has representitives from the University of Hull looking after it.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;Communications will be done primarily through a soon-to-exist project wiki, and it is hoped that the existing &lt;a rel="nofollow" target="_blank" href="http://www.jiscmail.ac.uk/lists/E-THESES-UK.html"&gt;E-Theses UK list&lt;/a&gt; will be used more heavily than it is already. Imperial College has agreed to host the existing ethos website, the wiki, and potentially the toolkit if necessary (&lt;a rel="nofollow" target="_blank" href="http://ethostoolkit.rgu.ac.uk/"&gt;currently hosted&lt;/a&gt; at RGU).&lt;/li&gt;&lt;br /&gt;&lt;li&gt;Toolkit development will be ongoing, with work being done on it within a wiki, but with the option to move to some XML format for the final product&lt;/li&gt;&lt;br /&gt;&lt;/ul&gt;&lt;br /&gt;&lt;br /&gt;This is a very big project, and I can't possibly represent everything that came out of Tuesday's meeting here. In the near future expect to see links to the project wiki appear and more information to come out.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3741879089300545664-1653851498787432588?l=chronicles-of-richard.blogspot.com'/&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/~r/chronicles-of-richard/~4/iT023WB9CPI" height="1" width="1"/&gt;</description>
         <author>Richard</author>
         <guid isPermaLink="false">tag:blogger.com,1999:blog-3741879089300545664.post-1653851498787432588</guid>
         <pubDate>Thu, 10 May 2007 07:05:00 -0700</pubDate>
      </item>
   <language>en-us</language><media:rating>nonadult</media:rating></channel>
</rss><!-- fe5.pipes.re3.yahoo.com uncompressed/chunked Tue Nov 10 18:38:44 PST 2009 -->
