<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:media="http://search.yahoo.com/mrss/" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>business|bytes|genes|molecules</title>
	
	<link>http://mndoci.com</link>
	<description>ruminations on the social and commercial potential of biotech, nanotech, infotech and computing</description>
	<lastBuildDate>Tue, 27 Jul 2010 22:37:56 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/mndoci" /><feedburner:info uri="mndoci" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><media:thumbnail url="http://mndoci.com/blog/images/hiv.jpg" /><geo:lat>47.618371</geo:lat><geo:long>-122.203083</geo:long><creativeCommons:license>http://creativecommons.org/licenses/by-nc-sa/2.0/</creativeCommons:license><feedburner:emailServiceId>mndoci</feedburner:emailServiceId><feedburner:feedburnerHostname>http://feedburner.google.com</feedburner:feedburnerHostname><feedburner:browserFriendly>This is an XML content feed. It is intended to be viewed in a newsreader or syndicated to another site, subject to copyright and fair use.</feedburner:browserFriendly><item>
		<title>Twenty queries</title>
		<link>http://feedproxy.google.com/~r/mndoci/~3/WYnan9rhGbA/</link>
		<comments>http://mndoci.com/2010/07/25/twenty-queries/#comments</comments>
		<pubDate>Mon, 26 Jul 2010 06:02:10 +0000</pubDate>
		<dc:creator>Deepak Singh</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Bytes]]></category>
		<category><![CDATA[Informatics]]></category>
		<category><![CDATA[Life Science]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Software & Internet]]></category>

		<guid isPermaLink="false">http://mndoci.com/?p=2257</guid>
		<description><![CDATA[Image via Wikipedia I am reading up a lot of Jim Gray these days, so a lot of his ideas are quite fresh in my head. Also had an interesting discussion with Nancy Parmalee on Twitter about software, informatics, bench scientists and small labs. One thing that jumped out, and is hardly a surprise, is [...]]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img zemanta-action-dragged">
<div>
<dl class="wp-caption alignleft">
<dt class="wp-caption-dt"><a href="http://en.wikipedia.org/wiki/File:11-23gray.jpg"><img title="Jim Gray (computer scientist)" src="http://upload.wikimedia.org/wikipedia/en/7/7e/11-23gray.jpg" alt="Jim Gray (computer scientist)" width="75" height="127" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution">Image via <a href="http://en.wikipedia.org/wiki/File:11-23gray.jpg">Wikipedia</a></dd>
</dl>
</div>
</div>
<p>I am reading up a lot of <a class="zem_slink" title="Jim Gray (computer scientist)" rel="wikipedia" href="http://en.wikipedia.org/wiki/Jim_Gray_%28computer_scientist%29">Jim Gray</a> these days, so a lot of  his ideas are quite fresh in my head.  Also had an interesting discussion with <a href="http://twitter.com/nparmalee">Nancy Parmalee</a> on Twitter about software, informatics, bench scientists and small labs.  One thing that jumped out, and is hardly a surprise, is that for the most part, there is a huge disconnect between the data science, and the scientists who need to make use of the work done by data scientists (often bench scientists).  I&#8217;ve long argued that we neglect &#8220;infrastructure&#8221; software like data management systems, tracking systems, query systems, etc which all require well designed, scalable backends and should be treated like products, cause they are, even if they are home grown, or derived from open source software.</p>
<p>This brings me to Jim Gray and his &#8220;rule of 20&#8243; (came in the context of the Sloan Digital Sky Survey and discussed in the Fourth Paradigm).  He argued that for the database builder, the researchers querying the data were asked about the 20 questions they most wanted answers to.  Five would be too few, and a hundred would become too general, and systems are usually better built for a purpose than become too general purpose.  I&#8217;ve seen this process work, and indeed participated in similar processes.  One of the points Nancy brought up was the lack of a common language. This is how you arrive at common semantics.  Also important in this discussion is the data scientist.  While they can maintain more general purpose systems for ad hoc querying and data mining (e.g. just flat namespaces to be queried with mapreduce), they are the providers of much of the information that those twenty questions will address, so it becomes a three stakeholder game and you need to approach this in an iterative manner, constantly refining the process.</p>
<p>Will we ever get there in an academic setting?  I am not sure, but in the places I have seen that process in action, it seems to work.  We live in an era of constant change and by creating artificial barriers and ignoring things we can do at the software layer, we&#8217;re only making things more difficult for ourselves.</p>
<h6 class="zemanta-related-title">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://mndoci.com/2010/07/13/data-science-roles-and-barriers/">Data science, roles, and barriers</a> (mndoci.com)</li>
</ul>
<div class="zemanta-pixie"><img class="zemanta-pixie-img" src="http://img.zemanta.com/pixy.gif?x-id=30a8eb06-37e3-473b-b29a-b668df676f79" alt="" /><span class="zem-script more-related pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/mndoci?a=WYnan9rhGbA:oUAQ8SeRecU:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/mndoci?i=WYnan9rhGbA:oUAQ8SeRecU:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=WYnan9rhGbA:oUAQ8SeRecU:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/mndoci?d=dnMXMwOfBR0" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=WYnan9rhGbA:oUAQ8SeRecU:JEwB19i1-c4"><img src="http://feeds.feedburner.com/~ff/mndoci?i=WYnan9rhGbA:oUAQ8SeRecU:JEwB19i1-c4" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=WYnan9rhGbA:oUAQ8SeRecU:EpLpB3ZkKWg"><img src="http://feeds.feedburner.com/~ff/mndoci?d=EpLpB3ZkKWg" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=WYnan9rhGbA:oUAQ8SeRecU:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/mndoci?i=WYnan9rhGbA:oUAQ8SeRecU:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=WYnan9rhGbA:oUAQ8SeRecU:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/mndoci?i=WYnan9rhGbA:oUAQ8SeRecU:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=WYnan9rhGbA:oUAQ8SeRecU:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/mndoci?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=WYnan9rhGbA:oUAQ8SeRecU:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/mndoci?i=WYnan9rhGbA:oUAQ8SeRecU:gIN9vFwOqvQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/mndoci/~4/WYnan9rhGbA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://mndoci.com/2010/07/25/twenty-queries/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://mndoci.com/2010/07/25/twenty-queries/</feedburner:origLink></item>
		<item>
		<title>The internet, publishing, and data</title>
		<link>http://feedproxy.google.com/~r/mndoci/~3/X0k77sF0qqM/</link>
		<comments>http://mndoci.com/2010/07/25/the-internet-publishing-and-data/#comments</comments>
		<pubDate>Sun, 25 Jul 2010 18:33:06 +0000</pubDate>
		<dc:creator>Deepak Singh</dc:creator>
				<category><![CDATA[Innovation]]></category>
		<category><![CDATA[Open Science]]></category>
		<category><![CDATA[Publishing]]></category>
		<category><![CDATA[Software & Internet]]></category>
		<category><![CDATA[Web as platform]]></category>

		<guid isPermaLink="false">http://mndoci.com/?p=2233</guid>
		<description><![CDATA[A few days before Jim Gray sadly disappeared, I exchanged emails with him on a set of slides which are now a chapter in the Fourth Paradigm. There&#8217;s one slide in there that I wanted to share click for full size In this slide (the narrative can be found in the Fourth Paradigm), Jim states [...]]]></description>
			<content:encoded><![CDATA[<p>A few days before <a href="http://http://en.wikipedia.org/wiki/Jim_Gray_%28computer_scientist%29">Jim Gray</a> sadly disappeared, I exchanged emails with him on a set of slides which are now a chapter in the <a href="http://www.fourthparadigm.org">Fourth Paradigm</a>.  There&#8217;s one slide in there that I wanted to share</p>
<p><img src="http://dsingh-public.s3.amazonaws.com/pyramid.jpg" alt="" width="500" height="334" /></p>
<p><a href="http://dsingh-public.s3.amazonaws.com/pyramid.jpg">click</a> for full size</p>
<p>In this slide (the narrative can be found in the Fourth Paradigm), Jim states a belief I share. That the internet can do a lot more than just make the text of research papers available.  The whole point of the Web as a platform for scientific discovery is lost until we can unify scientific data with literature in the sort of pyramid shown above.  To be able to jump between paper, data, commentary, reviews, further analysis, etc.  The interesting concept Jim brings up is <em>overlay journals</em>.  An overlay journal is one where you have data archives and literature archives with appropriate content going into the appropriate place (whether it&#8217;s actual context or a distributed index is a different story).  One should, in theory, with the raw data, appropriate APIs etc, be able to construct a meta-journal around an interest area (The collections on <a href="http://precedings.nature.com">Nature Precedings</a>, and similar efforts at <a href="http://plos.org">PLoS</a> come to mind).  Perhaps we could do with pre-publication repositories, or create loosely coupled peer-reviewed journals.  I&#8217;d like to see publishers and web science types (Timo?) push the envelope here.</p>
<h6 class="zemanta-related-title">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://mndoci.com/2010/07/06/collections-on-nature-precedings/">Collections on Nature Precedings</a> (mndoci.com)</li>
<li class="zemanta-article-ul-li"><a href="http://scienceblogs.com/christinaslisrant/2010/07/hey_maybe_scientists_should_do.php">Hey maybe scientists should do more than just wait for their journal to issue a press release on their new fabu article</a> (scienceblogs.com)</li>
<li class="zemanta-article-ul-li"><a href="http://mndoci.com/2010/07/11/peer-review-has-a-place/">Peer-review has a place</a> (mndoci.com)</li>
</ul>
<div class="zemanta-pixie"><img class="zemanta-pixie-img" src="http://img.zemanta.com/pixy.gif?x-id=674ea7c1-b0a9-4195-af2b-4a67c215cf26" alt="" /><span class="zem-script more-related pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/mndoci?a=X0k77sF0qqM:A-ppo0_7qN0:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/mndoci?i=X0k77sF0qqM:A-ppo0_7qN0:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=X0k77sF0qqM:A-ppo0_7qN0:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/mndoci?d=dnMXMwOfBR0" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=X0k77sF0qqM:A-ppo0_7qN0:JEwB19i1-c4"><img src="http://feeds.feedburner.com/~ff/mndoci?i=X0k77sF0qqM:A-ppo0_7qN0:JEwB19i1-c4" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=X0k77sF0qqM:A-ppo0_7qN0:EpLpB3ZkKWg"><img src="http://feeds.feedburner.com/~ff/mndoci?d=EpLpB3ZkKWg" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=X0k77sF0qqM:A-ppo0_7qN0:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/mndoci?i=X0k77sF0qqM:A-ppo0_7qN0:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=X0k77sF0qqM:A-ppo0_7qN0:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/mndoci?i=X0k77sF0qqM:A-ppo0_7qN0:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=X0k77sF0qqM:A-ppo0_7qN0:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/mndoci?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=X0k77sF0qqM:A-ppo0_7qN0:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/mndoci?i=X0k77sF0qqM:A-ppo0_7qN0:gIN9vFwOqvQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/mndoci/~4/X0k77sF0qqM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://mndoci.com/2010/07/25/the-internet-publishing-and-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://mndoci.com/2010/07/25/the-internet-publishing-and-data/</feedburner:origLink></item>
		<item>
		<title>Tenure</title>
		<link>http://feedproxy.google.com/~r/mndoci/~3/8RvThb2Rq4g/</link>
		<comments>http://mndoci.com/2010/07/25/tenure/#comments</comments>
		<pubDate>Sun, 25 Jul 2010 15:26:10 +0000</pubDate>
		<dc:creator>Deepak Singh</dc:creator>
				<category><![CDATA[Careers]]></category>
		<category><![CDATA[Education]]></category>
		<category><![CDATA[Publishing]]></category>

		<guid isPermaLink="false">http://mndoci.com/?p=2204</guid>
		<description><![CDATA[Megan McArdle has an interesting article on tenure in her blog at The Atlantic. She believes we should do away with tenure altogether.  She has another post on the costs of tenure Tyler Cowen has an different viewpoint on this subject (he is a tenured professor). I don’t know if that was the intent, but [...]]]></description>
			<content:encoded><![CDATA[<p><a class="zem_slink" title="Megan McArdle" rel="wikipedia" href="http://en.wikipedia.org/wiki/Megan_McArdle">Megan McArdle</a> has an interesting <a href="http://www.theatlantic.com/business/archive/2010/07/tenure-an-idea-whose-time-has-gone/60187/">article on tenure</a> in her blog at <em>The Atlantic</em>.  She believes we should do away with tenure altogether.  She has another post on the <a href="http://www.theatlantic.com/business/archive/2010/07/how-much-does-tenure-cost/60238/">costs of tenure</a></p>
<p><a class="zem_slink" title="Tyler Cowen" rel="wikipedia" href="http://en.wikipedia.org/wiki/Tyler_Cowen">Tyler Cowen</a> has an <a href="http://www.marginalrevolution.com/marginalrevolution/2010/07/what-if-universities-get-rid-of-tenure.html">different viewpoint</a> on this subject (he is a tenured professor). I don’t know if that was the intent, but you get the impression that he doesn’t quite like the idea, but he brings up a great point. No one has suggested a model that would replace it and there is no real debate cause the counterpoint is mostly against tenure without a solid alternative.</p>
<p>Steven Levitt <a href="http://freakonomics.blogs.nytimes.com/2007/03/03/lets-just-get-rid-of-tenure/">doesn&#8217;t like tenure much either</a></p>
<p>I’ve never been comfortable with the idea of tenure, but don’t have enough evidence that it’s a bad thing per se.  Tenure isn’t exactly easy to come by, especially at quality universities, so there is still work required to get there.  However, coupled with the other incentive systems in place for researchers, you end up with a rather vicious publish or perish system.  So you end up in a system which, from a naive point of view, hampers creativity and risk taking and in the end probably limits what problems we try and solve and how.  Of course, you could argue that you get a pulpit from which you can do some really groundbreaking work, but I think that argument breaks down because your funding comes from a different source.</p>
<p>What do you think? Is tenure a good thing?  Is it the best in the absence of a viable alternative? Do you consider it the pinnacle of your career, or just another step?</p>
<h6 class="zemanta-related-title">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.marginalrevolution.com/marginalrevolution/2010/07/what-if-universities-get-rid-of-tenure.html">What if universities get rid of tenure?</a> (marginalrevolution.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.theatlantic.com/business/archive/2010/07/tenure-an-idea-whose-time-has-gone/60187/">Tenure: An Idea Whose Time Has Gone</a> (theatlantic.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.theatlantic.com/business/archive/2010/07/how-much-does-tenure-cost/60238/">How Much Does Tenure Cost?</a> (theatlantic.com)</li>
<li class="zemanta-article-ul-li"><a href="http://camelswithhammers.com/2010/07/06/the-steady-disappearance-of-tenure/">The Steady Disappearance Of Tenure</a> (camelswithhammers.com)</li>
<li class="zemanta-article-ul-li"><a href="http://freakonomics.blogs.nytimes.com/2010/07/07/a-call-to-end-teacher-tenure/">A Call to End Teacher Tenure</a> (freakonomics.blogs.nytimes.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.businessinsider.com/whats-so-horrible-about-university-tenure-2010-7">What&#8217;s So Horrible About University Tenure?</a> (businessinsider.com)</li>
</ul>
<div class="zemanta-pixie"><img class="zemanta-pixie-img" src="http://img.zemanta.com/pixy.gif?x-id=554972c2-2649-4a35-af2a-2f3a17d08085" alt="" /><span class="zem-script more-related pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/mndoci?a=8RvThb2Rq4g:sgXA4eFBUT0:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/mndoci?i=8RvThb2Rq4g:sgXA4eFBUT0:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=8RvThb2Rq4g:sgXA4eFBUT0:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/mndoci?d=dnMXMwOfBR0" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=8RvThb2Rq4g:sgXA4eFBUT0:JEwB19i1-c4"><img src="http://feeds.feedburner.com/~ff/mndoci?i=8RvThb2Rq4g:sgXA4eFBUT0:JEwB19i1-c4" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=8RvThb2Rq4g:sgXA4eFBUT0:EpLpB3ZkKWg"><img src="http://feeds.feedburner.com/~ff/mndoci?d=EpLpB3ZkKWg" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=8RvThb2Rq4g:sgXA4eFBUT0:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/mndoci?i=8RvThb2Rq4g:sgXA4eFBUT0:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=8RvThb2Rq4g:sgXA4eFBUT0:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/mndoci?i=8RvThb2Rq4g:sgXA4eFBUT0:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=8RvThb2Rq4g:sgXA4eFBUT0:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/mndoci?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=8RvThb2Rq4g:sgXA4eFBUT0:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/mndoci?i=8RvThb2Rq4g:sgXA4eFBUT0:gIN9vFwOqvQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/mndoci/~4/8RvThb2Rq4g" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://mndoci.com/2010/07/25/tenure/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://mndoci.com/2010/07/25/tenure/</feedburner:origLink></item>
		<item>
		<title>TunkRank and citation metrics</title>
		<link>http://feedproxy.google.com/~r/mndoci/~3/zZRYwLY4jSA/</link>
		<comments>http://mndoci.com/2010/07/18/tunkrank-and-citation-metrics/#comments</comments>
		<pubDate>Mon, 19 Jul 2010 03:23:02 +0000</pubDate>
		<dc:creator>Deepak Singh</dc:creator>
				<category><![CDATA[Publishing]]></category>
		<category><![CDATA[Search]]></category>

		<guid isPermaLink="false">http://mndoci.com/?p=2197</guid>
		<description><![CDATA[In a set of slides from a presentation on TunkRank, Daniel Tunkelang (if you don&#8217;t read his blog you shoud) quotes Herbert Simon in an information-rich world the wealth of information means &#8230; a scarcity of whatever it is that information consumes &#8230; the attention of the recipients One of the challenges we face in [...]]]></description>
			<content:encoded><![CDATA[<p>In a set of slides from a presentation on <a href="http://tunkrank.com">TunkRank</a>, <a title="The Noisy Channel —" href="http://thenoisychannel.com/">Daniel Tunkelang</a> (if you don&#8217;t read his blog you shoud) quotes <a class="zem_slink" title="Herbert Simon" rel="wikipedia" href="http://en.wikipedia.org/wiki/Herbert_Simon">Herbert Simon</a></p>
<blockquote><p>in an information-rich world the wealth of information means &#8230; a scarcity of whatever it is that information consumes &#8230; the attention of the recipients</p></blockquote>
<p>One of the challenges we face in the sciences is measuring the relevance of a particular piece of research or a particular researcher.  From measures that are just poor representations of individual papers or even people (yes <a title="Impact factor - Wikipedia, the free encyclopedia" href="http://en.wikipedia.org/wiki/Impact_factor">Impact Factor</a> I am looking at you), to naive yet well intentioned steps in the right direction (<a title="PLoS ONE : accelerating the publication of peer-reviewed science" href="http://www.plosone.org/static/almInfo.action">Article Level Metrics</a>) there are a lot of attempts to try and determine relevance and influence in scientific publishing.  Personally I still like <a title="eigenfactor.org - ranking and mapping scientific journals" href="http://www.eigenfactor.org/">Eigenfactor</a> a lot, but there is still room for improvement.</p>
<p>As I read through Daniel&#8217;s slides, it was apparent to me that while his area of research is quite different, the underlying principles of authority and scarcity are universal and apply really well to scientific publishing, and in fact his work shows exactly why Impact Factor is a bad metric, since it only reflects the average of citations to articles published.  Specifically does not do so at a granularity where it is really useful and there is limited context.  Science is not much about gaming the system, but the IF is easy to game.  in fact if an author and his own students who go on to do their research keep citing each other, they are essentially doing that implicitly.  See slide #27 in the slides embedded below.</p>
<p>Anyway, for those of you into citation metrics and interested in objective methods of scientific authority and relevance check out these slides</p>
<p><img style="visibility: hidden; width: 0px; height: 0px;" src="http://counters.gigya.com/wildfire/IMP/CXNID=2000002.0NXC/bT*xJmx*PTEyNzk1MDkzOTE3NzgmcHQ9MTI3OTUwOTM5NzA3MCZwPTEwMTkxJmQ9c3NfZW1iZWQmZz*yJm89ZDg5MGQxNTE2MGI3/NDIwOTk3MWI5NmNhNjU*MGMzNWQmb2Y9MA==.gif" border="0" alt="" width="0" height="0" /></p>
<div id="__ss_4749609" style="width: 425px;"><strong><a title="The War on Attention Poverty: Measuring Twitter Authority" href="http://www.slideshare.net/dtunkelang/the-war-on-attention-poverty-measuring-twitter-authority">The War on Attention Poverty: Measuring Twitter Authority</a></strong><object id="__sse4749609" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="425" height="355" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=waronattentionpoverty-100713213804-phpapp01&amp;stripped_title=the-war-on-attention-poverty-measuring-twitter-authority" /><param name="name" value="__sse4749609" /><param name="allowfullscreen" value="true" /><embed id="__sse4749609" type="application/x-shockwave-flash" width="425" height="355" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=waronattentionpoverty-100713213804-phpapp01&amp;stripped_title=the-war-on-attention-poverty-measuring-twitter-authority" name="__sse4749609" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<div style="padding: 5px 0 12px;">View more <a href="http://www.slideshare.net/">presentations</a> from <a href="http://www.slideshare.net/dtunkelang">Daniel Tunkelang</a>.</div>
</div>
<h6 class="zemanta-related-title">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://blogs.forbes.com/firewall/2010/07/09/a-better-way-to-filter-twitters-spambots-ask-google/">A Better Way To Filter Twitter&#8217;s Spambots? Ask Google</a> (blogs.forbes.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.nature.com/news/2010/100616/full/465860a.html?s=news_rss">Metrics: Do metrics matter?</a> (nature.com)</li>
<li class="zemanta-article-ul-li"><a href="http://scholarlykitchen.sspnet.org/2010/07/14/post-publication-review/">Post-Publication Review: Does It Add Anything New and Useful?</a> (scholarlykitchen.sspnet.org)</li>
<li class="zemanta-article-ul-li"><a href="http://www.nature.com/news/2010/100616/full/465864a.html?s=news_rss">Metrics: A profusion of measures</a> (nature.com)</li>
</ul>
<div class="zemanta-pixie"><img class="zemanta-pixie-img" src="http://img.zemanta.com/pixy.gif?x-id=bd2aabbe-6764-4f2e-8e09-4820eebc1b07" alt="" /><span class="zem-script more-related pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/mndoci?a=zZRYwLY4jSA:DlInt0iLjeo:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/mndoci?i=zZRYwLY4jSA:DlInt0iLjeo:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=zZRYwLY4jSA:DlInt0iLjeo:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/mndoci?d=dnMXMwOfBR0" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=zZRYwLY4jSA:DlInt0iLjeo:JEwB19i1-c4"><img src="http://feeds.feedburner.com/~ff/mndoci?i=zZRYwLY4jSA:DlInt0iLjeo:JEwB19i1-c4" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=zZRYwLY4jSA:DlInt0iLjeo:EpLpB3ZkKWg"><img src="http://feeds.feedburner.com/~ff/mndoci?d=EpLpB3ZkKWg" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=zZRYwLY4jSA:DlInt0iLjeo:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/mndoci?i=zZRYwLY4jSA:DlInt0iLjeo:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=zZRYwLY4jSA:DlInt0iLjeo:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/mndoci?i=zZRYwLY4jSA:DlInt0iLjeo:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=zZRYwLY4jSA:DlInt0iLjeo:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/mndoci?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=zZRYwLY4jSA:DlInt0iLjeo:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/mndoci?i=zZRYwLY4jSA:DlInt0iLjeo:gIN9vFwOqvQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/mndoci/~4/zZRYwLY4jSA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://mndoci.com/2010/07/18/tunkrank-and-citation-metrics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://mndoci.com/2010/07/18/tunkrank-and-citation-metrics/</feedburner:origLink></item>
		<item>
		<title>Google and Freebase.  A pair that could make sparks fly</title>
		<link>http://feedproxy.google.com/~r/mndoci/~3/mt3AycVbbhg/</link>
		<comments>http://mndoci.com/2010/07/16/google-and-freebase-a-pair-that-could-make-sparks-fly/#comments</comments>
		<pubDate>Sat, 17 Jul 2010 04:46:57 +0000</pubDate>
		<dc:creator>Deepak Singh</dc:creator>
				<category><![CDATA[Search]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Software & Internet]]></category>

		<guid isPermaLink="false">http://mndoci.com/?p=2175</guid>
		<description><![CDATA[I&#8217;ve long been an admirer of Freebase. I like their approach to structured data, and context, about linking objects without necessarily putting a formal vocabulary in the way. Today, a flurry of tweets alerted me to the news that Google had bought Metaweb, the parent company for Freebase. This is a big deal. I am [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve long been an <a href="http://www.lijit.com/search?uri=http%3A%2F%2Fwww.lijit.com%2Fusers%2Fmndoci&amp;q=freebase&amp;type=blog">admirer</a> of <a class="zem_slink" title="freebase" rel="homepage" href="http://www.freebase.com/">Freebase</a>.  I like their approach to structured data, and context, about linking objects without necessarily putting a formal vocabulary in the way.  Today, a flurry of tweets alerted me to the news that <a title="The Freebase Blog  » Blog Archive   » Metaweb joins Google" href="http://blog.freebase.com/2010/07/16/metaweb-joins-google/">Google had bought Metaweb</a>, the parent company for Freebase.</p>
<p>This is a big deal.  I am not quite yet sure how the integration will happen and how Freebase data will start surfacing in Google results.  The key for Google is to keep Freebase up and running and figure out how to pull that information into the core search function at Google.  It is also entirely possible that this was a play for the excellent Freebase team, although that wouldn&#8217;t make complete sense, given the powerful platform and tools that the Freebase gang has created.  It&#8217;s going to be fun to see how this evolves.</p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="560" height="340" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.youtube.com/v/TJfrNo3Z-DU&amp;hl=en_US&amp;fs=1?color1=0x2b405b&amp;color2=0x6b8ab6" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="560" height="340" src="http://www.youtube.com/v/TJfrNo3Z-DU&amp;hl=en_US&amp;fs=1?color1=0x2b405b&amp;color2=0x6b8ab6" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<h6 class="zemanta-related-title">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://www.businessinsider.com/google-acquires-semantic-web-startup-metaweb-technologies-2010-7">Google Tries Freebasing (GOOG, GS)</a> (businessinsider.com)</li>
<li class="zemanta-article-ul-li"><a href="http://gigaom.com/2010/07/16/google-gets-semantic-buys-metaweb/">Google Gets Semantic: Buys Metaweb</a> (gigaom.com)</li>
</ul>
<div class="zemanta-pixie"><img class="zemanta-pixie-img" src="http://img.zemanta.com/pixy.gif?x-id=2fc216e6-83f7-4cc4-882b-ae9955d8e1a7" alt="" /><span class="zem-script more-related pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/mndoci?a=mt3AycVbbhg:bN85f1Ilvh4:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/mndoci?i=mt3AycVbbhg:bN85f1Ilvh4:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=mt3AycVbbhg:bN85f1Ilvh4:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/mndoci?d=dnMXMwOfBR0" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=mt3AycVbbhg:bN85f1Ilvh4:JEwB19i1-c4"><img src="http://feeds.feedburner.com/~ff/mndoci?i=mt3AycVbbhg:bN85f1Ilvh4:JEwB19i1-c4" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=mt3AycVbbhg:bN85f1Ilvh4:EpLpB3ZkKWg"><img src="http://feeds.feedburner.com/~ff/mndoci?d=EpLpB3ZkKWg" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=mt3AycVbbhg:bN85f1Ilvh4:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/mndoci?i=mt3AycVbbhg:bN85f1Ilvh4:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=mt3AycVbbhg:bN85f1Ilvh4:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/mndoci?i=mt3AycVbbhg:bN85f1Ilvh4:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=mt3AycVbbhg:bN85f1Ilvh4:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/mndoci?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=mt3AycVbbhg:bN85f1Ilvh4:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/mndoci?i=mt3AycVbbhg:bN85f1Ilvh4:gIN9vFwOqvQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/mndoci/~4/mt3AycVbbhg" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://mndoci.com/2010/07/16/google-and-freebase-a-pair-that-could-make-sparks-fly/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		<feedburner:origLink>http://mndoci.com/2010/07/16/google-and-freebase-a-pair-that-could-make-sparks-fly/</feedburner:origLink></item>
		<item>
		<title>Data science, roles, and barriers</title>
		<link>http://feedproxy.google.com/~r/mndoci/~3/jesvqq0466A/</link>
		<comments>http://mndoci.com/2010/07/13/data-science-roles-and-barriers/#comments</comments>
		<pubDate>Tue, 13 Jul 2010 07:01:54 +0000</pubDate>
		<dc:creator>Deepak Singh</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Informatics]]></category>
		<category><![CDATA[Life Science]]></category>

		<guid isPermaLink="false">http://mndoci.com/?p=2147</guid>
		<description><![CDATA[In keeping with my recent data science theme, Ben Lorica has a nice post up on how to nurture data scientists. While his post focusses on data scientists in commercial organizations, the post has some very relevant points for bioinformaticians. After working in companies both large and small, it&#8217;s clear to me that the strict [...]]]></description>
			<content:encoded><![CDATA[<p>In keeping with my recent data science theme, Ben Lorica has a nice post up on <a title="The Practical Quant: How to nurture Data Scientists" href="http://practicalquant.blogspot.com/2010/07/how-to-nurture-data-scientists.html">how to nurture data scientists</a>.  While his post focusses on data scientists in commercial organizations, the post has some very relevant points for bioinformaticians.</p>
<blockquote><p>After working in companies both large and small, it&#8217;s clear to me that the strict separation of tasks is the major obstacle faced by data scientists. The most common manifestation is the separation between data analysis and data management. In many large companies, most analysts/statisticians have to wait for data from a designated <a class="zem_slink" title="Data warehouse" rel="wikipedia" href="http://en.wikipedia.org/wiki/Data_warehouse">data warehousing</a> team, and in a lot of cases they wait for data from multiple owners of different data warehouses.</p></blockquote>
<p><a href="http://nsaunders.wordpress.com">Neil</a> pointed to another bit</p>
<blockquote><p>To nurture data scientists, companies need to focus more on culture and organizational structure. Many data workers have enough skills and training to quickly become productive in multiple areas of data intelligence. The problem is that most don&#8217;t work in environments that encourage them to become data scientists. They&#8217;re stuck in silos and limited to one or two areas of data intelligence. Often, they&#8217;re restricted to use tools &#8220;approved&#8221; by their managers.</p></blockquote>
<p>When I wrote about the biological data scientists, one of the points I was trying to make was that advanced data analysis requires a set of skills that are not easily transferable to bench scientists or biologists who want to retrieve specific, limited, pieces of informations.  While at a particular point in time different people in a company or lab might focus on different aspects of the task, a data scientist needs to be capable of carrying out different aspects of the task.  Ben covers these different aspects as well.</p>
<ul>
<li><strong>data acquisition</strong>: this might entail writing custom parsers and web crawlers, or scripts that target specific web services or API&#8217;s for non-traditional data sources.</li>
<li><strong>data management</strong>: ETL, manipulate, query, and maintain data in databases, key-value stores, or <a class="zem_slink" title="Hadoop" rel="homepage" href="http://hadoop.apache.org/">Hadoop</a>.</li>
<li><strong><a class="zem_slink" title="Information visualization" rel="wikipedia" href="http://en.wikipedia.org/wiki/Information_visualization">information visualization</a></strong>: uncovering patterns through the use of static visualization toolkits and/or interactive platforms based on Flash or Javascript.</li>
<li><strong>analytics</strong>: this can range from simple to complex techniques in multivariate statistics, <a class="zem_slink" title="Machine learning" rel="wikipedia" href="http://en.wikipedia.org/wiki/Machine_learning">machine-learning</a>, and NLP.</li>
<li><strong>insight</strong>: extract, summarize, and present key findings to a broad audience.</li>
</ul>
<p>We need to rethink how we approach data as bioinformaticians.  It will not be acceptable to be considered a bioinformatician with skills that don&#8217;t cover at least a good chunk of the skills described above.  But to get there, we need to change our idea of the role of bioinformaticians and of the value of the large volumes of data we are generating.</p>
<h6 class="zemanta-related-title">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://mndoci.com/2010/06/22/the-biological-data-scientist/">The Biological Data Scientist</a> (mndoci.com)</li>
<li class="zemanta-article-ul-li"><a href="http://mndoci.com/2010/05/02/data-driven-research-products/">Data-driven research products</a> (mndoci.com)</li>
</ul>
<div class="zemanta-pixie"><img class="zemanta-pixie-img" src="http://img.zemanta.com/pixy.gif?x-id=b2b1db2f-5dce-419e-8f39-87c5b789d975" alt="" /><span class="zem-script more-related pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/mndoci?a=jesvqq0466A:Aq_H2dgow9c:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/mndoci?i=jesvqq0466A:Aq_H2dgow9c:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=jesvqq0466A:Aq_H2dgow9c:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/mndoci?d=dnMXMwOfBR0" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=jesvqq0466A:Aq_H2dgow9c:JEwB19i1-c4"><img src="http://feeds.feedburner.com/~ff/mndoci?i=jesvqq0466A:Aq_H2dgow9c:JEwB19i1-c4" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=jesvqq0466A:Aq_H2dgow9c:EpLpB3ZkKWg"><img src="http://feeds.feedburner.com/~ff/mndoci?d=EpLpB3ZkKWg" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=jesvqq0466A:Aq_H2dgow9c:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/mndoci?i=jesvqq0466A:Aq_H2dgow9c:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=jesvqq0466A:Aq_H2dgow9c:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/mndoci?i=jesvqq0466A:Aq_H2dgow9c:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=jesvqq0466A:Aq_H2dgow9c:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/mndoci?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=jesvqq0466A:Aq_H2dgow9c:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/mndoci?i=jesvqq0466A:Aq_H2dgow9c:gIN9vFwOqvQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/mndoci/~4/jesvqq0466A" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://mndoci.com/2010/07/13/data-science-roles-and-barriers/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://mndoci.com/2010/07/13/data-science-roles-and-barriers/</feedburner:origLink></item>
		<item>
		<title>Asking questions, and getting answers</title>
		<link>http://feedproxy.google.com/~r/mndoci/~3/-fcPzjiP1zk/</link>
		<comments>http://mndoci.com/2010/07/11/asking-questions-and-getting-answers/#comments</comments>
		<pubDate>Mon, 12 Jul 2010 03:29:05 +0000</pubDate>
		<dc:creator>Deepak Singh</dc:creator>
				<category><![CDATA[Infotech]]></category>
		<category><![CDATA[Innovation]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Software & Internet]]></category>
		<category><![CDATA[Web as platform]]></category>

		<guid isPermaLink="false">http://mndoci.com/?p=2131</guid>
		<description><![CDATA[The way I see it, Q&#38;A sites fall into four categories (especially for science and technology related content) Mailing lists and newsgroups: The first form of Q&#38;A that I ever used was a mailing list My PhD thesis calls out the Computational Chemistry List and the CHARMM mailing list for helping me learn a lot [...]]]></description>
			<content:encoded><![CDATA[<p>The way I see it, Q&amp;A sites fall into four categories (especially for science and technology related content)</p>
<ul>
<li><strong>Mailing lists and newsgroups</strong>: The first form of Q&amp;A that I ever used was a mailing list  My PhD thesis calls out the <a title="CCL Home" href="http://www.ccl.net/chemistry/">Computational Chemistry List</a> and the CHARMM mailing list for helping me learn a lot about the tools and science I needed to learn to successfully graduate.  Closely related, since they are often a hybrid, are newsgroups, with <a href="http://groups.google.com">Google Groups</a> probably being the best known today</li>
<li><strong>Forums</strong>: Probably the most common form of Q&amp;A in the technical world today. It&#8217;s how support is provided, and sites like <a href="http://seqanswers.com/">SEQanswers</a> are a treasure trove of information.  I can safely say that a lot of what I&#8217;ve learnt about writing and producing electronic music came from the forums at <a title="KVR:  Virtual Instruments, Virtual Effects, VST Plugins, Audio Units (AU), DirectX (DX), Universal Binary Compatibility - Audio Plugin News, Reviews and Community" href="http://www.kvraudio.com/">KvR</a>.  The CHARMM mailing list has long been a <a title="CHARMM Forums - Forums powered by UBB.threads™" href="http://www.charmm.org/ubbthreads/ubbthreads.php?Cat=">forum</a></li>
<li><strong><a class="zem_slink" title="Stack Overflow" rel="homepage" href="http://stackoverflow.com/">StackOverflow</a></strong>: It&#8217;s safe to say that <a href="http://stackoverflow.com">Stackoverflow</a> has been a resounding success, if for no other reason than the number of sister sites, and Stackoverflow-style sites have come up.  From <a href="http://biostars.org">Biostar</a> to the various metaoptimize sites (<a title="scale.metaoptimize Q+A - scalability, high availability, data stores, NoSQL, distributed computing, parallel computing, cloud computing, elastic computing, HPC, grid computing, AWS, crawling, failover, redundancy, concurrency, shared nothing" href="http://scale.metaoptimize.com/">Scale</a>, <a title="MetaOptimize Q+A - machine learning, natural language processing, artificial intelligence, text analysis, information retrieval, search, data mining, statistical modeling, and data visualization" href="http://metaoptimize.com/qa/">ML/NLP</a>), a plethora of sites have come up which use the reputation model and design style of Stackoverflow.  <a title="A Q&amp;A Site for Sequencing and Bioinformatics" href="http://i.seqanswers.com/">SEQanswers</a> is also experimenting with a similar site, and in general this model seems to do better for specific topics</li>
<li><strong>NLP driven</strong>: Sites like <a href="http://quora.com">Quora</a> (which to me is different from other, more general sites like <a href="http://vark.com">Aarvark</a> and <a title="Yahoo! Answers - Home" href="http://answers.yahoo.com/">Yahoo Answers</a>) approach the technology Q&amp;A problem differently. Quora takesideas from Stackoverflow, others from <a class="zem_slink" title="Aardvark" rel="homepage" href="http://vark.com">Aardvark</a>, and also from sites like <a class="zem_slink" title="Mahalo" rel="homepage" href="http://mahalo.com">Mahalo</a> and Wikipedia, and distills them into a model that has yet to be proven, but shows a lot of promise.  It will be interesting to see how Quora evolves.  Like Stackoverflow it looks designed to Google well.</li>
<li><strong>More general sites</strong>: Here I include sites like <a class="zem_slink" title="Hacker News" rel="homepage" href="http://news.ycombinator.com/">Hacker News</a>, Friendfeed, etc.  On Hacker News, you will often fine hackers asking specific questions, and the rooms on Friendfeed, like <a title="The Life Scientists - FriendFeed" href="http://friendfeed.com/the-life-scientists">The Life Scientists</a>, are a great place to ask questions.  And I haven&#8217;t even talked about IRC.</li>
</ul>
<p>Each approach has pros and cons, and quite honestly, at some level I like all of them, although mailing lists and newsgroups are less usable and useful for me with the exception of a few very specialized, low volume, niche ones.  In the end, we&#8217;ll all use a mix and each community will find one or two mediums that work well, depending on the nature of questions, the people involved and in the end what our goals are.  For examples, on SEQanswers, I don&#8217;t see a reputation really mattering that much, and any advantage to tall the rights and privileges that come with a reputation score, so a forum might prevail.  On the other hand, Biostars probably works best in a Stackoverflow model.</p>
<p>Just don&#8217;t come up with another model.  I am having a tough time keeping up.</p>
<h6 class="zemanta-related-title">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://gigaom.com/2010/05/11/zillow-ceo-pozits-a-social-qa-site/">Zillow CEO Pozits a Social Q&amp;A Site</a> (gigaom.com)</li>
<li class="zemanta-article-ul-li"><a href="http://thenextweb.com/us/2010/07/06/the-future-of-question-and-answer-sites/">The Future of Question and Answer Sites</a> (thenextweb.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.businessinsider.com/facebook-recruiting-questions-2010-6">Facebook Recruits Users To Test Q&amp;A Product</a> (businessinsider.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.r-statistics.com/2010/06/a-new-qa-website-for-data-analysis-based-on-stackoverflow-engine-is-waiting-for-you/">A new Q&amp;A website for Data-Analysis (based on StackOverFlow engine) &#8211; is waiting for you</a> (r-statistics.com)</li>
</ul>
<div class="zemanta-pixie"><img class="zemanta-pixie-img" src="http://img.zemanta.com/pixy.gif?x-id=ebd0b36e-14b1-4d89-b10b-b5c4697f40e8" alt="" /><span class="zem-script more-related pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/mndoci?a=-fcPzjiP1zk:6gtIqiwoWPo:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/mndoci?i=-fcPzjiP1zk:6gtIqiwoWPo:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=-fcPzjiP1zk:6gtIqiwoWPo:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/mndoci?d=dnMXMwOfBR0" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=-fcPzjiP1zk:6gtIqiwoWPo:JEwB19i1-c4"><img src="http://feeds.feedburner.com/~ff/mndoci?i=-fcPzjiP1zk:6gtIqiwoWPo:JEwB19i1-c4" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=-fcPzjiP1zk:6gtIqiwoWPo:EpLpB3ZkKWg"><img src="http://feeds.feedburner.com/~ff/mndoci?d=EpLpB3ZkKWg" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=-fcPzjiP1zk:6gtIqiwoWPo:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/mndoci?i=-fcPzjiP1zk:6gtIqiwoWPo:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=-fcPzjiP1zk:6gtIqiwoWPo:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/mndoci?i=-fcPzjiP1zk:6gtIqiwoWPo:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=-fcPzjiP1zk:6gtIqiwoWPo:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/mndoci?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=-fcPzjiP1zk:6gtIqiwoWPo:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/mndoci?i=-fcPzjiP1zk:6gtIqiwoWPo:gIN9vFwOqvQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/mndoci/~4/-fcPzjiP1zk" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://mndoci.com/2010/07/11/asking-questions-and-getting-answers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://mndoci.com/2010/07/11/asking-questions-and-getting-answers/</feedburner:origLink></item>
		<item>
		<title>Peer-review has a place</title>
		<link>http://feedproxy.google.com/~r/mndoci/~3/EPPMc85BbLU/</link>
		<comments>http://mndoci.com/2010/07/11/peer-review-has-a-place/#comments</comments>
		<pubDate>Sun, 11 Jul 2010 14:57:59 +0000</pubDate>
		<dc:creator>Deepak Singh</dc:creator>
				<category><![CDATA[Open Science]]></category>
		<category><![CDATA[Publishing]]></category>

		<guid isPermaLink="false">http://mndoci.com/?p=2102</guid>
		<description><![CDATA[I usually find Glyn Moody&#8216;s thoughts to be insightful, although we often disagree. His post on doing away with peer-review is definitely in the disagree category. Anyone who reads these pages knows I am not a huge fan of the current peer-review system. IMO it is loaded in favor of incumbents, has tremendous inefficiencies and [...]]]></description>
			<content:encoded><![CDATA[<p>I usually find <a class="zem_slink" title="Glyn Moody" rel="blog" href="http://opendotdotdot.blogspot.com/">Glyn Moody</a>&#8216;s thoughts to be insightful, although we often disagree.  His post on <a title="open...: Abundance Obsoletes Peer Review, so Drop It" href="http://opendotdotdot.blogspot.com/2010/06/abundance-obsoletes-peer-review-so-drop.html?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed%3A+blogspot%2FcBoI+%28open...%29&amp;utm_content=FriendFeed+Bot">doing away with peer-review</a> is definitely in the disagree category.</p>
<p>Anyone who reads these pages knows I am <a href="http://www.lijit.com/search?uri=http%3A%2F%2Fwww.lijit.com%2Fusers%2Fmndoci&amp;q=peer-review&amp;type=blog">not a huge fan</a> of the current peer-review system.  IMO it is loaded in favor of incumbents, has tremendous inefficiencies and there is too much getting reviewed that doesn&#8217;t need to.  However, peer review serves a purpose.  As <a href="http://pbeltrao.blogspot.com">Pedro</a> puts up in a <a title="Abundance Obsoletes Peer Review, so Drop It - Cameron Neylon - FriendFeed" href="http://friendfeed.com/cameronneylon/d78e49d3/abundance-obsoletes-peer-review-so-drop-it">great thread out on Friendfeed</a>, on average peer-review adds value, even subjective opinion.   Many of the commenters derive value from doing peer-review.  Pedro also sees it as part of his duty as a scientist (I agree).</p>
<p>The problem is not with peer-review, but rather with the overall publishing ecosystem.  We publish material in peer-reviewed journals that, today, can be published in a number of alternative avenues.  We don&#8217;t have enough transparency and randomness in our peer review process.  In the long term, to put this in terms of Glyn&#8217;s post, peer review should come in a system that deals with scarcity, as long as there is a well functioning system that deals with more abundance. To put this another way, with peer-review we should be focussing on a subset of what we publish today, while the rest goes into blogs, <a href="http://precedings.nature.com">nature precedings</a>, self-published articles, etc, as discussed in a <a title="Collections on Nature Precedings" href="http://mndoci.com/2010/07/06/collections-on-nature-precedings/">previous post</a>.  The point that Glyn seems to miss is that the role of peer-review is not to just vote something up or down, but rather provide constructive, sometimes qualitative advice or suggestions, to help improve the science.  I do believe that in the long run a more open peer review process might be the way to go, since we can find papers in our area of interest and provide the same suggestions via a different mechanism, but not yet.  What we should rather focus on is changing the publish and reward system, which discourages scarcity, encourages &#8220;me too&#8221; and incremental publishing, and (IMO) discourages creativity.</p>
<h6 class="zemanta-related-title">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://scholarlykitchen.sspnet.org/2010/03/30/improving-peer-review-lets-provide-an-ingredients-list-for-our-readers/">Improving Peer Review: Let&#8217;s Provide an Ingredients List for Our Readers</a> (scholarlykitchen.sspnet.org)</li>
<li class="zemanta-article-ul-li"><a href="http://www.theatlantic.com/business/archive/2010/06/peer-review-is-no-panacea/58842/">Peer Review is No Panacea</a> (theatlantic.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.kevinmd.com/blog/2010/07/mark-midei-failure-peer-review.html">Mark Midei and the failure of peer review</a> (kevinmd.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.prnewswire.com/news-releases/elsevier-pilots-peerchoice-in-chemical-physics-letters-96786359.html">Elsevier Pilots PeerChoice in Chemical Physics Letters</a> (prnewswire.com)</li>
</ul>
<div class="zemanta-pixie"><img class="zemanta-pixie-img" src="http://img.zemanta.com/pixy.gif?x-id=3c8c6bba-824e-4067-a816-62af287d7102" alt="" /><span class="zem-script more-related pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/mndoci?a=EPPMc85BbLU:pyrFyY9Wd6c:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/mndoci?i=EPPMc85BbLU:pyrFyY9Wd6c:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=EPPMc85BbLU:pyrFyY9Wd6c:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/mndoci?d=dnMXMwOfBR0" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=EPPMc85BbLU:pyrFyY9Wd6c:JEwB19i1-c4"><img src="http://feeds.feedburner.com/~ff/mndoci?i=EPPMc85BbLU:pyrFyY9Wd6c:JEwB19i1-c4" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=EPPMc85BbLU:pyrFyY9Wd6c:EpLpB3ZkKWg"><img src="http://feeds.feedburner.com/~ff/mndoci?d=EpLpB3ZkKWg" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=EPPMc85BbLU:pyrFyY9Wd6c:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/mndoci?i=EPPMc85BbLU:pyrFyY9Wd6c:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=EPPMc85BbLU:pyrFyY9Wd6c:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/mndoci?i=EPPMc85BbLU:pyrFyY9Wd6c:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=EPPMc85BbLU:pyrFyY9Wd6c:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/mndoci?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=EPPMc85BbLU:pyrFyY9Wd6c:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/mndoci?i=EPPMc85BbLU:pyrFyY9Wd6c:gIN9vFwOqvQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/mndoci/~4/EPPMc85BbLU" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://mndoci.com/2010/07/11/peer-review-has-a-place/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://mndoci.com/2010/07/11/peer-review-has-a-place/</feedburner:origLink></item>
		<item>
		<title>Collections on Nature Precedings</title>
		<link>http://feedproxy.google.com/~r/mndoci/~3/i-GEK8ypRv8/</link>
		<comments>http://mndoci.com/2010/07/06/collections-on-nature-precedings/#comments</comments>
		<pubDate>Tue, 06 Jul 2010 17:45:33 +0000</pubDate>
		<dc:creator>Deepak Singh</dc:creator>
				<category><![CDATA[Open Science]]></category>
		<category><![CDATA[Publishing]]></category>

		<guid isPermaLink="false">http://mndoci.com/?p=2100</guid>
		<description><![CDATA[OK, so I&#8217;ve been living under a rock, but Nature Precedings has this concept of collections which is pretty neat.  A couple that caught my eye iEvoBio The future of the web for collaborative science This, coupled with some great blog posts on data analysis coming from various, non-hard science sources got me thinking.  Let&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p>OK, so I&#8217;ve been living under a rock, but <a href="http://precedings.nature.com">Nature Precedings</a> has this concept of collections which is pretty neat.  A couple that caught my eye</p>
<ul>
<li><a href="http://precedings.nature.com/collections/ievobio-2010">iEvoBio</a></li>
<li><a href="http://precedings.nature.com/collections/future-of-the-web-2010">The future of the web for collaborative science</a></li>
</ul>
<p>This, coupled with some great blog posts on data analysis coming from various, non-hard science sources got me thinking.  Let&#8217;s say I am an independent researcher, and I have some cool science on hand (doesn&#8217;t have to be computational).  I can write it up and submit it to a journal.  Or I might have an opinion which I could write up and submit.  Today, we have a multitude of options about getting it out there.  What&#8217;s going to be interesting is what options we choose to get our thoughts out there.  Take Carl Boettiger&#8217;s deck on his <a href="http://precedings.nature.com/documents/4602/version/1">experiments with open science</a>.  That was posted to Precedings.  It could also have been cross-posted to <a href="http://slideshare.net">slideshare</a> or <a href="http://scribd.com">scribd</a>.  Carl could have blogged his thoughts (expanding on the slides) and also embedded one of the slide shows.  And, nothing stops you from writing this up as an opinion paper.  The point is that we have avenues that we never had before.  What&#8217;s going to be interesting is how formal we get about all this?  Some material needs to go into Peer Review, but what happens if you blog about some of your initial results first, or maintain an open notebook.  How will it be treated?  What are the implications?</p>
<p>The answers are not black and white, and still in flux, but I&#8217;d encourage both scientists and publishers (and funding agencies) to rethink the current formalisms for making information available.  My hope is we&#8217;ll come up with our own set of guidelines on when, how and where without constraining creativity too much.</p>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/mndoci?a=i-GEK8ypRv8:bIqTsFSnd3w:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/mndoci?i=i-GEK8ypRv8:bIqTsFSnd3w:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=i-GEK8ypRv8:bIqTsFSnd3w:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/mndoci?d=dnMXMwOfBR0" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=i-GEK8ypRv8:bIqTsFSnd3w:JEwB19i1-c4"><img src="http://feeds.feedburner.com/~ff/mndoci?i=i-GEK8ypRv8:bIqTsFSnd3w:JEwB19i1-c4" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=i-GEK8ypRv8:bIqTsFSnd3w:EpLpB3ZkKWg"><img src="http://feeds.feedburner.com/~ff/mndoci?d=EpLpB3ZkKWg" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=i-GEK8ypRv8:bIqTsFSnd3w:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/mndoci?i=i-GEK8ypRv8:bIqTsFSnd3w:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=i-GEK8ypRv8:bIqTsFSnd3w:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/mndoci?i=i-GEK8ypRv8:bIqTsFSnd3w:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=i-GEK8ypRv8:bIqTsFSnd3w:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/mndoci?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=i-GEK8ypRv8:bIqTsFSnd3w:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/mndoci?i=i-GEK8ypRv8:bIqTsFSnd3w:gIN9vFwOqvQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/mndoci/~4/i-GEK8ypRv8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://mndoci.com/2010/07/06/collections-on-nature-precedings/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		<feedburner:origLink>http://mndoci.com/2010/07/06/collections-on-nature-precedings/</feedburner:origLink></item>
		<item>
		<title>Recommendation: Data-intensive text processing with MapReduce</title>
		<link>http://feedproxy.google.com/~r/mndoci/~3/JZKS2_mfuKI/</link>
		<comments>http://mndoci.com/2010/07/02/recommendation-data-intensive-text-processing-with-mapreduce/#comments</comments>
		<pubDate>Sat, 03 Jul 2010 05:21:03 +0000</pubDate>
		<dc:creator>Deepak Singh</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Computing]]></category>
		<category><![CDATA[Programming]]></category>

		<guid isPermaLink="false">http://mndoci.com/?p=2069</guid>
		<description><![CDATA[Staying on my massive data processing theme here is a more practical post. In the world of large scale distributed processing, the original MapReduce paper will probably hold the most important position. Hadoop remains the most well known of all the MapReduce implementations, and is now a proven, battle-tested commodity. Tom White&#8217;s book is a [...]]]></description>
			<content:encoded><![CDATA[<p>Staying on my <a title="Massive data" href="http://mndoci.com/2010/06/30/massive-data/">massive data</a> processing theme here is a more practical post.  In the world of large scale <a class="zem_slink" title="Distributed computing" rel="wikipedia" href="http://en.wikipedia.org/wiki/Distributed_computing">distributed processing</a>, the original <a class="zem_slink" title="MapReduce" rel="homepage" href="http://labs.google.com/papers/mapreduce.html">MapReduce</a> paper will probably hold the most important position.  <a class="zem_slink" title="Hadoop" rel="homepage" href="http://hadoop.apache.org/">Hadoop</a> remains the most well known of all the MapReduce implementations, and is now a proven, battle-tested commodity.  <a href="http://www.amazon.com/gp/product/0596521979?ie=UTF8&amp;tag=businessbyt05-20&amp;linkCode=as2&amp;camp=1789&amp;creative=390957&amp;creativeASIN=0596521979">Tom White&#8217;s book</a><img style="border: none !important; margin: 0px !important;" src="http://www.assoc-amazon.com/e/ir?t=businessbyt05-20&amp;l=as2&amp;o=1&amp;a=0596521979" border="0" alt="" width="1" height="1" /><br />
is a great place to start if you have an interest in the framework itself, but the book I wanted to point out was <a title="Jimmy Lin » Homepage" href="http://www.umiacs.umd.edu/~jimmylin/">Jimmy Lin&#8217;s</a> book on <a href="http://www.amazon.com/gp/product/1608453421?ie=UTF8&amp;tag=businessbyt05-20&amp;linkCode=as2&amp;camp=1789&amp;creative=390957&amp;creativeASIN=1608453421"><em>Data-Intensive Text Processing with MapReduce</em></a><img style="border: none !important; margin: 0px !important;" src="http://www.assoc-amazon.com/e/ir?t=businessbyt05-20&amp;l=as2&amp;o=1&amp;a=1608453421" border="0" alt="" width="1" height="1" /> (there is a pre-production PDF of the book from the <a title="Jimmy Lin » Data-Intensive Text Processing with MapReduce" href="http://www.umiacs.umd.edu/~jimmylin/book.html">homepage</a>)  and it&#8217;s a great dive into algorithm design.  The book talks about general algo design, indexing, graphs and a fabulous section on expectation maximization that is a must read for bioinformaticians who might be interested in analyzing and processing large data sets.</p>
<h6 class="zemanta-related-title">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://atbrox.com/2010/02/12/mapreduce-hadoop-algorithms-in-academic-papers-updated/">Papers on MapReduce algorithms</a> (atbrox.com)</li>
<li class="zemanta-article-ul-li"><a href="http://www.slideshare.net/ydn/7-honu-hadoopsummit2010">Honu &#8211; A Large Scale Streaming Data Collection and Processing Pipeline__HadoopSummit2010</a> (slideshare.net)</li>
</ul>
<div class="zemanta-pixie"><img class="zemanta-pixie-img" src="http://img.zemanta.com/pixy.gif?x-id=4135c11d-4e7a-417d-89a8-4a1407b88066" alt="" /><span class="zem-script more-related pretty-attribution"><script src="http://static.zemanta.com/readside/loader.js" type="text/javascript"></script></span></div>
<div class="feedflare">
<a href="http://feeds.feedburner.com/~ff/mndoci?a=JZKS2_mfuKI:AwKMsIH_5Rw:D7DqB2pKExk"><img src="http://feeds.feedburner.com/~ff/mndoci?i=JZKS2_mfuKI:AwKMsIH_5Rw:D7DqB2pKExk" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=JZKS2_mfuKI:AwKMsIH_5Rw:dnMXMwOfBR0"><img src="http://feeds.feedburner.com/~ff/mndoci?d=dnMXMwOfBR0" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=JZKS2_mfuKI:AwKMsIH_5Rw:JEwB19i1-c4"><img src="http://feeds.feedburner.com/~ff/mndoci?i=JZKS2_mfuKI:AwKMsIH_5Rw:JEwB19i1-c4" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=JZKS2_mfuKI:AwKMsIH_5Rw:EpLpB3ZkKWg"><img src="http://feeds.feedburner.com/~ff/mndoci?d=EpLpB3ZkKWg" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=JZKS2_mfuKI:AwKMsIH_5Rw:F7zBnMyn0Lo"><img src="http://feeds.feedburner.com/~ff/mndoci?i=JZKS2_mfuKI:AwKMsIH_5Rw:F7zBnMyn0Lo" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=JZKS2_mfuKI:AwKMsIH_5Rw:V_sGLiPBpWU"><img src="http://feeds.feedburner.com/~ff/mndoci?i=JZKS2_mfuKI:AwKMsIH_5Rw:V_sGLiPBpWU" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=JZKS2_mfuKI:AwKMsIH_5Rw:qj6IDK7rITs"><img src="http://feeds.feedburner.com/~ff/mndoci?d=qj6IDK7rITs" border="0"></img></a> <a href="http://feeds.feedburner.com/~ff/mndoci?a=JZKS2_mfuKI:AwKMsIH_5Rw:gIN9vFwOqvQ"><img src="http://feeds.feedburner.com/~ff/mndoci?i=JZKS2_mfuKI:AwKMsIH_5Rw:gIN9vFwOqvQ" border="0"></img></a>
</div><img src="http://feeds.feedburner.com/~r/mndoci/~4/JZKS2_mfuKI" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://mndoci.com/2010/07/02/recommendation-data-intensive-text-processing-with-mapreduce/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://mndoci.com/2010/07/02/recommendation-data-intensive-text-processing-with-mapreduce/</feedburner:origLink></item>
	<media:credit role="author">Deepak Singh</media:credit><media:rating>nonadult</media:rating></channel>
</rss>
