<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="http://feeds.feedburner.com/~d/styles/rss2full.xsl" type="text/xsl" media="screen"?><?xml-stylesheet href="http://feeds.feedburner.com/~d/styles/itemcontent.css" type="text/css" media="screen"?><!-- generator="wordpress/2.3.1" --><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Mailund on the Internet</title>
	<link>http://www.mailund.dk</link>
	<description>Computer science, bioinformatics, genetics, and everything in between</description>
	<pubDate>Sun, 11 May 2008 14:27:00 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.3.1</generator>
	<language>en</language>
			<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/MailundOnTheInternet" type="application/rss+xml" /><item>
		<title>A silly meme…</title>
		<link>http://feeds.feedburner.com/~r/MailundOnTheInternet/~3/288079700/</link>
		<comments>http://www.mailund.dk/index.php/2008/05/11/a-silly-meme/#comments</comments>
		<pubDate>Sun, 11 May 2008 14:27:00 +0000</pubDate>
		<dc:creator>Thomas Mailund</dc:creator>
		
		<category><![CDATA[Fun]]></category>

		<category><![CDATA[blog]]></category>

		<guid isPermaLink="false">http://www.mailund.dk/index.php/2008/05/11/a-silly-meme/</guid>
		<description><![CDATA[Ok, so I was tagged to do this by my good old friend Thomas Widmann. Who, by the way, stood me up once in Oxford, claiming he was ill or something&#8230; PJC and myself were meeting up for a drink (Peter coming all the way from Bremen, taking time off his busy schedule building space [...]]]></description>
			<content:encoded><![CDATA[<p>Ok, so I was <a href="http://twid.bibulus.org/2008/05/page-123.html">tagged to do this</a> by my good old friend <a href="http://twid.bibulus.org/hpindex-en.html">Thomas Widmann</a>. Who, by the way, <a href="http://twid.bibulus.org/2006/05/nu-ville-jeg-have-vret-hjemme-igen.html">stood me up</a> once in Oxford, claiming he was ill or something&#8230; <a href="http://www.peterjc.dk/Welcome.html">PJC</a> and myself were meeting up for a drink (Peter coming all the way from Bremen, taking time off his busy schedule building space rockets) and Widmann didn&#8217;t even bother to show up&#8230;</p>
<p>Anyway, apparently the idea now is:</p>
<blockquote>
<ol>
<li>Pick up the nearest book.</li>
<li>Open to page 123.</li>
<li>Find the fifth sentence.</li>
<li>Post the next three sentences.</li>
<li>Tag five people and post a comment here once you post it to your blog so I can come see!</li>
</ol>
</blockquote>
<p>Now, the closest book means I have to dig through the piles of paper on my desk.  I am assuming closest is in Euclidean distance and not in time it takes to get at it.  On my desk I have a few books that I have had here for ages and never use &#8212; a few manuals that I never look at because Googling is much easier, and a <em>dictionary</em> that I haven&#8217;t used since I discovered Webster online.  So, the closest book, once I made it through piles of papers I should have read but never got around to, is: <a href="http://www.amazon.com/Oxford-Advanced-Learners-Dictionary-Albert/dp/0194001164">Oxford Advanced Learner&#8217;s Dictionary</a>.  It was a gift from my sister &#8212; I have the big, red, massive Webster myself, but I got this from here when she finished her studies and decided she would never need it again&#8230;</p>
<p>In my version &#8212; which doesn&#8217;t look like the picture at Amazon I linked to above &#8212; page 123 is <em>bole </em>to <em>bona fide.</em></p>
<p>Sentence five is:</p>
<blockquote><p><strong>bolero</strong> <em>n</em>(<em>pl</em>~s) <strong>1</strong> (music for a) type of Spanish dance.</p></blockquote>
<p>The next three are:</p>
<blockquote><p><strong>2</strong> woman&#8217;s short jacket with no front fastening. <strong>boll</strong> <em>n</em> seed-case of the cotton pants or flax. <strong>bool-weevil</strong>  <em>n</em> destructive insect whose larvae eat cotton bolls. <strong>bollard</strong> <em>n</em> <strong>1</strong> short thick post on a quay or ship&#8217;s deck, to which a ship&#8217;s mooring ropes are tied. <strong>2</strong> short post on a kerb or fraffic island.</p></blockquote>
<p>Wasn&#8217;t that a senseless exercise?</p>
<p>Now I need to pick five others to tag.  I&#8217;ll stick to people <a href="http://www.au.dk">AU</a> people who probably knows Widmann as well:  <a href="http://www.peterjc.dk/Welcome.html">Peter JC</a>, <a href="http://digital-sushi.org/">Peter Ahé</a>, <a href="http://bitplanet.net">Kristian Høgsberg</a>, <a href="http://www.daimi.au.dk/~sandmann">Søren Sandmann</a> and <a href="http://www.klafbang.dk">Michael Westergaard</a>.</p>
<p>Now, where exactly am I supposed to leave a comment, Widmann?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mailund.dk/index.php/2008/05/11/a-silly-meme/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.mailund.dk/index.php/2008/05/11/a-silly-meme/</feedburner:origLink></item>
		<item>
		<title>Connecting the world</title>
		<link>http://feeds.feedburner.com/~r/MailundOnTheInternet/~3/287996403/</link>
		<comments>http://www.mailund.dk/index.php/2008/05/11/connecting-the-world/#comments</comments>
		<pubDate>Sun, 11 May 2008 10:54:07 +0000</pubDate>
		<dc:creator>Thomas Mailund</dc:creator>
		
		<category><![CDATA[Fun]]></category>

		<category><![CDATA[Internet]]></category>

		<category><![CDATA[TED]]></category>

		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://www.mailund.dk/index.php/2008/05/11/connecting-the-world/</guid>
		<description><![CDATA[Just in follow up to my previous post, where I wrote a bit about my amazement of how the Internet has connected the world, here&#8217;s a TED talk on that exact topic:








 The first part is a bit dull, I think, but halfway through Hector Ruiz gets to the 50&#215;15 project &#8212; connecting half the [...]]]></description>
			<content:encoded><![CDATA[<p>Just in follow up to my previous post, where I wrote a bit about my amazement of how the Internet has connected the world, here&#8217;s a TED talk on that exact topic:<br />
<center><br />
<!--cut and paste--><a href="http://static.videoegg.com/ted2/flash/loader.swf" style="left: 318px ! important; top: 0px ! important" title="Click here to block this object with Adblock Plus" class="abp-objtab-06349998257069632 visible ontop"></a><a href="http://static.videoegg.com/ted2/flash/loader.swf" style="left: 0px ! important; top: 16px ! important" title="Click here to block this object with Adblock Plus" class="abp-objtab-06349998257069632 visible ontop"></a><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=8,0,0,0" id="VE_Player" align="middle" height="285" width="320"></p>
<param name="movie" value="http://static.videoegg.com/ted2/flash/loader.swf"></param>
<param name="FlashVars" value="bgColor=FFFFFF&amp;file=http://static.videoegg.com/ted/movies/HectorRuiz_2007G_high.flv&amp;autoPlay=false&amp;fullscreenURL=http://static.videoegg.com/ted/flash/fullscreen.html&amp;forcePlay=false&amp;logo=&amp;allowFullscreen=true"></param>
<param name="quality" value="high"></param>
<param name="allowScriptAccess" value="always"></param>
<param name="bgcolor" value="#FFFFFF"></param>
<param name="scale" value="noscale"></param>
<param name="wmode" value="window"></param><embed src="http://static.videoegg.com/ted2/flash/loader.swf" flashvars="bgColor=FFFFFF&amp;file=http://static.videoegg.com/ted/movies/HectorRuiz_2007G_high.flv&amp;autoPlay=false&amp;fullscreenURL=http://static.videoegg.com/ted/flash/fullscreen.html&amp;forcePlay=false&amp;logo=&amp;allowFullscreen=true" quality="high" allowscriptaccess="always" bgcolor="#FFFFFF" scale="noscale" wmode="window" name="VE_Player" type="application/x-shockwave-flash" pluginspage="http://www.macromedia.com/go/getflashplayer" align="middle" height="285" width="320"></embed></object></center> The first part is a bit dull, I think, but halfway through Hector Ruiz gets to the 50&#215;15 project &#8212; connecting half the world to the Net in 2015.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mailund.dk/index.php/2008/05/11/connecting-the-world/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.mailund.dk/index.php/2008/05/11/connecting-the-world/</feedburner:origLink></item>
		<item>
		<title>Pangea Day</title>
		<link>http://feeds.feedburner.com/~r/MailundOnTheInternet/~3/287971427/</link>
		<comments>http://www.mailund.dk/index.php/2008/05/11/pangea-day/#comments</comments>
		<pubDate>Sun, 11 May 2008 09:49:03 +0000</pubDate>
		<dc:creator>Thomas Mailund</dc:creator>
		
		<category><![CDATA[Fun]]></category>

		<category><![CDATA[Google]]></category>

		<category><![CDATA[Information technology]]></category>

		<category><![CDATA[Internet]]></category>

		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://www.mailund.dk/index.php/2008/05/11/pangea-day/</guid>
		<description><![CDATA[Yesterday was Pangea Day.  Imagine a world-wide film festival?


I continue to be baffled about how information technology has changed the world.
  When I started studying computer science, in 1995 (which is centuries ago in Internet years), the web was in the process of replacing Gopher and the Internet was not really something ordinary [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday was <a href="http://www.pangeaday.org/">Pangea Day</a>.  Imagine a world-wide film festival?</p>
<p align="center"><object type="application/x-shockwave-flash" data="http://www.youtube.com/v/JxhRukX-vAw" width="425" height="355" wmode="transparent">
<param name="movie" value="http://www.youtube.com/v/JxhRukX-vAw" /></object></p>
<p align="left">I continue to be baffled about how information technology has changed the world.</p>
<p align="left"> <a href="http://www.twango.com/media/PangeaDay.inspire/pangeaday.10957"><img src="http://media.twango.com/m1/roundedthumbnail/0176/c5c9070577da465a9c6fa247fba04c4f.jpg" alt="Wade Davis, National Geographic explorer, endorses Pangea Day - Share on Ovi" align="left" border="0" height="100" width="100" /></a> When I started studying computer science, in 1995 (which is centuries ago in Internet years), the web was in the process of replacing Gopher and the Internet was not really something ordinary people knew about or used.  Not in Denmark, at least.  That changed <em>fast</em>.</p>
<p align="left">Two-three years later, everyone had an Internet connection at home, and a little bit later, when Google showed up on the scene replacing AltaVista, Yahoo etc., the Internet replaced the encyclopaedia as the place you would find information  (they may not like me saying this, but Google really <em>did</em> blow the competitors out of the water).  I cannot really imagine living without the Internet and search engines any more.  When I am not connected, I feel almost handicapped.  In my work, I need to look up stuff all the time, and going through books and manuals is <em>so</em> much harder than just googling.</p>
<p align="left"> <a href="http://www.twango.com/media/PangeaDay.inspire/pangeaday.10771"><img src="http://media.twango.com/m1/roundedthumbnail/0175/83050799825f4c05b613928448793a35.jpg" alt="Loic Le Meur - Share on Ovi" align="left" border="0" height="100" width="100" /></a><br />
Anyway, that was a &#8220;local&#8221; change to life, and I didn&#8217;t really notice how the world got smaller through the same technology until a few years later.  I was working on a project together with a guy in Australia, and one day it struck me how amazing it was that I could send an email in the evening about my progress so far, and while I slept he would work on the project and in the morning I would have his update.  A letter would take days, but we were working almost interactively together on opposite sides of the world!
</p>
<p align="left"> A little later, I was myself working in Australia, first at the University of Adelaide, a bit later at the University of South Australia &#8212; both times for shorter stays of a few months duration &#8212; and using instant messaging I <em>could</em> work interactively with people at home, and talk to friends.  With my cellphone, I could <em>talk</em> to my friends while I was walking through a park in Adelaide and they were getting ready for bed in Denmark.  It was a bit expensive, but it was <em>possible</em>.</p>
<p align="left"><a href="http://www.twango.com/media/PangeaDay.inspire/pangeaday.10770"><img src="http://media.twango.com/m1/roundedthumbnail/0175/8c012b6e3ae24a6aaa2fb3dffb8d0628.jpg" alt="Kevin Wall PSA - Share on Ovi" align="left" border="0" height="100" width="100" /></a>Today, I frequently communicate over the &#8216;net with people from the North America, Europe, Australia and Asia.  Very rarely South America, although I used to when I was working in a different field, and never really Africa for some reason, although I do get the occasional email from South Africa.  If I want to, I can call people up all over the world, and as the price goes down (or Internet telephony takes over) I will be able to talk to people everywhere, from anywhere, over my cellphone.</p>
<p align="left">Hell, I&#8217;m posting this sitting in my garden, connected over a wireless connection.  I wouldn&#8217;t have believed this possible ten years ago.</p>
<p align="left">There&#8217;s some drawbacks to this as well, of course, such as <a href="http://www.stats.ox.ac.uk/people/academic_staff/jotun_hein">Jotun Hein</a> never really understanding time zones and calling in the middle of the night from New Zealand to discuss a project&#8230; It happened a couple of times while Jotun was on sabbatical there.  <a href="http://www.math.canterbury.ac.nz/~m.steel/">Mike Steel</a> told me at a bbq that he had the same problem when Jotun calls him from the UK.  But all in all, it is a great thing, in my opinion.</p>
<p align="left">Information  technology has made the world into a global village.  It is said a lot, but in some sense it is true, and if you stop and think about it, it really is amazing.</p>
<p align="left">Well, not the <em>entire</em> world has joined the village.  The majority of people are <em>not</em> connected.  I don&#8217;t know the numbers, but I wouldn&#8217;t think a billion people are connected (and we have just passed <a href="http://johnhawks.net/weblog/topics/population_structure/population_6666666666_2008.html">6,666,666 billion people worldwide</a>).</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mailund.dk/index.php/2008/05/11/pangea-day/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.mailund.dk/index.php/2008/05/11/pangea-day/</feedburner:origLink></item>
		<item>
		<title>Debugging fun</title>
		<link>http://feeds.feedburner.com/~r/MailundOnTheInternet/~3/287438204/</link>
		<comments>http://www.mailund.dk/index.php/2008/05/10/debugging-fun/#comments</comments>
		<pubDate>Sat, 10 May 2008 11:11:01 +0000</pubDate>
		<dc:creator>Thomas Mailund</dc:creator>
		
		<category><![CDATA[Fun]]></category>

		<category><![CDATA[Work]]></category>

		<category><![CDATA[debugging]]></category>

		<category><![CDATA[software]]></category>

		<category><![CDATA[software development]]></category>

		<guid isPermaLink="false">http://www.mailund.dk/index.php/2008/05/10/debugging-fun/</guid>
		<description><![CDATA[Yesterday I got a simulator from Garrett Hellenthal that I need to simulate some genetics data (combined &#8220;panel&#8221; and &#8220;tagSNP&#8221; data, if you want to know).
The program didn&#8217;t work for me.  On the example data provided with the code, it complained about invalid input.  Right!  Naturally, I expected that Garrett had given [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday I got a simulator from <a href="http://www.stats.ox.ac.uk/people/postdoctoral_researchers/garrett_hellenthal">Garrett Hellenthal</a> that I need to simulate some genetics data (combined &#8220;panel&#8221; and &#8220;tagSNP&#8221; data, if you want to know).</p>
<p>The program didn&#8217;t work for me.  On the example data provided with the code, it complained about invalid input.  Right!  Naturally, I expected that Garrett had given me test data that was in some way outdated.   It happens quie often that the test data isn&#8217;t updated when new features are added to a program, and that test examples no longer function the way they should.  I make that mistake a lot, anyway.</p>
<p>So I got hold of Garrett, and yes the test example was incorrect and he sent me an updated test.  It didn&#8217;t work either.  We emailed back and forth a bit, and this time it was a bit of a puzzle.  Everything worked fine at Garrett&#8217;s end, but the program just wouldn&#8217;t eat the input on my computer.   I didn&#8217;t have the time to figure out why, yesterday, as I had a plane to catch, but I had another look at it today.</p>
<p>I first grep&#8217;ed for the error message I got: &#8220;CV frequencies must be between 0 and 0.50&#8243;.  That line occurred a few places in the code, so I annotated the code with some output before each and that pinpointed this test as the problem:</p>
<pre>
if ((allelefreq[count] &lt; epsilon[count]) || (allelefreq[count] &gt; (0.50-epsilon[count]))) {
   printf("CV frequencies must be between 0 and 0.50\n");
   exit(1);
 }</pre>
<p>Ok, so on Garrett&#8217;s machine, the test was false and on mine it was true, but the input files were the same.  First I tested if the input was read in correctly (Garrett&#8217;s IO code is a bit of a mess, excuse me for saying it).  allelefreq[count] was supposed to be 0.4 &#8212; and it was &#8212; and epsilon[count] should be 0.1 &#8212; and it was.</p>
<p>That&#8217;s when I spotted the problem.  Do you see it?</p>
<p>Clearly 0.4 ≥ 0.1 and 0.4 ≤ 0.5-0.1 so the test <em>should</em> evaluate to false, but it doesn&#8217;t.</p>
<p>The thing is, floating point numbers aren&#8217;t real numbers (as you probably know).  And for one thing, 0.1 cannot be represented in a finite number of bits (as a binary number) so it is <em>approximated</em> in the computer.  So 0.5-0.1 is <em>not</em> 0.4 (although they <em>could</em> be represented by the same bit-patter as 0.4 is represented as).  Whether 0.4 ≤ 0.5-0.1 or 0.4 &gt; 0.5-0.1 on the computer depends on the way the numbers are represented.</p>
<p>Garrett was running his program on a 64 bit architecture (where the numbers were represented using 128 bits)  and I on a 32 bit architecture (with 64 bits per number).  That made all the difference.</p>
<p>Fun, eh?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mailund.dk/index.php/2008/05/10/debugging-fun/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.mailund.dk/index.php/2008/05/10/debugging-fun/</feedburner:origLink></item>
		<item>
		<title>Postdoc life vs PI life</title>
		<link>http://feeds.feedburner.com/~r/MailundOnTheInternet/~3/286825923/</link>
		<comments>http://www.mailund.dk/index.php/2008/05/09/postdoc-life-vs-pi-life/#comments</comments>
		<pubDate>Fri, 09 May 2008 13:28:42 +0000</pubDate>
		<dc:creator>Thomas Mailund</dc:creator>
		
		<category><![CDATA[Work]]></category>

		<category><![CDATA[Research life]]></category>

		<guid isPermaLink="false">http://www.mailund.dk/index.php/2008/05/09/postdoc-life-vs-pi-life/</guid>
		<description><![CDATA[I just saw this post today linked to from Bitesize Bio&#8217;s Around the Blog. It bears the title &#8220;Postdocs always overestimate their intellectual contributions&#8221; but really is about the relationship between postdocs and PIs, as seen from the postdoc perspective (and an under-appreciation of the PIs work).
 Life as a postdoc
The postdoc (and grad student, [...]]]></description>
			<content:encoded><![CDATA[<p>I just saw <a href="http://scienceblogs.com/drugmonkey/2008/04/repost_postdocs_always_overest_1.php">this post</a> today linked to from <a href="http://bitesizebio.com/2008/05/09/around-the-blogs-27/">Bitesize Bio&#8217;s Around the Blog</a>. It bears the title <a href="http://scienceblogs.com/drugmonkey/2008/04/repost_postdocs_always_overest_1.php" id="a075354"><em>&#8220;Postdocs always overestimate their intellectual contributions&#8221;</em></a> but really is about the relationship between postdocs and PIs, as seen from the postdoc perspective (and an under-appreciation of the PIs work).</p>
<h3> Life as a postdoc</h3>
<p>The postdoc (and grad student, for that matter) perspective is &#8212; and I recognize it both from my own experience and my colleagues &#8212; that you come up with most of the ideas, do all the work, and the PI just takes the credit.  It is a bit exaggerated, but the point is that the vast majority of the time spend on a project is time spent by a postdoc or grad student.  The advisor reads the occasional draft paper, comment on the research a bit here and there, but doesn&#8217;t really put in the hours.</p>
<p>I think this is true.  I think this is absolutely true (and on this point I think I disagree with the post I link to above&#8230; or maybe not when I elaborate on the point&#8230;)</p>
<p>Before I elaborate, though, I should probably explain my own situation, so you know where this perspective comes from.</p>
<p>I am somewhere in between being postdoc or PI.  The last couple of years I have been working on my own grants and not really been associated with any particular group, so I haven&#8217;t been a postdoc as such.  Although I&#8217;m currently officially an associate professor, I am not associated to anyone, and I haven&#8217;t got tenure yet so I cannot start my own group.  With the current rules in Aarhus, I cannot have grad students associated either.  Instead, I have associated myself to different other groups, for individual projects, and when doing so take on roles as either &#8220;postdoc&#8221; or &#8220;advisor&#8221;, depending on the project and who else is involved.</p>
<p>Anyway, the situation is that I was relatively recently  a postdoc &#8212; and remember how I felt about academic life then &#8212; and now I have experienced both the advisor and the postdoc world (but so far have never really been <em>responsible</em> for my own group, so my perspective might change later on).</p>
<p>So, as I said, I think it is true that on any one project, the postdoc will be doing the vast majority of the work, at least measured in hours.  This isn&#8217;t really surprising.  The postdoc can focus on one project (or a few projects), while the PI needs to manage many.  Of course the postdoc will contribute the most time &#8212; but who contributes the most <em>research</em>?</p>
<h3>Division of labour</h3>
<p>This is a trickier question&#8230; first of all because it is not easy to define nor measure &#8220;research&#8221;.  The best measure we have is the publications, but within a project you cannot use it to measure the output of the first author vs the last author.</p>
<p>The PI will be more experienced (or at least should be, but I&#8217;ll come back to that) so he will have a better idea about how to approach a problem, he will be able to spot dead-ends long before the postdoc, he will know the literature better, etc.  This is all experienced he gained when <em>he</em> was the one doing the grunt work, and now he should use it to teach the postdoc.  A PI hour might simply be worth more than a postdoc our because of the extra experience.</p>
<p>That is not the whole story, though.  There is also what I would call a trade-off between &#8220;detail work&#8221; and &#8220;big picture&#8221;.</p>
<p>Most research is fiddling with details, trying to make epsilon improvements upon previous work.  We all dream of a great breakthrough, but day to day research is about epsilon improvements, and that means fiddling with details.  It requires a lot of work to fully understand the details of any kind of problem, and it is very easy to get lost in details and lose the big picture.  How does the current work fit into the larger problems in a field, how does it related to previous work, and what will be the consequences of whatever you are trying to prove or solve?  Someone needs to figure out the details, but someone also needs to keep the big picture in mind!</p>
<p>I&#8217;m not sure you can do both at the same time.  At least I find it difficult to do both at the same time.</p>
<p>The postdoc deals with the details, and is probably the only one who fully <em>understands</em> the problem being worked on.  The PI keeps the big picture in mind, trying to relate the current problem to the rest of the field.</p>
<p>The PI will spend less time on the particular problem &#8212; contribute fewer hours to the project &#8212; but at the same time he probably needs to keep up with a broader part of the literature and keep track of related projects to improve on <em>this</em> project.  It doesn&#8217;t feel like he is contributing to the specific problem when he is just reading papers about other peoples work, but the hours he put in there helps him keep an eye on the big picture instead of getting caught in the details.</p>
<p>One of the perks is, of course, that keeping up with the literature pays off for more than one project at a time, so you get more publications that way.  As I said, it is one of the perks of that side to research.</p>
<h3>Experience matters (if you have it)</h3>
<p>One thing is the &#8220;details&#8221; / &#8220;big picture&#8221; division, but another is<em> experience</em>.  The postdoc will be less experienced in doing research.  The whole package, including doing the <em>actual</em> research, writing the papers &#8212; just as important as doing the experiments and analysing data, really &#8212; applying for grants, etc.  This is stuff you learn from experience, and is a lot easier to learn with the assistance of someone who already <em>has</em> the experience.  That would be the PI.</p>
<p>The postdoc is still learning.  He is being taught (or should be taught) by the PI.  It might not feel like a student/teacher relationship, because it is very different from the typical teaching at a university, but it <em>is</em> such a relationship.  It is just more of a master/apprentice  relationship.  The apprentice starts out doing only the grunt work, but gradually does more and more of the complicated work under less and less supervision, until he has mastered the discipline.</p>
<p>Experience is mainly a way to avoid dead ends, in my experience (no pun intended).  When you have worked with something for a long time, you have a good intuition for what might work and what will not, so you avoid a lot of wrong turns.  The experience of the PI will help the postdoc avoid a lot of pitfalls (probably without ever realising that there was a potential pitfall).  There should be room for making mistakes &#8212; that is how we learn &#8212; but by supervising the postdoc, the PI can help to avoid the worst mistakes.  By supervising less and less, leaving room for some mistakes but not until the postdoc is ready to deal with them, the PI allows the postdoc to gain the experience while still being a bit sheltered.</p>
<p>When supervising, it is actually harder to allow someone to make a mistake than it is to spot a mistake and avoid it, but sometimes mistakes are necessary to learn, so they should never be totally avoided&#8230; but that is a topic for another day&#8230;</p>
<p>Will the PI always be more experienced? Many paragraphs ago I promised to get back to this point.  In most ways I would say yes.  Writing papers, applying for grants, that stuff he <em>should</em> be more experienced.  Otherwise he wouldn&#8217;t be the PI, really.  But there is a limit to that experience and that is also important to keep in mind.</p>
<p>There&#8217;s two scenarios where the postdoc will be more experienced than the PI.  If the project he is working on is outside the PIs previous experience, or when the detail-work is important but completely lost to the PI.</p>
<p>Let&#8217;s take the first scenario first.  The PI gets an idea.  &#8220;It could be fun to figure out if &#8230;&#8221; and although it is not really related to anything he has done before, he manages to get a grant for doing research on the project. He has the grant but not the experience.  So he hires a postdoc to work on the problem, but cannot really supervise the science! (This is not as made up as it might sound; I have seen it happen a couple of times&#8230;)</p>
<p>The second scenario is probably more common.  Here the problem is that by focusing on &#8220;the big picture&#8221; for so long, the PI has forgotten the details and ignores that <em>that</em> is where the problems really are.  The big picture might tell you what the important questions are, but the answers are found in the details.  People forget this too often, in my opinion. They ask an interesting question and then suggest &#8220;maybe you can solve it using an MCMC or something&#8230;&#8221; or &#8220;maybe you can solve it using dynamic programming&#8230;&#8221; but that is not a helpful suggestion at all!  It is a complete disregard for the details, and the details are <em>important</em> (damn it)!</p>
<p>You loose the feeling for the details if you keep away from the for too long, and when that happens you cannot teach others how to deal with them any more.</p>
<p>Of course, a PI will never be as much into the details as the postdoc working with them every day, but luckily most problems are somewhat similar so what works in one setting might work in another so some experience is transferable.  This is why experience can be substituted for hard work in many cases.</p>
<p>Sometimes the experience just isn&#8217;t there (for either of the two reasons I mentioned), and in that case the PI isn&#8217;t pulling his own weight on the project.  A few general remarks to a draft and completely generic comments to solutions (&#8221;have you considered an ABC solution?&#8221;) are of little help.  In such cases, I think a postdoc would be right in thinking that the PI just gets the credit but doesn&#8217;t do the work.  Fortunately, it is not something that happens that often, and most PIs would recognise that they are not actually capable of helping with the problems and try to improve on it by getting to know the details a bit better.</p>
<h3>Life as  a PI</h3>
<p>Being in transit from postdoc to PI I&#8217;ve noticed a few things that I never really thought about before. New problems when you are no longer a postdoc in a group.  I&#8217;ve probably only noticed half of the new problems, since I have yet to experience running my own group.</p>
<p>Anyway, the most important difference, the way I see it, is related to the &#8220;big picture&#8221;/&#8221;details&#8221; thing.  I no longer have time to focus on just a single problem for days or weeks.  There is always a lot of different projects that requires my attention, and that means that I cannot put in the time really necessary to solve detailed problems. It is something I really miss, being able to completely focus on a problem for a couple of weeks, until I have it nailed.  If I want to do that now, I have to do it in my vacation.</p>
<p>Another thing is the time it takes to come up with those few suggestions to drafts and projects.  I would have guessed that you would spend half an hour before a meeting and that would be the time you would put into it when supervising a project.  Maybe some people can do that, but I find that to really think about it I need to spend <em>hours</em>.  Checking the literature, doing the math to check out if it works out, looking over the data with my own simple methods to get a feeling for it.  It is not enough to get the feeling for the details that is really needed &#8212; as I said, there simply isn&#8217;t the time for it any more &#8212; but it takes a lot longer than I would have thought.</p>
<p>I&#8217;ve already begun to think of the postdoc days as the good old days, but I fully remember being envious of the PIs at the time.  Over the coming years, if I get tenure, I will get more and more administrative and teaching tasks and I might end up missing the postdoc time even more.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mailund.dk/index.php/2008/05/09/postdoc-life-vs-pi-life/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.mailund.dk/index.php/2008/05/09/postdoc-life-vs-pi-life/</feedburner:origLink></item>
		<item>
		<title>Going home…</title>
		<link>http://feeds.feedburner.com/~r/MailundOnTheInternet/~3/286747546/</link>
		<comments>http://www.mailund.dk/index.php/2008/05/09/going-home/#comments</comments>
		<pubDate>Fri, 09 May 2008 11:04:44 +0000</pubDate>
		<dc:creator>Thomas Mailund</dc:creator>
		
		<category><![CDATA[Fun]]></category>

		<category><![CDATA[Work]]></category>

		<category><![CDATA[travel]]></category>

		<guid isPermaLink="false">http://www.mailund.dk/index.php/2008/05/09/going-home/</guid>
		<description><![CDATA[Despite Thomas Wolfe&#8217;s wild claim, you can go home, and today I am going home.  That is, I am leaving for home in the evening but it will be past midnight before I am actually home.
I&#8217;ve been in Oxford and London the last few days &#8212; last weekend in Oxford and this week in [...]]]></description>
			<content:encoded><![CDATA[<p>Despite Thomas Wolfe&#8217;s wild claim, you <em>can</em> go home, and today I am going home.  That is, I am leaving for home in the evening but it will be past midnight before I am actually home.</p>
<p>I&#8217;ve been in Oxford and London the last few days &#8212; last weekend in Oxford and this week in London &#8212; visiting Jotun Hein in Oxford and David Balding at Imperial College.</p>
<p>The Oxford visit was more social than work. Over the weekend, it couldn&#8217;t really be more, but it was nice meeting the people there (who Jotun had working weekends, of course).</p>
<p>The visit to Imperial was for making progress on a paper I am writing with David.  We didn&#8217;t make as much progress on it as I&#8217;d have liked &#8212; the time was simply too short &#8212; but we have a plan for how to progress from here, so that is fine.  Meeting the group here and chatting with people has been very stimulating.</p>
<p>I&#8217;ve never really been much for going to conferences.  Sometimes they inspire me to some work, but usually I am just bored through most of it.  I&#8217;m just not good at sitting quietly and listening to talks.  Visiting people and chat with them, on the other hand, I find very inspiring.  Plus, you really learn a lot from talking with people that they would never mention in a formal presentation.</p>
<p>Anyway, now I&#8217;m soon on my way home.  Teaching awaits me when I am back, so until the end of term I won&#8217;t be able to go travelling again. Over summer I probably will.  Most likely to DeCODE.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mailund.dk/index.php/2008/05/09/going-home/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.mailund.dk/index.php/2008/05/09/going-home/</feedburner:origLink></item>
		<item>
		<title>Combining alignment inference and phylogenetic footprinting</title>
		<link>http://feeds.feedburner.com/~r/MailundOnTheInternet/~3/285456880/</link>
		<comments>http://www.mailund.dk/index.php/2008/05/07/combining-alignment-inference-and-phylogenetic-footprinting/#comments</comments>
		<pubDate>Wed, 07 May 2008 16:01:38 +0000</pubDate>
		<dc:creator>Thomas Mailund</dc:creator>
		
		<category><![CDATA[Paper reviews]]></category>

		<guid isPermaLink="false">http://www.mailund.dk/index.php/2008/05/07/combining-alignment-inference-and-phylogenetic-footprinting/</guid>
		<description><![CDATA[
Since constructing an alignment is really a statistical inference problem, we shouldn&#8217;t pretend that we can infer it flawlessly and completely ignore the uncertainty in the inference.  Especially since doing so can bias the downstream inference.  See a list of previous posts where I discuss these problems to get an idea about the [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.researchblogging.org"><img src="http://www.researchblogging.org/images/rbicons/ResearchBlogging-Medium-White.png" alt="ResearchBlogging.org" align="left" height="50" width="80" /></a></p>
<p>Since constructing an alignment is really a statistical inference problem, we shouldn&#8217;t pretend that we can infer it flawlessly and completely ignore the uncertainty in the inference.  Especially since doing so can bias the downstream inference.  See a <a href="http://www.mailund.dk/index.php/tag/alignment/">list of previous posts where I discuss these problems</a> to get an idea about the problem.</p>
<p>At the very least, we should take into account the uncertainty associated with alignments.  In many cases, however, we are not really interested in the alignment in the first place.  It is a nuisance parameter in our model, necessary for extracting whatever information we are <em>really</em> interested in.</p>
<p>For example, if we want to annotate a sequence and want to use related sequences for this annotation, we are not really interested in alignments but in the annotation.  It is just that we need alignments to get the annotation.</p>
<p>Or do we?  Couldn&#8217;t we just integrate / sum over all alignments, with some suitable prior, and get rid of the nuisance parameter that way?</p>
<p>At least in theory, you can, and in many cases you can in practise as well.  I have, myself, a paper under review doing this, but some colleagues in Oxford beat me to it (despite them starting their project long after we had the results for our paper &#8230; we have been a bit slow on this thing):</p>
<blockquote><p><a href="http://bioinformatics.oxfordjournals.org/cgi/content/abstract/24/10/1236"><strong>Combining statistical alignment and phylogenetic footprinting to detect regulatory elements</strong></a><br />
Rahul Satija, Lior Pachter and Jotun Hein</p>
<p><font size="-1">Bioinformatics 2008 24(10):1236-1242; doi:10.1093/bioinformatics/btn104   </font></p>
<p align="center"><strong>Abstract</strong></p>
<p> <strong>Motivation:</strong> Traditional alignment-based phylogenetic footprinting<sup> </sup>approaches make predictions on the basis of a single assumed<sup> </sup>alignment. The predictions are therefore highly sensitive to<sup> </sup>alignment errors or regions of alignment uncertainty. Alternatively,<sup> </sup>statistical alignment methods provide a framework for performing<sup> </sup>phylogenetic analyses by examining a distribution of alignments.<sup> </sup></p>
<p><strong>Results:</strong> We developed a novel algorithm for predicting functional<sup> </sup>elements by combining statistical alignment and phylogenetic<sup> </sup>footprinting (SAPF). SAPF simultaneously performs both alignment<sup> </sup>and annotation by combining phylogenetic footprinting techniques<sup> </sup>with an hidden Markov model (HMM) transducer-based multiple<sup> </sup>alignment model, and can analyze sequence data from multiple<sup> </sup>sequences. We assessed SAPF&#8217;s predictive performance on two<sup> </sup>simulated datasets and three well-annotated <em>cis</em>-regulatory modules<sup> </sup>from newly sequenced <em>Drosophila</em> genomes. The results demonstrate<sup> </sup>that removing the traditional dependence on a single alignment<sup> </sup>can significantly augment the predictive performance, especially<sup> </sup>when there is uncertainty in the alignment of functional regions.</p></blockquote>
<p>The idea, quite simply, is this:  You can express the alignment problem statistically as a hidden Markov model (HMM), and you can express the footprinting problem as a hidden Markov model, and it is rather straight forward to combine two hidden Markov models into a combined model (at least as long as you do not worry too much about the probabilities and only consider the topology).  Running the combined HMM, you can look at the marginal states can consider them solutions to the two original problems.  Again, there are some problems with interpreting the marginal states as runs of the marginal HMMs because of the structure imposed on them by the joint HMM, but in most cases it doesn&#8217;t matter much.</p>
<p>In this paper, they happily just combine the alignment HMM with the footprinting HMM and thereby get footprinting annotations while summing away the alignments as a nuisance parameter.</p>
<p>In my opinion, this is the way to go.</p>
<p>There is one complication, though, and that is the running time.  The way the HMM is constructed here, the number of sequences in the data is a problem.  This is a general problem, regardless of whether you consider statistical or &#8220;plain old&#8221; alignment: the running time to get optimal alignments (or sum over them all) grows exponential in the number of sequences.</p>
<p>The way out of this is sampling methods, and there is an obvious Gibbs sampler solution.  Using the forward and backward algorithm from HMM theory, you can sample the alignment of two sequences with the correct probability.  So, in a multiple sequence alignment, you fix all but one sequence, you can use this trick to sample the conditional alignment of one sequence at a time.  That is all you need for a Gibbs sampler.</p>
<p>Unfortunately, the Gibbs approach is left for future work, but I guess we will see something soon &#8230;</p>
<hr /> <span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.aulast=Satija&amp;rft.aufirst=R&amp;rft.au=R+ Satija&amp;rft.au=L+Pachter&amp;rft.au=J+Hein&amp;rft.title=Bioinformatics&amp;rft.atitle=Combining+statistical+alignment+and+phylogenetic+footprinting+to+detect+regulatory+elements&amp;rft.date=2008&amp;rft.volume=24&amp;rft.issue=10&amp;rft.spage=1236&amp;rft.epage=1242&amp;rft.genre=article&amp;rft.id=info:DOI/10.1093%2Fbioinformatics%2Fbtn104"></span>Satija, R., Pachter, L., Hein, J. (2008). Combining statistical alignment and phylogenetic footprinting to detect regulatory elements. <span style="font-style: italic">Bioinformatics, 24</span>(10), 1236-1242. DOI: <a href="http://dx.doi.org/10.1093/bioinformatics/btn104" rev="review">10.1093/bioinformatics/btn104</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.mailund.dk/index.php/2008/05/07/combining-alignment-inference-and-phylogenetic-footprinting/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.mailund.dk/index.php/2008/05/07/combining-alignment-inference-and-phylogenetic-footprinting/</feedburner:origLink></item>
		<item>
		<title>New “research” web-pages at BiRC</title>
		<link>http://feeds.feedburner.com/~r/MailundOnTheInternet/~3/284615050/</link>
		<comments>http://www.mailund.dk/index.php/2008/05/06/new-research-web-pages-at-birc/#comments</comments>
		<pubDate>Tue, 06 May 2008 12:37:29 +0000</pubDate>
		<dc:creator>Thomas Mailund</dc:creator>
		
		<category><![CDATA[Rants]]></category>

		<category><![CDATA[Research]]></category>

		<category><![CDATA[Work]]></category>

		<category><![CDATA[BiRC]]></category>

		<category><![CDATA[Web]]></category>

		<guid isPermaLink="false">http://www.mailund.dk/index.php/2008/05/06/new-research-web-pages-at-birc/</guid>
		<description><![CDATA[Yesterday I updated the &#8220;Research&#8221; page at the BiRC website.  We have been wanting to update the description of our research for a while.  It is after all an important part of the website of a research institution, and ours was hopelessly outdated and a mess of small and large projects, some we [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday I updated the <a href="http://birc.au.dk/Research2/">&#8220;Research&#8221;</a> page at the BiRC website.  We have been wanting to update the description of our research for a while.  It <em>is</em> after all an important part of the website of a research institution, and ours was hopelessly outdated and a mess of small and large projects, some we were actively working on and some that were little more than areas we were interested in.</p>
<p>A year ago, we had a big meeting and discussed, all of BiRC, for an entire afternoon how to update the pages.  It really goes without saying that nothing could be decided in such a large group.  Instead, around Christmas, Storm, our director, called a meeting inviting those interested in updating our web-pages <em>and willing to put in the work to implement the changes</em>.  Requiring that people should back up their ideas with actual work reduced the group significantly.  We were only three, myself, Enette and Storm.</p>
<p>I got the task of coming up with a draft of the research pages and present it to the rest of BiRC.</p>
<h3>What is a research project and what goes on a research center&#8217;s homepage?</h3>
<p>The first thing I had to decide was what I wanted on the page.  The form of the content is usually easier to figure out once you know what the content <em>is</em>.</p>
<p>On a webpage describing the research going on at BiRC, I wanted it to be the actual research going on at BiRC.  This sounds obvious, but I have seen many research institutions listing their research interests with lists I find completely unbelievable.  You might be <em>interested</em> in a lot of different problems, but there is a limit to how many fields you can make an active contribution to.  I find it dishonest if you claim to be doing research in field where, really, all you have done is publish a paper related to the field five years ago, and you occasionally read a paper about the topic.  Too often, that is what I have noticed.</p>
<p>There is nothing wrong with focusing on a few research areas at a research institution.  Sure, it is fun to do a few projects outside your main research area from time to time, but if people read the web pages to find out what you are doing at the institution, then they should find the area where you are spending the majority of your time, not something you think about every second year.  If a post doc wants to come and work at BiRC, he should know what kind of work we <em>actually</em> do, not what kind of work we like to read about.</p>
<p>Anyway, this is turning into a bit of a rant, but that is the kind of thoughts I was having at the time.</p>
<p>So, I wanted the page to contain our <strong><em>active</em></strong> research areas, and I wanted to back up the &#8220;we are actively working on this stuff&#8221; claim we are implicitly making when describing research on our web pages.</p>
<p>How do you prove that you are active?  You put up the money or the work!  Ideally, you want to prove that you are doing research in an area, by showing that you are actively publishing in it.  If you do not publish, <em>are</em> you really doing research there?  If you are, is it any good?  I wanted each project to be able to show at least two-three papers a year to be considered active.  I made an exception if there was funding for a project, but it hadn&#8217;t really produced any results yet&#8230; I am still not sure this is a good idea, but my arm was twisted a bit to allow for this&#8230;</p>
<h3>Designing the web pages</h3>
<p>After these considerations, I started to design the pages.  I made an overview page, where people can scan through all the research areas at BiRC very quickly, and with links to sub-pages &#8212; one for each research area &#8212; that can contain more information.<br />
On the main page I wanted a short description of each research area and a list of the most recent papers published in it (at least three, is my initial choice, but more if there are more papers published within the last year).  This works as a sort of sanity check for whether we are describing an active research area or not, and also tells the world what kind of papers we publish in each area.</p>
<h3>Putting it together</h3>
<p>This design I presented to BiRC and no one objected.  Good.  Then I asked people to send me the projects (description and publication lists) they wanted on the page.  Deadline March 15, which would give me time to set up the sub-pages and such, and then the new page could be put on the on May 1.</p>
<p>Naturally, but March 16 I hadn&#8217;t received anything and by May 1 I had a few half-baked descriptions.  Deadlines are rarely taken that serious when they are internal deadlines like this.  I am pretty bad at meeting them myself, so I cannot really complain.</p>
<p>With a lot of threats I got the material I needed over the weekend &#8212; well, enough to make descriptions of the main research areas at BiRC &#8212; and I put the pages together.  I&#8217;m personally maintaining more of them than I had planned.  I really only wanted to be responsible for the <a href="http://birc.au.dk/Research2/Association_Mapping/">Association Mapping</a> page.  This is my own main area, and the one where I consider myself the local expert.  I ended up maintaining virtually <em>all</em> the pages where I am contributing <em>anything</em>.  I plan to push the responsibility to someone else when I get the chance, but for now this is how it is.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mailund.dk/index.php/2008/05/06/new-research-web-pages-at-birc/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.mailund.dk/index.php/2008/05/06/new-research-web-pages-at-birc/</feedburner:origLink></item>
		<item>
		<title>On the road again…</title>
		<link>http://feeds.feedburner.com/~r/MailundOnTheInternet/~3/284551729/</link>
		<comments>http://www.mailund.dk/index.php/2008/05/06/on-the-road-again/#comments</comments>
		<pubDate>Tue, 06 May 2008 10:33:41 +0000</pubDate>
		<dc:creator>Thomas Mailund</dc:creator>
		
		<category><![CDATA[Fun]]></category>

		<category><![CDATA[hapcluster]]></category>

		<category><![CDATA[travelling]]></category>

		<guid isPermaLink="false">http://www.mailund.dk/index.php/2008/05/06/on-the-road-again/</guid>
		<description><![CDATA[I&#8217;m in London now, working with David Balding on HapCluster.  We had a late dinner yesterday and I am slightly hung over and a bit lacking in sleep, which is not the optimal state of mind to be trying to solve a mixing problem in an MCMC&#8230;
]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m in London now, working with David Balding on HapCluster.  We had a late dinner yesterday and I am slightly hung over and a bit lacking in sleep, which is not the optimal state of mind to be trying to solve a mixing problem in an MCMC&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mailund.dk/index.php/2008/05/06/on-the-road-again/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.mailund.dk/index.php/2008/05/06/on-the-road-again/</feedburner:origLink></item>
		<item>
		<title>Ancestral effective population size</title>
		<link>http://feeds.feedburner.com/~r/MailundOnTheInternet/~3/282592726/</link>
		<comments>http://www.mailund.dk/index.php/2008/05/03/ancestral-effective-population-size/#comments</comments>
		<pubDate>Sat, 03 May 2008 06:59:52 +0000</pubDate>
		<dc:creator>Thomas Mailund</dc:creator>
		
		<category><![CDATA[Research]]></category>

		<category><![CDATA[ancestral effective population size]]></category>

		<category><![CDATA[effective population size]]></category>

		<category><![CDATA[genetics]]></category>

		<guid isPermaLink="false">http://www.mailund.dk/index.php/2008/05/03/ancestral-effective-population-size/</guid>
		<description><![CDATA[John Hawks &#8212; in a discussion of a potential bottleneck in the human ancestral population &#8212; gives a good description of &#8220;effective population size&#8221; and how you go about inferring ancestral effective population sizes.  Worth a read.
]]></description>
			<content:encoded><![CDATA[<p>John Hawks &#8212; <a href="http://johnhawks.net/weblog/reviews/genetics/mtdna_migrations/sub-saharan-africa-population-size-behar-2008.html">in a discussion of a potential bottleneck in the human ancestral population</a> &#8212; gives a good description of &#8220;effective population size&#8221; and how you go about inferring ancestral effective population sizes.  Worth a read.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mailund.dk/index.php/2008/05/03/ancestral-effective-population-size/feed/</wfw:commentRss>
		<feedburner:origLink>http://www.mailund.dk/index.php/2008/05/03/ancestral-effective-population-size/</feedburner:origLink></item>
	</channel>
</rss>
