<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Your bones got a little machine</title>
	
	<link>http://blog.pansapiens.com</link>
	<description />
	<lastBuildDate>Sun, 29 Mar 2009 07:06:02 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<creativeCommons:license>http://creativecommons.org/publicdomain/zero/1.0/</creativeCommons:license>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/YourBonesGotALittleMachine" type="application/rss+xml" /><item>
		<title>Occyd : tagging for locations</title>
		<link>http://feedproxy.google.com/~r/YourBonesGotALittleMachine/~3/L9_vKhsEwBk/</link>
		<comments>http://blog.pansapiens.com/2009/02/14/occyd-tagging-for-locations/#comments</comments>
		<pubDate>Sat, 14 Feb 2009 02:08:16 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[android]]></category>
		<category><![CDATA[code]]></category>
		<category><![CDATA[occyd]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[gae]]></category>
		<category><![CDATA[geotagging]]></category>
		<category><![CDATA[Google App Engine]]></category>

		<guid isPermaLink="false">http://blog.pansapiens.com/?p=109</guid>
		<description><![CDATA[
Those who have been watching may have noticed I quietly started developing an Android application in the last month or so. It&#8217;s still super-buggy and far from feature complete, but I thought it was time to announce it here (&#8221;release early, release often&#8221;). It&#8217;s not ready for real users yet, but developers may like to [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-113" title="Occyd Map View (search results)" src="http://blog.pansapiens.com/wp-content/uploads/2009/02/occyd_mapview.png" alt="Occyd Map View (search results)" width="320" height="480" /></p>
<p>Those who have been watching may have noticed I quietly started developing an Android application in the last month or so. <strong>It&#8217;s still super-buggy and far from feature complete</strong>, but I thought it was time to announce it here (&#8221;release early, release often&#8221;). It&#8217;s not ready for real users yet, but developers may like to take a little look.</p>
<p><span id="more-109"></span></p>
<p>Occyd (<a href="http://www.bartleby.com/61/12.html"><img src="http://www.bartleby.com/images/pronunciation/obreve.gif" alt="" align="absbottom" />-k <img src="http://www.bartleby.com/images/pronunciation/emacr.gif" alt="" align="absbottom" />d</a> <em>.. sounds like rockied or oggied</em>) is an application for tagging geolocations, aimed at GPS-enabled network-connected devices. It currently consists of an Android client, and a server backend running on Google App Engine. The (evolving) API is simple enough that it should be easy to write clients (or servers) for various platforms. The idea is to enable people to tag locations on the surface of the planet with a list of keywords, just like they can tag web pages with <a href="http://delicious.com/">delicious</a>. They should also be able to search for tagged locations, based on tag(s), on distance from their current location and recency of the post.</p>
<p><img class="alignright size-full wp-image-111" title="Occyd posting screen" src="http://blog.pansapiens.com/wp-content/uploads/2009/02/occyd_post.png" alt="Occyd posting screen" width="320" height="480" /></p>
<p>Here&#8217;s one possible elevator pitch (for a very long, slow elevator ride):</p>
<blockquote><p>&#8220;You are a member of a large bird watching club. Your members like to record where they have spotted various species, and use Occyd to share the locations at which they have sighted various birds. You are out in the park, when you spot the rare Orange Bellied Parrot. You pull out your Android phone, fire up the Occyd client which automatically knows your location via GPS, and tag that current location &#8216;orangebelliedparrot parrot birds&#8217;. You then decide to see if others have spotted parrots in the area. You search for &#8216;parrot&#8217; in the Occyd client; a map appears showing the locations of all the other sightings tagged &#8216;parrot&#8217; in your vacinity. You tweak the search settings to show only &#8216;parrot&#8217; sightings within 100 metres and 14 days &#8230; on the map you see that your friend <em>RobHill</em> spotted an Orange Bellied Parrot here last week &#8211; looks like the numbers of this population are recovering !&#8221;</p>
<p><img class="alignright size-full wp-image-112" title="Occyd searching screen" src="http://blog.pansapiens.com/wp-content/uploads/2009/02/occyd_search.png" alt="Occyd searching screen" width="320" height="480" /></p></blockquote>
<p>Ponder for a bit, and I&#8217;m sure you can think up at least a handful of other great uses (tagging good fishing spots, favorite cafes, or maybe even sightings of parking inspectors <img src='http://blog.pansapiens.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  ).</p>
<p>As with any new project, there are lots more ideas than time to implement them (and I have a day job that doesn&#8217;t involve Occyd &#8230;). The <a href="http://github.com/pansapiens/occyd-android/tree/master">Occyd Android client</a> and <a href="http://github.com/pansapiens/occyd-gae-server/tree/master">Occyd GAE server</a> source is currently available under the GPL v3 on GitHub, and I&#8217;m keeping all my documentation and notes on the <a href="http://wiki.github.com/pansapiens/occyd-android">Occyd Android client wiki</a> provided at GitHub. Watch this space &#8230;.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.pansapiens.com/2009/02/14/occyd-tagging-for-locations/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<creativeCommons:license>http://creativecommons.org/publicdomain/zero/1.0/</creativeCommons:license>
	<feedburner:origLink>http://blog.pansapiens.com/2009/02/14/occyd-tagging-for-locations/</feedburner:origLink></item>
		<item>
		<title>QR-code bookmarklets</title>
		<link>http://feedproxy.google.com/~r/YourBonesGotALittleMachine/~3/G-TBtapQJT0/</link>
		<comments>http://blog.pansapiens.com/2009/01/02/qr-code-bookmarklets/#comments</comments>
		<pubDate>Fri, 02 Jan 2009 01:09:48 +0000</pubDate>
		<dc:creator>perry</dc:creator>
				<category><![CDATA[code]]></category>
		<category><![CDATA[howto]]></category>
		<category><![CDATA[android]]></category>
		<category><![CDATA[qrcode]]></category>
		<category><![CDATA[tips]]></category>

		<guid isPermaLink="false">http://blog.pansapiens.com/?p=95</guid>
		<description><![CDATA[A quick post to share some bookmarklets I made.
I&#8217;ve found QR-code &#8220;2D barcodes&#8221; really handy when playing with my Android phone.

Sometimes, I have a web page open on my desktop PC, and I want to quickly load it in the Android Chrome browser to see what it looks like. Rather than re-typing it with my [...]]]></description>
			<content:encoded><![CDATA[<p><em>A quick post to share some bookmarklets I made.</em></p>
<p>I&#8217;ve found <a href="http://en.wikipedia.org/wiki/QRcode">QR-code</a> &#8220;2D barcodes&#8221; really handy when playing with my Android phone.</p>
<p><img class="alignright size-full wp-image-96" title="qrcode" src="http://blog.pansapiens.com/wp-content/uploads/2009/01/qrcode.png" alt="qrcode" width="245" height="245" /></p>
<p>Sometimes, I have a web page open on my desktop PC, and I want to quickly load it in the Android Chrome browser to see what it looks like. Rather than re-typing it with my thumbs, the Barcode Scanner application allows me to scan a QR-code from the screen of my computer, and if the decoded text contains a URL, open it in the Android browser.</p>
<p>These two bookmarklets turn the URL of the current page that is open in your browser into a scannable QR-code:</p>
<p><strong>Google Charts API based bookmarklet:</strong> Drag this link &#8211;&gt;<a href="javascript: location.href='http://chart.apis.google.com/chart?cht=qr&amp;chs=350x350&amp;chl='+escape(location.href);">Current URL to QR-code</a> to your bookmarks toolbar.</p>
<pre>The code is:
javascript:location.href='http://chart.apis.google.com/chart?cht=qr&amp;chs=350x350&amp;chl='+escape(location.href);</pre>
<p>Alternatively, I made a <a href="http://qrcode.kaywa.com">Kaywa QR-code generator</a> version. Drag this link &#8211;&gt;<a href="javascript: location.href='http://qrcode.kaywa.com/img.php?s=8&amp;d='+escape(location.href);">Current URL to QR-code</a> to your bookmarks toolbar.</p>
<pre>The code is:
javascript:location.href='http://qrcode.kaywa.com/img.php?s=8&amp;d='+escape(location.href);</pre>
<p>They both do the same thing, so you probably only want one. Only tested on Firefox.</p>
<p><em>(Update:  seemed Wordpress ate the javascript in the bookmarklet links &#8230; should be okay now)</em></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.pansapiens.com/2009/01/02/qr-code-bookmarklets/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<creativeCommons:license>http://creativecommons.org/publicdomain/zero/1.0/</creativeCommons:license>
	<feedburner:origLink>http://blog.pansapiens.com/2009/01/02/qr-code-bookmarklets/</feedburner:origLink></item>
		<item>
		<title>Delicious geohashes … mmmm … tagging *drool*</title>
		<link>http://feedproxy.google.com/~r/YourBonesGotALittleMachine/~3/IrpDF3V_XGQ/</link>
		<comments>http://blog.pansapiens.com/2008/12/29/delicious-geohashes-mmmm-tagging-drool/#comments</comments>
		<pubDate>Mon, 29 Dec 2008 06:21:53 +0000</pubDate>
		<dc:creator>perry</dc:creator>
				<category><![CDATA[ideas]]></category>
		<category><![CDATA[two-point-oh]]></category>
		<category><![CDATA[web2.0]]></category>
		<category><![CDATA[android]]></category>
		<category><![CDATA[delicious]]></category>
		<category><![CDATA[geohash]]></category>
		<category><![CDATA[gps]]></category>

		<guid isPermaLink="false">http://blog.pansapiens.com/?p=94</guid>
		<description><![CDATA[Since I got a new toy for Christmas, I&#8217;ve become interested in geolocation and the fun things you can do when you have an internet-connected GPS-enabled device in your pocket. I&#8217;m also a compulsive delicious tagger, so I quickly discovered the existing practice for geotagging delicious bookmarks.
Essentially, this seems to be: add the tag &#8216;geotagged&#8216;, [...]]]></description>
			<content:encoded><![CDATA[<p>Since I got a <a href="http://www.flickr.com/photos/pansapiens/3145097863/">new toy</a> for Christmas, I&#8217;ve become interested in geolocation and the fun things you can do when you have an internet-connected GPS-enabled device in your pocket. I&#8217;m also a compulsive delicious tagger, so I quickly discovered the <a href="http://en.wikipedia.org/wiki/Geotagging#Geotagging_in_tag-based_systems">existing practice for geotagging delicious bookmarks</a>.</p>
<p>Essentially, this seems to be: add the tag &#8216;<strong><a href="http://delicious.com/tag/geotagged">geotagged</a></strong>&#8216;, along with the tags &#8216;<strong>geo:lat=<em>X.xxx</em></strong>&#8216; and &#8216;<strong>geo:lon=<em>X.xxx</em></strong>&#8216;, where the <strong><em>X.xxx</em></strong>&#8217;s are the latitude and longtitude numbers that are likely to come straight out of your GPS, in decimal degrees (WGS84).</p>
<p>This is all very nice, but the problem with tags in this format is that there is no easy or efficient way to use them to retrieve all items tagged for a particular <em>locality</em>. Sure, if I&#8217;m standing right on top of the <a href="http://maps.google.com/maps?f=q&amp;hl=en&amp;geocode=&amp;q=eureka+tower&amp;sll=-37.773358,144.946055&amp;sspn=0.010194,0.014913&amp;ll=-37.821362,144.964213&amp;spn=0.010187,0.014913&amp;t=h&amp;z=16">Eureka Tower</a> at <em>-37.821362,144.964213</em>, I can search for tags <strong>geo:lat=-37.821362</strong> and <strong>geo:lon=144.964213</strong> to find all the geotagged links for that <em>exact</em> location, but what if I&#8217;m standing 50 metres across the street looking up at the tower and want to search for links near my current location ?<span id="more-94"></span></p>
<p>Enter the <a href="http://geohash.org/">geohash</a>, a hash function for geolocation coordinates invented by <a href="http://labix.org/">Gustavo Niemeyer</a> (not to be confused with the <a href="http://www.xkcd.com/426/">xkcd Spontaneous Adventure Generation algorithm</a> of the same name). Wikipedia gives a reasonable explanation of <a href="http://en.wikipedia.org/wiki/Geohash">how geohashes work</a> &#8230; essentially the latitude and longitude are encoded as strings like <em><a href="http://geohash.org/r1r0fdzdwg">r1r0fdzdwg</a></em>. Geohashes have the useful property of having arbitrary precision &#8230; geohashes with the same prefix represent locations in the same vicinity. This means that the location across the street from the Eureka tower, at geohash <em><a href="http://geohash.org/r1r0fdy7sm"><strong>r1r0fd</strong>y7sm</a></em>, shares the prefix <em>r1r0fd</em> with the geohash closest to the top of the Eureka Tower, at <em><a href="http://geohash.org/r1r0fdzdwg"><strong>r1r0fd</strong>zdwg</a></em>.</p>
<p>My proposal for delicious geotaggers is that in addition to the <strong>geo:lat</strong> and <strong>geo:lon</strong> tags, several truncated geo:hash tags should also be used. If I were to bookmark something related to the Eureka Tower, I may tag it:</p>
<pre><strong>geotagged
</strong><strong>geo:lat=-37.821362
geo:lon=144.964213
geo:hash=r1r0fdzdwg
geo:hash=</strong><strong><strong>r1r0fdz
</strong>geo:hash=</strong><strong><strong>r1r0f</strong></strong></pre>
<p>Then, anyone searching for the tag <a href="http://delicious.com/pansapiens/geo:hash=r1r0f"><strong>geo:hash=r1r0f</strong></a> will find every item within the area that this geohash covers &#8230; this would include not only the Eureka Tower, but the <a href="http://geohash.org/r1r0fe76n">Rialto Towers</a> too.</p>
<p>For each bookmarked item, the number of truncated geohashes used as tags roughly determine the distance ranges (ie <a href="http://mappinghacks.com/2008/05/29/geohash-implemented-in-python/">bounding boxes</a>) that can be searched. Exactly which truncations, or how many geohash tags to use, is an existing problem that I haven&#8217;t yet decided the best solution for; is it best to &#8216;overload&#8217; with every possible geohash truncation (eg include tags geo:hash=r1r0fdzdwg, geo:hash=r1r0fdzdwg, geo:hash=r1r0fdzdw, geo:hash=r1r0fdzd, geo:hash=r1r0fdz &#8230;etc&#8230; to geo:hash=r) ? This is probably overkill. A better approach would be to choose just a few key truncations that roughly correlate to a range of sensibly sized patches on the Earths surface, eg, bounding boxes with diagonal lengths of:</p>
<ul>
<li>geo:hash=r1r0fdzdwg <strong>~60 cm</strong> ['exact']</li>
<li>geo:hash=r1r0fdzd <strong>~20 m</strong></li>
<li>geo:hash=r1r0fdz <strong>~150 m</strong></li>
<li>geo:hash=r1r0fd <strong>~600 m</strong></li>
<li>geo:hash=r1r0f <strong>~4.8 km</strong></li>
<li>geo:hash=r1r0 <strong>~19.5 km</strong></li>
<li>geo:hash=r1r <strong>~150 km</strong></li>
</ul>
<p>These ranges map loosely to those deemed useful by Brightkite, which lets you search for events around you within 20 m, 200 m, 2 km, 4 km, 10 km, 50 km and 100 km. Maybe we only need a few of these. If only a few truncations were provided by the tagger, the user can always execute multiple searches, starting from the full geohash of their current location and truncating back, character by character, (effectively expanding the search radius) until they start to get hits. There may also be techniques whereby the last character(s) of the truncated hash can be incremented/decremented to search neighboring bounding boxes (eg for r1r0fd, also search for r1r0fc, r1r0fe tags), although I need to think about this a little more.</p>
<p>Of course, the best solution for more useful geotagging within delicious would be for delicious/Yahoo to explicitly support some style of geotagging and provide a geotag-aware search facility &#8230; but until that day, geohashes may well do the job well enough. Next step for me: write a proof of concept application that actually produces and makes use of these types of tags &#8230;.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.pansapiens.com/2008/12/29/delicious-geohashes-mmmm-tagging-drool/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<creativeCommons:license>http://creativecommons.org/publicdomain/zero/1.0/</creativeCommons:license>
	<feedburner:origLink>http://blog.pansapiens.com/2008/12/29/delicious-geohashes-mmmm-tagging-drool/</feedburner:origLink></item>
		<item>
		<title>Naming in molecular biology: get comfortable with meaninglessness !</title>
		<link>http://feedproxy.google.com/~r/YourBonesGotALittleMachine/~3/H_2s4LGTZik/</link>
		<comments>http://blog.pansapiens.com/2008/12/14/naming-in-molecular-biology-get-comfortable-with-meaninglessness/#comments</comments>
		<pubDate>Sun, 14 Dec 2008 01:13:45 +0000</pubDate>
		<dc:creator>perry</dc:creator>
				<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[molecular biology]]></category>
		<category><![CDATA[semantic]]></category>

		<guid isPermaLink="false">http://blog.pansapiens.com/?p=92</guid>
		<description><![CDATA[I noticed an interesting post over on BoingBoing: &#8220;Comfort with meaninglessness the key to good programmers&#8220;. It outlines some research by Dehnadi and Bornat on attributes that can predict aptitude in computer programming. They conclude that a &#8220;deep comfort with meaninglessness&#8221; is an important predictor of programming aptitude.
I think comfort with meaninglessness is an important [...]]]></description>
			<content:encoded><![CDATA[<p>I noticed an interesting post over on BoingBoing: &#8220;<a href="http://www.boingboing.net/2008/12/12/comfort-with-meaning.html">Comfort with meaninglessness the key to good programmers</a>&#8220;. It outlines some <a href="http://www.cs.mdx.ac.uk/research/PhDArea/saeed/">research by Dehnadi and Bornat</a> on attributes that can predict aptitude in computer programming. They conclude that a &#8220;deep comfort with meaninglessness&#8221; is an important predictor of programming aptitude.</p>
<p>I think comfort with meaninglessness is an important skill in studying biology (and probably other sciences too). Many times, during the description of a system, various acronyms are thrown about as labels for entities (or &#8216;actors&#8217;) in that system. An important skill of the scientist is being able to follow how all the actors in the system relate to each other, without necessarily knowing anything about the specific properties of those actors. There are lots of protein and gene names which often bear very little meaning relative to the biological entity that they label, and fixating on what &#8216;the name&#8217; means simply distracts from the true nature of the entity.</p>
<p><span id="more-92"></span></p>
<p><strong>Example:</strong> TPR proteins are a superfamily of protein fold, often involved in protein-protein interactions. I have sometimes been asked at poster presentations, or the occasional talk: &#8220;What does TPR stand for ?&#8221;. &#8220;TPR&#8221; is an acronym for &#8220;<em>t</em>etratrico<em>p</em>eptide <em>r</em>epeat&#8221; &#8230; you may be able to glean from that expansion that the protein fold is composed of repeat sequences 34 amino acids long &#8211; but that is only one small aspect of the family, and isn&#8217;t the important point. Yet many molecular biologists appear uncomfortable with an &#8220;undefined&#8221; acronym, and insist on having it expanded to reveal the full name. TPR is just a convenient label for the superfamily &#8230; it could equally have been called <em>GrratBlat</em> or <em>5450520A, </em>it would still be the same thing. The point is, you <em>shouldn&#8217;t have to ask</em> what TPR stands for. Sure, it&#8217;s a curiosity, and some protein names can be amusing (Sonic Hedgehog, or &#8220;Just Another Kinase&#8221; come to mind), it may also contain some meaning, but first and foremost it&#8217;s a label &#8211; something to link the entity to all the other descriptive information about it&#8217;s structure, function, localisation and regulation. Like many classes of protein, the original name was given at a time when little was actually known about the thing, and typically the meaning embedded in the name should be ignored lest it bias our interpretation about what that protein really does.</p>
<p><strong>Summary of opinion:</strong> Molecular biologists should become comfortable with the notion that a name is just a label &#8211; meaningless without the associated metadata.</p>
<p>All of this is probably second nature those who studied philosophy (or computer science, or linguistics) &#8230; I&#8217;m guessing it is an issue of <a href="http://en.wikipedia.org/wiki/Semantic">semantics</a>. I really should have taken some of those subjects back in my undergrad days &#8230; <img src='http://blog.pansapiens.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://blog.pansapiens.com/2008/12/14/naming-in-molecular-biology-get-comfortable-with-meaninglessness/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<creativeCommons:license>http://creativecommons.org/publicdomain/zero/1.0/</creativeCommons:license>
	<feedburner:origLink>http://blog.pansapiens.com/2008/12/14/naming-in-molecular-biology-get-comfortable-with-meaninglessness/</feedburner:origLink></item>
		<item>
		<title>That Science Blog Meme Thing Going Around</title>
		<link>http://feedproxy.google.com/~r/YourBonesGotALittleMachine/~3/SDXkJeIl3LY/</link>
		<comments>http://blog.pansapiens.com/2008/11/17/that-science-blog-meme-thing-going-around/#comments</comments>
		<pubDate>Sun, 16 Nov 2008 23:14:36 +0000</pubDate>
		<dc:creator>perry</dc:creator>
				<category><![CDATA[meta]]></category>
		<category><![CDATA[random]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[blogging]]></category>

		<guid isPermaLink="false">http://blog.pansapiens.com/?p=89</guid>
		<description><![CDATA[Something broke out of the picket-fence surrounding Nature Networks Blogs. I&#8217;ve never participated in a blog meme &#8230; but this one appealed to me simply because I liked the questions. I haven&#8217;t read anyone else&#8217;s answers yet, to avoid biasing my own.
1. What is your blog about?
I often ask myself this question. Being a scientist, [...]]]></description>
			<content:encoded><![CDATA[<p>Something broke out of the picket-fence surrounding Nature Networks Blogs. I&#8217;ve never participated in a blog meme &#8230; <a href="http://network.nature.com/groups/nnbloggername/forum/topics/3392">but this one</a> appealed to me simply because I liked the questions. I haven&#8217;t read anyone else&#8217;s answers yet, to avoid biasing my own.</p>
<p><strong>1. What is your blog about?</strong></p>
<p>I often ask myself this question. Being a scientist, I always feel like it <em>should</em> be about science, and the various biological systems I work with. I reality, it is about programming, bioinformatics, the web with the tiniest bit of structural biology thrown in. I used to blog about Linux related things occasionally, but I split that off into <a href="http://linuxblog.pansapiens.com/">another blog</a> (which gets 10 times more traffic <img src='http://blog.pansapiens.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> ).</p>
<p><span id="more-89"></span></p>
<p><strong>2. What will you never write about?</strong></p>
<p>I&#8217;d rather not rule anything out. I&#8217;ve avoided politics, largely because all my &#8220;blogging peers&#8221; also largely avoid it. Sometime I wonder why, but I think I&#8217;m beginning to understand; blogging about political issues may seem important, but eventually it just becomes a constant &#8216;<em>rah rah rah</em>&#8216;. That would get boring. So I&#8217;m saving it for when it <em>really</em> matters (the <a href="http://www.computerworld.com.au/index.php/id;355409327">proposed &#8216;filtering&#8217; of internet connections at the ISP level by the Australian government</a> is coming close. There, I did it <img src='http://blog.pansapiens.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  ).</p>
<p><strong>3. Have you ever considered leaving science?</strong></p>
<p>Yes. But I did cheat and read <a href="http://scienceblogs.com/clock/2008/11/the_science_blog_meme.php">Bora&#8217;s answer to this one</a> (by accident).<br />
I think it would be impossible to leave &#8217;science&#8217; unless I became permanently brain damaged or something. Leave research &#8230; sure, it crosses my mind from time to time.</p>
<p><strong>4. What would you do instead?</strong></p>
<p>I&#8217;d try my best to get a job in video game development, or web application development (or both &#8230; developing a web-based game !). I&#8217;d try to work my experience as a scientist into this endeavour. Not sure how successful I would be, but I&#8217;m trying to <a href="http://omgwtfgames.com/">slowly build up a portfolio</a>, just in case <img src='http://blog.pansapiens.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p><strong>5. What do you think will science blogging be like in 5 years?</strong></p>
<p>I think research groups will be tied together more and more by their blogs. A bit like the way <a href="http://rrresearch.blogspot.com/">RRRosie Redfield</a> runs things. Not necessarily blanket Open Notebook Science (give it 10 years), but certainly a greater level of open, public communication of ideas.</p>
<p><strong>6. What is the most extraordinary thing that happened to you because of blogging?</strong></p>
<p>It&#8217;s not that exciting, but it&#8217;s the best I&#8217;ve got; I was contacted by an <em>real</em> journalist with some questions about the role of cloud computing for academic research. I didn&#8217;t really feel qualified to comment a length, since it largely stemmed from a <a href="http://blog.pansapiens.com/2007/03/04/an-amazon-ec2-cluster-for-blast-searching/">&#8216;what if&#8217; type idea I blogged about</a>. But I gave my perspective anyhow.</p>
<p><strong>7. Did you write a blog post or comment you later regretted?</strong></p>
<p>Yes, quite often. I have a tendancy to always be in a hurry when commenting, coupled with a desire to not be overly self-censoring. I&#8217;m never trying to sound like a kook, troll or A-hole &#8230; it just turns out that way sometimes. Personally I find it takes a long time to write a concise, clear comment, beyond &#8220;LOL&#8221;. The FriendFeed comment box is always too small for me, but that&#8217;s a good thing, since it forces me to summarise.</p>
<p><strong>8. When did you first learn about science blogging?</strong></p>
<p>Not exactly sure, but I think it was via <a href="http://nodalpoint.org/">nodalpoint.org</a> the bioinformatics weblog. I wrote one or two posts for nodalpoint. Around the time I was writing my PhD thesis, a few netziens of nodalpoint started blogs (or I discovered that they had blogs outside of nodalpoint). The most influential for me were <a href="http://nsaunders.wordpress.com/">Neil Saunders&#8217; &#8220;What You&#8217;re Doing is Rather Desparate&#8221;</a>, and well as <a href="http://pbeltrao.blogspot.com/">Pedro Beltrao&#8217;s &#8220;Public Rambling&#8221;</a> and <a href="http://duncan.hull.name/">Duncan Hull&#8217;s &#8220;O&#8217;Really ?&#8221;</a>. I thought, &#8220;I can do that&#8221;, so I started a Blogger blog. One year later I moved to a self-hosted Wordpress blog. But lets not get side tracked with a boring meta-discussion about blogging software.</p>
<p><strong>9. What do your colleagues at work say about your blogging?</strong></p>
<p>Mostly, they are unaware. Occasionally I have dropped a comment about &#8220;my blog&#8221;, but no one has really jumped on it. Which is fine, since I don&#8217;t really blog about anything that most of them have any interest in.</p>
<p>Well, that&#8217;s it. Now to read everyone else&#8217;s responses &#8230;&#8230;</p>
<p>(Originally via <a href="http://scienceblogs.com/clock/2008/11/the_science_blog_meme.php">Blog Around the Clock [Bora Zivkovic]</a>, via <a href="http://friendfeed.com/e/8905af47-caf7-35c7-5396-ab2dab3e62e7/The-Science-Blog-Meme/">FriendFeed</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.pansapiens.com/2008/11/17/that-science-blog-meme-thing-going-around/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	<creativeCommons:license>http://creativecommons.org/publicdomain/zero/1.0/</creativeCommons:license>
	<feedburner:origLink>http://blog.pansapiens.com/2008/11/17/that-science-blog-meme-thing-going-around/</feedburner:origLink></item>
		<item>
		<title>Searching bioinformatic databases with YubNub</title>
		<link>http://feedproxy.google.com/~r/YourBonesGotALittleMachine/~3/pJKICa681nE/</link>
		<comments>http://blog.pansapiens.com/2008/11/12/searching-bioinformatic-databases-with-yubnub/#comments</comments>
		<pubDate>Wed, 12 Nov 2008 11:29:16 +0000</pubDate>
		<dc:creator>perry</dc:creator>
				<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[howto]]></category>
		<category><![CDATA[two-point-oh]]></category>
		<category><![CDATA[web2.0]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[yubnub]]></category>

		<guid isPermaLink="false">http://blog.pansapiens.com/?p=87</guid>
		<description><![CDATA[You may already be familiar with YubNub; it describes itself as &#8220;the social command line for the web&#8221;. Most commands consist of two (or more) words &#8230; one for the search engine, the other for the query.
For example, typing:
gg open science on friendfeed
into the YubNub search box searches Google for &#8220;open science on friendfeed&#8220;, via [...]]]></description>
			<content:encoded><![CDATA[<p>You may already be familiar with <a href="http://yubnub.org/">YubNub</a>; it describes itself as &#8220;the social command line for the web&#8221;. Most commands consist of two (or more) words &#8230; one for the search engine, the other for the query.</p>
<p>For example, typing:</p>
<blockquote><p><em><strong>gg open science on friendfeed</strong></em></p></blockquote>
<p>into the YubNub search box searches Google for &#8220;<em>open science on friendfeed</em>&#8220;, via YubNub.</p>
<p>I thought I&#8217;d highlight a few life science- and bioinformatics-related YubNub commands I find myself using quite often in my day-to-day work. Some are commands I created, others someone else created. This is the beauty of YubNub &#8230; often someone has already made the &#8216;obvious&#8217; command &#8230; it&#8217;s worth just trying to search with a command you expect to exist, since it often does.</p>
<p>Onward, with the list:</p>
<p><span id="more-87"></span></p>
<ul>
<li><a href="http://yubnub.org/kernel/man?args=pdb"><strong>pubmed</strong></a> &#8212; Searches PubMed</li>
<li><a href="http://yubnub.org/kernel/man?args=hubmed"><strong>hubmed</strong></a> &#8212; Searches <a href="http://www.hubmed.org/">HubMed</a> (Alf Eatons featureful alternative interface to PubMed)</li>
<li><a href="http://yubnub.org/kernel/man?args=gopubmed"><strong>gopubmed</strong></a> &#8212; Searches <a href="http://www.gopubmed.org/">GoPubMed</a> (an ontology enhanced PubMed search)</li>
<li><a href="http://yubnub.org/kernel/man?args=doi"><strong>doi</strong></a> &#8212; Redirects you based on a Digital Object Identifier (DOI), via <span class="muted">http://dx.doi.org/</span></li>
<li><a href="http://yubnub.org/kernel/man?args=pdb"><strong>pdb</strong></a> &#8212; Searches the Protein DataBank for 3D structures. Usually the search term should be a 4 letter pdb code.</li>
<li><a href="http://yubnub.org/kernel/man?args=uniprot"><strong>uniprot</strong></a> &#8212; Searches the Uniprot database (use an accession, id or keyword as the query).</li>
<li><a href="http://yubnub.org/kernel/man?args=ihop"><strong>ihop</strong></a> &#8212; Searches <a href="http://www.ihop-net.org">iHOP</a>, information Hyperlinked over Proteins, for views of the biomedical literature guided by gene networks. Nothing to do with <a href="http://www.google.com/search?q=ihop">pancakes (or prayer)</a>.</li>
</ul>
<p>There is also a class of more general, non-biomedical commands which I often use:</p>
<ul>
<li><a href="http://yubnub.org/kernel/man?args=gg"><strong>gg</strong></a> &#8212; The Google.</li>
<li><strong><a href="http://yubnub.org/kernel/man?args=gim">gim</a> &#8212; </strong>The Google Image Search.</li>
<li><a href="http://yubnub.org/kernel/man?args=wp"><strong>wp</strong></a> &#8212; Good ol&#8217; Wikipedia.</li>
<li><strong><a href="http://yubnub.org/kernel/man?args=ucc">ucc</a> </strong>&#8211; The universal currency converter at XE.com. Use it like <strong><em>ucc 399 aud usd</em></strong>, to convert $399 Australian dollars to US dollars. Then, if you have your cash in Australian dollars, weep about the recent drop in the exchange rate <img src='http://blog.pansapiens.com/wp-includes/images/smilies/icon_razz.gif' alt=':P' class='wp-smiley' /> </li>
<li><strong><a href="http://yubnub.org/kernel/man?args=man">man</a></strong> &#8212; Like *nix man &#8216;manual pages&#8217;, but for YubNub commands. Eg, <strong><em>man ucc</em></strong> will give the manual page describing how to used the <em>ucc</em> command.</li>
<li><strong><a href="http://yubnub.org/kernel/man?args=ls">ls</a></strong> &#8212; A bit like the *nix shell ls, this command lists existing YubNub commands that contain your query in their name, description or url. eg. searching <strong><em><a href="http://yubnub.org/kernel/ls?args=protein">ls protein</a></em></strong> gives you a short list of all the commands related to proteins.</li>
</ul>
<p>I&#8217;ve installed the <a href="http://mycroft.mozdev.org/search-engines.html?name=yubnub">YubNub opensearch plugin</a> so I can search directly from the search box (or location bar) in Firefox. Maybe one day <a href="https://wiki.mozilla.org/Labs/Ubiquity">Ubiquity</a> will fulfill this purpose, since in many way it is the natural progression of the YubNub idea. But for the moment YubNub is the fastest, most streamlined way I&#8217;ve found to quickly fire off a search when I need to hunt down a reference, protein sequence or 3D structure. Nothing like instant gratification <img src='http://blog.pansapiens.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://blog.pansapiens.com/2008/11/12/searching-bioinformatic-databases-with-yubnub/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<creativeCommons:license>http://creativecommons.org/publicdomain/zero/1.0/</creativeCommons:license>
	<feedburner:origLink>http://blog.pansapiens.com/2008/11/12/searching-bioinformatic-databases-with-yubnub/</feedburner:origLink></item>
		<item>
		<title>Synthetic Biology 4.0 : highlights and reflections</title>
		<link>http://feedproxy.google.com/~r/YourBonesGotALittleMachine/~3/TwKow2laR5Y/</link>
		<comments>http://blog.pansapiens.com/2008/10/30/synthetic-biology-40-highlights-and-reflections/#comments</comments>
		<pubDate>Thu, 30 Oct 2008 03:55:46 +0000</pubDate>
		<dc:creator>perry</dc:creator>
				<category><![CDATA[meetings]]></category>
		<category><![CDATA[synthetic biology]]></category>

		<guid isPermaLink="false">http://blog.pansapiens.com/?p=84</guid>
		<description><![CDATA[Update: The videos of the talks from Synthetic Biology 4.0 are here !
Around three weeks ago I attended the Synthetic Biology 4.0 meeting in Hong Kong, hosted by the Hong Kong University of Science and Technology. I&#8217;ve taken a little time to allow all the new and exciting ideas to sink in. I really enjoyed [...]]]></description>
			<content:encoded><![CDATA[<p><em><strong>Update:</strong> The <a href="http://www.youtube.com/user/BioBricksFoundation">videos of the talks from Synthetic Biology 4.0 are here</a> !</em></p>
<p>Around three weeks ago I attended the <a href="http://sb4.biobricks.org/">Synthetic Biology 4.0 meeting</a> in Hong Kong, hosted by the Hong Kong University of Science and Technology. I&#8217;ve taken a little time to allow all the new and exciting ideas to sink in. I really enjoyed the meeting, and while it was a little short it was an effective way to quickly sample the current developments in synthetic biology, as it stands.</p>
<p><span id="more-84"></span></p>
<h3>Some highlights</h3>
<p>Here are some summaries on a few highlight presentations. Due to parallel sessions, I couldn&#8217;t see every talk; luckily I&#8217;ll be able to catch up on those I missed once the videos appear online.</p>
<p><strong>Clyde Hutchison (J. Craig Venter Institute)</strong>, spoke about genome construction, specifically about rebuilding the minimal genome on <a href="http://en.wikipedia.org/wiki/Mycoplasma_genitalium"><em>Mycoplasma genitalium</em></a> by synthesis of large fragments and subsequent stepwise assembly. This is probably not brand new work, but it was the first time I&#8217;d seen it presented. Turns out they could fully assemble the genome into a yeast vector from 25 large fragments using a &#8217;shot-gun&#8217; approach with subsequent PCR screening to identify the correctly assembled construct.</p>
<p><strong>Christopher Voight (UCSF) (and a poster presentation by Travis Bayer <em>et al</em>): Bio-MeX: A Novel Route from Biomass to Gasoline and Petrochemicals. </strong>Chris spoke about the work his lab is doing on producing methyl halides in <em>E. coli</em> and yeast. Methyl halides are a common feedstock in the petrochemical industry for producing many other organic chemicals, so a non-fossil fuel source will be useful in the future. I&#8217;ve always been a little skeptical about ocean metagenomic studies that sell themselves as the solution to the worlds energy problems, based on the notion that an amazing new enzyme will be discovered in marine bacteria. Well, this work coming out of Chris&#8217; lab could prove me wrong &#8230; after screening 89 putative homologs of a methyl halide transferases from the various kingdoms of life (most annotated &#8216;methyl transferase&#8217; in sequence database), the best one turned out the be from an &#8220;uncultured marine bacteria&#8221;, discovered by the Sargasso Sea Sequencing Project.</p>
<p><strong>Eric Winfree (CalTech) </strong>spoke about his experiments with DNA tile-based crystals, as a potential model system for pre-biotic life. Crystal growth, breakage and regrowth models replication, the tiled &#8216;layers&#8217; of different crystal variants form the genome. Don&#8217;t be mislead and assume that Eric is proposing that DNA of this nature was actually around in the primordial soup &#8230; it&#8217;s simply being used as a well understood system enabling specific molecular complementarity and template based replication (with errors); in theory this type of system could be built using any &#8216;crystal&#8217; with similar properties.</p>
<p><strong>Jay Keasling (UC Berkley, LBNL): Fuel and Drug Production:</strong> Jay gave some facts and figures about biofuel production from the metabolically engineered organisms which are ultimately being commercialised at Amyris. It was noted in the opening slides of several talks at this conference: Ethanol is not considered a very good gasoline replacement (lower energy density, too hydroscopic for existing pipeline infrastructure, high octane rating) when compared with n-butanol (higher energy density, less hydroscopic, controlled volitility, similar octane rating to gasoline). Watch the video for some well handled but potentially hairy questions about the source of feedstocks for biofuels production. On the science-side, Jay showed some interesting results from studies testing the effect of scaffolds for co-localising enzymes in a biosynthetic pathway of interest (as fusion proteins to PDZ, SH3 etc domains). After screening a library of scaffolds that would arrange different numbers of enzymes in different orders, one specific arrangment (&#8221;A-B-B-C-C&#8221;) worked better than other variations. No rational explaination, but interesting nonetheless.</p>
<p><strong>Patrick Boyle <em>et al</em> from Pamela Silvers lab:Â  (poster presentation, </strong><em><strong>The Synthetic Hydrogenosome: Subcellular Engineering for Biohydrogen production</strong></em><strong>):</strong> showed how he was engineering yeast mitochondria for hydrogen production by introducing components of the pathway responsible from <a href="http://en.wikipedia.org/wiki/Hydrogenosome">hydrogenosomes</a>. This work caught my interest since for the past few years I&#8217;ve worked on aspects of protein import into mitochondria and related organelles (including a little bit of collaborative work on hydrogenosomes recently). Early versions of the engineered strain were producing small amounts of hydrogen compared with wildtype yeast &#8230; it will be interesting to see how far this can be optimized in the future (I don&#8217;t expect it to fuel a hydrogen economy anytime soon, but it&#8217;s early days).</p>
<p><strong>Priscilla Purnick (Yale)</strong> spoke about her work helping engineer embryonic stem cells with genetic circuits to precisely control proliferation, differentiation and cell death, with the ultimate goal of treating Type I Diabetes with self-regulating beta-like insulin secreting cells. I couldn&#8217;t do justice to the technical details in a short summary here, needless to say it&#8217;s complex and impressive looking work. This was a spin off project from the Princeton iGEM team using many &#8216;off the shelf&#8217; <a href="http://partsregistry.org/Main_Page">BioBricks</a>(tm). While still in the early stages, the talk showed a nice mix of systems biology style modeling and hard experimental data.</p>
<p>I sat in on an open discussion session, entitled <strong>Legal Schemes and Rights</strong>. The key point that surprised me here was how little had actually been figured out surrounding the &#8216;intellectual property&#8217; law of sharing, modifying, combining and (potentially) commercializing individual BioBrick[tm]-style parts, and devices built with such parts. I&#8217;d been under the delusion that the legal side was figured out early on, and that BioBrick parts were shared under some sort of MIT-like or BSD-like license. This is apparently not the case. Various possible legal frameworks were outlined, but I think the most insightful advice I heard was something to the effect of (paraphrased) .. <em>&#8220;Don&#8217;t sit around treating this like an interesting academic problem .. pretty soon (~12 months ?) community norms for licensing parts will emerge by necessity and under the force of commercial interests, not by careful or considered design, and if you haven&#8217;t worked to establish things in the way that is best for proliferation of the technology and all stakeholders, it will be too late to change course. We have seen this has happen in other industries, it will happen here too&#8221;</em>. Watch the videos once they appear if you want the correct quote, I can&#8217;t remember who said it now <img src='http://blog.pansapiens.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>I&#8217;ve half written a &#8220;part 2&#8243; to this post, outlining some ideas on the current state of Synthetic Biology as a field &#8230; but I&#8217;ll probably never publish it since it&#8217;s mostly just me organizing my slightly opinionated thoughts out loud. The summary is: there is currently a lot of excitement, potentially a little hype (see <a href="http://en.wikipedia.org/wiki/Technology_hype">Gartner&#8217;s hype cycle</a>, pick where you think Synthetic Biology currently sits on the curve). It will be interesting how this pans out, particularly if expectations have been unrealistically inflated. There has also been a bit of <a href="http://blog.openwetware.org/freegenes/2008/04/13/synthetic-biology-rant-link/">quibbling</a> over the <a href="http://blog-msb.embo.org/blog/2008/03/synthetic_biology_nsabb_and_cr.html#comment-1651">definition of Synthetic Biology</a>; those engineers that have forged the field are probably pretty bored of the old &#8216;definition debate&#8217; by now, but many biologists still don&#8217;t really know what Synthetic Biology is. Based on discussions I&#8217;ve had with colleagues about the conference content upon returning, the average molecular biologist doesn&#8217;t always see how Synthetic Biology is very different to what various bioscientists have been doing under the banner of Biotechnology for many years. My simplest explaination is that the underlying technologies are essentially the same, but the approach, the application of <em>formal engineering principles</em> and their use in a rational way to create useful &#8216;devices&#8217;, is different from past practices that have largely focused on discovering new knowledge about biological systems with many useful technologies arising in a more &#8216;undirected&#8217; and opportunistic fashion. It&#8217;s great to be living in a time when the foundational knowledge in biology, built on decades (centuries?) of basic research, are slowly beginning to come to fruition in rationally designed technologies.</p>
<p>For another perspective, and little more on the buzz outside of the formal presentations, see <a href="http://88proof.com/synthetic_biology/blog/archives/tag/sb40"><span class="entry-author author vcard"><span class="url fn n">Jonathan Cline</span></span>s &#8220;Word on the Street @ SB 4.0&#8243; posts</a>.<abbr class="published posted_date" title="2008-10-12T07:04:30-06:00" /></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.pansapiens.com/2008/10/30/synthetic-biology-40-highlights-and-reflections/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	<creativeCommons:license>http://creativecommons.org/publicdomain/zero/1.0/</creativeCommons:license>
	<feedburner:origLink>http://blog.pansapiens.com/2008/10/30/synthetic-biology-40-highlights-and-reflections/</feedburner:origLink></item>
		<item>
		<title>Count the number of sequences in a FASTA format file: a Unix shell snippet</title>
		<link>http://feedproxy.google.com/~r/YourBonesGotALittleMachine/~3/XttIOC7bqfg/</link>
		<comments>http://blog.pansapiens.com/2008/09/01/count-the-number-of-sequences-in-a-fasta-format-file-a-unix-shell-snippet/#comments</comments>
		<pubDate>Mon, 01 Sep 2008 05:49:38 +0000</pubDate>
		<dc:creator>perry</dc:creator>
				<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[linux]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[bash]]></category>
		<category><![CDATA[code]]></category>
		<category><![CDATA[scripts]]></category>
		<category><![CDATA[unix]]></category>

		<guid isPermaLink="false">http://blog.pansapiens.com/?p=82</guid>
		<description><![CDATA[Sometimes it&#8217;s nice to quickly check how many sequences are in a FASTA format sequence file.
It barely warrants it&#8217;s own blog post, but here we go anyhow: my one-liner shell script for counting the number of sequences in a FASTA &#8220;flat-file database&#8221;, based on the presence of the &#8220;&#62;&#8221; header symbol.
#!/bin/sh
# ~/bin/countseqs
# Counts the number [...]]]></description>
			<content:encoded><![CDATA[<p>Sometimes it&#8217;s nice to quickly check how many sequences are in a <a href="http://en.wikipedia.org/wiki/FASTA_format">FASTA format</a> sequence file.</p>
<p>It barely warrants it&#8217;s own blog post, but here we go anyhow: my one-liner shell script for counting the number of sequences in a FASTA &#8220;flat-file database&#8221;, based on the presence of the &#8220;&gt;&#8221; header symbol.</p>
<div class="dean_ch" style="white-space: wrap;"><span class="re3">#!/bin/sh</span><br />
<span class="re3"># ~/bin/countseqs</span><br />
<span class="re3"># Counts the number of sequences <span class="kw1">in</span> a FASTA format file</span><br />
<span class="kw2">grep</span> <span class="st0">&quot;&gt;&quot;</span> $<span class="nu0">1</span> | <span class="kw2">wc</span> -l</div>
<p>Dead easy huh ? I put this in <em>~/bin/countseqs,</em> make it executable (<em>chmod +x ~/bin/countseqs</em>) and use it in lots of situations, as a quick sanity check.</p>
<p><em>(oh, btw, this is not public domain and u can&#8217;t use it for commercial gain without paying me a license fee. academic users can fax me something for a free license. k thx bye).</em></p>
<p><span style="color: #000000;">Couldn&#8217;t help myself &#8230; everyone else is doing it <img src='http://blog.pansapiens.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </span></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.pansapiens.com/2008/09/01/count-the-number-of-sequences-in-a-fasta-format-file-a-unix-shell-snippet/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	<creativeCommons:license>http://creativecommons.org/publicdomain/zero/1.0/</creativeCommons:license>
	<feedburner:origLink>http://blog.pansapiens.com/2008/09/01/count-the-number-of-sequences-in-a-fasta-format-file-a-unix-shell-snippet/</feedburner:origLink></item>
		<item>
		<title>Setting up NCBI wwwblast on Ubuntu 8.04 (Hardy), Apache 2</title>
		<link>http://feedproxy.google.com/~r/YourBonesGotALittleMachine/~3/6WcRqYOJslE/</link>
		<comments>http://blog.pansapiens.com/2008/08/25/setting-up-wwwblast-on-ubuntu-apache/#comments</comments>
		<pubDate>Mon, 25 Aug 2008 07:28:35 +0000</pubDate>
		<dc:creator>perry</dc:creator>
				<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[linux]]></category>
		<category><![CDATA[apache]]></category>
		<category><![CDATA[blast]]></category>
		<category><![CDATA[howto]]></category>
		<category><![CDATA[tutorial]]></category>

		<guid isPermaLink="false">http://blog.pansapiens.com/?p=80</guid>
		<description><![CDATA[
Recently I needed to install NCBI wwwblast on my local workstation to enable some software that needed to interface with BLAST via the web service. It was straightforward to install, but I took some notes, because there were a few changes required with respect to the official wwwblast documentation at NCBI. These instructions are for [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.pansapiens.com/wp-content/uploads/2008/08/wwwblast.png" rel="lightbox[80]"><img class="alignright size-medium wp-image-81" title="wwwblast" src="http://blog.pansapiens.com/wp-content/uploads/2008/08/wwwblast-300x262.png" alt="" width="300" height="262" /></a></p>
<p>Recently I needed to install NCBI wwwblast on my local workstation to enable some software that needed to interface with BLAST via the web service. It was straightforward to install, but I took some notes, because there were a few changes required with respect to the <a href="http://www.ncbi.nlm.nih.gov/staff/tao/URLAPI/wwwblast/wwwblast.html">official wwwblast documentation at NCBI</a>. These instructions are for Ubuntu 8.04, but probably will work with many recent flavours of Debian.<br />
<span id="more-80"></span></p>
<h3>Download and install</h3>
<p>Download NCBI wwwblast from <a href="ftp://ftp.ncbi.nih.gov/blast/executables/LATEST/">ftp://ftp.ncbi.nih.gov/blast/executables/LATEST/</a> (The version <span class="file">I used was wwwblast-2.2.18-ia32-linux.tar.gz).</span></p>
<p>Untar it into /var/www/ , preserving permissions.</p>
<pre><strong>$</strong> cd /var/www/</pre>
<pre><strong>$</strong> sudo tar zxvpf <span class="file">wwwblast-2.2.18-ia32-linux.tar.gz
</span></pre>
<p><span class="file">You will also need to make sure <em>csh</em> (the &#8220;C-shell&#8221;) is installed, since the <em>blast.cgi</em> script needs this to run:</span></p>
<pre><span class="file"><strong>$</strong> sudo apt-get install csh
</span></pre>
<p><em><span class="file">(Thanks to </span><span class="commentauthor">jpopesku for spotting this missing package [comments below])</span></em></p>
<h3>Set up Apache2</h3>
<p><a href="http://www.ncbi.nlm.nih.gov/staff/tao/URLAPI/wwwblast/node10.html">The instructions in the official wwwblast manual</a> didn&#8217;t seem to work for Apache2 .. (could be a typo: <em>Follow SymLinks </em>should be <em>FollowSymLinks</em> &#8230; one camel-case word without the space). I put this in into the VirtualHost definition in the standard <em>/etc/apache2/sites-available/default</em> file used by Apache2 in Ubuntu 8.04.</p>
<div class="dean_ch" style="white-space: wrap;"> &nbsp; &nbsp;# modified slightly from:<br />
&nbsp; &nbsp; #http://www.ncbi.nlm.nih.gov/staff/tao/URLAPI/wwwblast/node10.html<br />
&nbsp; &nbsp; &lt;Directory &quot;/var/www&quot;&gt;<br />
&nbsp; &nbsp; &nbsp; &nbsp; AddHandler cgi-script cgi REAL<br />
&nbsp; &nbsp; &nbsp; &nbsp; Options Indexes FollowSymLinks MultiViews +ExecCGI<br />
&nbsp; &nbsp; &nbsp; &nbsp; Order allow,deny<br />
&nbsp; &nbsp; &nbsp; &nbsp; Allow from all<br />
&nbsp; &nbsp; &lt;/Directory&gt;</div>
<p>The code above was inserted just before the &lt;/VirtualHost&gt; closing tag.</p>
<p>Restart Apache for good measure.</p>
<pre><strong>$</strong> sudo /etc/init.d/apache2 restart</pre>
<h3>Configure your wwwblast to see your databases</h3>
<p>Assuming you already have some existing <a href="http://www.ncbi.nlm.nih.gov/staff/tao/URLAPI/blastdb.html">BLAST formatted databases made with <em>formatdb</em></a>, you will then need to configure wwwblast to find them, <a href=" http://www.ncbi.nlm.nih.gov/staff/tao/URLAPI/wwwblast/node12.html">as per the manual</a>. In short, this involves putting the BLAST database files into <em>/var/www/blast/db/ </em>(or symlinking to the files), modifying <em>/var/www/blast/blast.rc </em>(or one of the other *.rc files, for other BLAST programs, eg psiblast) to associate the database name with a BLAST program, and modifying the dropdown database list in <em>/var/www/blast/blast.html </em>(or one of the other *.html files for other BLAST programs) to add the name of the database(s).</p>
<p>My BLAST databases live in <em>/data/databases/blast/</em>, so I simply moved the example database directory <em>/var/www/blast/db </em>to <em>/var/www/blast/db.orig<strong>:</strong></em></p>
<pre><strong>$</strong> sudo mv  /var/www/blast/db /var/www/blast/db.orig</pre>
<p>then symlinked <em>/var/www/blast/db</em> to <em>/data/databases/blast</em>:</p>
<pre><strong>$</strong> sudo ln -s /data/databases/blast /var/www/blast/db</pre>
<p>In <em>/var/www/blast/blast.rc</em> I changed the line:</p>
<pre>blastp test_aa_db</pre>
<p>to:</p>
<pre>blastp nr swissprot pdbaa</pre>
<p><em>(you may also want to configure blastn etc in the same way)</em></p>
<p>In <em>/var/www/blast/blast.html</em>, I changed the test database code:</p>
<div class="dean_ch" style="white-space: wrap;">&lt;select name = &quot;DATALIB&quot;&gt;<br />
&nbsp; &nbsp; &lt;option VALUE = &quot;test_aa_db&quot;&gt; test_aa_db<br />
&nbsp; &nbsp; &lt;option VALUE = &quot;test_na_db&quot;&gt; test_na_db<br />
&lt;/select&gt;</div>
<p>to:</p>
<div class="dean_ch" style="white-space: wrap;">&lt;select name = &quot;DATALIB&quot;&gt;<br />
&nbsp; &nbsp; &lt;option VALUE = &quot;nr&quot;&gt; nr<br />
&nbsp; &nbsp; &lt;option VALUE = &quot;swissprot&quot;&gt; swissprot<br />
&nbsp; &nbsp; &lt;option VALUE = &quot;pdbaa&quot;&gt; pdbaa<br />
&lt;/select&gt;</div>
<p><em>(you may want to do this for other BLAST programs too, eg also edit psiblast.html)</em></p>
<p>This will allow be to search the <em>nr</em>, <em>swissprot</em> and <em>pdbaa</em> databases I have installed using plain-vanilla BLAST.</p>
<h3>Test it out</h3>
<p>Go to <a href="http://localhost/blast/blast.html" target="_blank">http://localhost/blast/blast.html</a>.</p>
<p>You should find the wwwblast interface, with a dropdown box featuring the database(s) you added. Run a test search with your favorite sequences (ensure you also select the correct blast program, eg <em>blastp</em> !). Enjoy using your own CPU time !</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.pansapiens.com/2008/08/25/setting-up-wwwblast-on-ubuntu-apache/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
	<creativeCommons:license>http://creativecommons.org/publicdomain/zero/1.0/</creativeCommons:license>
	<feedburner:origLink>http://blog.pansapiens.com/2008/08/25/setting-up-wwwblast-on-ubuntu-apache/</feedburner:origLink></item>
		<item>
		<title>A “Web2.0 for Scientists Week”: signup-fest ?</title>
		<link>http://feedproxy.google.com/~r/YourBonesGotALittleMachine/~3/CMJeKxRAhzU/</link>
		<comments>http://blog.pansapiens.com/2008/08/01/a-web20-for-scientists-week-signup-fest/#comments</comments>
		<pubDate>Fri, 01 Aug 2008 03:30:24 +0000</pubDate>
		<dc:creator>perry</dc:creator>
				<category><![CDATA[science]]></category>
		<category><![CDATA[web2.0]]></category>
		<category><![CDATA[ideas]]></category>

		<guid isPermaLink="false">http://blog.pansapiens.com/?p=77</guid>
		<description><![CDATA[So, here is my half-baked idea:Why not have a special week where new online collaboration tools are promoted to scientists globally ?Similar to the way local Linux user groups sometimes run &#8220;installfests&#8221; to install Linux for users, (just to get them over the initial interia), why not have a global &#8220;signup-fest&#8221; for the core set [...]]]></description>
			<content:encoded><![CDATA[<p>So, here is my half-baked idea:<br id="df2e" /><br id="df2e0" />Why not have a special week where new online collaboration tools are promoted to scientists globally ?<a href="http://blog.pansapiens.com/wp-content/uploads/2008/08/signupfest.png" rel="lightbox[77]"><img class="alignright alignnone size-medium wp-image-78" style="float: right;" title="Signupfest" src="http://blog.pansapiens.com/wp-content/uploads/2008/08/signupfest-300x137.png" alt="Silly logo Web2.0 for a Signupfest" width="300" height="137" /></a><br id="pq9z" /><br id="pq9z0" />Similar to the way local Linux user groups sometimes run &#8220;installfests&#8221; to install Linux for users, (just to get them over the initial interia), why not have a global &#8220;signup-fest&#8221; for the core set of online tools useful for scientific collaboration ? In each local area / University / Institute, a few Web2.0 savvy scientists could promote the event and hand-hold new users through the signup process, and help with basic usage and integration of tools. Ideally, lab groups could signup together in the same room, and make an event of it. Or in places with Wifi &#8230; a &#8220;bring your laptop&#8221; signup lunchtime event. I think you get the idea.<br id="ecrz" /><span id="more-77"></span><br id="ecrz0" />To create buzz, I expect there should be an impressive opening talk about why these new ways of communicating matter, and how they will help scientists do their jobs better. A listing of other places where &#8220;signup-fests&#8221; where happening during the talk could encourage scientists to get involved (peer pressure helps <img src='http://blog.pansapiens.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  ). In places where someone cannot speak, the local &#8220;evangelist&#8221; could host a screening of the online video in a lecture theatre, then host the Q&amp;A.<br id="y:.6" /><br id="l.ii" /> Working some prizes into the overall event, while a being little &#8220;carrot on stick&#8221;-y, could also help spark some initial interest and get people looking at flyers.<br id="l.ii0" /> <br id="y:.60" />The week could culminate with all the &#8220;signup-fest&#8221; scientists from around the world meeting each other: maybe in a &#8220;Web2.0 for Science Week&#8221; FriendFeed room. Maybe encouraging everyone to make and answer at least one &#8220;Dear LazyWeb&#8221; request would help scientists that are new to these tools to connect and collaborate.<br id="o-5d0" /> <br id="fabb" />What would be the &#8220;core set&#8221; of web applications ? This could be contentious; among others I like FriendFeed, Twitter, Google Reader, CiteULike, del.icio.us and RSS/ATOM feeds generally. Honestly I think that for a &#8220;signup-fest&#8221; chosing the best established services in each space that work well together is the way to go. Too many options will just confuse new users and negate the whole intention of the event. While I understand that this is brutal to the various competitors, the other options can be listed as footnotes &#8211; scientists that are keen will check them out anyway and decide for themselves.<br id="ecrz1" /><br id="fcz3" /> Finally: Is the movement toward global, online collaboration for scientists too early for something like this (ie Are the tools ready) ? Is &#8220;pushing the issue&#8221; slightly as opposed to relying on organic adoption a bad idea ?<br id="xtp2" /></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.pansapiens.com/2008/08/01/a-web20-for-scientists-week-signup-fest/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	<creativeCommons:license>http://creativecommons.org/publicdomain/zero/1.0/</creativeCommons:license>
	<feedburner:origLink>http://blog.pansapiens.com/2008/08/01/a-web20-for-scientists-week-signup-fest/</feedburner:origLink></item>
	</channel>
</rss>
