<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>hilarymason.com - Hilary Mason</title>
	
	<link>http://www.hilarymason.com</link>
	<description>I'm a computer science professor, data scientist, and web geek.</description>
	<lastBuildDate>Thu, 12 Nov 2009 15:11:17 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/3greeneggs" type="application/rss+xml" /><feedburner:browserFriendly>This is an XML content feed. It is intended to be viewed in a newsreader or syndicated to another site, subject to copyright and fair use.</feedburner:browserFriendly><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com" /><item>
		<title>My code is on TV (and so am I)!</title>
		<link>http://feedproxy.google.com/~r/3greeneggs/~3/jr64hHZ5EIM/</link>
		<comments>http://www.hilarymason.com/blog/my-code-is-on-tv-and-so-am-i/#comments</comments>
		<pubDate>Wed, 11 Nov 2009 00:43:28 +0000</pubDate>
		<dc:creator>Hilary Mason</dc:creator>
				<category><![CDATA[blog]]></category>
		<category><![CDATA[code]]></category>
		<category><![CDATA[media]]></category>
		<category><![CDATA[press]]></category>
		<category><![CDATA[television]]></category>

		<guid isPermaLink="false">http://www.hilarymason.com/?p=373</guid>
		<description><![CDATA[FoxNY did a piece featuring me and Diana as hackers who use our technical powers for good, not evil.

There are way too few female technologists on television, and I&#8217;m happy to do what I can to show that women kick ass with code! Look for my mischievous I&#8217;m-writing-infinite-nested-loops grin in the clip where I&#8217;m programming.
If [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.myfoxny.com/dpp/good_day_ny/091109-helpful-hackers">FoxNY</a> did a piece featuring me and <a href="http://www.dianaeng.com/">Diana</a> as hackers who use our technical powers for good, not evil.</p>
<p><object id="video" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="320" height="280" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="FlashVars" value="&amp;skin=MP1ExternalAll-MFL.swf&amp;embed=true&amp;adSrc=http%3A%2F%2Fad%2Edoubleclick%2Enet%2Fadx%2Ftsg%2Ewnyw%2Fwildcard%5F1%2Flanding%3Bdcmt%3Dtext%2Fxml%3Bpos%3D%3Btile%3D2%3Bfname%3Dgood%5Fday%3Bloc%3Dsite%3Bsz%3D320x240%3Bord%3D21998848898459156%3Frand%3D0%2E7780692178907891&amp;flv=%2Ffeeds%2FoutboundFeed%3FobfType%3DVIDEO%5FPLAYER%5FSMIL%5FFEED%26componentId%3D130967743&amp;img=http%3A%2F%2Fmedia2%2Emyfoxny%2Ecom%2F%2Fphoto%2F2009%2F11%2F09%2F091109hackers%5Ftmb0000%5F20091109092559%5F640%5F480%2EJPG&amp;story=http%3A%2F%2Fwww%2Emyfoxny%2Ecom%2Fdpp%2Fgood%5Fday%5Fny%2F091109%2Dhelpful%2Dhackers" /><param name="allowNetworking" value="all" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://www.myfoxny.com/video/videoplayer.swf" /><embed id="video" type="application/x-shockwave-flash" width="320" height="280" src="http://www.myfoxny.com/video/videoplayer.swf" allowscriptaccess="always" allownetworking="all" flashvars="&amp;skin=MP1ExternalAll-MFL.swf&amp;embed=true&amp;adSrc=http%3A%2F%2Fad%2Edoubleclick%2Enet%2Fadx%2Ftsg%2Ewnyw%2Fwildcard%5F1%2Flanding%3Bdcmt%3Dtext%2Fxml%3Bpos%3D%3Btile%3D2%3Bfname%3Dgood%5Fday%3Bloc%3Dsite%3Bsz%3D320x240%3Bord%3D21998848898459156%3Frand%3D0%2E7780692178907891&amp;flv=%2Ffeeds%2FoutboundFeed%3FobfType%3DVIDEO%5FPLAYER%5FSMIL%5FFEED%26componentId%3D130967743&amp;img=http%3A%2F%2Fmedia2%2Emyfoxny%2Ecom%2F%2Fphoto%2F2009%2F11%2F09%2F091109hackers%5Ftmb0000%5F20091109092559%5F640%5F480%2EJPG&amp;story=http%3A%2F%2Fwww%2Emyfoxny%2Ecom%2Fdpp%2Fgood%5Fday%5Fny%2F091109%2Dhelpful%2Dhackers"></embed></object></p>
<p>There are way too few female technologists on television, and I&#8217;m happy to do what I can to show that women kick ass with code! Look for my mischievous I&#8217;m-writing-infinite-nested-loops grin in the clip where I&#8217;m programming.</p>
<p>If this looks like fun to you, come join us at <a href="http://www.nycresistor.com">NYC Resistor</a> (where the segment was filmed!) for Thursday night craft nights or for one of <a href="http://www.eventbrite.com/org/52408308?s=1406470">many awesome classes</a>.</p>
<img src="http://feeds.feedburner.com/~r/3greeneggs/~4/jr64hHZ5EIM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.hilarymason.com/blog/my-code-is-on-tv-and-so-am-i/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		<feedburner:origLink>http://www.hilarymason.com/blog/my-code-is-on-tv-and-so-am-i/</feedburner:origLink></item>
		<item>
		<title>Yahoo OpenHackNYC: The Del.icio.us Cake</title>
		<link>http://feedproxy.google.com/~r/3greeneggs/~3/8dXP2Qwnsr0/</link>
		<comments>http://www.hilarymason.com/blog/yahoo-openhacknyc-the-del-icio-us-cake/#comments</comments>
		<pubDate>Sun, 18 Oct 2009 01:43:17 +0000</pubDate>
		<dc:creator>Hilary Mason</dc:creator>
				<category><![CDATA[blog]]></category>
		<category><![CDATA[api]]></category>
		<category><![CDATA[cake]]></category>
		<category><![CDATA[conference]]></category>
		<category><![CDATA[delicious]]></category>
		<category><![CDATA[hack]]></category>
		<category><![CDATA[openhacknyc]]></category>

		<guid isPermaLink="false">http://www.hilarymason.com/?p=344</guid>
		<description><![CDATA[
Last weekend Yahoo came to New York for an Open Hack Day, and it was great!
I was invited to speak on a panel on semantic metadata, moderated by Paul Ford (harpers.org) along with Marco Neumann (KONA) and Paul Tarjan (Yahoo/Search Monkey). The panel was a lively discussion, and we got some great questions from the [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.flickr.com/photos/yodelanecdotal/3998619064/"><img class="alignnone" title="delicious cake" src="http://farm3.static.flickr.com/2441/3998619064_ea47758bda.jpg" alt="" width="360" height="240" /></a></p>
<p>Last weekend <a href="http://developer.yahoo.com">Yahoo</a> came to New York for an <a href="http://developer.yahoo.net/blogs/theater/archives/2009/09/open_hack_day_new_york.html">Open Hack Day</a>, and it was great!</p>
<p>I was invited to speak on a panel on <em>semantic metadata</em>, moderated by Paul Ford (harpers.org) along with Marco Neumann (KONA) and Paul Tarjan (Yahoo/Search Monkey). The panel was a lively discussion, and we got some great questions from the audience.</p>
<p>After the panel, I stayed around to participate in the hack competition. Yahoo! provided a fantastic space, with free-flowing coffee, snacks, comfy chairs and plenty of Yahoo folks and other hackers around to give advice and play foosball with. I teamed up with <a href="http://www.dianaeng.com/">Diana Eng</a>, <a href="http://aliciagibb.com/">Alicia Gibb</a>, and <a href="http://wwward.typepad.com/blog/">Bill Ward</a> to create the Del.icio.us Cake!</p>
<p>The cake is attached to a laptop via USB. A program running on the laptop accepts a delicious tag and retrieves a list of recent popular sites for that tag from the delicious API. Finally, it iterates through each URL, downloads the page, and computes the sentiment of that page relative to the tag &#8212; basically, is the content of the page positive, neutral or negative?</p>
<p><a href="http://www.flickr.com/photos/rlerdorf/3998611469/"><img class="alignnone" title="Presenting the Delicious Cake" src="http://farm3.static.flickr.com/2477/3998611469_71b6985fe3.jpg" alt="" width="400" height="266" /></a></p>
<p>The signal is output to an <a href="http://www.arduino.cc/">ardiuno</a> (hidden in the middle of the cake) which turns on the appropriate set of LEDs. There are four sets of LEDs on the cake, one in each quadrant of the delicious logo, one each for positive sentiment, neutral or inconclusive sentiment, and negative sentiment, and, of course, one to let us know that the cake is turned on.</p>
<p>I wrote the sentiment classifiers between around 3am and 6am Saturday morning, so they really were a hack! I trained them on movie reviews data, working with the assumption that 5-star reviews contain positive terms and 1-star reviews contain negative terms. I wouldn&#8217;t recommend this approach for a serious attempt at sentiment analysis, but it worked well enough.</p>
<p>We won the food/hardware hack prize, shared with the <a href="http://makerbot.com/">awesome MakerBot team</a>!</p>
<p>We had a great time creating and presenting the hack. Thanks, Yahoo, and most of all, thanks to Alicia, Bill, and Diana for a really fantastic, silly weekend.</p>
<p><a href="http://www.flickr.com/photos/littlegreenfroggy/3998757320/"><img class="alignnone" title="Team Delicious Cake" src="http://farm4.static.flickr.com/3510/3998757320_4fb19c70cd.jpg" alt="" width="400" height="300" /></a></p>
<p>Further coverage:</p>
<ul>
<li>Yahoo&#8217;s <a href="http://ycorpblog.com/2009/10/12/new-york-has-been-hacked/">summary of the Open Hack NYC event</a></li>
<li>Diana&#8217;s <a href="http://eyebeam.org/feeds/uncategorized/yahoo-open-hackday-nyc">writeup for Eyebeam</a></li>
<li>CNN.com: <a href="http://us.cnn.com/video/?/video/tech/2009/10/22/morris.yahoo.hacker.contest.cnn">Hackers Take Over Times Square</a></li>
</ul>
<img src="http://feeds.feedburner.com/~r/3greeneggs/~4/8dXP2Qwnsr0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.hilarymason.com/blog/yahoo-openhacknyc-the-del-icio-us-cake/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		<feedburner:origLink>http://www.hilarymason.com/blog/yahoo-openhacknyc-the-del-icio-us-cake/</feedburner:origLink></item>
		<item>
		<title>Data: first and last names from the US Census</title>
		<link>http://feedproxy.google.com/~r/3greeneggs/~3/GJqfqPGo1yU/</link>
		<comments>http://www.hilarymason.com/blog/data-first-and-last-names-from-the-us-census/#comments</comments>
		<pubDate>Fri, 16 Oct 2009 18:26:21 +0000</pubDate>
		<dc:creator>Hilary Mason</dc:creator>
				<category><![CDATA[blog]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[dataset]]></category>
		<category><![CDATA[mysql]]></category>
		<category><![CDATA[sql]]></category>

		<guid isPermaLink="false">http://www.hilarymason.com/?p=352</guid>
		<description><![CDATA[I&#8217;ve found myself in need of a name distribution for a few projects recently, so I thought I would post it here so I won&#8217;t have to go looking for it again.
The data is available from the US Census Bureau (from 1990 census) here, and I have it here in a friendly MySQL *.sql format [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve found myself in need of a name distribution for a few projects recently, so I thought I would post it here so I won&#8217;t have to go looking for it again.</p>
<p>The data is available from the US Census Bureau (from 1990 census) <a href="http://www.census.gov/genealogy/names/">here</a>, and I have it here in a friendly MySQL *.sql format (it will create the tables and insert the data). There are three tables: male first names, female first names, and surnames.</p>
<p>I&#8217;ve noted several issues in the data that are likely the result of typos, so make sure to do your own validation if your application requires it.</p>
<p>The format is simple:</p>
<ol>
<li>the name</li>
<li>frequency (percentage of people in the sampled population with that name)</li>
<li>cumulative frequency (as you read down the list, the percentage of total population covered)</li>
<li>rank</li>
</ol>
<p>If you want to use this to generate a random name, you can do so very easily with a query like this:</p>

<div class="wp_syntax"><div class="code"><pre class="sql" style="font-family:monospace;"><span style="color: #993333; font-weight: bold;">SELECT</span> name <span style="color: #993333; font-weight: bold;">FROM</span> ref_census_surnames n <span style="color: #993333; font-weight: bold;">ORDER</span> <span style="color: #993333; font-weight: bold;">BY</span> <span style="color: #66cc66;">&#40;</span>RAND<span style="color: #66cc66;">&#40;</span><span style="color: #66cc66;">&#41;</span> <span style="color: #66cc66;">*</span> <span style="color: #66cc66;">&#40;</span>n<span style="color: #66cc66;">.</span>freq <span style="color: #66cc66;">+</span> <span style="color: #66cc66;">.</span>01<span style="color: #66cc66;">&#41;</span><span style="color: #66cc66;">&#41;</span> <span style="color: #993333; font-weight: bold;">LIMIT</span> <span style="color: #cc66cc;">0</span><span style="color: #66cc66;">,</span><span style="color: #cc66cc;">1</span>;</pre></div></div>

<p>Download it here: <a href="http://www.hilarymason.com/wp-content/uploads/2009/10/census_names.tar.gz">census_names.tar.gz</a></p>
<img src="http://feeds.feedburner.com/~r/3greeneggs/~4/GJqfqPGo1yU" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.hilarymason.com/blog/data-first-and-last-names-from-the-us-census/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.hilarymason.com/blog/data-first-and-last-names-from-the-us-census/</feedburner:origLink></item>
		<item>
		<title>Hadoop World NYC</title>
		<link>http://feedproxy.google.com/~r/3greeneggs/~3/uowO7nweCp0/</link>
		<comments>http://www.hilarymason.com/blog/hadoop-world-nyc/#comments</comments>
		<pubDate>Sat, 03 Oct 2009 17:33:45 +0000</pubDate>
		<dc:creator>Hilary Mason</dc:creator>
				<category><![CDATA[blog]]></category>
		<category><![CDATA[conference]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[hadoop]]></category>

		<guid isPermaLink="false">http://www.hilarymason.com/?p=327</guid>
		<description><![CDATA[Yesterday, I attended the first Hadoop World NYC conference. Hadoop is a platform for scalable distributed computing. In essence, it makes analyzing large quantities of data much faster, and analyzing very large quantities of data possible.
Cloudera did a great job organizing the conference, and managed to assemble a diverse set of speakers. The sessions covered [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday, I attended the first <a href="http://www.cloudera.com/hadoop-world-nyc">Hadoop World NYC</a> conference. <a href="http://hadoop.apache.org/">Hadoop</a> is a platform for scalable distributed computing. In essence, it makes analyzing large quantities of data much faster, and analyzing very large quantities of data possible.</p>
<p><a href="http://www.cloudera.com/">Cloudera</a> did a great job organizing the conference, and managed to assemble a diverse set of speakers. The sessions covered everything from academic research to fraud detection to bioinformatics and even helping people fall in love (eHarmony uses Hadoop)!</p>
<p>I&#8217;m not going to review every session, but I saw several themes emerging from the content and conversations.</p>
<h3><span style="color: #000000;">Hadoop is Getting Easier</span></h3>
<p>New integrated UIs like <a href="http://www.cloudera.com/desktop">Cloudera Desktop</a> and <a href="http://www.karmasphere.com/">Karmasphere</a> mean that developers will no longer be required to use a command-line interface to configure and execute Hadoop jobs. <a href="http://www-01.ibm.com/software/ebusiness/jstart/m2/">IBM&#8217;s M2</a> project hides Hadoop behind a spreadsheet metaphor, making the collection, analysis and visualization of data as easy as using Excel.</p>
<p>This doesn&#8217;t just speed up development time, it puts the tools for manipulating the data directly in the hands of the people who need the results, without requiring them to talk to a database programmer.</p>
<h3><span style="color: #000000;">Hadoop is a Utility</span></h3>
<p>The only organizations that talked about building their own Hadoop clusters are those who deal with very sensitive data (VISA) and those who deal with very very large quantities of data (Yahoo, Facebook, eBay). Organizations with more manageable data sets, such as eHarmony and the New York Times, use EC2 and Amazon&#8217;s Elastic Map-Reduce. <a href="http://aws.amazon.com/elasticmapreduce/">Amazon</a>, <a href="http://www.rackspacecloud.com/">Rackspace</a>, and <a href="http://www.softlayer.com/">Softlayer</a> have offerings in this area and were all event sponsors.</p>
<p>Yes, you can turn on a cluster of nodes from your living room in your PJs!</p>
<h3><span style="color: #000000;">Hadoop Can Talk to Your Existing Systems</span></h3>
<p>Hadoop has an ecosystem of supporting products that allow organizations to adapt their existing infrastructure. <a href="http://www.cloudera.com/blog/2009/06/01/introducing-sqoop/">Cloudera&#8217;s Sqoop</a> (which is just fun to say out loud) is a tool for importing data from SQL databases, <a href="http://hadoop.apache.org/hbase/">HBase</a> is a Hadoop database, and <a href="http://hadoop.apache.org/pig/">Pig</a> lets you talk to the system in a SQL-like language.</p>
<p>I expect we&#8217;ll see more information available in the near future to clarify which systems are more appropriate for which kinds of users (an ecosystem decision tree?).</p>
<h3><span style="color: #000000;">Hadoop is Changing Things</span></h3>
<p>I heard the phrase &#8220;an order of magnitude improvement in speed&#8221; so many times that I lost count. Speaking from personal experience, the difference you see in productivity between waiting minutes and hours for results and waiting days is immense. When you can see the answer to a question shortly after you ask it you can preserve the context you need to <em>act on that answer immediately</em> without having to spend the time to figure out why you were asking that question in the first place.</p>
<p>Most of the projects were doing fairly simple analysis over data like web user sessions or transactions. I was intrigued by Deepak Singh&#8217;s talk on bioinformatics and genome sequencing (<a href="http://bit.ly/3gXMOn">slides</a>) and <a href="http://www.jakehofman.com/">Jake Hofman</a>&#8217;s talk on social network analysis (<a href="http://bit.ly/hadoopworldjmh">slides</a>). More and more massive datasets are becoming available and will drive techniques for new analysis. I do wish there had been a talk about <a href="http://lucene.apache.org/mahout/">Mahout</a>, which is a very promising approach to developing machine learning algorithms on the Hadoop platform.</p>
<p>I left the event more excited about the technology and very enthusiastic about the community. Thanks for a great day!</p>
<p><strong>Update</strong>: A few other people have written up their notes and impressions from the event:</p>
<ul>
<li>Stephen O&#8217;grady posted <a href="http://redmonk.com/sogrady/2009/10/02/hadoopworld/">The View from HadoopWorld</a></li>
<li>Deepak Singh&#8217;s <a href="http://mndoci.com/2009/10/03/post-hadoop-world-thoughts/">Post-HadoopWorld Thoughts</a></li>
<li>HubSpot Dev Blog has two write-ups, by <a href="http://dev.hubspot.com/bid/27047/Hadoop-World-NYC-2009?source=BlogTwitter_[Hadoop+World,+NYC+20]">Dan</a> and <a href="http://dev.hubspot.com/bid/27054/Hadoop-World-impressions">Steve</a></li>
<li>Atbrox has notes from the <a href="http://atbrox.com/2009/10/02/hadoop-world-2009-some-notes-from-morning-session/">morning session</a> and the <a href="http://atbrox.com/2009/10/03/hadoop-world-2009-notes-from-application-session/">application session</a></li>
<li>Alexander Sicular&#8217;s <a href="http://siculars.posterous.com/are-you-new-to-hadoop-settle-in">Are You New to Hadoop? Settle in&#8230;</a></li>
<li><span>Pete Skomoroch posted his <a href="http://www.datawrangling.com/slides-thoughts-from-hadoop-world-nyc">slides and thoughts</a><br />
</span></li>
</ul>
<p><span style="border: 1px solid #c9c9c9; padding: 3px; display: none; position: absolute; max-width: 300px; background-color: #ffffc9; font-size: 10px; z-index: 1000; text-align: left; top: 708px; left: 608px;">http://jakehofman.com/talks/hadoopworld_20091002.pdf</span></p>
<p><span style="border: 1px solid #c9c9c9; padding: 3px; display: none; position: absolute; max-width: 300px; background-color: #ffffc9; font-size: 10px; z-index: 1000; text-align: left; top: 752px; left: 272px;">http://www.slideshare.net/mndoci/hadoop-for-bioinformatics</span></p>
<img src="http://feeds.feedburner.com/~r/3greeneggs/~4/uowO7nweCp0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.hilarymason.com/blog/hadoop-world-nyc/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		<feedburner:origLink>http://www.hilarymason.com/blog/hadoop-world-nyc/</feedburner:origLink></item>
		<item>
		<title>Do you do human subject research?</title>
		<link>http://feedproxy.google.com/~r/3greeneggs/~3/XDD6IJXMzmE/</link>
		<comments>http://www.hilarymason.com/blog/do-you-do-human-subject-research/#comments</comments>
		<pubDate>Sat, 29 Aug 2009 20:20:39 +0000</pubDate>
		<dc:creator>Hilary Mason</dc:creator>
				<category><![CDATA[blog]]></category>
		<category><![CDATA[headlamp]]></category>
		<category><![CDATA[human]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[science]]></category>

		<guid isPermaLink="false">http://www.hilarymason.com/?p=324</guid>
		<description><![CDATA[Dear friends and colleagues,
Do you do research that involves gathering data from human participants? This can be anything from marketing surveys to psychology experiments to medical science. If so, please take a short (5 to 10 minute) survey:
research tool survey
The results of the survey will help us design a new platform for online human research!
I&#8217;m [...]]]></description>
			<content:encoded><![CDATA[<p>Dear friends and colleagues,</p>
<p>Do you do research that involves gathering data from human participants? This can be anything from marketing surveys to psychology experiments to medical science. If so, please take a short (5 to 10 minute) survey:</p>
<p><a href="http://bit.ly/hlamp">research tool survey</a></p>
<p>The results of the survey will help us design <a href="http://www.headlampresearch.com">a new platform for online human research</a>!</p>
<p>I&#8217;m very excited about this project and would very much appreciate your input. If you have colleagues who do this kind of work, please pass it on.</p>
<p>Thank you!</p>
<img src="http://feeds.feedburner.com/~r/3greeneggs/~4/XDD6IJXMzmE" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.hilarymason.com/blog/do-you-do-human-subject-research/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		<feedburner:origLink>http://www.hilarymason.com/blog/do-you-do-human-subject-research/</feedburner:origLink></item>
		<item>
		<title>My NYC Python Meetup Presentation: Practical Data Analysis in Python</title>
		<link>http://feedproxy.google.com/~r/3greeneggs/~3/s_Hz25cC6fE/</link>
		<comments>http://www.hilarymason.com/blog/my-nyc-python-meetup-presentation-practical-data-analysis-in-python/#comments</comments>
		<pubDate>Wed, 12 Aug 2009 15:28:57 +0000</pubDate>
		<dc:creator>Hilary Mason</dc:creator>
				<category><![CDATA[blog]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[nltk]]></category>
		<category><![CDATA[presentations]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[spam]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://www.hilarymason.com/?p=318</guid>
		<description><![CDATA[I gave a talk at the NYC Python Meetup on July 29 on Practical Data Analysis in Python.
I tend to use my slides for visual representations of the concepts I&#8217;m discussing, so there&#8217;s a lot of content that was in the presentation that you unfortunately won&#8217;t see here.
The talk starts with the immense opportunities for [...]]]></description>
			<content:encoded><![CDATA[<p>I gave a talk at the <a href="http://www.meetup.com/nycpython/calendar/10536092/?from=list&#038;offset=0">NYC Python Meetup</a> on July 29 on <em>Practical Data Analysis in Python</em>.
<p>I tend to use my slides for visual representations of the concepts I&#8217;m discussing, so there&#8217;s a lot of content that was in the presentation that you unfortunately won&#8217;t see here.</p>
<p>The talk starts with the immense opportunities for knowledge derived from data. I spent some time showing data systems &#8216;in the wild&#8217; along with the appropriate algorithmic vocabulary (for example, <a href="http://www.amazon.com">amazon.com</a>&#8217;s &#8216;books you might like&#8217; feature is a <a href="http://en.wikipedia.org/wiki/Recommender_system">recommender system</a>).</p>
<p>Once we can describe the problems properly, we can look for tools, and Python has many! Finally, in the fun part of the presentation, I demoed working code that uses <a href="http://www.nltk.org/">NLTK</a> to build a Twitter spam filter with 90% accuracy*.</p>
<p>Please let me know if you have questions or comments.</p>
<div style="width:425px;text-align:left" id="__ss_1845576"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/hmason/practical-data-analysis-in-python" title="Practical Data Analysis in Python">Practical Data Analysis in Python</a><object style="margin:0px" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=nycpython-090811161045-phpapp02&#038;stripped_title=practical-data-analysis-in-python" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=nycpython-090811161045-phpapp02&#038;stripped_title=practical-data-analysis-in-python" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object>
<div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">presentations</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/hmason">Hilary Mason</a>.</div>
</div>
<p style="font-size:90%;color:#999;">* I&#8217;ll post the code and training data shortly</p>
<img src="http://feeds.feedburner.com/~r/3greeneggs/~4/s_Hz25cC6fE" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.hilarymason.com/blog/my-nyc-python-meetup-presentation-practical-data-analysis-in-python/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.hilarymason.com/blog/my-nyc-python-meetup-presentation-practical-data-analysis-in-python/</feedburner:origLink></item>
		<item>
		<title>My Barcamp Presentation: Have Data? What Now?!</title>
		<link>http://feedproxy.google.com/~r/3greeneggs/~3/H5IttMte1G8/</link>
		<comments>http://www.hilarymason.com/blog/my-barcamp-presentation-have-data-what-now/#comments</comments>
		<pubDate>Mon, 01 Jun 2009 15:29:29 +0000</pubDate>
		<dc:creator>Hilary Mason</dc:creator>
				<category><![CDATA[blog]]></category>
		<category><![CDATA[barcamp]]></category>
		<category><![CDATA[barcampnyc]]></category>
		<category><![CDATA[classifier]]></category>
		<category><![CDATA[clustering]]></category>
		<category><![CDATA[conference]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[presentations]]></category>

		<guid isPermaLink="false">http://www.hilarymason.com/?p=304</guid>
		<description><![CDATA[I gave a talk at BarCampNYC4 on Saturday on common data problems and a very light overview of algorithms that address them.
I delivered the majority of the content verbally, by talking through examples of problems and how to solve them, so there&#8217;s no guarantee that these slides will make sense, but they might be funny!
Have [...]]]></description>
			<content:encoded><![CDATA[<p>I gave a talk at BarCampNYC4 on Saturday on common data problems and a very light overview of algorithms that address them.</p>
<div class="wp-caption alignleft" style="width: 510px"><a href="http://www.flickr.com/photos/jaxuk/3580269452/in/photostream/"><img alt="My talk at barcampnyc4. - photo courtesy of dynamist on flickr." src="http://farm4.static.flickr.com/3344/3580269452_2c89eb1a0a.jpg?v=0" title="Hilary Mason on Data" width="250" height="188" /></a><p class="wp-caption-text">My talk at barcampnyc4 - photo courtesy of dynamist on flickr.</p></div>
<p>I delivered the majority of the content verbally, by talking through examples of problems and how to solve them, so there&#8217;s no guarantee that these slides will make sense, but they might be funny!</p>
<div style="width:425px;text-align:left" id="__ss_1513427"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/hmason/have-data-what-now-1513427?type=presentation" title="Have data? What now?!">Have data? What now?!</a><object style="margin:0px" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=barcampnyc4-090531135219-phpapp01&#038;stripped_title=have-data-what-now-1513427" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=barcampnyc4-090531135219-phpapp01&#038;stripped_title=have-data-what-now-1513427" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object>
<div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">presentations</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/hmason">Hilary Mason</a>.</div>
</div>
<p>Sanford took some excellent <a href="http://sanford.blogspot.com/2009/05/hillary-mason-at-barcampnyc4.html">notes</a> during the presentation.</p>
<p>There were some <a href="http://twitter.com/ianlandsman/statuses/1975242296">very</a> <a href="http://twitter.com/maximka/statuses/1975002430">nice</a> <a href="http://twitter.com/rebeccaforever/statuses/1975210619">comments</a> on twitter.</p>
<p>The discussion was so lively and engaging that I&#8217;m planning to expand on this content &#8212; I really welcome your suggestions and comments!</p>
<img src="http://feeds.feedburner.com/~r/3greeneggs/~4/H5IttMte1G8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.hilarymason.com/blog/my-barcamp-presentation-have-data-what-now/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.hilarymason.com/blog/my-barcamp-presentation-have-data-what-now/</feedburner:origLink></item>
		<item>
		<title>I’m on Jon Udell’s Interviews with Innovators!</title>
		<link>http://feedproxy.google.com/~r/3greeneggs/~3/26aDx8gw-f8/</link>
		<comments>http://www.hilarymason.com/blog/im-on-jon-udells-interviews-with-innovators/#comments</comments>
		<pubDate>Fri, 08 May 2009 03:50:00 +0000</pubDate>
		<dc:creator>Hilary Mason</dc:creator>
				<category><![CDATA[blog]]></category>
		<category><![CDATA[media]]></category>
		<category><![CDATA[path101]]></category>
		<category><![CDATA[podcast]]></category>

		<guid isPermaLink="false">http://www.hilarymason.com/?p=297</guid>
		<description><![CDATA[Jon Udell hosted Charlie and I on his Interviews with Innovators podcast. 
We discussed Path101&#8217;s approach to career advice through data, and how the high availability of data is changing the way we make decisions.
Listen here.
]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.jonudell.net/">Jon Udell</a> hosted <a href="http://www.thisisgoingtobebig.com">Charlie</a> and I on his <a href="http://itc.conversationsnetwork.org/series/innovators.html">Interviews with Innovators</a> podcast. </p>
<p>We discussed <a href="http://www.path101.com">Path101</a>&#8217;s approach to career advice through data, and how the high availability of data is changing the way we make decisions.</p>
<p>Listen <a href="http://itc.conversationsnetwork.org/shows/detail4091.html">here</a>.</p>
<img src="http://feeds.feedburner.com/~r/3greeneggs/~4/26aDx8gw-f8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.hilarymason.com/blog/im-on-jon-udells-interviews-with-innovators/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.hilarymason.com/blog/im-on-jon-udells-interviews-with-innovators/</feedburner:origLink></item>
		<item>
		<title>LSL: AOL IM Status Indicator</title>
		<link>http://feedproxy.google.com/~r/3greeneggs/~3/dYMx3LBW-_M/</link>
		<comments>http://www.hilarymason.com/blog/lsl-aol-im-status-indicator/#comments</comments>
		<pubDate>Wed, 29 Apr 2009 03:10:09 +0000</pubDate>
		<dc:creator>Hilary Mason</dc:creator>
				<category><![CDATA[blog]]></category>
		<category><![CDATA[aim]]></category>
		<category><![CDATA[lsl]]></category>
		<category><![CDATA[second life]]></category>

		<guid isPermaLink="false">http://www.hilarymason.com/?p=281</guid>
		<description><![CDATA[I think this might be my very first LSL script, from back in 2005! This script indicates whether your AIM (AOL Instant Messenger) account is online by changing the color of an object. You can configure it to either share your AIM ID publicly, or keep it private.
This script uses the AIM web services API [...]]]></description>
			<content:encoded><![CDATA[<p>I think this might be my very first LSL script, from back in 2005! This script indicates whether your AIM (AOL Instant Messenger) account is online by changing the color of an object. You can configure it to either share your AIM ID publicly, or keep it private.</p>
<div id="attachment_287" class="wp-caption alignleft" style="width: 378px"><a href="http://www.hilarymason.com/wp-content/uploads/2009/04/sl_aim_indicator.jpg"><img src="http://www.hilarymason.com/wp-content/uploads/2009/04/sl_aim_indicator.jpg" alt="AIM Indicators in Second LIfe" title="AIM Indicators in Second LIfe" width="368" height="283" class="size-full wp-image-287" /></a><p class="wp-caption-text">AIM Indicators in Second LIfe</p></div>
<p>This script uses the AIM web services API to check your online status &#8212; you only need to give it your username, not your password! This is not a proxy service. You can&#8217;t send messages through this script, just show your online status in SL.</p>
<p>To use this script, <a href="http://3greeneggs.com/autoscript/howto.html">create an object in your favorite shape, create a new script inside of it, paste this code into it and save</a>.</p>

<div class="wp_syntax"><div class="code"><pre class="cpp" style="font-family:monospace;">key request_id<span style="color: #008080;">;</span>
string aim_id<span style="color: #008080;">;</span>
string av_name<span style="color: #008080;">;</span>
key data_card<span style="color: #008080;">;</span>
integer nLine <span style="color: #000080;">=</span> <span style="color: #0000dd;">0</span><span style="color: #008080;">;</span>
integer <span style="color: #0000ff;">public</span> <span style="color: #000080;">=</span> TRUE<span style="color: #008080;">;</span>
&nbsp;
<span style="color: #0000ff;">default</span>
<span style="color: #008000;">&#123;</span>
    state_entry<span style="color: #008000;">&#40;</span><span style="color: #008000;">&#41;</span>
    <span style="color: #008000;">&#123;</span>
        llSetText<span style="color: #008000;">&#40;</span><span style="color: #FF0000;">&quot;AIM Indicator... setting up&quot;</span>,<span style="color: #000080;">&lt;</span><span style="color: #0000dd;">1</span>,<span style="color: #0000dd;">1</span>,<span style="color: #0000dd;">1</span><span style="color: #000080;">&gt;</span>,<span style="color: #0000dd;">1</span><span style="color: #008000;">&#41;</span><span style="color: #008080;">;</span>
        data_card <span style="color: #000080;">=</span> llGetNotecardLine<span style="color: #008000;">&#40;</span><span style="color: #FF0000;">&quot;Settings&quot;</span>,nLine<span style="color: #008000;">&#41;</span><span style="color: #008080;">;</span> <span style="color: #666666;">// load settings</span>
        llSetTimerEvent<span style="color: #008000;">&#40;</span><span style="color: #0000dd;">60</span><span style="color: #008000;">&#41;</span><span style="color: #008080;">;</span> <span style="color: #666666;">// check once per min</span>
    <span style="color: #008000;">&#125;</span>
&nbsp;
    <span style="color: #666666;">// reset when owner touches it</span>
    touch_start<span style="color: #008000;">&#40;</span>integer total_number<span style="color: #008000;">&#41;</span> <span style="color: #008000;">&#123;</span>
        <span style="color: #0000ff;">if</span> <span style="color: #008000;">&#40;</span>llDetectedKey<span style="color: #008000;">&#40;</span><span style="color: #0000dd;">0</span><span style="color: #008000;">&#41;</span> <span style="color: #000080;">==</span> llGetOwner<span style="color: #008000;">&#40;</span><span style="color: #008000;">&#41;</span><span style="color: #008000;">&#41;</span> <span style="color: #008000;">&#123;</span>
            llResetScript<span style="color: #008000;">&#40;</span><span style="color: #008000;">&#41;</span><span style="color: #008080;">;</span>
        <span style="color: #008000;">&#125;</span>
    <span style="color: #008000;">&#125;</span>
&nbsp;
    <span style="color: #666666;">// read settings from notecard</span>
    dataserver<span style="color: #008000;">&#40;</span>key query_id, string data<span style="color: #008000;">&#41;</span> <span style="color: #008000;">&#123;</span>
        <span style="color: #0000ff;">if</span> <span style="color: #008000;">&#40;</span>query_id <span style="color: #000080;">==</span> data_card<span style="color: #008000;">&#41;</span> <span style="color: #008000;">&#123;</span>
            <span style="color: #0000ff;">if</span> <span style="color: #008000;">&#40;</span>data <span style="color: #000040;">!</span><span style="color: #000080;">=</span> <span style="color: #0000ff;">EOF</span><span style="color: #008000;">&#41;</span> <span style="color: #008000;">&#123;</span> <span style="color: #666666;">// not at the end of the notecard</span>
                <span style="color: #0000ff;">if</span> <span style="color: #008000;">&#40;</span>nLine <span style="color: #000080;">==</span> <span style="color: #0000dd;">0</span><span style="color: #008000;">&#41;</span> <span style="color: #008000;">&#123;</span>
                    av_name <span style="color: #000080;">=</span> data<span style="color: #008080;">;</span>
                <span style="color: #008000;">&#125;</span> <span style="color: #0000ff;">if</span> <span style="color: #008000;">&#40;</span>nLine <span style="color: #000080;">==</span> <span style="color: #0000dd;">1</span><span style="color: #008000;">&#41;</span> <span style="color: #008000;">&#123;</span>
                    aim_id <span style="color: #000080;">=</span> data<span style="color: #008080;">;</span>
                <span style="color: #008000;">&#125;</span> <span style="color: #0000ff;">if</span> <span style="color: #008000;">&#40;</span>nLine <span style="color: #000080;">==</span> <span style="color: #0000dd;">2</span><span style="color: #008000;">&#41;</span> <span style="color: #008000;">&#123;</span>
                    <span style="color: #0000ff;">if</span> <span style="color: #008000;">&#40;</span>data <span style="color: #000080;">==</span> <span style="color: #FF0000;">&quot;private&quot;</span><span style="color: #008000;">&#41;</span> <span style="color: #008000;">&#123;</span> <span style="color: #666666;">// if they do not want their ID shared</span>
                        <span style="color: #0000ff;">public</span> <span style="color: #000080;">=</span> FALSE<span style="color: #008080;">;</span>
                    <span style="color: #008000;">&#125;</span>
                <span style="color: #008000;">&#125;</span>
                <span style="color: #000040;">++</span>nLine<span style="color: #008080;">;</span> <span style="color: #666666;">// increase line count</span>
                data_card <span style="color: #000080;">=</span> llGetNotecardLine<span style="color: #008000;">&#40;</span><span style="color: #FF0000;">&quot;Settings&quot;</span>, nLine<span style="color: #008000;">&#41;</span><span style="color: #008080;">;</span> <span style="color: #666666;">// request next line</span>
            <span style="color: #008000;">&#125;</span>
        <span style="color: #008000;">&#125;</span>
    <span style="color: #008000;">&#125;</span>
&nbsp;
    timer<span style="color: #008000;">&#40;</span><span style="color: #008000;">&#41;</span> <span style="color: #008000;">&#123;</span>
        string url <span style="color: #000080;">=</span> <span style="color: #FF0000;">&quot;http://api.oscar.aol.com/SOA/key=hi15uD79wuEukQTS/resource-lists/users/*anonymous*/presence/~~/resource-lists/list%5Bname=%22users%22%5D/entry%5B@uri=%22user:&quot;</span><span style="color: #000040;">+</span>aim_id<span style="color: #000040;">+</span><span style="color: #FF0000;">&quot;%22%5D&quot;</span><span style="color: #008080;">;</span>
        request_id <span style="color: #000080;">=</span> llHTTPRequest<span style="color: #008000;">&#40;</span>url,<span style="color: #008000;">&#91;</span>HTTP_METHOD, <span style="color: #FF0000;">&quot;GET&quot;</span><span style="color: #008000;">&#93;</span>,<span style="color: #FF0000;">&quot;&quot;</span><span style="color: #008000;">&#41;</span><span style="color: #008080;">;</span>
    <span style="color: #008000;">&#125;</span>
&nbsp;
    http_response<span style="color: #008000;">&#40;</span>key req_id, integer status, list metadata, string body<span style="color: #008000;">&#41;</span> <span style="color: #008000;">&#123;</span>
        <span style="color: #0000ff;">if</span> <span style="color: #008000;">&#40;</span>req_id <span style="color: #000080;">==</span> request_id<span style="color: #008000;">&#41;</span> <span style="color: #008000;">&#123;</span>
            list result <span style="color: #000080;">=</span> llParseString2List<span style="color: #008000;">&#40;</span>body,<span style="color: #008000;">&#91;</span><span style="color: #FF0000;">&quot;&lt;&quot;</span>,<span style="color: #FF0000;">&quot;&gt;&quot;</span>,<span style="color: #FF0000;">&quot;<span style="color: #000099; font-weight: bold;">\n</span>&quot;</span><span style="color: #008000;">&#93;</span>,<span style="color: #008000;">&#91;</span><span style="color: #008000;">&#93;</span><span style="color: #008000;">&#41;</span><span style="color: #008080;">;</span>
&nbsp;
            <span style="color: #666666;">// if user is online</span>
            <span style="color: #0000ff;">if</span> <span style="color: #008000;">&#40;</span>llList2String<span style="color: #008000;">&#40;</span>result,<span style="color: #0000dd;">24</span><span style="color: #008000;">&#41;</span> <span style="color: #000040;">!</span><span style="color: #000080;">=</span> <span style="color: #FF0000;">&quot;http://cdn.digitalcity.com/presence/offline.gif&quot;</span><span style="color: #008000;">&#41;</span> <span style="color: #008000;">&#123;</span>
                <span style="color: #0000ff;">if</span> <span style="color: #008000;">&#40;</span><span style="color: #0000ff;">public</span> <span style="color: #000080;">==</span> FALSE<span style="color: #008000;">&#41;</span> <span style="color: #008000;">&#123;</span> 
                    llSetText<span style="color: #008000;">&#40;</span>av_name<span style="color: #000040;">+</span><span style="color: #FF0000;">&quot;'s AIM account is online&quot;</span>,<span style="color: #000080;">&lt;</span><span style="color: #0000dd;">1</span>,<span style="color: #0000dd;">1</span>,<span style="color: #0000dd;">1</span><span style="color: #000080;">&gt;</span>,<span style="color: #0000dd;">1</span><span style="color: #008000;">&#41;</span><span style="color: #008080;">;</span>
                <span style="color: #008000;">&#125;</span> <span style="color: #0000ff;">else</span> <span style="color: #008000;">&#123;</span>
                    llSetText<span style="color: #008000;">&#40;</span><span style="color: #FF0000;">&quot;AIM &quot;</span><span style="color: #000040;">+</span>av_name<span style="color: #000040;">+</span><span style="color: #FF0000;">&quot;: &quot;</span><span style="color: #000040;">+</span>aim_id<span style="color: #000040;">+</span><span style="color: #FF0000;">&quot; is online&quot;</span>,<span style="color: #000080;">&lt;</span><span style="color: #0000dd;">1</span>,<span style="color: #0000dd;">1</span>,<span style="color: #0000dd;">1</span><span style="color: #000080;">&gt;</span>,<span style="color: #0000dd;">1</span><span style="color: #008000;">&#41;</span><span style="color: #008080;">;</span>
                <span style="color: #008000;">&#125;</span>
                llSetColor<span style="color: #008000;">&#40;</span>llVecNorm<span style="color: #008000;">&#40;</span><span style="color: #000080;">&lt;</span><span style="color: #0000dd;">160</span>,<span style="color: #0000dd;">237</span>,<span style="color: #0000dd;">160</span><span style="color: #000080;">&gt;</span><span style="color: #008000;">&#41;</span>,ALL_SIDES<span style="color: #008000;">&#41;</span><span style="color: #008080;">;</span>
            <span style="color: #008000;">&#125;</span> <span style="color: #0000ff;">else</span> <span style="color: #008000;">&#123;</span> <span style="color: #666666;">// user is offline</span>
                llSetText<span style="color: #008000;">&#40;</span>av_name<span style="color: #000040;">+</span><span style="color: #FF0000;">&quot;'s AIM account is offline&quot;</span>,<span style="color: #000080;">&lt;</span><span style="color: #0000dd;">1</span>,<span style="color: #0000dd;">1</span>,<span style="color: #0000dd;">1</span><span style="color: #000080;">&gt;</span>,<span style="color: #0000dd;">1</span><span style="color: #008000;">&#41;</span><span style="color: #008080;">;</span>
                llSetColor<span style="color: #008000;">&#40;</span>llVecNorm<span style="color: #008000;">&#40;</span><span style="color: #000080;">&lt;</span><span style="color: #0000dd;">237</span>,<span style="color: #0000dd;">160</span>,<span style="color: #0000dd;">160</span><span style="color: #000080;">&gt;</span><span style="color: #008000;">&#41;</span>,ALL_SIDES<span style="color: #008000;">&#41;</span><span style="color: #008080;">;</span>
            <span style="color: #008000;">&#125;</span>
        <span style="color: #008000;">&#125;</span>
    <span style="color: #008000;">&#125;</span>
<span style="color: #008000;">&#125;</span></pre></div></div>

<p>This script is released under <a rel="license" href="http://creativecommons.org/licenses/by-sa/3.0/us/"><img alt="Creative Commons License" style="border-width:0" src="http://i.creativecommons.org/l/by-sa/3.0/us/80x15.png" /></a>. Have fun!</p>
<p>Create a notecard called &#8220;Settings&#8221; with your avatar name on the first line* and your AIM username on the second line. If you do not want your AIM username shared, put the word &#8220;private&#8221; on the third line. If you <em>do</em> want it shared, change the line to anything else (&#8221;public&#8221;, or even blank will do). For example:</p>
<pre>
Ann Enigma
hilm1
private
</pre>
<p>The script polls the AIM server every 60 seconds, so give it a minute to update. It will reset if the owner clicks on it.</p>
<p><span style="font-size:80%">* I&#8217;m aware that you can get the avatar name via LSL &#8212; this was my first script, be nice!</span></p>
<img src="http://feeds.feedburner.com/~r/3greeneggs/~4/dYMx3LBW-_M" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.hilarymason.com/blog/lsl-aol-im-status-indicator/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		<feedburner:origLink>http://www.hilarymason.com/blog/lsl-aol-im-status-indicator/</feedburner:origLink></item>
		<item>
		<title>From the ACM: Learning More About Active Learning</title>
		<link>http://feedproxy.google.com/~r/3greeneggs/~3/9EZUBMfqKPk/</link>
		<comments>http://www.hilarymason.com/blog/from-the-acm-learning-more-about-active-learning/#comments</comments>
		<pubDate>Thu, 02 Apr 2009 14:39:32 +0000</pubDate>
		<dc:creator>Hilary Mason</dc:creator>
				<category><![CDATA[blog]]></category>
		<category><![CDATA[acm]]></category>
		<category><![CDATA[active learning]]></category>
		<category><![CDATA[machine learning]]></category>

		<guid isPermaLink="false">http://www.hilarymason.com/?p=272</guid>
		<description><![CDATA[
The April edition of Communications of the ACM has an interesting article on recent advances in active learning by Graeme Stemp-Morlock.
In passive learning (a more traditional approach), you build a large training set of classified data by (often) manually assigning labels. This data is used as the basis of your analysis. 
In the real world, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://mags.acm.org/communications/200904/?pg=12" target="_blank" onclick="name='w'+Math.round(Math.random()*(1000));w=screen.width-10;h=screen.height-40;window.open('http://mags.acm.org/communications/200904/?pg=12',name,'toolbar=no,menubar=no,resizable=yes,scrollbars=yes,left=0,top=0,width='+w+'height='+h);return false;" title=" View Magazine"><img src="http://mags.acm.org/tcprojects/acm/communication/inbox/60517/imgpages/tn/communications200904_0013.gif" border="0" style="float:left;margin:10px;"></a></p>
<p>The April edition of Communications of the ACM has an interesting article on recent advances in active learning by Graeme Stemp-Morlock.</p>
<p>In <em>passive learning</em> (a more traditional approach), you build a large training set of classified data by (often) manually assigning labels. This data is used as the basis of your analysis. </p>
<p>In the real world, we find that generating these large sets of labeled data is often expensive and time consuming. With <em>active learning</em>, you identify the most ambiguous data to label, resulting in a much higher payoff for each label defined (and fewer headaches for your labelers).</p>
<p>The article goes on to mention that active learning is being used in practice with excellent results (for example in music identification, text classification, and even bioinformative), but that the theory lags. This is another example of a gap between the world of the practitioner and the academic work behind it.</p>
<img src="http://feeds.feedburner.com/~r/3greeneggs/~4/9EZUBMfqKPk" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.hilarymason.com/blog/from-the-acm-learning-more-about-active-learning/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		<feedburner:origLink>http://www.hilarymason.com/blog/from-the-acm-learning-more-about-active-learning/</feedburner:origLink></item>
	</channel>
</rss><!-- Dynamic Page Served (once) in 3.511 seconds -->
