<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>The CrowdFlower Blog</title>
	
	<link>http://blog.crowdflower.com</link>
	<description />
	<lastBuildDate>Wed, 09 May 2012 19:52:19 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/doloreslabs" /><feedburner:info uri="doloreslabs" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><item>
		<title>Real Time Foto Moderator – The Rambling Anti-Elevator Pitch</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/t3ll_lxIeT0/</link>
		<comments>http://blog.crowdflower.com/2012/05/real-time-foto-moderator/#comments</comments>
		<pubDate>Wed, 09 May 2012 19:46:06 +0000</pubDate>
		<dc:creator>Lukas Biewald</dc:creator>
				<category><![CDATA[Miscellaneous]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=5285</guid>
		<description><![CDATA[I never really understood the need for photo moderation until Chris and I built our first webapp together. This webapp was called FaceStat, a website where you could upload photos of yourself and get “feedback” from other people on things like whether or not you looked friendly and if you could win a fight with [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright" title="Crowdflower_bkt_4487" src="http://blog.crowdflower.com/wp-content/uploads/2012/05/Crowdflower_bkt_4487.jpg" alt="" width="170" height="170" /></p>
<p>I never really understood the need for photo moderation until Chris and I built our first webapp together. This webapp was called FaceStat, a website where you could upload photos of yourself and get “feedback” from other people on things like whether or not you looked friendly and if you could win a fight with a medium-sized dog. It was a surprisingly popular, extremely viral website with horrible retention and no prayer of a viable business model.</p>
<p>At first we just let people upload whatever photos they wanted and comment however they wanted. Not surprisingly, things quickly got out of hand. Hours after we released the website we started seeing extremely disturbing uploads. We tried to use a user-moderation system but there were lots of false positives and it pained me to subject unsuspecting users to some of the things people uploaded. We tried automated systems but they were extremely unreliable.</p>
<p>This experience caused us to build CrowdSifter, a content moderation crowdsourcing tool, where our FaceStat app was the first customer. It worked really well and we got a few early customers. But our customers seemed interested in lots of tasks besides content moderation, so we decided to build a general purpose crowdsourcing system that we called CrowdFlower. Eventually CrowdSifter fell into disrepair and we shut it down and moved the existing customers over to our general crowdsourcing platform. CrowdFlower became so popular that we renamed our company CrowdFlower and we got in the business of helping companies get very high quality work done through crowdsourcing. We signed up over 50 channels, over 3 million contributors and hundreds of name-brand customers. We built out an enterprise sales team and learned how to deal with procurement departments. We even did some work for the US Government.</p>
<p>But as our engagement sizes rose, our sales costs rose too, and we stopped being able to service anyone except customers with the largest crowdsourcing jobs. Which in some ways completely contradicts the promise of crowdsourcing – a super-flexible workforce. We have a self-service site with some happy customers, but the learning curve is steep and while we’re working on improvements, I’m not sure if it’s possible to make a general-purpose crowdsourcing tool that’s also easy for everyone to use.</p>
<p>So we decided to build specific self-service products that would be easy to use, reliably fast and high-quality. I wanted to start with an application that we had lots of experience with and one where we felt like we really understood our customer’s needs. So I chose content moderation. The customers have expanded in the past few years with iPhone developers getting lots and lots of user generated images and living in constant fear of apple shutting them down for objectionable content. I’ve seen several indie app developers sign up for our beta and in particular I hope that this is a great product for them.</p>
<p>We plan on releasing several of these self-service applications over the next few months. If there are any crowdsourcing applications you would especially like to see, please let me know.</p>
<p>TL;DR We’ve offered crowdsourced photo moderation as a service for years, but now you can buy it with clear pricing, no sign-up fee and without ever talking to a sales person. And you should! Go to <a href="http://crowdflower.com/rtfm">crowdflower.com/rtfm</a>.</p>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/t3ll_lxIeT0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/05/real-time-foto-moderator/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/05/real-time-foto-moderator/</feedburner:origLink></item>
		<item>
		<title>What Motivates Crowdsource Workers on CrowdFlower &amp; Kaggle – Highlights of SXSW 2012′s “Pay or Play” Crowdsourcing Talk</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/dv1v9HLUUTs/</link>
		<comments>http://blog.crowdflower.com/2012/03/what-motivates-crowdsource-workers-on-crowdflower-kaggle-highlights-of-sxsw-2012s-pay-or-play-crowdsourcing-talk/#comments</comments>
		<pubDate>Wed, 28 Mar 2012 21:03:35 +0000</pubDate>
		<dc:creator>Wagner James Au</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[CrowdFlower]]></category>
		<category><![CDATA[Human Behavior]]></category>
		<category><![CDATA[Sentiment Analysis]]></category>
		<category><![CDATA[SXSW]]></category>
		<category><![CDATA[Tweets]]></category>
		<category><![CDATA[Twitter]]></category>
		<category><![CDATA[Virtual Goods]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=5211</guid>
		<description><![CDATA[This is a summary of South by Southwest&#8217;s talk &#8220;Getting a Crowd to Work for You: For Pay or Play?&#8221; featuring CrowdFlower founder Lukas Biewald and Kaggle founder Anthony Goldsbloom. They discussed how crowdsource work is transforming traditional business and research, provided background on their companies, and offered insights on the motivations of people involved [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.crowdflower.com/2012/03/what-motivates-crowdsource-workers-on-crowdflower-kaggle-highlights-of-sxsw-2012s-pay-or-play-crowdsourcing-talk/lukas-biewald-crowdflower-anthony-goldbloom-kaggle/" rel="attachment wp-att-5220"><img class="alignright size-full wp-image-5220" title="Lukas Biewald CrowdFlower Anthony Goldbloom Kaggle" src="http://blog.crowdflower.com/wp-content/uploads/2012/03/Lukas-Biewald-CrowdFlower-Anthony-Goldbloom-Kaggle.jpg" alt="" width="314" height="262" /></a><em>This is a summary of South by Southwest&#8217;s talk &#8220;<a href="http://bit.ly/zjzGHs">Getting a Crowd to Work for You: For Pay or Play?</a>&#8221; featuring <a href="http://bit.ly/wtLZql">CrowdFlower</a> founder Lukas Biewald and <a href="http://bit.ly/wnEt1P">Kaggle</a> founder Anthony Goldsbloom. They discussed how crowdsource work is transforming traditional business and research, provided background on their companies, and offered insights on the motivations of people involved with crowdsourcing.</em></p>
<p><strong>The Rise of Crowdsourcing</strong></p>
<p>CrowdFlower&#8217;s company name was inspired by a 2006 article by Wired author Jeff Howe, who coined the term &#8220;crowdsourcing&#8221;. Since then, the concept has rapidly gained mainstream recognition.</p>
<p>These graphs below show the growth of Google searches of the term, and the rising number of academic journal articles which also reference crowdsourcing:<br />
<a href="http://blog.crowdflower.com/2012/03/what-motivates-crowdsource-workers-on-crowdflower-kaggle-highlights-of-sxsw-2012s-pay-or-play-crowdsourcing-talk/crowdsourcing-google-2/" rel="attachment wp-att-5215"><img class="aligncenter size-full wp-image-5215" title="CrowdSourcing Google" src="http://blog.crowdflower.com/wp-content/uploads/2012/03/CrowdSourcing-Google.jpg" alt="" width="682" height="355" /></a></p>
<p>The crowdsourcing industry has grown in size and diversity, and now involves hundreds of companies in at least eight sub-sectors, <a href="http://bit.ly/GKlngz">as this chart by crowdsourcing.org shows</a>. CrowdFlower and Kaggle fit within the Cloud Labor segment, providing on-demand services from a virtual pool of workers. In line with broader industry trends, both CrowdFlower and Kaggle have enjoyed very strong growth in recent years: <strong>The number of judgements performed for enterprise and self-service clients by CrowdFlower&#8217;s workers  is now over 300 million, and the growth of registered members on Kaggle now stands at about 30,000.</strong></p>
<p>The two companies have very different approaches to cloud labor:</p>
<p><strong>CrowdFlower deploys massive groups of workers to complete complex but simple jobs, while Kaggle leverages smaller (and competing) groups of specialists to solve difficult technical challenges.</strong></p>
<p>More on this below:<br />
<span id="more-5211"></span></p>
<p><a href="http://blog.crowdflower.com/2012/03/what-motivates-crowdsource-workers-on-crowdflower-kaggle-highlights-of-sxsw-2012s-pay-or-play-crowdsourcing-talk/kaggle-dark-matter-competition/" rel="attachment wp-att-5226"><img class="alignright size-full wp-image-5226" title="Kaggle Dark Matter Competition" src="http://blog.crowdflower.com/wp-content/uploads/2012/03/Kaggle-Dark-Matter-Competition.jpg" alt="" width="300" height="143" /></a></p>
<p><strong>How Kaggle Crowdsourcing Works</strong></p>
<p>Kaggle&#8217;s basic operation resembles the famed Netflix competition, in which thousands of people vied to improve the video service&#8217;s search algorithm. Kaggle posts challenges to highly technical problems, and contestants around the world compete to solve them. One notable example is the search for a better algorithm to map &#8220;dark matter&#8221; in the universe. (NASA, the European Space Agency, and the Royal Astronomical Society posted this challenge on Kaggle.) To everyone&#8217;s surprise, the entry which outperformed the others came not from an astrophysicist, but Martin O&#8217;Leary, a PhD student in glaciology. (O&#8217;Leary created an algorithim similar to one used for mapping glaciers.) <strong>Thanks to Kaggle, someone from a totally different field of science was able to come forward and provide this leap of insight.</strong> Sometimes, the winning entry doesn&#8217;t even come from a scientist &#8212; in another Kaggle competition involving HIV research, the winner was an English professor.<br />
<a href="http://blog.crowdflower.com/2012/03/what-motivates-crowdsource-workers-on-crowdflower-kaggle-highlights-of-sxsw-2012s-pay-or-play-crowdsourcing-talk/kaggle-competition/" rel="attachment wp-att-5225"><img class="aligncenter size-full wp-image-5225" title="Kaggle competition" src="http://blog.crowdflower.com/wp-content/uploads/2012/03/Kaggle-competition.jpg" alt="" width="670" height="352" /></a></p>
<p>As noted, Kaggle&#8217;s challenges are competitive. So in the weeks after O&#8217;Leary&#8217;s entry was submitted, other challengers presented their own dark matter mapping algorithms. Not to be outdone, O&#8217;Leary improved his own original entry. <strong>As the chart above shows, competition pushed subsequent entries to grow ever more accurate.</strong></p>
<p><strong>How CrowdFlower Crowdsourcing Works</strong></p>
<p>CrowdFlower&#8217;s system leverages large groups of workers to solve massive but technically simple tasks. One of these services is sentiment analysis, the study of public opinion on a given subject. Other sentiment analysis programs use computational linguistics to perform this task, but computers cannot always discern nuances of tone, such as sarcasm or irony. <strong>CrowdFlower&#8217;s version is unique in that actual humans perform the sentiment analysis.</strong></p>
<p><a href="http://blog.crowdflower.com/2012/03/what-motivates-crowdsource-workers-on-crowdflower-kaggle-highlights-of-sxsw-2012s-pay-or-play-crowdsourcing-talk/crowdsourcing-weather/" rel="attachment wp-att-5227"><img class="aligncenter size-full wp-image-5227" title="Crowdsourcing weather" src="http://blog.crowdflower.com/wp-content/uploads/2012/03/Crowdsourcing-weather.png" alt="" width="873" height="501" /></a></p>
<p>This service can be applied to a variety of jobs. For example, an academic used CrowdFlower&#8217;s self-service site to perform a sentiment analysis of Twitter messages about the weather. (See chart above.) By tracking thousands of Tweets from people expressing their feelings about the weather (positive or negative) across the country, the researcher was able to create an accurate map of climate conditions through the entire continental US.</p>
<p><a href="http://blog.crowdflower.com/2012/03/what-motivates-crowdsource-workers-on-crowdflower-kaggle-highlights-of-sxsw-2012s-pay-or-play-crowdsourcing-talk/crowdflower-quality-control/" rel="attachment wp-att-5228"><img class="aligncenter size-full wp-image-5228" title="CrowdFlower Quality Control" src="http://blog.crowdflower.com/wp-content/uploads/2012/03/CrowdFlower-Quality-Control.jpg" alt="" width="663" height="477" /></a><br />
To insure accuracy of the crowd&#8217;s work on a given job, CrowdFlower&#8217;s system also leverages the crowd. By comparing individual results with the aggregate, final results are 95% accurate. At SXSW, Biewald said that CrowdFlower is so confident of the company&#8217;s quality control, they have an open API for their crowdsourcing service.</p>
<p><a href="http://blog.crowdflower.com/2012/03/what-motivates-crowdsource-workers-on-crowdflower-kaggle-highlights-of-sxsw-2012s-pay-or-play-crowdsourcing-talk/crowdsource-game-paywall/" rel="attachment wp-att-5229"><img class="aligncenter size-full wp-image-5229" title="Crowdsource game paywall" src="http://blog.crowdflower.com/wp-content/uploads/2012/03/Crowdsource-game-paywall.png" alt="" width="604" height="504" /></a></p>
<p>While some CrowdFlower workers perform tasks to earn a side income, many are motivated by other goals. <strong>Notably, 50% of them come to CrowdFlower through online games, where they can earn game money and other rewards by performing crowdsource jobs. </p>
<p><strong>CrowdFlower and Kaggle Crowdsource Worker Demographics</strong></p>
<p>Much of CrowdFlower&#8217;s crowd workers are based in the US and India, while Kaggle has a strong presence in the US, and Australia as well (owing to its origin in that nation.) Both companies&#8217; workforces are based in over half the countries of the world.</p>
<p><a href="http://blog.crowdflower.com/2012/03/what-motivates-crowdsource-workers-on-crowdflower-kaggle-highlights-of-sxsw-2012s-pay-or-play-crowdsourcing-talk/crowdflower-kaggle-demographics-2/" rel="attachment wp-att-5242"><img class="aligncenter size-full wp-image-5242" title="CrowdFlower Kaggle Demographics" src="http://blog.crowdflower.com/wp-content/uploads/2012/03/CrowdFlower-Kaggle-Demographics1.jpg" alt="" width="590" height="457" /></a></p>
<p>By gender, CrowdFlower&#8217;s work force skews heavily female. (<strong>According to Biewald, this may be because CrowdFlower jobs can be performed successfully without the worry of gender bias</strong>.) In contrast, Kaggle would make a poor dating site, since the vast majority of its competitors are male. This could be due to the over-representation of men in the sciences, Goldsbloom suggested. Some have also argued that women are generally less motivated by the kind of competition that Kaggle leverages.</p>
<p><a href="http://blog.crowdflower.com/2012/03/what-motivates-crowdsource-workers-on-crowdflower-kaggle-highlights-of-sxsw-2012s-pay-or-play-crowdsourcing-talk/kaggle-workforce-by-occupation-2/" rel="attachment wp-att-5244"><img class="alignnone size-full wp-image-5244" title="Kaggle workforce by occupation" src="http://blog.crowdflower.com/wp-content/uploads/2012/03/Kaggle-workforce-by-occupation1.png" alt="" width="800" height="594" /></a></p>
<p><strong>By occupation and education, about 80% of Kaggle&#8217;s registered users have a technical background, such as computer science, economics, and the hard sciences</strong>.</p>
<p><strong>Crowdsourcing Work Motivations on CrowdFlower and Kaggle</strong></p>
<p><a href="http://blog.crowdflower.com/2012/03/what-motivates-crowdsource-workers-on-crowdflower-kaggle-highlights-of-sxsw-2012s-pay-or-play-crowdsourcing-talk/crowdflower-motivations/" rel="attachment wp-att-5245"><img class="aligncenter size-full wp-image-5245" title="CrowdFlower Motivations" src="http://blog.crowdflower.com/wp-content/uploads/2012/03/CrowdFlower-Motivations.jpg" alt="" width="658" height="481" /></a></p>
<p>In two surveys, CrowdFlower workers cited multiple motivations for performing tasks on the system. While a large percent cite &#8220;money&#8221;, about as many also list &#8220;fun&#8221;. <strong>Interestingly, about 40% cite a &#8220;sense of purpose&#8221;.</strong> For CrowdFlower&#8217;s part, <strong>Biewald said he believes it&#8217;s important that crowdsourced jobs are paid for, since it indicates they are valuable, and assures the workers that the company cares enough about the task to pay for it</strong>.</p>
<p><a href="http://blog.crowdflower.com/2012/03/what-motivates-crowdsource-workers-on-crowdflower-kaggle-highlights-of-sxsw-2012s-pay-or-play-crowdsourcing-talk/crowdflower-kaggle-work-motivations-2/" rel="attachment wp-att-5253"><img class="aligncenter size-full wp-image-5253" title="CrowdFlower Kaggle work motivations" src="http://blog.crowdflower.com/wp-content/uploads/2012/03/CrowdFlower-Kaggle-work-motivations1.jpg" alt="" width="800" height="555" /></a></p>
<p>In individual interviews, Kaggle competitors and CrowdFlower workers cite a number of motives: <strong>&#8220;I like it is that you are solely awarded on your quality of work. Unlike a normal work situation where your social skills and appearance greatly affect the rewards that you receive</strong>,&#8221; one CrowdFlower worker said, while a Kaggle competition winner cited his motive as: “I was just bored one Saturday afternoon.&#8221; Others mention the desire to exercise their brain, learn new things, and in the case of CrowdFlower, to earn virtual goods and currency.</p>
<p>Unsurprisingly, competition is also a major driver for many Kaggle competitors: “Initially it was a Can I do it? motivation,&#8221; as one put it. &#8220;Almost immediately, that was replaced by Can I do it better then others?” The prospect of winning a competition cash prizes is also a motivator. <strong>Indeed, Kaggle hopes that teams will eventually make a full-time income through their competitons</strong>.</p>
<p><em>Slides courtesy CrowdFlower and Kaggle</em></p>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/dv1v9HLUUTs" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/03/what-motivates-crowdsource-workers-on-crowdflower-kaggle-highlights-of-sxsw-2012s-pay-or-play-crowdsourcing-talk/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/03/what-motivates-crowdsource-workers-on-crowdflower-kaggle-highlights-of-sxsw-2012s-pay-or-play-crowdsourcing-talk/</feedburner:origLink></item>
		<item>
		<title>REPOST: Training the Cloud with the Crowd: Training A Google Prediction API Model Using CrowdFlower’s Workforce</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/jwfK3Z10JsY/</link>
		<comments>http://blog.crowdflower.com/2012/03/repost-training-the-cloud-with-the-crowd-training-a-google-prediction-api-model-using-crowdflowers-workforce/#comments</comments>
		<pubDate>Wed, 21 Mar 2012 22:13:17 +0000</pubDate>
		<dc:creator>Josh Eveleth</dc:creator>
				<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[Google Predictions]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[sentiment analysis]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=4994</guid>
		<description><![CDATA[NOTE: The following post, by Kevin Cocco, was reposted from the Dialogue Earth blog. &#160; Can a machine be taught to determine the sentiment of a Twitter message about weather? With the data from over 1 million crowd sourced human judgements the goal was to use this data to train a predictive model and use [...]]]></description>
			<content:encoded><![CDATA[<h3><strong>NOTE: The following post, by <a href="http://www.dialogueearth.org/author/kevin-cocco/">Kevin Cocco</a>, was reposted from  the Dialogue Earth blog.</strong></h3>
<p>&nbsp;</p>
<p><img class="ngg-singlepic ngg-center" src="http://www.dialogueearth.org/wp-content/gallery/crowd-model-training-with-crowdflower-and-google-prediction-api/kcocco_twitter_data_google_prediction_api.png" alt="kcocco_twitter_data_google_prediction_api" width="500" height="197" /></p>
<p>Can a machine be taught to determine the sentiment of a Twitter message about weather?  With the data from over <a href="http://www.dialogueearth.org/2011/06/27/sentiment-analysis-milestone-more-than-one-million-human-judgments/" >1 million crowd sourced human judgements</a> the goal was to use this data to train a predictive model and use this machine learning system to make judgements.  Below are the highlights from the research and development of a machine learning model in the cloud that predicts the sentiment of text regarding the weather.  The following are the major technologies used in this research:  <a title="Google Prediction API" href="https://developers.google.com/prediction/" target="_blank">Google Prediction API</a>, <a title="CrowdFlower" href="http://crowdflower.com/" target="_blank">CrowdFlower</a>, <a title="Twitter" href="https://dev.twitter.com/" target="_blank">Twitter</a>,  <a href="http://code.google.com/apis/maps/index.html" target="_blank">Google Maps</a>.</p>
<p>The only person that can really determine the true sentiment of a tweet is the person who wrote it.  When the human crowd worker makes tweet sentiment judgements only 44% of the time do all 5 humans make the same judgement.  CrowdFlower&#8217;s crowd sourcing processes are great for managing the art and science of sentiment analysis.  You can scale up CrowdFlower&#8217;s number of crowd workers per record to increase accuracy, of course at a scaled up cost.</p>
<p><span id="more-4994"></span></p>
<p>The results of this study show that when all 5 crowd workers agree on the sentiment of tweet the predictive model makes the same judgement 90% of the time.  When you take all tweets the CrowdFlower and Predictive model return the same judgement 71% of the time.  Both CrowdFlower and Google Predictions supplement rather than substitute each other.  As shown in this study, CrowdFlower can successfully be used to build a domain/niche specific data set to train a Google Predicition model.  I see the power of integrating machine learning into  crowd sourcing systems like CrowdFlower.  CrowdFlower users could have the option of automatically training a predictive model as the crowd workers make their judgements.  CrowdFlower could continually monitor the models trending accuracy and then progressively include machine workers into the worker pool.  Once the model hit X accuracy you could have a majority of data stream routed to predictive judgments while continuing to feed a small percentage of data the crowd to refresh current topics and continually validate accuracy.  <a href="http://www.mturk.com/">MTurk</a> hits may only be pennies but <a href="http://code.google.com/apis/predict/docs/pricing.html">Google Prediction ‘hits’</a> cost even less.</p>
<p><span id="more-3414"></span></p>
<p>&nbsp;</p>
<p><strong>Weather Sentiment Prediction Demo Application:</strong><br />
      <img class="ngg-singlepic ngg-none" src="http://www.dialogueearth.org/wp-content/gallery/crowd-model-training-with-crowdflower-and-google-prediction-api/kcocco_weather_sentiment_prediction_app.png" alt="kcocco_weather_sentiment_prediction_app" /></p>
<h2><strong><a title="Weather Sentiment Prediction Application" href="http://www.sproutloop.com/prediction_demo/" target="_blank">LIVE Demo Link: www.SproutLoop.com/prediction_demo</a> </strong></h2>
<p>Note, this demo uses server side Twitter feed that is throttled, retry later if you get no results. Contact me regarding high volume applications and integrations with full Twitter firehouse.</p>
<p>&nbsp;</p>
<p><strong>Match Rate/ Accuracy Findings:</strong><br />
        Below are the highlighted match rates of CrowdFlower human judgements to Google Prediction machine judgements.  A match rate compares the resulting predicted sentiment labels from one method up to those from another:</p>
<ul>
<li>Google Prediction API matching CrowdFlower Sentiment Analysis  = 71% match rate</li>
<li>Mirroring DialogueEarth Plus filtering of lowest 22% confidence scores  = 79% match rate</li>
<li>Tweets sentiment can be confusing for humans and machines.  Google predictions of only the tweets in which all the crowd workers agreed (CrowdFlower confidence score = 1)  =  90% match rate</li>
</ul>
<p>&nbsp;</p>
<p><strong>About Google Predication API:</strong><br />
          During the <a href="http://www.youtube.com/watch?v=FJDP_0Mrb-w&amp;feature=youtu.be" >May, 2011 Google IO conference</a> Google released a new version of their <a href="http://code.google.com/apis/predict/">Google Prediciton API </a>with open access to Google’s <a href="http://en.wikipedia.org/wiki/Machine_learning">machine learning</a> systems in the cloud.  The basic process to creating predictive models is to upload training data to <a href="http://code.google.com/apis/storage/">Google Cloud Storage</a> and then use Google Prediction API to train a machine learning model from the training data set.  Once you have a trained model in the cloud you can write code with their API to submit data for sub second (avg 0.62 sec per) predictions.</p>
<p>&nbsp;</p>
<p><img class="ngg-singlepic ngg-none" src="http://www.dialogueearth.org/wp-content/gallery/crowd-model-training-with-crowdflower-and-google-prediction-api/kcocco_crowdflower_and_twitter.png" alt="kcocco_crowdflower_and_twitter" /></p>
<p><strong>About The Data:</strong><br />
            Much has been written about <a href="http://www.dialogueearth.org/pulse/" >DialogueEarth.org&#8217;s Weather Mood Pulse system</a>.  Pulse has collected 200k+ <a href="http://www.dialogueearth.org/2011/06/27/sentiment-analysis-milestone-more-than-one-million-human-judgments/" >dataset of tweets</a> that have been assigned one of five labels regarding the tweets sentiment related to the weather.  This labeling of tweets is crowd sourced with the <a href="http://crowdflower.com/">CrowdFlower</a> system that presents each tweet with a survey for the workers in the crowd to decision.  CrowdFlower has quality control processes in place to present the same tweet to several people in the crowd.  DialogueEarth’s crowd jobs were configured so that each tweet was graded by 5 different people.  CrowdFlower uses this 5 person matching and each person’s <a href="http://blog.crowdflower.com/2011/10/stopworrying/">CrowdFlower “Gold”</a> score to calculate a confidence score between 0 and 1 for each tweet.  About 44% of the tweets have a confidence score of 1 or 100% of the graders agreed on the sentiment label for the tweet, while some of the other tweets have low scores like 0.321 meaning very little agreement in tweet sentiment plus some influence from each of the graders Gold scores.  The Pulse system has chosen to use only the tweets that have a CrowdFlower confidence score equal or greater than 0.6.  Dozens of models where build using various segments of CrowdFlower confidence score ranges.  Testing showed that the best model used the full confidence range of CrowdFlower records.    <strong></strong></p>
<p>&nbsp;</p>
<p><strong>Weather Sentiment Tweet Labels From CrowdFlower:</strong><br />
              The CrowdFlower scored tweet data contains the tweet text, weather sentiment label and the CrowdFlower confidence score.  The tweet data set was randomized into two segments: 111k(~90%) rows used to train model and 12k rows held out for testing the model.</p>
<p><strong></p>
<table id="wp-table-reloaded-id-3-no-1" class="wp-table-reloaded wp-table-reloaded-id-3">
<thead>
<tr class="row-1 odd">
<th class="column-1">Sentiment</th>
<th class="column-2">Modeled Tweets 90%</th>
<th class="column-3">Test Tweets 10%</th>
<th class="column-4">% of Total</th>
</tr>
</thead>
<tfoot>
<tr class="row-7 odd">
<th class="column-1">TOTAL</th>
<th class="column-2">111,651</th>
<th class="column-3">12,389</th>
<th class="column-4">124,040</th>
</tr>
</tfoot>
<tbody>
<tr class="row-2 even">
<td class="column-1">negative</td>
<td class="column-2">23,976</td>
<td class="column-3">2,578</td>
<td class="column-4">21.4%</td>
</tr>
<tr class="row-3 odd">
<td class="column-1">positive</td>
<td class="column-2">21,688</td>
<td class="column-3">2,384</td>
<td class="column-4">19.4%</td>
</tr>
<tr class="row-4 even">
<td class="column-1">not weather related</td>
<td class="column-2">34,232</td>
<td class="column-3">3,780</td>
<td class="column-4">30.6%</td>
</tr>
<tr class="row-5 odd">
<td class="column-1">neutral</td>
<td class="column-2">29,333</td>
<td class="column-3">3,340</td>
<td class="column-4">26.3%</td>
</tr>
<tr class="row-6 even">
<td class="column-1">cannot tell</td>
<td class="column-2">2,422</td>
<td class="column-3">307</td>
<td class="column-4">2.2%</td>
</tr>
</tbody>
</table>
<p>              </strong></p>
<p>&nbsp;</p>
<p><strong>CrowdFlower Confidence Scores Correlation to Google Prediction Match Rate / Accuracy:</strong></p>
<p>Running Match Rate &#8211; Ordered by CrowdFlower Confidence, Google Prediction Confidence</p>
<p><img class="ngg-singlepic ngg-none" src="http://www.dialogueearth.org/wp-content/gallery/crowd-model-training-with-crowdflower-and-google-prediction-api/kcocco_cf_conf_vs_goog_accuracy_v2.png" alt="kcocco_cf_conf_vs_goog_accuracy_v2" width="624" height="344" /></p>
<ul>
<li>X- axis shows the distribution of CF confidence scores in 10% random test data set 12k rows</li>
<li>Google is better at predicting Tweets that have a higher CrowdFlower confidence score</li>
<li>The Google confidence score correlates with accuracy/match rate, on average higher Google confidence = higher accuracy of matching</li>
</ul>
<p>&nbsp;</p>
<p><strong>Google’s Prediction Confidence Score and Correlation to Match Rate Accuracy</strong><br />
                Running Match Rate and Google Prediction Score &#8211; Ordered by Prediction Score, Random</p>
<p><img class="ngg-singlepic ngg-none" src="http://www.dialogueearth.org/wp-content/gallery/crowd-model-training-with-crowdflower-and-google-prediction-api/kcocco_goog_prediction_score_vs_matchrate_v2.png" alt="kcocco_goog_prediction_score_vs_matchrate_v2" width="634" height="342" /></p>
<ul>
<li>Higher Google confidence scores correlate with higher matching/accuracy rate.</li>
<li>Filtering results at Google conf score &gt; 0.8290 will result in 80% accuracy and filtering/loss of 24.41% of data.</li>
</ul>
<p><strong></p>
<table id="wp-table-reloaded-id-4-no-1" class="wp-table-reloaded wp-table-reloaded-id-4">
<thead>
<tr class="row-1 odd">
<th class="column-1">Accuracy/Match</th>
<th class="column-2">Google Confidence</th>
<th class="column-3">% Data Filtered</th>
<th class="column-4">Rows out of 12390</th>
</tr>
</thead>
<tbody>
<tr class="row-2 even">
<td class="column-1">98.47%</td>
<td class="column-2">score = 1</td>
<td class="column-3">86.88%</td>
<td class="column-4">1627</td>
</tr>
<tr class="row-3 odd">
<td class="column-1">90%</td>
<td class="column-2">score > 0.99537</td>
<td class="column-3">55.16%</td>
<td class="column-4">5543</td>
</tr>
<tr class="row-4 even">
<td class="column-1">85%</td>
<td class="column-2">score > 0.95688</td>
<td class="column-3">38.78%</td>
<td class="column-4">7586</td>
</tr>
<tr class="row-5 odd">
<td class="column-1">80%</td>
<td class="column-2">score > 0.82900</td>
<td class="column-3">24.41%</td>
<td class="column-4">9367</td>
</tr>
<tr class="row-6 even">
<td class="column-1">78.93%</td>
<td class="column-2">score > 0.79122</td>
<td class="column-3">21.60% **</td>
<td class="column-4">9715</td>
</tr>
<tr class="row-7 odd">
<td class="column-1">75.00%</td>
<td class="column-2">score > 0.61647</td>
<td class="column-3">11.06%</td>
<td class="column-4">11021</td>
</tr>
<tr class="row-8 even">
<td class="column-1">70.91%</td>
<td class="column-2">score > 0.25495</td>
<td class="column-3">0.0%</td>
<td class="column-4">12390</td>
</tr>
</tbody>
</table>
<p>                </strong></p>
<p>** Note, 21.6% is the current % of data that Pulse filters by excluding CF conf. scores &lt;= 0.6</p>
<p>&nbsp;</p>
<p><strong>Effect of Model Size on Match Rate / Accuracy</strong></p>
<p><strong></strong><img class="ngg-singlepic ngg-none" src="http://www.dialogueearth.org/wp-content/gallery/crowd-model-training-with-crowdflower-and-google-prediction-api/kcocco_model_size_matchrate_1.png" alt="kcocco_model_size_matchrate" /></p>
<ul>
<li>The larger the data model training data set the higher match accuracy</li>
<li>95% (5k of 111k) decrease in dataset set size decreases match rate by 7.1% (64% &#8211; 71%)</li>
<li>The classificationAccuracy returned from Google’s model build was between 5%(model 5k) to 0.06%(model 111k) different than tested accuracy rates.</li>
</ul>
<p>&nbsp;</p>
<p><strong>Formatting Tweet Text for modeling:</strong><br />
                  When preparing text for modeling <a href="http://code.google.com/apis/predict/docs/developer-guide.html" >Google recommends</a> removing all punctuation because “Punctuation rarely add meaning to the training data but are treated as meaningful elements by the learning engine”.  The tweet text was lowercased, stripped of all punctuation, special characters, returns, tabs,.. These were replaced by a space to prevent two words from joining.  With the unique 140 character limit and the use of emoticons it might be interesting to replace <a href="http://www.cool-smileys.com/text-emoticons" >emoticons</a> with words like replacing  :&#8217;-(  with something like a specific replacement  ‘ emoticon_i_am_crying ‘ or general  ‘emoticon_negative’ before building the model.  Here is an example of tweet hygiene below:<br />
                  <strong>BEFORE:</strong>  83 degrees n Atl @mention:59pm :&gt; I LOOOOVE this &#8230;feels like flawda<br />
                  <strong>AFTER:</strong>  83 degrees n atl mention 59pm i loooove this feels like flawda</p>
<p>&nbsp;</p>
<p dir="ltr"><img class="ngg-singlepic ngg-none" src="http://www.dialogueearth.org/wp-content/gallery/crowd-model-training-with-crowdflower-and-google-prediction-api/kcocco_google_prediction_api_logo.png" alt="kcocco_google_prediction_api_logo" /></p>
<p dir="ltr"><strong>Training The Google Prediction Model<br />
                  </strong>Here are the basic <a href="http://code.google.com/apis/predict/docs/hello_world.html" >steps for training a model</a>.  The training time can take a few hours for a 100k/10MB training data set, this seems to depend on the sever load.  When Google is finishes building the model the trainedmodels.get method will return a <a href="http://code.google.com/apis/predict/docs/reference/v1.4/reference.html#property_trainedmodels_modelInfo2">confusion matrix</a> and also the model’s “classificationAccuracy” score.  Note, Google’s classificationAccuracy score and the testing match rate  accuracy scores below are statistically the same (0.71 vs 0.709). <strong></strong></p>
<p>&nbsp;</p>
<p><strong>Google Model building confusion matrix with a classificationAccuracy of 0.71</strong></p>
<p dir="ltr"><strong></p>
<table id="wp-table-reloaded-id-5-no-1" class="wp-table-reloaded wp-table-reloaded-id-5">
<tbody>
<tr class="row-1">
<td class="column-1">SENTIMENT</td>
<td class="column-2">negative</td>
<td class="column-3">positive</td>
<td class="column-4">not weather related</td>
<td class="column-5">neutral</td>
<td class="column-6">cannot tell</td>
</tr>
<tr class="row-2">
<td class="column-1">negative</td>
<td class="column-2">1627</td>
<td class="column-3">191</td>
<td class="column-4">261.5</td>
<td class="column-5">351</td>
<td class="column-6">88</td>
</tr>
<tr class="row-3">
<td class="column-1">positive</td>
<td class="column-2">200.5</td>
<td class="column-3">1631.5</td>
<td class="column-4">220</td>
<td class="column-5">236</td>
<td class="column-6">40.5</td>
</tr>
<tr class="row-4">
<td class="column-1">not weather related</td>
<td class="column-2">239.5</td>
<td class="column-3">163.5</td>
<td class="column-4">268</td>
<td class="column-5">1965.5</td>
<td class="column-6">66</td>
</tr>
<tr class="row-5">
<td class="column-1">neutral</td>
<td class="column-2">262.5</td>
<td class="column-3">163.5</td>
<td class="column-4">268</td>
<td class="column-5">1965.5</td>
<td class="column-6">66</td>
</tr>
<tr class="row-6">
<td class="column-1">cannot tell</td>
<td class="column-2">5</td>
<td class="column-3">3</td>
<td class="column-4">1</td>
<td class="column-5">8.5</td>
<td class="column-6">3.5</td>
</tr>
</tbody>
</table>
<p>                  </strong></p>
<p dir="ltr"><strong><br />
                    Testing 12k hold out tweets against the model above with a match rate / accuracy of 0.709<br />
                    Crowd Flower Actual   </strong></p>
<p dir="ltr"><strong></p>
<table id="wp-table-reloaded-id-6-no-1" class="wp-table-reloaded wp-table-reloaded-id-6">
<tbody>
<tr class="row-1">
<td class="column-1">SENTIMENT</td>
<td class="column-2">negative</td>
<td class="column-3">positive</td>
<td class="column-4">not weather related</td>
<td class="column-5">neutral</td>
<td class="column-6">cannot tell</td>
</tr>
<tr class="row-2">
<td class="column-1">negative</td>
<td class="column-2">1824</td>
<td class="column-3">232</td>
<td class="column-4">294</td>
<td class="column-5">413</td>
<td class="column-6">94</td>
</tr>
<tr class="row-3">
<td class="column-1">positive</td>
<td class="column-2">196</td>
<td class="column-3">1732</td>
<td class="column-4">216</td>
<td class="column-5">200</td>
<td class="column-6">61</td>
</tr>
<tr class="row-4">
<td class="column-1">not weather related</td>
<td class="column-2">251</td>
<td class="column-3">200</td>
<td class="column-4">2967</td>
<td class="column-5">460</td>
<td class="column-6">64</td>
</tr>
<tr class="row-5">
<td class="column-1">neutral</td>
<td class="column-2">300</td>
<td class="column-3">215</td>
<td class="column-4">303</td>
<td class="column-5">2263</td>
<td class="column-6">86</td>
</tr>
<tr class="row-6">
<td class="column-1">cannot tell</td>
<td class="column-2">7</td>
<td class="column-3">5</td>
<td class="column-4">0</td>
<td class="column-5">4</td>
<td class="column-6">2</td>
</tr>
</tbody>
</table>
<p>                    </strong></p>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/jwfK3Z10JsY" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/03/repost-training-the-cloud-with-the-crowd-training-a-google-prediction-api-model-using-crowdflowers-workforce/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/03/repost-training-the-cloud-with-the-crowd-training-a-google-prediction-api-model-using-crowdflowers-workforce/</feedburner:origLink></item>
		<item>
		<title>Ray Kurzweil’s SXSW Keynote Speech Draws Mixed Reaction, According to CrowdFlower’s Sentiment Analysis of 1100+ Tweets</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/eq9OhUCWE9U/</link>
		<comments>http://blog.crowdflower.com/2012/03/ray-kurzweils-sxsw-keynote-speech-draws-mixed-reaction-according-to-crowdflowers-sentiment-analysis-of-1100-tweets/#comments</comments>
		<pubDate>Wed, 14 Mar 2012 17:49:39 +0000</pubDate>
		<dc:creator>Wagner James Au</dc:creator>
				<category><![CDATA[Miscellaneous]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=5166</guid>
		<description><![CDATA[On Monday we conducted a sentiment analysis of futurist Ray Kurzweil&#8217;s SXSW 2012 keynote address, based on Tweets about his talk sent out during and immediately after his talk. In roughly two-three hours, our crowdsource-powered workforce analyzed 1,188 such Tweets, determining the sub-topic of each Tweet and its general sentiment (negative, positive, or neutral). While [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.crowdflower.com/2012/03/ray-kurzweils-sxsw-keynote-speech-draws-mixed-reaction-according-to-crowdflowers-sentiment-analysis-of-1100-tweets/ray-kurzweil-sxsw-2012-speech/" rel="attachment wp-att-5167"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/03/Ray-Kurzweil-SXSW-2012-speech.jpg" alt="" title="Ray Kurzweil SXSW 2012 speech" width="100" height="100" class="alignright size-full wp-image-5167" /></a>On Monday we conducted a sentiment analysis of futurist <a href="http://schedule.sxsw.com/2012/events/event_IAP992058">Ray Kurzweil&#8217;s SXSW 2012 keynote address</a>, based on Tweets about his talk sent out during and immediately after his talk. In roughly two-three hours, our crowdsource-powered workforce analyzed 1,188 such Tweets, determining the sub-topic of each Tweet and its general sentiment (negative, positive, or neutral). While Kurzweil spoke to a capacity crowd of thousands, the reaction (according to our sentiment analysis) was decidedly mixed.</p>
<p>Here&#8217;s the results:</p>
<p><a href="http://blog.crowdflower.com/2012/03/ray-kurzweils-sxsw-keynote-speech-draws-mixed-reaction-according-to-crowdflowers-sentiment-analysis-of-1100-tweets/kurzweil-sxsw-sentiment-analysis-by-crowdflower/" rel="attachment wp-att-5170"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/03/Kurzweil-SXSW-sentiment-analysis-by-CrowdFlower.jpg" alt="" title="Kurzweil SXSW sentiment analysis by CrowdFlower" width="609" height="311" class="aligncenter size-full wp-image-5170" /></a></p>
<p>These answers were selected by our contributors from a number of options we put to them, including:</p>
<ul>
 It&#8217;s about expanding our intelligence or IQ<br />
 It&#8217;s about Ray Kurzweil&#8217;s current projects<br />
 It&#8217;s about the singularity<br />
 It&#8217;s about artificial intelligence<br />
 It&#8217;s about life extension<br />
 It&#8217;s about Ray Kurzweil&#8217;s thoughts and predictions on the future<br />
 It&#8217;s about virtual reality<br />
 It&#8217;s about Transhumanism<br />
 It&#8217;s about nanotechnology<br />
 It&#8217;s about leaving his SXSW talk<br />
 It&#8217;s about SOMETHING ELSE (Other)</ul>
<p>34% of the Tweets fell into the Other category, 34% were about AI, 10% were related to Kurzweil&#8217;s predictions of the future and current projects, and 2% about virtual reality. <strong>Interestingly, though Kurzweil&#8217;s talk was entitled &#8220;Expanding Our Intelligence Without Limit&#8221;, only a fraction of Tweets were related to that theme.</strong> We saw a similar pattern with Rainn Wilson&#8217;s SXSW speech, which was meant to promote his new site SoulPancake, but <a href="http://bit.ly/yL1NRm">which also garnered minimal Tweets</a>.</p>
<p>Having selected the sub-topic of each Tweet, our contributors then judged the sentiment &#8212; in other words, the author&#8217;s emotional tone regarding these Kurzweil topics. Notably, 38% were neutral, while 32% were positive, and 30% negative:</p>
<p><a href="http://blog.crowdflower.com/2012/03/ray-kurzweils-sxsw-keynote-speech-draws-mixed-reaction-according-to-crowdflowers-sentiment-analysis-of-1100-tweets/reaction-to-ray-kurzweil-sxsw-2012-keynote/" rel="attachment wp-att-5171"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/03/Reaction-to-Ray-Kurzweil-SXSW-2012-keynote.jpg" alt="" title="Reaction to Ray Kurzweil SXSW 2012 keynote" width="630" height="354" class="aligncenter size-full wp-image-5171" /></a></p>
<p>It&#8217;s interesting that both Kurzweil&#8217;s keynote and <a href="http://blog.crowdflower.com/2012/03/what-did-people-think-about-rainn-wilsons-sxsw-keynote-speech-heres-crowdflowers-sentiment-analysis-of-600-tweets/">Rainn Wilson&#8217;s keynote </a> drew such a significant negative reaction, despite the popularity of both celebrities in their respective fields. Perhaps if both speakers directly responded to the Twitter back channel throughout their talk (which typically happens in many SXSW panels and presentations), the overall response might have been more positive. It&#8217;s certainly worth considering for future SXSW keynotes.</p>
<p>How did we get this data? <a href="http://bit.ly/wzNiGE">As we explained in a post last year</a>, CrowdFlower’s crowdsource-driven sentiment analyses are fairly unique, in that they aggregate human judgements of Tweets, as opposed to an automated, computerized sampling, which is more likely to miss nuances of expression. </p>
<p>Again, <a href="http://bit.ly/yL1NRm ">check out our sentiment analysis of Rainn Wilson&#8217;s SXSW 2012</a> talk using the same methodology. Speaking of Tweets, don&#8217;t miss <a href="http://bit.ly/z2Yvlo">our top twelve Tweets from the SXSW talk of CrowdFlower founder Lukas Biewald</a>.</p>
<p><em>Thanks to CrowdFlower’s Josh Eveleth for all the help putting this project together!</em></p>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/eq9OhUCWE9U" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/03/ray-kurzweils-sxsw-keynote-speech-draws-mixed-reaction-according-to-crowdflowers-sentiment-analysis-of-1100-tweets/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/03/ray-kurzweils-sxsw-keynote-speech-draws-mixed-reaction-according-to-crowdflowers-sentiment-analysis-of-1100-tweets/</feedburner:origLink></item>
		<item>
		<title>Top Twelve Tweets from Kaggle &amp; CrowdFlower’s SXSW Talk on Crowdsourcing</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/4F9V37KGS7Y/</link>
		<comments>http://blog.crowdflower.com/2012/03/top-twelve-tweets-from-kaggle-crowdflowers-sxsw-talk-on-crowdsourcing/#comments</comments>
		<pubDate>Tue, 13 Mar 2012 22:19:37 +0000</pubDate>
		<dc:creator>Wagner James Au</dc:creator>
				<category><![CDATA[CrowdFlower]]></category>
		<category><![CDATA[Events]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=5132</guid>
		<description><![CDATA[Thanks to everyone who came to yesterday&#8217;s SXSW talk, “Getting a Crowd to Work for You: For Pay or Play?&#8221; featuring Lukas Biewald of CrowdFlower and Anthony Goldbloom of Kaggle. It was great to get such a solid and engaged crowd; no mean feat, as our talk overlapped with one by Al Gore and Sean [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.crowdflower.com/2012/03/top-twelve-tweets-from-kaggle-crowdflowers-sxsw-talk-on-crowdsourcing/sxsw-crowdsourcing-talk-kaggle-crowdflower/" rel="attachment wp-att-5144"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/03/SXSW-crowdsourcing-talk-Kaggle-CrowdFlower.jpg" alt="" title="SXSW crowdsourcing talk Kaggle CrowdFlower" width="300" height="297" class="alignright size-full wp-image-5144" /></a>Thanks to everyone who came to <a href="http://bit.ly/zjzGHs">yesterday&#8217;s SXSW talk</a>, “Getting a Crowd to Work for You: For Pay or Play?&#8221; featuring Lukas Biewald of <a href="http://bit.ly/wtLZql">CrowdFlower</a> and Anthony Goldbloom of <a href="http://bit.ly/wnEt1P">Kaggle</a>. It was great to get such a solid and engaged crowd; no mean feat, as our talk overlapped <a href="http://techcrunch.com/2012/03/12/sean-parker-defeating-sopa-was-the-nerd-spring/">with one by Al Gore and Sean Parker</a>. In case you couldn&#8217;t catch it, <a href="http://bit.ly/wl1aFG">click here to read the Tweet feed for the talk, hashtagged under #sxsw #wrkrmotive</a>. And below, read our selected top twelve Tweets from the talk:</p>
<ul>
<strong>@kaggle specializes in high level projects, @crowdflower specializes in jobs that anyone can do.</strong></p>
<p><strong>Crowdsourcing was once unknown and now it’s become a buzzword. <a href="http://bit.ly/zxsWYH">See Google Trends</a>.<br />
</strong><br />
<strong>A metereologist used @crowdflower to measure weather by tracking people’s sentiment of weather on Twitter.</strong></p>
<p><strong>A PhD student in glaciology solved NASA&#8217;s dark matter algorithm challenge via a @kaggle competition. </strong></p>
<p><strong>Leaderboard competition led to leapfrogging improvement of NASA&#8217;s dark matter challenge @kaggle</strong></p>
<p><strong>More than 50% of @crowdflower’s crowdsourcing workforce do real work for virtual goods.</strong></p>
<p><strong>@CrowdFlower is the largest buyer of virtual goods. Pays for real work crowdsourced jobs. </strong></p>
<p><strong>Paying $ for a crowdsourced job makes it special &#038; shows workers the company cares enough to pay for it.</strong></p>
<p><strong>@kaggle aims for its workforce members to be making a real life living from competitions.</strong></p>
<p><strong>@CrowdFlower crowdsource workforce has a slightly higher median income than average Americans.</strong></p>
<p><strong>@CrowdFlower has many more women in its workforce &#8212; maybe because a lack of gender bias in doing tasks.</strong></p>
<p><strong>@crowdflower has a sandbox-style self-service site where people can create their own crowdsourcing jobs</strong>.</ul>
<p><a href="http://bit.ly/wU8T4z">Read all of Kaggle and CrowdFlower&#8217;s SXSW talk Tweets here</a>. We&#8217;ll be blogging more about some of these topics in future posts, and plan to post the slide deck from the SXSW talk soon.</p>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/4F9V37KGS7Y" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/03/top-twelve-tweets-from-kaggle-crowdflowers-sxsw-talk-on-crowdsourcing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/03/top-twelve-tweets-from-kaggle-crowdflowers-sxsw-talk-on-crowdsourcing/</feedburner:origLink></item>
		<item>
		<title>What Did People Think About Rainn Wilson’s SXSW Keynote Speech? Here’s CrowdFlower’s Sentiment Analysis of 600+ Tweets</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/3eilByLpY2M/</link>
		<comments>http://blog.crowdflower.com/2012/03/what-did-people-think-about-rainn-wilsons-sxsw-keynote-speech-heres-crowdflowers-sentiment-analysis-of-600-tweets/#comments</comments>
		<pubDate>Sun, 11 Mar 2012 02:15:58 +0000</pubDate>
		<dc:creator>Wagner James Au</dc:creator>
				<category><![CDATA[CrowdFlower]]></category>
		<category><![CDATA[Sentiment Analysis]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[Rainn Wilson]]></category>
		<category><![CDATA[sentiment analysis]]></category>
		<category><![CDATA[Soul Pancake]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=5072</guid>
		<description><![CDATA[Last night we conducted a sentiment analysis of actor Rainn Wilson&#8217;s SXSW 2012 keynote address, based on hundreds of Tweets related to his talk sent out during and shortly afterward. Our crowdsource-powered workforce analyzed 637 such Tweets, determining the sub-topic of each Tweet and its general sentiment (negative, positive, or neutral). The task was completed [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.crowdflower.com/2012/03/what-did-people-think-about-rainn-wilsons-sxsw-keynote-speech-heres-crowdflowers-sentiment-analysis-of-600-tweets/rainn-wilson-sxsw-sentiment-analysis/" rel="attachment wp-att-5073"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/03/Rainn-Wilson-SXSW-Sentiment-Analysis.jpg" alt="" title="Rainn Wilson SXSW Sentiment Analysis" width="106" height="106" class="alignright size-full wp-image-5073" /></a>Last night we conducted a sentiment analysis of actor <a href="http://bit.ly/wQWTEn">Rainn Wilson&#8217;s SXSW 2012 keynote address</a>, based on hundreds of Tweets related to his talk sent out during and shortly afterward. Our crowdsource-powered workforce analyzed 637 such Tweets, determining the sub-topic of each Tweet and its general sentiment (negative, positive, or neutral). The task was completed about 90 minutes after Rainn Wilson left the SXSW stage.</p>
<p>Here&#8217;s the results:</p>
<p><a href="http://blog.crowdflower.com/2012/03/what-did-people-think-about-rainn-wilsons-sxsw-keynote-speech-heres-crowdflowers-sentiment-analysis-of-600-tweets/sentiment-analysis-chart-rainn-wilson-sxsw-keynote-2/" rel="attachment wp-att-5097"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/03/Sentiment-analysis-chart-Rainn-Wilson-SXSW-keynote1.jpg" alt="" title="Sentiment analysis chart Rainn Wilson SXSW keynote" width="612" height="313" class="aligncenter size-full wp-image-5097" /></a></p>
<p>These answers were selected by our crowdforce from a number of options, including:</p>
<ul>
 It&#8217;s about Going to or attending his SXSW talk [Going]<br />
 It&#8217;s about his site SoulPancake<br />
 It&#8217;s about his thoughts on Online Community<br />
 It&#8217;s about his thoughts on Art and Creativity<br />
 It&#8217;s about his thoughts on Social Media<br />
 It&#8217;s about SOMETHING ELSE [Other]
</ul>
<p>As you can see, 38% were about social media, 28% about going to his talk, 10% related to Rainn&#8217;s opinions on online community or &#8220;Other&#8221;, 8% were specifically about Wilson&#8217;s Internet startup, <a href="http://bit.ly/yANooA">SoulPancake</a>, and finally, 6% on art and creativity. <strong>Interestingly, though Wilson&#8217;s talk was largely about SoulPancake, an online community, most of the Tweets were not about it.</strong></p>
<p>Having selected the sub-topic of each Tweet, our crowdforce then judged the sentiment (in other words, the author&#8217;s emotional tone). Notably, 58% were positive or neutral, while a plurality of 42% were negative:</p>
<p><a href="http://blog.crowdflower.com/2012/03/what-did-people-think-about-rainn-wilsons-sxsw-keynote-speech-heres-crowdflowers-sentiment-analysis-of-600-tweets/rainn-wilson-sxsw-tweet-sentiment/" rel="attachment wp-att-5081"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/03/Rainn-Wilson-SXSW-Tweet-Sentiment.jpg" alt="" title="Rainn Wilson SXSW Tweet Sentiment" width="625" height="360" class="aligncenter size-full wp-image-5081" /></a> </p>
<p>How did we get this data? <a href="http://bit.ly/wzNiGE">As we explained in a post last year</a>, CrowdFlower’s crowdsource-driven sentiment analyses are fairly unique, in that they aggregate human judgements of Tweets, as opposed to an automated, computerized sampling, which is more likely to miss nuances of expression. To be sure, sentiment analysis cannot cover all nuances: For example, at the end of the talk, Rainn Wilson unexpectedly invited a member of the audience up on stage to smash two guitars together. Not exactly an option one can anticipate beforehand!</p>
<p>In future posts, we plan to do more sentiment analyses of SXSW-related Tweets, so check back soon to see what other surprises our crowdforce find. And join us for CrowdFlower founder Lukas Biewald’s SXSW in-person presentation on <strong>Monday at 5pm March 12</strong>, which you can follow in person (<a href="http://bit.ly/zjzGHs">click here for all the details</a>) or, of course, on this blog and Twitter — <strong><a href="http://bit.ly/wKG0d4">follow @CrowdFlower for updates</a></strong>.</p>
<p><em>Thanks to CrowdFlower’s Josh Eveleth for all the help putting this project together!</em></p>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/3eilByLpY2M" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/03/what-did-people-think-about-rainn-wilsons-sxsw-keynote-speech-heres-crowdflowers-sentiment-analysis-of-600-tweets/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/03/what-did-people-think-about-rainn-wilsons-sxsw-keynote-speech-heres-crowdflowers-sentiment-analysis-of-600-tweets/</feedburner:origLink></item>
		<item>
		<title>What SXSW Topics Are People Tweeting About? According to CrowdFlower’s Sentiment Analysis, Surprisingly Little About Tech</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/oSWN8e1yvX8/</link>
		<comments>http://blog.crowdflower.com/2012/03/what-sxsw-topics-are-people-tweeting-about-according-to-cfs-sentiment-analysis-not-so-much-about-tech/#comments</comments>
		<pubDate>Thu, 08 Mar 2012 19:58:51 +0000</pubDate>
		<dc:creator>Wagner James Au</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Sentiment Analysis]]></category>
		<category><![CDATA[Tweets]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[sentiment analysis]]></category>
		<category><![CDATA[SXSW]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=4937</guid>
		<description><![CDATA[We recently ran a sentiment analysis of 1000 Tweets related to SXSW*, and it generated some fairly interesting results. Assuming our sample is typical, it will probably surprise this year’s 20,000 attendees of SXSW Interactive, where Twitter itself first gained buzz about six years ago. Where the digerati might expect that most SXSW Tweets would [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.crowdflower.com/2012/03/what-sxsw-topics-are-people-tweeting-about-according-to-cfs-sentiment-analysis-not-so-much-about-tech/sxsw-2012-sentiment-analysis-twitter-2/" rel="attachment wp-att-5040"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/03/SXSW-2012-Sentiment-Analysis-Twitter1.png" alt="" title="SXSW 2012 Sentiment Analysis Twitter" width="300" height="182" class="alignright size-full wp-image-5040" /></a>We recently ran a sentiment analysis of 1000 Tweets related to SXSW*, and it generated some fairly interesting results. Assuming our sample is typical, it will probably surprise this year’s 20,000 attendees of SXSW Interactive, where Twitter itself first gained buzz about six years ago. Where the digerati might expect that most SXSW Tweets would be about attending startups at SXSW like Pinterest or tech celebrities like Twitter co-founder Biz Stone, our sample suggests something quite different. </p>
<p>To generate this data, our workforce analyzed the content and sentiment in each of the 1000 Tweets. Most were judged to be neutral or positive about SXSW &#8212; no surprise there. The workforce then analyzed the SXSW-related sub-topic of each particular Tweet, selecting from a set of 11 options, including interactive and tech, music, film, parties, hotels, details about SXSW, whether or not the Tweeter was going to SXSW, etc. </p>
<p>Take a look at the pie chart below, which categorizes the 1000 Tweets by SXSW sub-topic:<br />
<a href="http://blog.crowdflower.com/2012/03/what-sxsw-topics-are-people-tweeting-about-according-to-cfs-sentiment-analysis-not-so-much-about-tech/sxsw-sentiment-analysis-crowdflower/" rel="attachment wp-att-5021"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/03/SXSW-sentiment-analysis-CrowdFlower.jpg" alt="" title="SXSW sentiment analysis CrowdFlower" width="704" height="537" class="aligncenter size-full wp-image-5021" /></a><br />
The striking result: We found that people in our sample were rarely Tweeting about SXSW tech subjects in particular. Even though Interactive now has many more attendees than Music, and Twitter is still the default media channel for tech folks at SXSW. Tech doesn&#8217;t even show up as a topic in our pie chart. To be sure, there were many tangentially tech-related Tweets in the 1000 Tweets, but our workforce judged them to be more about parties, talks, or going (for example), than tech per se. </p>
<p>What does this suggest? One hypothesis: Twitter may have reached a point of critical mass where it&#8217;s now used far more by non-techies than the early adopters associated with SXSW Interactive, and this chart strongly hints at that transition. As someone who’s been attending SXSW since before Twitter launched there, I was personally surprised to go through dozens of Tweets in our sample, and see how few were about tech at all (start-ups, speakers, etc.) and how much much more were about bands playing at SXSW, etc. Perhaps we&#8217;re seeing a cultural shift, where Twitter is no longer driven by the tech/Internet community, as it&#8217;s embraced by the mainstream. (A passing of an era?)</p>
<p>How did we get these results?<br />
<span id="more-4937"></span><br />
<a href="http://bit.ly/wzNiGE">As we explained in a post last year</a>, CrowdFlower&#8217;s crowdsource-driven sentiment analyses are fairly unique, in that they aggregate human judgements of Tweets, as opposed to an automated, computerized sampling, which is more likely to miss nuances of expression.</p>
<p>To get a sense of how that works, take a look at this Tweet, which we used as a &#8220;Gold&#8221; response, testing our judgement of it against the answers of the CrowdFlower workforce:</p>
<p><a href="http://blog.crowdflower.com/2012/03/what-sxsw-topics-are-people-tweeting-about-according-to-cfs-sentiment-analysis-not-so-much-about-tech/sxsw-twitter-sentiment-analysis/" rel="attachment wp-att-5026"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/03/SXSW-Twitter-Sentiment-Analysis.jpg" alt="" title="SXSW Twitter Sentiment Analysis" width="600" height="88" class="aligncenter size-full wp-image-5026" /></a></p>
<p>This Gold test checks to assure that answers aren&#8217;t being haphazardly selected. The particular Tweet appears to be about someone who’s attending a New York tech meet-up at SXSW. A plurality of our workforce analyzed the topic of this Tweet to be about “Going to SXSW”, and by implication, judged the mention of tech to be incidental. </p>
<p>*In future posts, we plan to do more sentiment analyses of SXSW-related Tweets, so check back soon to see what other surprises our crowdforce find. And join us for CrowdFlower founder Lukas Biewald&#8217;s SXSW in-person presentation on Monday at 5pm March 12, which you can follow in person (<strong><a href="http://bit.ly/zjzGHs">all the details here</a></strong>) or, of course, on this blog and Twitter &#8212; follow @CrowdFlower for updates.</p>
<p><em>Thanks to CrowdFlower&#8217;s Josh Eveleth for all the help putting this project together!</em></p>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/oSWN8e1yvX8" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/03/what-sxsw-topics-are-people-tweeting-about-according-to-cfs-sentiment-analysis-not-so-much-about-tech/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/03/what-sxsw-topics-are-people-tweeting-about-according-to-cfs-sentiment-analysis-not-so-much-about-tech/</feedburner:origLink></item>
		<item>
		<title>Going to SXSW? March 12 at 5pm, Come Talk about the Future of Work and Crowdsourcing with Kaggle and CrowdFlower</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/1EMoxbjyhf0/</link>
		<comments>http://blog.crowdflower.com/2012/03/going-to-sxsw-march-12-come-talk-about-the-future-of-work-and-crowdsourcing-with-kaggle-and-crowdflower/#comments</comments>
		<pubDate>Tue, 06 Mar 2012 22:37:55 +0000</pubDate>
		<dc:creator>Wagner James Au</dc:creator>
				<category><![CDATA[Conference]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Wisdom of Small Crowds]]></category>
		<category><![CDATA[Anthony Goldbloom]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[Kaggle]]></category>
		<category><![CDATA[Lukas Biewald]]></category>
		<category><![CDATA[SXSW]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=4940</guid>
		<description><![CDATA[If you’re going to SXSW this year, mark your calendar: On Monday, March 12, CrowdFlower founder Lukas Biewald will be there to talk about the future of work and crowdsourcing with Anthony Goldsbloom, founder of Kaggle, in a dual presentation called &#8220;Getting a Crowd to Work for You: For Pay or Play?&#8221; (Click here for [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.crowdflower.com/2012/03/going-to-sxsw-march-12-come-talk-about-the-future-of-work-and-crowdsourcing-with-kaggle-and-crowdflower/crowdflower-crowdsourcing-sxsw-talk-2/" rel="attachment wp-att-4984"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/03/CrowdFlower-crowdsourcing-SXSW-talk1.jpg" alt="" title="CrowdFlower crowdsourcing SXSW talk" width="335" height="432" class="alignright size-full wp-image-4984" /></a>If you’re going to SXSW this year, mark your calendar:</p>
<p><strong>On Monday, March 12, <a href="http://bit.ly/wtLZql">CrowdFlower</a> founder Lukas Biewald will be there to talk about the future of work and crowdsourcing with Anthony Goldsbloom, founder of <a href="http://bit.ly/wnEt1P">Kaggle</a>, in a dual presentation called &#8220;Getting a Crowd to Work for You: For Pay or Play?&#8221;</strong> (<a href="http://bit.ly/xnftO8">Click here for the SXSW listing</a>.)</p>
<p>Anthony and Lukas will discuss how crowdsourced-powered companies are changing the way individuals and organizations work online &#8212; and what motivates crowdworkers to accurately solve major research challenges and rapidly complete massive work tasks that take traditional workforces years and millions of dollars to accomplish.</p>
<p>If you can’t join us in Austin, don’t worry &#8212; you can still participate in the talk live on Twitter, and afterward, on this blog. Here’s how:</p>
<p><a href="http://blog.crowdflower.com/2012/03/going-to-sxsw-march-12-come-talk-about-the-future-of-work-and-crowdsourcing-with-kaggle-and-crowdflower/crowdflower-talk-location-2/" rel="attachment wp-att-4961"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/03/CrowdFlower-Talk-location1.jpg" alt="" title="CrowdFlower Talk location" width="318" height="444" class="alignright size-full wp-image-4961" /></a><strong>Participating in Austin:</strong><br />
The talk happens at SXSW on Monday, March 12 from 5:00-6:00PM in the Brazos room of the Courtyard Marriott on 300 East 4th Street, one block from the Austin Convention Center. (Map at right.) Immediately afterward, we’ll continue the conversation in a nearby meeting spot (TBA), and hope you join us there as well.</p>
<p><strong>Participating live on Twitter:</strong><br />
Follow <a href=" http://bit.ly/wP83hH">CrowdFlower’s Twitter account @crowdflower</a> , where the talk will be live-Tweeted. You can participate by <strong>using the hashtags #sxsw #wrkrmotive</strong>. (Before and during the talk, use those hashtags to send questions and comments to Lukas and Anthony.) <a href="http://bit.ly/xC4mLp">Also be sure to follow Kaggle’s account @kaggle</a>.</p>
<p><strong>Participating on This Blog</strong><br />
After the talk, we’ll blog the key points from the SXSW conversation here, and invite Lukas and Anthony to participate in Comments. RSS this blog at blog.crowdflower.com, <a href=" http://bit.ly/wP83hH">or follow us on Twitter @crowdflower</a> for updates.</p>
<p><strong>More About CrowdFlower and Kaggle</strong><br />
Founded by Lukas Biewald, CrowdFlower is the leader in enterprise crowdsourcing, with clients including Twitter, Yahoo!, and eBay, and is ranked among Lead411’s Hottest San Francisco Companies for 2012.<br />
<a href="http://blog.crowdflower.com/2012/03/going-to-sxsw-march-12-come-talk-about-the-future-of-work-and-crowdsourcing-with-kaggle-and-crowdflower/lukas-biewald-crowdflower/" rel="attachment wp-att-4970"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/03/Lukas-Biewald-CrowdFlower.jpg" alt="" title="Lukas Biewald CrowdFlower" width="250" height="105" class="alignleft size-full wp-image-4970" /></a>CrowdFlower&#8217;s technology platform offers quality-ensured business process crowdsourcing at massive scale. The company solves problems ranging from product categorization to business lead verification to content creation.</p>
<p><a href="http://blog.crowdflower.com/2012/03/going-to-sxsw-march-12-come-talk-about-the-future-of-work-and-crowdsourcing-with-kaggle-and-crowdflower/anthony-goldbloom-kaggle/" rel="attachment wp-att-4971"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/03/Anthony-Goldbloom-Kaggle.jpg" alt="" title="Anthony Goldbloom Kaggle" width="289" height="110" class="alignright size-full wp-image-4971" /></a>Founded by Anthony Goldbloom, Kaggle offers a platform for predictive modeling competitions that helps companies, governments, and researchers identify solutions to some of the world&#8217;s hardest problems. By using a competition format to introduce challenges to a community of nearly 30,000 data scientists, Kaggle helps organizations achieve the best possible predictive accuracy. Kaggle has helped solve problems for NASA, Deloitte, and Ford, and it is currently running the $3 million Heritage Health Prize, the largest medical prize ever, to help prevent unnecessary hospitalization. Kaggle’s investors include Index Ventures and Khosla Ventures. It was founded in 2010 and is based in San Francisco, California.</p>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/1EMoxbjyhf0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/03/going-to-sxsw-march-12-come-talk-about-the-future-of-work-and-crowdsourcing-with-kaggle-and-crowdflower/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/03/going-to-sxsw-march-12-come-talk-about-the-future-of-work-and-crowdsourcing-with-kaggle-and-crowdflower/</feedburner:origLink></item>
		<item>
		<title>Lead411 2012 Hottest Tech Companies: CrowdFlower is on Fire!</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/gyL1GJ6cuVY/</link>
		<comments>http://blog.crowdflower.com/2012/02/lead411-2012-hottest-tech-companies-crowdflower-is-on-fire/#comments</comments>
		<pubDate>Mon, 27 Feb 2012 20:46:01 +0000</pubDate>
		<dc:creator>Mollie Allick</dc:creator>
				<category><![CDATA[Media]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[awards]]></category>
		<category><![CDATA[Lead411]]></category>
		<category><![CDATA[recognition]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=4860</guid>
		<description><![CDATA[Lead411 recently released their 2012 list of the Hottest San Francisco Companies, and we’re proud to report that CrowdFlower made the cut! The 2012 Hottest Companies List marks the third annual iteration of Lead411’s award.  Given to software, wireless technology, hardware, internet, or media companies, the award recognizes the fastest growing businesses in the US.  [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" title="Lead411" src="http://blog.crowdflower.com/wp-content/uploads/2012/02/San_Francisco_20121.jpg" alt="" width="220" height="210" /></p>
<p style="text-align: left;">Lead411 recently released their 2012 list of the Hottest San Francisco Companies, and we’re proud to report that CrowdFlower <a title="Hottest San Francisco Companies" href="http://www.lead411.com/awards/2012/san-francisco.html" target="_blank">made the cut</a>!</p>
<p>The 2012 Hottest Companies List marks the third annual iteration of Lead411’s award.  Given to software, wireless technology, hardware, internet, or media companies, the award recognizes the fastest growing businesses in the US.  CEO of Lead411, Tom Blue explained, “We’ve been tracking fast companies for our customers for the past 10 years and we felt it was important to recognize these growing brands publicly.”  Selection criteria include achieving at least a 100% revenue increase or $5 million or more in funding over the past two years.  This year the Lead411 research team selected 80 top contenders from a field of over 2,600 submissions.</p>
<p>CrowdFlower’s place on the 2012 Hot List, which includes companies like Twitter, Scribd, and Dropbox, is a testament to the rapid growth we’ve seen since the beginning in 2008.  We’ve completed over 300 million tasks to date, and in 2011 <a title="CrowdFlower Press Release 02-07-12" href="http://crowdflower.com/images/marketing/press-releases/CrowdCensus-press-release-02-07-12.pdf" target="_blank">increased revenue by 300%</a>.  This is on the heels of our #6 ranking on the <a title="Lead411 Tech 200" href="http://www.lead411.com/tech200/2011/list.php" target="_blank">2011 Lead411’s Tech 200</a>.  So we’ve been showing impressive growth for two years running!</p>
<p>It’s an honor to make the grade.  Our rapid growth is proof positive of the increasing enterprise demand for the ultra-scalable, efficient outsourcing alternative that crowdsourcing provides.</p>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/gyL1GJ6cuVY" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/02/lead411-2012-hottest-tech-companies-crowdflower-is-on-fire/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/02/lead411-2012-hottest-tech-companies-crowdflower-is-on-fire/</feedburner:origLink></item>
		<item>
		<title>How to Survey This Year’s Top Oscar Picks (or Anything Else) Using CrowdFlower’s Self-Service Platform</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/kdVgHRYBmnE/</link>
		<comments>http://blog.crowdflower.com/2012/02/how-to-survey-this-year%e2%80%99s-top-oscar-picks-or-anything-else-using-crowdflower%e2%80%99s-self-service-platform/#comments</comments>
		<pubDate>Fri, 24 Feb 2012 23:11:24 +0000</pubDate>
		<dc:creator>Wagner James Au</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Wisdom of Small Crowds]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[George Clooney]]></category>
		<category><![CDATA[Oscar predictions]]></category>
		<category><![CDATA[Oscars]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=4839</guid>
		<description><![CDATA[As a fun demonstration of CrowdFlower’s self-service platform, we just generated a survey asking 500 members of our workforce to predict the likeliest winners of this Sunday’s 84th Annual Academy Awards, in the top seven categories. After the rundown, I’ll explain how similar surveys can be created in just a few hours at nominal costs. [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.crowdflower.com/2012/02/how-to-survey-this-year%e2%80%99s-top-oscar-picks-or-anything-else-using-crowdflower%e2%80%99s-self-service-platform/george-clooney-oscar-pick-descendants-2/" rel="attachment wp-att-4846"><img class="alignright size-full wp-image-4846" title="George Clooney Oscar pick Descendants" src="http://blog.crowdflower.com/wp-content/uploads/2012/02/George-Clooney-Oscar-pick-Descendants1.jpg" alt="" width="350" height="238" /></a>As a fun demonstration of <a href="http://bit.ly/yISN0f">CrowdFlower’s self-service platform</a>, we just generated a survey asking 500 members of our workforce to predict the likeliest winners of this Sunday’s 84th Annual Academy Awards, in the top seven categories. After the rundown, I’ll explain how similar surveys can be created in just a few hours at nominal costs.</p>
<blockquote><p><strong>Likeliest 84th Academy Award Winners, According to CrowdFlower’s Workforce:</strong></p>
<ol><strong>Best Picture</strong>: <em>The Help</em> (or <em>The Artist</em>)<br />
<strong>Best Actor</strong>: George Clooney, <em>The Descendants</em><br />
<strong>Best Actress</strong>: Viola Davis, <em>The Help</em> (or Meryl Streep for <em>The Iron Lady</em>)<br />
<strong>Best Director</strong>: Martin Scorsese for <em>Hugo</em> (or Alexander Payne for <em>The Descendants</em>, or Michel Hazanavicius for <em>The Artist</em>.)<br />
<strong>Best Supporting Actor</strong>: Jonah Hill, <em>Moneyball</em> (or Max von Sydow for <em>Extremely Loud and Incredibly Close</em>)<br />
<strong>Best Supporting Actress</strong>: Octavia Spencer, <em>The Help</em><br />
<strong>Best Animated Feature</strong>: <em>Puss in Boots</em></ol>
</blockquote>
<p>We listed alternatives in the several categories where the very top predictions were within 5% of each other &#8212; clearly, many of these races will be tight. One surprise is the strong showing of <em>The Help</em> against <em>The Artist</em>, which many Hollywood insiders assume is already a lock for Best Picture. It’ll be interesting to see how well our workforce predicts the Academy’s often unpredictable voting behavior.</p>
<p>Whatever the ultimate Oscar winners, what’s pretty impressive is how fast we were able to generate these results. I just started working with CrowdFlower, and this was the first job I ever created on the self-service platform, but I was still able to build and deploy it in under an hour. Here’s what the survey editing window looks like &#8212; it’ll be familiar to anyone who’s edited HTML:</p>
<p><a href="http://blog.crowdflower.com/2012/02/how-to-survey-this-year%e2%80%99s-top-oscar-picks-or-anything-else-using-crowdflower%e2%80%99s-self-service-platform/creating-crowdflower-self-service-survey/" rel="attachment wp-att-4901"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/02/Creating-CrowdFlower-self-service-Survey.jpg" alt="" title="Creating CrowdFlower self service Survey" width="600" height="382" class="aligncenter size-full wp-image-4901" /></a></p>
<p>Once the project was set up (including the pay rate for each individual judgement), we put it in the available job queue for our 2 million workforce members. Results can be tracked dynamically on the self-service account dashboard:</p>
<p><a href="http://blog.crowdflower.com/2012/02/how-to-survey-this-year%e2%80%99s-top-oscar-picks-or-anything-else-using-crowdflower%e2%80%99s-self-service-platform/crowdflower-self-service-job-meter/" rel="attachment wp-att-4902"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/02/CrowdFlower-self-service-job-meter.jpg" alt="" title="CrowdFlower self-service job meter" width="600" height="223" class="aligncenter size-full wp-image-4902" /></a></p>
<p>Within 2-3 hours, we had 500 completed surveys. As a journalist and analyst, I’ve run countless polls on blogs with massive traffic, and I’ve never seen so many solid responses generated anywhere near as fast.</p>
<p>Once completed, we downloaded the survey results as a .csv file, and processed the data through Excel, counting the number of votes for each movie/actor/etc. using pivot tables.</p>
<p>Cost to run this? For someone using <a href="http://bit.ly/yISN0f">CrowdFlower’s self-service platform</a>, running a survey with similar parameters, total expenses were pretty nominal &#8212; probably less than what George Clooney tips at his favorite lunch joint.</p>
<p><em>Much thanks to CrowdFlower’s Joseph Childress and Patrick Philips for help with this post! </em></p>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/kdVgHRYBmnE" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/02/how-to-survey-this-year%e2%80%99s-top-oscar-picks-or-anything-else-using-crowdflower%e2%80%99s-self-service-platform/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/02/how-to-survey-this-year%e2%80%99s-top-oscar-picks-or-anything-else-using-crowdflower%e2%80%99s-self-service-platform/</feedburner:origLink></item>
		<item>
		<title>Superbowl Commercials – The Good, The Bad and The Ugly</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/hYq6BkQQk9o/</link>
		<comments>http://blog.crowdflower.com/2012/02/crowdsourcing-superbowl-commercials/#comments</comments>
		<pubDate>Thu, 16 Feb 2012 20:04:01 +0000</pubDate>
		<dc:creator>Patrick Philips</dc:creator>
				<category><![CDATA[Miscellaneous]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=4769</guid>
		<description><![CDATA[With consumers increasingly recording their experiences with brands online, Superbowl XLVI unleashed a trove of valuable consumer data. As the volume increases, though, it becomes harder and harder to sift though the quantities of data. In the words of Brad Jakeman, Chief Creative Officer of Pepsi, Smart listening tools that generate real-time cultural insights and [...]]]></description>
			<content:encoded><![CDATA[<p>With consumers increasingly recording their experiences with brands online, Superbowl XLVI unleashed a trove of valuable consumer data. As the volume increases, though, it becomes harder and harder to sift though the quantities of data. In the words of Brad Jakeman, Chief Creative Officer of Pepsi,</p>
<blockquote><p>Smart listening tools that generate real-time cultural insights and consumer feedback are critical in establishing and retaining relevance, especially for pop-culture brands like ours.</p>
<p><a title="Marketer Resolutions: PepsiCo's Jakeman Will Be Busy, Fit and Talkative" href="http://adage.com/article/cmo-strategy/jakeman-busy-fit-talkative/231957/" target="_blank">Source: AdAge, &#8220;Marketer Resolutions: PepsiCo&#8217;s Jakeman Will Be Busy, Fit and Talkative</a>&#8220;</p></blockquote>
<p>Enter CrowdFlower. We use native speakers &#8211; online gamers, stay-at-home moms, really anyone with a computer and a few minutes of free time &#8211; to analyze social media content. Because we&#8217;re using real people, we can achieve higher levels of accuracy than automated solutions, and with much greater flexibility. Is your brand trending unexpectedly? We can quickly sift through social media content for topics faster (and with much less pain) than it would take to refine an existing NLP process.</p>
<div class="wp-caption aligncenter" style="width: 404px"><a href="http://www.etsy.com/listing/53769772/people-are-good-at-stuff"><img class=" " src="http://img1.etsystatic.com/il_fullxfull.165915853.jpg" alt="People are good at stuff" width="394" height="600" /></a><p class="wp-caption-text">A Favorite Decoration at CrowdFlower HQ </p></div>
<p>Back to Superbowl Ads. One big difference this year was the flurry of pre-Superbowl ads activity, with many companies premiering their ads online in the weeks leading up to the Superbowl. We thought it would be fun and interesting to look at this early activity on Twitter to try to predict which ads woud be the top performers this year.</p>
<p>We used our platform to hand-tag a random sample of thousands of tweets from each of the 30+ companies that purchased a Superbowl ad this year. We looked at whether the tweets were related to the company as a whole, about a specific product or about the Superbowl commercial. We also looked at the overall tone of the message, whether it was positive, negative or neutral. A sample of these results is presented below:</p>
<p style="text-align: center"><a href="http://blog.crowdflower.com/2012/02/crowdsourcing-superbowl-commercials/tweets_products-2/" rel="attachment wp-att-4780"><img class="aligncenter size-full wp-image-4780" src="http://blog.crowdflower.com/wp-content/uploads/2012/02/tweets_products1.jpg" alt="" width="549" height="378" /></a></p>
<p style="text-align: left">With respect to the products of  companies buying ads in this year&#8217;s Superbowl, the social media content was generally positive. The one exception is goDaddy, who had some lingering dissatisfaction related to SOPA. It&#8217;s also worth noting that the content about Coca-Cola was overwhelmingly related to the product, rather than the company or its commercials.</p>
<p style="text-align: center"><a href="http://blog.crowdflower.com/2012/02/crowdsourcing-superbowl-commercials/sentiment_tweet1/" rel="attachment wp-att-4805"><img class="aligncenter size-full wp-image-4805" src="http://blog.crowdflower.com/wp-content/uploads/2012/02/sentiment_tweet1.jpg" alt="Sentiment of Tweets about Commercials" width="549" height="378" /></a></p>
<p>Looking to the commercials, it&#8217;s pretty clear that the H&amp;M commercial featuring David Beckham was a big hit in terms of driving conversations. Jerry Seinfeld also seems to have been a pretty big hit for Acura. Interestingly, the one commercial to get a predominantly negative response from the pre-Superbowl screening was CareerBuilder, for their use of chimps.</p>
<p>Things get more interesting when you get more granular, so if Disney wanted to dig into the John Carter ads to see what aspects people are liking (&#8220;Swoon. OBSESSED with Woola from John Carter&#8221;) or hating (&#8220;John Carter is a terrible movie title. John Carter of Mars is an awesome title. Why is this so hard?!&#8221;), that&#8217;s where the crowd really excels. And for anyone caught up in the immortal creative struggle between Agency and Client, we offer a quick and easy way to find 10,000 people who are ready to help decide who is right. But we all know there will never be a real answer to that question.</p>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/hYq6BkQQk9o" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/02/crowdsourcing-superbowl-commercials/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/02/crowdsourcing-superbowl-commercials/</feedburner:origLink></item>
		<item>
		<title>CrowdFlower Highest Ranked Industry Leader by Daily Crowdsource</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/2CB2-YT_z_o/</link>
		<comments>http://blog.crowdflower.com/2012/02/crowdflower-highest-ranked-industry-leader-by-daily-crowdsource/#comments</comments>
		<pubDate>Wed, 15 Feb 2012 18:53:28 +0000</pubDate>
		<dc:creator>Roger Nunn</dc:creator>
				<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[press]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=4830</guid>
		<description><![CDATA[The Daily CrowdSource recently released CrowdCensus, a milestone report on the crowdsourcing industry.  The researchers rated 6 BPCs (business process crowdsourcers) on 16 different categories, producing a four digit CrowdCensus score.  We’re proud to report that CrowdFlower was ranked #1 in the industry, with a score of 9-9-8-7! CrowdCensus is the first report of its [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" title="CrowdCensus" src="http://dailycrowdsource.com/unjunglables/crowdsourcing-images/article-images/crowdcensus-book.jpg" alt="" width="221" height="282" />The Daily CrowdSource recently released CrowdCensus, a milestone report on the crowdsourcing industry.  The researchers rated 6 BPCs (business process crowdsourcers) on 16 different categories, producing a four digit CrowdCensus score.  We’re proud to report that <a href="http://crowdflower.com/images/marketing/press-releases/CrowdCensus-press-release-02-07-12.pdf">CrowdFlower was ranked #1 in the industry</a>, with a score of 9-9-8-7!</p>
<p>CrowdCensus is the first report of its kind, providing an independently produced resource for business leaders.  David Bratvold, one of the authors of the report, explained, “CrowdCensus is the first of many quarterly reports on the microtasking space…when we return to the microtasking research report next quarter, we plan to analyze even more metrics and create more focus for [them].  The CrowdCensus helps business leaders make informed decisions for their companies without incurring wasteful expenses testing less effective platforms.”</p>
<p>Not to brag (too much), but it’s very exciting to be recognized as the industry leader, especially given the growth we’ve seen over the past four years.  Being able to say that we’re the best, <em>with evidence</em>, is really motivating, especially for the team members that have been on since the beginning.  It’s also assurance for our clients, like eBay, Microsoft, and AT&amp;T, knowing that they are doing business with the industry leader.  As we continue to grow, our services and solutions are only going to improve.</p>
<p><img class="aligncenter" title="Rocky" src="http://darrenhardy.success.com/wp-content/uploads/2011/05/rocky-victory.jpg" alt="" width="480" height="342" /></p>
<p>Daily CrowdSource is planning on releasing these reports on a quarterly basis, and we look forward to the upcoming iterations.</p>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/2CB2-YT_z_o" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/02/crowdflower-highest-ranked-industry-leader-by-daily-crowdsource/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/02/crowdflower-highest-ranked-industry-leader-by-daily-crowdsource/</feedburner:origLink></item>
		<item>
		<title>Real Work for Virtual Goods: Growing Share of CrowdFlower’s Microtasks Performed by Facebook Gamers!</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/bNVHhoj4Rl0/</link>
		<comments>http://blog.crowdflower.com/2012/02/real-work-for-virtual-goods-growing-share-of-crowdflower%e2%80%99s-microtasks-performed-by-facebook-gamers/#comments</comments>
		<pubDate>Wed, 08 Feb 2012 19:18:25 +0000</pubDate>
		<dc:creator>Wagner James Au</dc:creator>
				<category><![CDATA[Facebook Games]]></category>
		<category><![CDATA[Virtual Goods]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=4750</guid>
		<description><![CDATA[Everyone in tech is talking about Facebook’s IPO filing, which revealed that the company is making a lot of money from virtual goods sales &#8212; half a billion last year, according to Wired, with Facebook users spending around $1.85 billion to buy items in their favorite games. So here’s an interesting detail behind those numbers: [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.crowdflower.com/2012/02/real-work-for-virtual-goods-growing-share-of-crowdflower%e2%80%99s-microtasks-performed-by-facebook-gamers/facebook-credits-for-crowdflower-micro-tasks/" rel="attachment wp-att-4751"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/02/facebook-credits-for-Crowdflower-micro-tasks.jpg" alt="" title="facebook credits for Crowdflower micro tasks" width="315" height="302" class="alignright size-full wp-image-4751" /></a>Everyone in tech is talking about Facebook’s IPO filing, which revealed that the company is making a lot of money from virtual goods sales &#8212; half a billion last year, <a href="http://bit.ly/A8GIww">according to Wired</a>, with Facebook users spending around $1.85 billion to buy items in their favorite games. So here’s an interesting detail behind those numbers:</p>
<p>Quite a bit of real work went into getting those virtual goods. Last year, CrowdFlower had about 2 million microtask workers, who helped complete a grand total of 300 million tasks to date. (<a href="http://bit.ly/xyyDpI">Click here to see our latest press release for more growth numbers</a>.) A growing share of those microtask projects were completed by players of Facebook games and other entertainment apps, who found our projects through offer walls. </p>
<p>In Facebook games, players buy virtual goods with Facebook Credits, the social network’s official currency. Credits can be bought for real cash, but can also be earned by completing microtask projects. So for example, when a gamer wants to get a super-duper tractor in her favorite Facebook farming game, she could use her credit card to buy it with Credits &#8212; or instead, spend a few minutes doing some real world microtasks, such as online research or analysis. (<a href="http://bit.ly/xKEVVD">Click here for more about how that’s done through CrowdFlower</a>.)</p>
<p>As Facebook expands the market for Facebook Credits, and developers look for new ways to monetize their games, we’re probably going to see many more game-driven microtask workers in 2012. While few folks work on real farms any more, a lot of people are doing real work to keep their virtual farms growing.</p>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/bNVHhoj4Rl0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/02/real-work-for-virtual-goods-growing-share-of-crowdflower%e2%80%99s-microtasks-performed-by-facebook-gamers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/02/real-work-for-virtual-goods-growing-share-of-crowdflower%e2%80%99s-microtasks-performed-by-facebook-gamers/</feedburner:origLink></item>
		<item>
		<title>TechCrunch Calls Online Work Efficiency the Next Great Internet Disruption – CrowdFlower Couldn’t Agree More!</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/hZC8Awups-o/</link>
		<comments>http://blog.crowdflower.com/2012/02/techcrunch-calls-online-work-efficiency-the-next-great-internet-disruption-crowdflower-couldnt-agree-more/#comments</comments>
		<pubDate>Tue, 07 Feb 2012 00:02:01 +0000</pubDate>
		<dc:creator>Wagner James Au</dc:creator>
				<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[crowdsource]]></category>
		<category><![CDATA[micro-tasks]]></category>
		<category><![CDATA[TechCrunch]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=4732</guid>
		<description><![CDATA[TechCrunch ran a very interesting editorial last weekend, arguing that online labor efficiency is going to be the next big Internet disruption. Noting that 20% of the US workforce are already independent contractors (and forecast to be 50% by 2020), Nick Cronin proclaims the rise of online labor services, many of them crowdsourced-powered: [W]e are [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.crowdflower.com/2012/02/techcrunch-calls-online-work-efficiency-the-next-great-internet-disruption-crowdflower-couldnt-agree-more/techcrunch-crowdsourcing-2/" rel="attachment wp-att-4743"><img src="http://blog.crowdflower.com/wp-content/uploads/2012/02/TechCrunch-Crowdsourcing1.jpg" alt="TechCrunch Crowdsourcing" title="TechCrunch Crowdsourcing" width="256" height="208" class="alignright size-full wp-image-4743" /></a>TechCrunch ran <a href="http://tcrn.ch/y51UwM">a very interesting editorial last weekend</a>, arguing that online labor efficiency is going to be the next big Internet disruption. Noting that 20% of the US workforce are already independent contractors (and forecast to be 50% by 2020), Nick Cronin proclaims the rise of online labor services, many of them crowdsourced-powered:</p>
<blockquote><p>[W]e are just beginning to scratch the surface of how the Internet is going to disrupt labor. The real change will come as more and more of the traditional job creators, small businesses all the way up to the Fortune 500s, realize the benefits of flexible workforces and more and more individuals take the plunge into independent, free-agent land — whether by necessity or choice.</p></blockquote>
<p>Call us biased, but we&#8217;re seeing a similar future from the CrowdFlower office. Not just because we&#8217;re counting over 2 million Contributors who regularly do micro-tasks through our site for Fortune 500 companies and individuals alike, or because the number of jobs handled by CrowdFlower increased exponentially last year. Even more interesting to us is how diverse these micro-tasks are, everything from doing research on local business listings, to analyzing social media sentiment for Presidential candidates. So we tend to agree with TechCrunch that we’re scratching the surface of what online labor can do.</p>
<p>Stay tuned &#8212; much more on that in coming weeks.</p>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/hZC8Awups-o" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/02/techcrunch-calls-online-work-efficiency-the-next-great-internet-disruption-crowdflower-couldnt-agree-more/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/02/techcrunch-calls-online-work-efficiency-the-next-great-internet-disruption-crowdflower-couldnt-agree-more/</feedburner:origLink></item>
		<item>
		<title>2011 Retrospective: Good Begets Good</title>
		<link>http://feedproxy.google.com/~r/doloreslabs/~3/kTlOB0jbHK0/</link>
		<comments>http://blog.crowdflower.com/2012/01/2011-retrospective-good-begets-good/#comments</comments>
		<pubDate>Tue, 10 Jan 2012 20:00:35 +0000</pubDate>
		<dc:creator>Vaughn Hester and Lukas Biewald</dc:creator>
				<category><![CDATA[Disaster Relief]]></category>
		<category><![CDATA[Health]]></category>
		<category><![CDATA[Holidays]]></category>
		<category><![CDATA[Miscellaneous]]></category>
		<category><![CDATA[Al Jazeera]]></category>
		<category><![CDATA[crowdflower]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[disaster relief]]></category>
		<category><![CDATA[Pakreport]]></category>
		<category><![CDATA[ushahidi]]></category>

		<guid isPermaLink="false">http://blog.crowdflower.com/?p=4640</guid>
		<description><![CDATA[One of the most exciting things about working at CrowdFlower is the ongoing discovery of the wide range of crowdsourcing applications. While our core focus is enterprise solutions, we&#8217;re also involved in a number of social innovation projects. At recent meetups and in recent blog posts, we&#8217;ve described CrowdFlower implementations that help create unprecedented social impact. Many [...]]]></description>
			<content:encoded><![CDATA[<p>One of the most exciting things about working at CrowdFlower is the ongoing discovery of the wide range of crowdsourcing applications. While our core focus is enterprise solutions, we&#8217;re also involved in a number of social innovation projects. At recent meetups and in recent <a href="http://blog.crowdflower.com/2011/11/scientific-research/">blog posts</a>, we&#8217;ve described CrowdFlower implementations that help create unprecedented social impact. Many of these projects involve processing data in support of crisis relief or <a href="http://poptech.org/popcasts/fortune_and_biewald_crowdsourcing_tb_cell_annotation">public health research</a>.<a title="" href="#_ftn1">[1]</a> What we continue to see over time is that there are truly inspirational ripple effects emerging from these efforts.</p>
<p><a href="http://blog.crowdflower.com/2012/01/2011-retrospective-good-begets-good/crowdflower-job-67330-preview/" rel="attachment wp-att-4654"><img class="alignright size-full wp-image-4654" title="CrowdFlower/Somalia Speaks Task" src="http://blog.crowdflower.com/wp-content/uploads/2012/01/CrowdFlower-Job-67330-Preview.jpg" alt="" width="401" height="432" /></a></p>
<p>&nbsp;</p>
<div>
<p>In the fall of 2011, the Nexus for ICTs, Climate Change and Development (NICCD) project at the University of Manchester released a <a href="http://www.niccd.org/casestudies.htm">series of case studies</a> on innovative uses of technology for development. <a href="http://www.niccd.org/NICCD_Disasters_Case_Study_Pakreport.pdf">Pakreport was featured</a> as a tool for reporting among flood-affected communities; but it was also highlighted for its contributions to climate change awareness and natural disaster monitoring in a country at very high risk.</p>
<p><span id="more-4640"></span></p>
<p><em>&#8220;Crowdsourcing was critical to the success of this disaster response system,&#8221; the report says. &#8220;It was integral to the data input model, which would otherwise have relied on much more limited inputs from individual relief agency workers.&#8221;</em></p>
<p>As a result of our involvement with Pakreport, our partners in Pakistan signed up to be a CrowdFlower contributor channel. Since April 2011, we have sent 313,030 microtasks to a pool of 500 Pakistani contributors from underserved communities. Our partners in Pakistan also recently launched <a href="http://pakreport.org/dowevote/">DoWeVote</a>, a map-based effort to improve future civic engagement by visualizing 2008 voter turnout data from across Pakistan. Pakreport continues to be an engine for social change, and its core structure is relatively simple to replicate in any setting or modify for similar projects.</p>
<p>After the <a href="http://poptech.org/world_rebalancing">2011 PopTech conference</a>, our partners at <a href="http://www.ushahidi.com">Ushahidi</a> reached out regarding a project to collect reports from the ground in Somalia. In conjunction with Al Jazeera, Souktel, and the African Diaspora Institute, the <a href="http://www.aljazeera.com/indepth/spotlight/somaliaconflict/somaliaspeaks.html">Somalia Speaks project</a> &#8220;seeks to echo the voices of ordinary Somalis in Somalia so they can be heard in the international media.&#8221; Text messages from the ground are collected, translated, categorized and mapped. The translated messages and maps are shared on Al Jazeera. It is a very powerful experience to read the words of a refugee or survivor on one of the largest news websites in the world; examples of mainstream media including the voices of these populations are few and far between.</p>
<div>
<div>
<div>
<p>Beyond all the do-gooder self-congratulation, however, it is important to note  that these efforts create new challenges and move our colleagues and us into new ethical territory. While CrowdFlower maintains rigorous confidentiality and security measures as part of our standard enterprise engagements, the recent collaborations described here are more creative, collaborative and high-profile. As more data flows through open source software and multistep workflows involving collaboration among multiple organizations, critical questions arise as to the confidentiality of the data involved as it is shared by wider audiences. For example, how can you reconcile the flow of sensitive or personal information with the use of software that emphasizes transparency above all else? How can we be certain that, particularly in conflict situations, these workflows do not create a risk of exposure or vulnerability for the people submitting reports from the ground? Who defines the standards for privacy as you amplify voices from vulnerable groups? A key lesson learned through these projects is that confidentiality is essential when dealing with personal information, but that it can be a challenge to protect confidentiality at every step of these multistep workflows. Finally, as we see more demand for these types of crowdsourcing projects, how can we reduce the start up time and the learning curve for organizations who wish to replicate these projects?</p>
<p>These are small but important achievements. The examples from 2010 directly inspired the examples we&#8217;ve seen in 2011. The successful implementation of these projects and the evolution of the discourse surrounding these disruptive tools are the result of substantial collaborative efforts, often by teams comprised entirely of volunteers located around the world. We feel incredibly privileged to work with the clients, partners, researchers, and visionaries who are redefining the ways that technology benefits society. To all of our partners and supporters, thank you for helping us discover new ways in which data can change the world. We can&#8217;t wait to see what 2012 will bring.</p>
<div>
<p><em>The entire team at CrowdFlower wishes you a joyful and peaceful 2012.</em></p>
</div>
</div>
</div>
</div>
</div>
<div>
<p>&nbsp;</p>
<hr align="left" size="1" width="33%" />
<div>
<p><a title="" href="#_ftnref1">[1]</a> In early 2010, we were part of the <a href="http://www.mission4636.org/">Mission 4636</a> collaboration to translate, categorize, and map SMS messages sent from survivors of the first earthquakes in Haiti. We repurposed the Mission 4636 workflow for another deployment of an Ushahidi instance with the <a href="http://www.pakreport.org">Pakreport</a> group in the wake of heavy flooding in Pakistan in the summer of 2010.</p>
</div>
</div>
<img src="http://feeds.feedburner.com/~r/doloreslabs/~4/kTlOB0jbHK0" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://blog.crowdflower.com/2012/01/2011-retrospective-good-begets-good/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://blog.crowdflower.com/2012/01/2011-retrospective-good-begets-good/</feedburner:origLink></item>
	</channel>
</rss>

