<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0"><channel><description>Director of Community at FirstMark Capital. Like to eat, run, mash data, sleep.</description><title>Data Diversions</title><generator>Tumblr (3.0; @dfkoz)</generator><link>https://dfkoz.tumblr.com/</link><item><title>Join Me on Medium</title><description>&lt;p&gt;Over the past few years, I’ve had a lot of fun writing on dfkoz.tumblr.com. Beginning in 2012, however, I started a new adventure with FirstMark, which left less and less time to write here.&lt;/p&gt;&lt;p&gt;While I intend to occasionally share a data-driven post here from time to time, I’ll be focusing a lot more of my writing time on sharing lessons from my work at FirstMark.&lt;/p&gt;&lt;p&gt;If you want to come along for the ride, you can join me at https://medium.com/@dfkoz/.&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/145255230941</link><guid>https://dfkoz.tumblr.com/post/145255230941</guid><pubDate>Wed, 01 Jun 2016 09:26:05 -0400</pubDate></item><item><title>A 15% Tip Button Could Cost NYC Cabbies $60 Million</title><description>&lt;p&gt;This post is a response to a question posed by &lt;a href="https://twitter.com/chris_whong/status/550385970366132224" title="https://twitter.com/chris_whong/status/550385970366132224" target="_blank"&gt;Chris Whong on Twitter&lt;/a&gt;: &amp;ldquo;I am curious to know what the $ implications of adding a 15% button would be.&amp;rdquo; Chris is the source of the raw data for my previous post, &lt;a href="http://dfkoz.tumblr.com/post/106719206826/analysis-of-nyc-taxi-tip-data-44-of-passengers" title="http://dfkoz.tumblr.com/post/106719206826/analysis-of-nyc-taxi-tip-data-44-of-passengers" target="_blank"&gt;Analysis of NYC Taxi Tip Data: 44% of Passengers Hit the 20% Button&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;By my math, a 15% tip button reduces total taxi tips by between $35MM and $60MM. This works out to a range of $2,600 to $4,600 for each of the &lt;a href="http://www.nyc.gov/html/tlc/downloads/pdf/2014_taxicab_fact_book.pdf" title="http://www.nyc.gov/html/tlc/downloads/pdf/2014_taxicab_fact_book.pdf" target="_blank"&gt;13,437 medallions active in 2014&lt;/a&gt; and $700 to $1,200 for each of the roughly 50,000 drivers in the city.&lt;/p&gt;
&lt;p&gt;I calculated the impact four different ways. On the low end, I assumed that only those who default to the cheapest button (currently the 20% button) migrate to the 15% button. This reduces total tips by $26MM, but seems like unlikely behavior in practice.&lt;/p&gt;
&lt;p&gt;The second approach reduces all of the exact 20%, 25%, or 30% tips by 5%, with the assumption that all &amp;ldquo;non-standard&amp;rdquo; tipping behavior remains consistent. This leads to a $35MM reduction in tips.&lt;/p&gt;
&lt;p&gt;The third calculation lowers all credit card tips by 5% across the board, for an aggregate impact of $60MM.&lt;/p&gt;
&lt;p&gt;Finally, it is possible that the introduction of a 15% button would completely change tipping norms. Cash tips go almost completely unreported, but I assume in the base case that cash tipping behavior matches credit card tipping behavior. If true, the total impact impact of the introduction of a 15% button would be $112MM.&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/106817778241</link><guid>https://dfkoz.tumblr.com/post/106817778241</guid><pubDate>Thu, 01 Jan 2015 14:30:07 -0500</pubDate></item><item><title>Analysis of NYC Taxi Tip Data: 44% of Passengers Hit the 20% Button</title><description>&lt;p&gt;Thanks to Ben Wellington, the man behind &lt;a href="http://iquantny.tumblr.com/" title="http://iquantny.tumblr.com/" target="_blank"&gt;I Quant NY&lt;/a&gt;, I discovered that &lt;a href="http://chriswhong.com/" title="http://chriswhong.com/" target="_blank"&gt;Chris Whong&lt;/a&gt; posted &lt;a href="http://www.andresmh.com/nyctaxitrips/" title="http://www.andresmh.com/nyctaxitrips/" target="_blank"&gt;a year&amp;rsquo;s worth of NYC taxi fare data&lt;/a&gt;, which he obtained from the New York City Taxi and Limousine Commision by FOIA request. What follows is an analysis of 578,233 rides, or about 1/300th of the roughly 175 million yellow taxi rides taken in 2013. &lt;br/&gt;&lt;br/&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Tipping Behavior&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;45% of taxi rides were paid for with cash. Of these 264,154 transactions, only 29 (0.00931%) included a tip. As a result, the following graphs depict only credit card transactions, totaling 311,623 rides. &lt;/p&gt;
&lt;p&gt;The average tip was $2.53 with a standard deviation of $2.43. Excluding riders who did not tip at all, the average tip was $2.61 with a standard deviation of $2.43. As a fraction of the total fare, the average tip was 18.9% for all rides, and 19.5% for all rides that included a tip. 3.16% of riders did not tip at all. 1.75% of riders tip more than 30% and 0.21% tip more than 50%.&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/58f9e7c27e69014f7f7a0c394dd2fbba/tumblr_inline_nhggzdL7gy1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/26044af2366ce94ef7362c480b871f85/tumblr_inline_pahc4trFda1qkcgdm_540.png" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/58f9e7c27e69014f7f7a0c394dd2fbba/tumblr_inline_nhggzdL7gy1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;

&lt;p&gt;Unsurprisingly, there are some extreme data points in the raw data. In this sample of 1/300th of the records, there were about 20 negative fares and tips that (supposedly) went as high as 4067%.&lt;/p&gt;
&lt;p&gt;As NYC residents know, the in-cab credit card interface offers three default tip levels: 20%, 25%, and 30%, once described as the &amp;ldquo;&lt;a href="http://notes.gross.is/post/22958656041/the-144-146-165-button" title="http://notes.gross.is/post/22958656041/the-144-146-165-button" target="_blank"&gt;The $144,146,165 Button&lt;/a&gt;.&amp;rdquo; 44% of riders defaulted to the 20% button, 10% to the 25% button, and 4% to the 30% button.&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/7b918553c1e725d4c4637617228c3491/tumblr_inline_nhge5bfiU91qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/0fbbf5e8ff1fd2425ccb44588bcd3031/tumblr_inline_pahc4uMmcX1qkcgdm_540.png" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/7b918553c1e725d4c4637617228c3491/tumblr_inline_nhge5bfiU91qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&lt;strong&gt;Tipping Behavior by Time of Day&lt;/strong&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Riders apparently feel most generous shortly before lunch and through the end of the work day. That, or - more likely - employers are picking up tab, resulting in higher average tips during the day.&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/d2d41c9e52f2e9a7db8fb377f5fa84a1/tumblr_inline_nhgezlNfSH1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/9637482c351dd5bfe39ad006dd254cf7/tumblr_inline_pahc4uoMs11qkcgdm_540.png" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/d2d41c9e52f2e9a7db8fb377f5fa84a1/tumblr_inline_nhgezlNfSH1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;Here is how rides are concentrated across an average day, an average weekday (defined as 6 PM Sunday to 6 PM Friday), and an average weekend (6 PM Friday to 6 PM Sunday). One hour that stands out is 4-5 PM, which - as any resident will tell you - is when New York cab drivers change shifts, leading to fewer available cabs.&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/16f3474ebcaeda0ead0f3fc16fe985f3/tumblr_inline_nhggdwLcg41qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/a41f400e852525795b3deb37c9a9758b/tumblr_inline_pahc4u9tzD1qkcgdm_540.png" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/16f3474ebcaeda0ead0f3fc16fe985f3/tumblr_inline_nhggdwLcg41qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&lt;strong&gt;Tipping Behavior by Day of Week&lt;/strong&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;There is also a relationship between day of week and tipping behavior: Monday (the slowest taxi day of the week) is the best day for tips, and Friday (the busiest day) is the worst.&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/15949121fdbe40ba7e2ea958db7f569e/tumblr_inline_nhgf4m8QbF1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/f0f1134a578ef1aef4bbba681fde8a56/tumblr_inline_pahc4vjI4z1qkcgdm_540.png" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/15949121fdbe40ba7e2ea958db7f569e/tumblr_inline_nhgf4m8QbF1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/d16afe11b90e66edc3fda5bcd1b7707c/tumblr_inline_nhgfbgBMns1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/0b6da32e35049ea8e941ef92be54a1dd/tumblr_inline_pahc4vLTMy1qkcgdm_540.png" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/d16afe11b90e66edc3fda5bcd1b7707c/tumblr_inline_nhgfbgBMns1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;Again, my theory is that this has more to do with the type of trip (professional vs. personal) than anything else. Both Sunday and Monday are lower-volume days, but the tipping behavior is different.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Tipping Behavior by Month&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;There also appears to be a relationship between time of year and tipping behavior. December is the most generous month for tips:&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="301" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/c0a85ce8d74216af86b5e5c678bc186d/tumblr_inline_nhgfjm5TEk1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/da6df02d534129ccd6873bf3b2e17c71/tumblr_inline_pahc4vGqRV1qkcgdm_540.png" data-orig-height="301" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/c0a85ce8d74216af86b5e5c678bc186d/tumblr_inline_nhgfjm5TEk1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Distribution of Total Fares&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;The average taxi fare in 2013 was $14.80, with a standard deviation of $12.13. About 30% of all taxi fares are less than $10, and nearly 80% are less than $20. 3.79% of fares were greater than $50 and 0.15% were greater than $100. The most expensive trip recorded in my sample was a whopping $544, paid in cash. Here&amp;rsquo;s what the distribution of total fares, cash included, looks like:&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/ae2c4fe770782c2dacc47f64b6b425eb/tumblr_inline_nhgef4fYMm1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/38d926283aedf36c8b1dcb5b1ae8f320/tumblr_inline_pahc4vUreQ1qkcgdm_540.png" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/ae2c4fe770782c2dacc47f64b6b425eb/tumblr_inline_nhgef4fYMm1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;Zoomed in a bit:&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/8fe1211a0350b1f3564360abc1a66e56/tumblr_inline_nhgehj2QjO1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/a13922760b63015cfb86d0bc07650382/tumblr_inline_pahc4ww7tJ1qkcgdm_540.png" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/8fe1211a0350b1f3564360abc1a66e56/tumblr_inline_nhgehj2QjO1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;And the distribution of total fares:&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/12ace63cf9cfb50ab2c632484ecdc340/tumblr_inline_nhgejeZcSO1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/d8136e7fbf99b4df2e873e4161df3e36/tumblr_inline_pahc4wkERc1qkcgdm_540.png" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/12ace63cf9cfb50ab2c632484ecdc340/tumblr_inline_nhgejeZcSO1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;That&amp;rsquo;s a wrap for me. But if you&amp;rsquo;re hungry for more taxi data visualization, you can check out Chris Whong&amp;rsquo;s &lt;a href="http://nyctaxi.herokuapp.com/"&gt;NYC Taxis: A Day in the Life&lt;/a&gt; or &lt;a href="http://iquantny.tumblr.com/post/106587793299/brooklyn-leads-bronx-lags-in-green-cab-tipping" title="http://iquantny.tumblr.com/post/106587793299/brooklyn-leads-bronx-lags-in-green-cab-tipping" target="_blank"&gt;I Quant NY&amp;rsquo;s analysis of Green Taxi tipping behavior by neighborhood&lt;/a&gt; (which is evidently the kickoff to a series of posts.)&lt;/p&gt;
&lt;p&gt;Happy New Year to all of you! And remember to tip your cab driver an extra 1.25% tonight. &lt;/p&gt;

&lt;p&gt;&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/106719206826</link><guid>https://dfkoz.tumblr.com/post/106719206826</guid><pubDate>Wed, 31 Dec 2014 11:37:00 -0500</pubDate></item><item><title>Generate $315MM of Economic Impact by Wishing People an “HT” Today</title><description>&lt;p&gt;There are currently &lt;a href="http://en.wikipedia.org/wiki/Demographics_of_the_United_States"&gt;319,151,000&lt;/a&gt; people in the United States. As of October 31, 2014, &lt;a href="http://www.bls.gov/news.release/pdf/empsit.pdf"&gt;150,000,000&lt;/a&gt; of them were employed in full-time positions. At this wonderful time of year, most or all of them are presumably wishing colleagues, friends, and family a “Happy Thanksgiving”, whether they’re doing it from their PCs or phones. What would happen, however, if we collectively agreed to wish each other a “Happy Turkey Day”?&lt;/p&gt;
&lt;p&gt;If the average adult types at &lt;a href="http://en.wikipedia.org/wiki/Words_per_minute"&gt;40 WPM&lt;/a&gt; and average word length is &lt;a href="http://english.stackexchange.com/questions/2998/do-most-languages-need-more-space-than-english"&gt;5 characters&lt;/a&gt;, then an adult produces 200 characters in a minute, or one character every 0.3 seconds. Assuming that an average employee produces &lt;a href="http://en.wikipedia.org/wiki/List_of_countries_by_GDP_%28PPP%29_per_hour_worked"&gt;$67.32 of GDP per hour worked&lt;/a&gt;, one extra character &amp;ndash; repeated across, say, 25 holiday greetings &amp;ndash; amounts to $0.14025 of GDP. With 150,000,000 workers, that one-character difference costs a whopping total of $21,037,500 of GDP.&lt;/p&gt;
&lt;p&gt;So if we all wished each other a “Happy Turkey Day,” we would reclaim two characters and $42,075,000 of GDP. “Happy T Day,” which eliminates seven characters, represents a savings of $147,262,500. Taken to the extreme, we would all simply write “HT,” saving 15 characters and $315,562,500 of GDP.&lt;/p&gt;
&lt;p&gt;&lt;span&gt;I’ll do my part: wishing all of you an HT :)&lt;/span&gt;&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/103651550886</link><guid>https://dfkoz.tumblr.com/post/103651550886</guid><pubDate>Wed, 26 Nov 2014 13:17:00 -0500</pubDate></item><item><title>The Best of TV: Analyzing IMDB Data on 90,000 TV Series</title><description>&lt;p&gt;Founded in 1990, &lt;a href="http://www.imdb.com/" title="http://www.imdb.com/" target="_blank"&gt;IMDB&lt;/a&gt; is a treasure trove of data with &amp;ldquo;&lt;a href="http://en.wikipedia.org/wiki/Imdb" title="http://en.wikipedia.org/wiki/Imdb" target="_blank"&gt;&amp;hellip;2,882,569 titles (includes episodes) and 5,909,020 personalities in its database&lt;/a&gt;.&amp;rdquo; Using a simple scraper written in Ruby, I was able to grab data on the &amp;gt;90,000 TV shows in the IMDB database. This is what I found.&lt;/p&gt;
&lt;p&gt; &lt;br/&gt;There has been significant growth in the number of new television series created each year:&lt;br/&gt;&lt;figure class="tmblr-full" data-orig-height="301" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/c8e1db143541cfbd122676bcf6456a6f/tumblr_inline_n786e0eFch1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/9317d76a7016b6b864b23a164f58ae18/tumblr_inline_pahc4uvAT11qkcgdm_540.png" data-orig-height="301" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/c8e1db143541cfbd122676bcf6456a6f/tumblr_inline_n786e0eFch1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt; &lt;br/&gt;Here is the same chart from 2000 forward:&lt;br/&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/00a6b94447bc354ee8a527a5743b123f/tumblr_inline_n786ofJ8nA1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/0b48f00066ca6bc627e26f16c4bd7a9a/tumblr_inline_pahc4u6PH91qkcgdm_540.png" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/00a6b94447bc354ee8a527a5743b123f/tumblr_inline_n786ofJ8nA1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;&lt;br/&gt;It is important to note that our collective record-keeping has improved since IMDB&amp;rsquo;s launch in 1990, and with the rise of internet usage generally (and IMDB specifically) over the past decade. &lt;a href="http://www.google.com/trends/explore#q=imdb&amp;amp;cmpt=date" title="http://www.google.com/trends/explore#q=imdb&amp;amp;cmpt=date" target="_blank"&gt;IMDB&amp;rsquo;s Google Trends chart&lt;/a&gt;: &lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="201" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/09ce10ba098ff9f470526b4faa1531fa/tumblr_inline_n786l1rp7D1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/40454b3f8b9dfd3285c0ce89f2b640f1/tumblr_inline_pahc4vUQ5K1qkcgdm_540.png" data-orig-height="201" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/09ce10ba098ff9f470526b4faa1531fa/tumblr_inline_n786l1rp7D1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;War and history series are the highest-rated genres on average. Among common genres, documentaries, animated series, and dramas are the highest-rated. Game shows and reality TV are rated significantly lower than other genres:&lt;br/&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/798a326cd8615001dd6eaedf466f2aec/tumblr_inline_n786wobXLf1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/b546afd38744cfd02b8056e7f2f4c494/tumblr_inline_pahc4vTPnZ1qkcgdm_540.png" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/798a326cd8615001dd6eaedf466f2aec/tumblr_inline_n786wobXLf1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;

&lt;p&gt;Certain genres have grown or faded significantly since 1950:&lt;br/&gt;&lt;figure class="tmblr-full" data-orig-height="301" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/e460255b5f7052fdbeb6e89e9f0c2ddd/tumblr_inline_n786z6sB2R1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/4da755c1a194201b2973d616690593d8/tumblr_inline_pahc4w0wQc1qkcgdm_540.png" data-orig-height="301" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/e460255b5f7052fdbeb6e89e9f0c2ddd/tumblr_inline_n786z6sB2R1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;

&lt;p&gt;For example, westerns have almost completely disappeared:&lt;br/&gt;&lt;figure class="tmblr-full" data-orig-height="302" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/4568a4a412df67857f218f6747a030d5/tumblr_inline_n7870hVeCH1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/f9042b709b7c4488b115d4ad9fc486b7/tumblr_inline_pahc4wbfPq1qkcgdm_540.png" data-orig-height="302" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/4568a4a412df67857f218f6747a030d5/tumblr_inline_n7870hVeCH1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;

&lt;p&gt;The number of reality TV shows has had an astonishing rise starting in the late 90s, but has thankfully flattened out:&lt;br/&gt;&lt;figure class="tmblr-full" data-orig-height="303" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/852221c467692a2b8a30a979e315e4ab/tumblr_inline_n78723BDEl1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/a3920f7552aa333741d63715662caf8c/tumblr_inline_pahc4wAZSF1qkcgdm_540.png" data-orig-height="303" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/852221c467692a2b8a30a979e315e4ab/tumblr_inline_n78723BDEl1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;

&lt;p&gt;The prevalence of fantasy shows has been relatively steady over time, but has made a comeback in the past few years:&lt;br/&gt;&lt;figure class="tmblr-full" data-orig-height="302" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/9f27d734c217324a1c195be22e929d4a/tumblr_inline_n7874uHrIy1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/5d1f573a2040b86e302097a79e5498ba/tumblr_inline_pahc4xDTzl1qkcgdm_540.png" data-orig-height="302" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/9f27d734c217324a1c195be22e929d4a/tumblr_inline_n7874uHrIy1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;

&lt;p&gt;The overall quality of TV peaked in the 1960s, declined through 2004, and has since steadily improved (although I would take this chart with several grains of salt):&lt;br/&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/9e3bae53bd47cba6862bdd0140fc629e/tumblr_inline_n7878my5JB1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/c7b3bcf8f4650c0c8f2415d71469a071/tumblr_inline_pahc4x2kxB1qkcgdm_540.png" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/9e3bae53bd47cba6862bdd0140fc629e/tumblr_inline_n7878my5JB1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;Here is the same chart from 1990 through today:&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="301" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/12637e9169043a09f75ba207c2eaa7e3/tumblr_inline_n7879mdtmp1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/9d5c7b9f241fefab0d88a034f0f853c2/tumblr_inline_pahc4xf4ib1qkcgdm_540.png" data-orig-height="301" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/12637e9169043a09f75ba207c2eaa7e3/tumblr_inline_n7879mdtmp1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;

&lt;p&gt;The top ten highest-rated television shows of all time are as follows:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;1. Breaking Bad  (2008): 9.6&lt;br/&gt;2. Game of Thrones  (2011): 9.5&lt;br/&gt;3. Planet Earth  (2006): 9.5&lt;br/&gt;4. Cosmos: A SpaceTime Odyssey  (2014): 9.5&lt;br/&gt;5. The World at War  (1973): 9.5&lt;br/&gt;6. The Wire  (2002): 9.4&lt;br/&gt;7. True Detective  (2014): 9.4&lt;br/&gt;8. Sherlock  (2010): 9.3&lt;br/&gt;9. The Sopranos  (1999): 9.3&lt;br/&gt;10. Cosmos  (1980): 9.3&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&lt;strong&gt;&lt;br/&gt;&lt;/strong&gt;The top TV series that have debuted since 2010 (note that I did *not* filter out shows with a small number of ratings):&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;1. Game of Thrones  (2011): 9.5&lt;br/&gt; 2. Cosmos: A SpaceTime Odyssey  (2014): 9.5&lt;br/&gt; 3. True Detective  (2014): 9.4&lt;br/&gt; 4. Sherlock  (2010): 9.3&lt;br/&gt; 5. Rick and Morty  (2013): 9.3&lt;br/&gt; 6. Africa  (2013): 9.3&lt;br/&gt; 7. Kardes Payi  (2014): 9.3&lt;br/&gt; 8. Leyla and Mecnun  (2011): 9.2&lt;br/&gt; 9. Fargo  (2014): 9.2&lt;br/&gt; 10. 24: Live Another Day  (2014): 9.2&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;&lt;br/&gt;Best of the 2000s:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;1. Breaking Bad  (2008): 9.6&lt;br/&gt; 2. Planet Earth  (2006): 9.5&lt;br/&gt; 3. The Wire  (2002): 9.4&lt;br/&gt; 4. The Life of Mammals  (2002): 9.3&lt;br/&gt; 5. Al-Taghriba Al-Filistinia  (2004): 9.2&lt;br/&gt; 6. Arrested Development  (2003): 9.2&lt;br/&gt; 7. Firefly  (2002): 9.2&lt;br/&gt; 8. Life in the Undergrowth  (2005): 9.2&lt;br/&gt; 9. Life in Cold Blood  (2008): 9.2&lt;br/&gt; 10. Avatar: The Last Airbender  (2005): 9.1&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Best of the 90s:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;1. The Sopranos  (1999): 9.3&lt;br/&gt; 2. The Private Life of Plants  (1995): 9.2&lt;br/&gt; 3. The Trials of Life  (1990): 9.2&lt;br/&gt; 4. The Life of Birds  (1998): 9.1&lt;br/&gt; 5. Friends  (1994): 9.0&lt;br/&gt; 6. Twin Peaks  (1990): 9.0&lt;br/&gt; 7. Freaks and Geeks  (1999): 9.0&lt;br/&gt; 8. Batman: The Animated Series  (1992): 9.0&lt;br/&gt; 9. Cowboy Bebop  (1998): 9.0&lt;br/&gt; 10. Ice TV  (1996): 9.0&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Best of the 80s:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;1. Cosmos  (1980): 9.3&lt;br/&gt; 2. The Living Planet  (1984): 9.2&lt;br/&gt; 3. The Simpsons  (1989): 9.0&lt;br/&gt; 4. Seinfeld  (1989): 9.0&lt;br/&gt; 5. Dragon Ball Z  (1989): 8.9&lt;br/&gt; 6. Only Fools and Horses&amp;hellip;.  (1981): 8.9&lt;br/&gt; 7. Blackadder Goes Forth  (1989): 8.9&lt;br/&gt; 8. Fauji  (1988): 8.9&lt;br/&gt; 9. Solunski patrdii  (1985): 8.9&lt;br/&gt; 10. The Morning Program  (1987): 8.9&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Best of the 70s:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;1. The World at War  (1973): 9.5&lt;br/&gt; 2. Life on Earth  (1979): 9.2&lt;br/&gt; 3. I, Claudius  (1976): 9.1&lt;br/&gt; 4. La zulianita  (1977): 9.0&lt;br/&gt; 5. Doña Bárbara (1975): 9.0&lt;br/&gt; 6. Fawlty Towers  (1975): 8.9&lt;br/&gt; 7. Matador  (1978): 8.9&lt;br/&gt; 8. El chavo del ocho  (1972): 8.8&lt;br/&gt; 9. The Man and the City  (1971): 8.8&lt;br/&gt; 10. The Julie Andrews Hour  (1972): 8.8&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Best of the 60s:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;1. Monty Python&amp;rsquo;s Flying Circus  (1969): 9.0&lt;br/&gt; 2. Fujimaru of the Wind  (1964): 9.0&lt;br/&gt; 3. The Prisoner  (1967): 8.8&lt;br/&gt; 4. Nu, pogodi!  (1969): 8.8&lt;br/&gt; 5. Ferien in Lipizza  (1966): 8.8&lt;br/&gt; 6. The Great Adventure  (1963): 8.8&lt;br/&gt; 7. Alle Hunde lieben Theobald  (1969): 8.8&lt;br/&gt; 8. Hafenpolizei  (1963): 8.8&lt;br/&gt; 9. Landarzt Dr. Brock  (1967): 8.8&lt;br/&gt; 10. Die Gäste des Felix Hechinger (1964): 8.8&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Best of the 50s:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;1. Twilight Zone  (1959): 8.9&lt;br/&gt; 2. Walt Disney Presents: Annette  (1958): 8.8&lt;br/&gt; 3. Corky and White Shadow  (1956): 8.8&lt;br/&gt; 4. Mayor of the Town  (1954): 8.8&lt;br/&gt; 5. The Spade Cooley Show  (1958): 8.8&lt;br/&gt; 6. Alfred Hitchcock Presents  (1955): 8.7&lt;br/&gt; 7. Mr. Squiggle and Friends  (1959): 8.7&lt;br/&gt; 8. Studio 57  (1954): 8.7&lt;br/&gt; 9. J.P. Patches  (1958): 8.7&lt;br/&gt; 10. The Honeymooners  (1955): 8.6&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;Best of the 40s:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;1. Mama  (1949): 8.5&lt;br/&gt; 2. Mary Kay and Johnny  (1947): 8.5&lt;br/&gt; 3. Kukla, Fran and Ollie  (1947): 8.2&lt;br/&gt; 4. The Howdy Doody Show  (1947): 8.1&lt;br/&gt; 5. Stump the Stars  (1947): 8.1&lt;br/&gt; 6. Fireside Theatre  (1949): 8.1&lt;br/&gt; 7. The Lone Ranger  (1949): 8.0&lt;br/&gt; 8. The Goldbergs  (1949): 8.0&lt;br/&gt; 9. Time for Beany  (1949): 8.0&lt;br/&gt; 10. The Aldrich Family  (1949): 8.0&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;/p&gt;

&lt;p&gt;&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/88884293461</link><guid>https://dfkoz.tumblr.com/post/88884293461</guid><pubDate>Sun, 15 Jun 2014 16:04:00 -0400</pubDate></item><item><title>Movie Scripts Ranked by Flesch-Kincaid Grade Level</title><description>&lt;p&gt;Remember the Flesch Kincaid Grade Level? If you ever ran Microsoft Word's spell check in in the 90&amp;rsquo;s, you probably do:&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="302" data-orig-width="320" data-orig-src="https://64.media.tumblr.com/ac7e3f9f98d5a4f78a17d7fbbafc99f6/tumblr_inline_n509jqDCZ31qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/ac7e3f9f98d5a4f78a17d7fbbafc99f6/tumblr_inline_pahc4th4sW1qkcgdm_540.png" data-orig-height="302" data-orig-width="320" data-orig-src="https://64.media.tumblr.com/ac7e3f9f98d5a4f78a17d7fbbafc99f6/tumblr_inline_n509jqDCZ31qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;

&lt;p&gt;The &lt;a href="http://en.wikipedia.org/wiki/Flesch%E2%80%93Kincaid_readability_tests" title="http://en.wikipedia.org/wiki/Flesch%E2%80%93Kincaid_readability_tests" target="_blank"&gt;Flesch and Flesch-Kincaid tests&lt;/a&gt; are &amp;ldquo;readability tests designed to indicate comprehension difficulty when reading a passage of contemporary academic English.&amp;rdquo; The scores incorporate two primary factors: the average number of words per sentence, and the average number of syllables per word. A higher Flesch-Kincaid Grade Level is supposed to reflect more difficult text, with longer sentences and more multi-syllabic words.&lt;/p&gt;
&lt;p&gt;The Ruby gem &lt;a href="http://www.ruby-doc.org/gems/docs/l/lingua-0.6.2/Lingua/EN/Readability.html" title="http://www.ruby-doc.org/gems/docs/l/lingua-0.6.2/Lingua/EN/Readability.html" target="_blank"&gt;Lingua&lt;/a&gt; makes it quite easy to perform readability analysis against text. And with more than 950 scripts available on the &lt;a href="http://www.imsdb.com/" title="http://www.imsdb.com/" target="_blank"&gt;Internet Movie Script Database&lt;/a&gt;, I had the raw ingredients necessary to run a fun and entirely inane analysis of the Flesch-Kincaid scores of movies. Here are the results.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Movies with the lowest Flesch-Kincaid grade level (simpler):&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0104348/" title="http://www.imdb.com/title/tt0104348/" target="_blank"&gt;Glengarry Glen Gross&lt;/a&gt;: 0.11&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt1606392/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt1606392/?ref_=nv_sr_1" target="_blank"&gt;Win Win&lt;/a&gt;: 0.13&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0070849/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt0070849/?ref_=nv_sr_1" target="_blank"&gt;Last Tango in Paris&lt;/a&gt;: 0.25&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt1045658/" title="http://www.imdb.com/title/tt1045658/" target="_blank"&gt;Silver Linings Playbook&lt;/a&gt;: 0.45 [Oscar winner, Best Adapted Screenplay]&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt1205489/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt1205489/?ref_=nv_sr_1" target="_blank"&gt;Gran Torino&lt;/a&gt;: 0.54&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0195945/?ref_=fn_al_tt_1" title="http://www.imdb.com/title/tt0195945/?ref_=fn_al_tt_1" target="_blank"&gt;Next Friday&lt;/a&gt;: 0.60&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0212985/?ref_=nv_sr_2" title="http://www.imdb.com/title/tt0212985/?ref_=nv_sr_2" target="_blank"&gt;Hannibal&lt;/a&gt;: 0.64&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0117666/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt0117666/?ref_=nv_sr_1" target="_blank"&gt;Sling Blade&lt;/a&gt;: 0.65&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt2294629/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt2294629/?ref_=nv_sr_1" target="_blank"&gt;Frozen&lt;/a&gt;: 0.66&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0118749/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt0118749/?ref_=nv_sr_1" target="_blank"&gt;Boogie Nights&lt;/a&gt;: 0.68 [Oscar nominee, Best Original Screenplay]&lt;/li&gt;
&lt;/ol&gt;&lt;p&gt;&lt;figure data-orig-height="257" data-orig-width="180" data-orig-src="https://64.media.tumblr.com/01704f09bb09461908aaaf63ef7f7b5a/tumblr_inline_n50c32Xd7c1qkcgdm.jpg"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/01704f09bb09461908aaaf63ef7f7b5a/tumblr_inline_pahc4umlyn1qkcgdm_540.jpg" data-orig-height="257" data-orig-width="180" data-orig-src="https://64.media.tumblr.com/01704f09bb09461908aaaf63ef7f7b5a/tumblr_inline_n50c32Xd7c1qkcgdm.jpg"/&gt;&lt;/figure&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;br/&gt;Movies with the highest Flesch-Kincaid grade level (more complex):&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0089155/?ref_=nv_sr_4" title="http://www.imdb.com/title/tt0089155/?ref_=nv_sr_4" target="_blank"&gt;Fletch&lt;/a&gt;: 5.94&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0086856/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt0086856/?ref_=nv_sr_1" target="_blank"&gt;The Adventures of Buckaroo Banzai Across the Eighth Dimension&lt;/a&gt;: 5.48&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0142688/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt0142688/?ref_=nv_sr_1" target="_blank"&gt;The Ninth Gate&lt;/a&gt;: 5.07&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0053619/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt0053619/?ref_=nv_sr_1" target="_blank"&gt;L'Avventura (The Adventure)&lt;/a&gt;:  4.74&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0120762/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt0120762/?ref_=nv_sr_1" target="_blank"&gt;Mulan&lt;/a&gt;: 4.31&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0067525/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt0067525/?ref_=nv_sr_1" target="_blank"&gt;Omega Man&lt;/a&gt;: 4.18&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0325710/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt0325710/?ref_=nv_sr_1" target="_blank"&gt;The Last Samurai&lt;/a&gt;: 4.17&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0119177/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt0119177/?ref_=nv_sr_1" target="_blank"&gt;Gattaca&lt;/a&gt;: 3.99&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0120053/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt0120053/?ref_=nv_sr_1" target="_blank"&gt;The Saint&lt;/a&gt;: 3.76&lt;/li&gt;
&lt;li&gt;&lt;a href="http://www.imdb.com/title/tt0144084/?ref_=nv_sr_1" title="http://www.imdb.com/title/tt0144084/?ref_=nv_sr_1" target="_blank"&gt;American Psycho&lt;/a&gt;: 3.76&lt;/li&gt;
&lt;/ol&gt;&lt;p&gt;&lt;strong&gt;&lt;figure class="tmblr-full" data-orig-height="750" data-orig-width="495" data-orig-src="https://64.media.tumblr.com/37a5e8360a361a2e5c00e7d7fcb9cc53/tumblr_inline_n50c0sZcI81qkcgdm.jpg"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/3427dc1cdee0a6f2b5828993a975a2c1/tumblr_inline_pahc4uHP1Y1qkcgdm_540.jpg" width="100" data-orig-height="750" data-orig-width="495" data-orig-src="https://64.media.tumblr.com/37a5e8360a361a2e5c00e7d7fcb9cc53/tumblr_inline_n50c0sZcI81qkcgdm.jpg"/&gt;&lt;/figure&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;br/&gt;If you&amp;rsquo;re interested in seeing the full list, &lt;a href="https://docs.google.com/spreadsheets/d/1IloYS6IU-sw85ii9E8dEsVSdsMoYYH8_2810bF8Q9U0/edit?usp=sharing" title="https://docs.google.com/spreadsheets/d/1IloYS6IU-sw85ii9E8dEsVSdsMoYYH8_2810bF8Q9U0/edit?usp=sharing" target="_blank"&gt;click here&lt;/a&gt;. This list includes two additional scores: the Flesch Reading Ease score, which uses slightly different weights (with a lower score indicating more readable text), and the &lt;a href="http://en.wikipedia.org/wiki/Gunning_fog_index" title="http://en.wikipedia.org/wiki/Gunning_fog_index" target="_blank"&gt;Gunning fog index&lt;/a&gt;, which incorporates average sentence length and the density of words with three or more syllables.&lt;/p&gt;
&lt;p&gt;A few caveats: first, this list is far from complete. In fact, there were ~100 scripts on the IMSDb that have been taken down (e.g., most of the Batman and all of the Harry Potter movies).&lt;/p&gt;
&lt;p&gt;Second, the analysis is unfairly favorable to scripts with detailed stage directions, which tend to be more complex than dialogue. Here is the first passage from the &amp;ldquo;most sophisticated&amp;rdquo; movie, Fletch:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Seagulls squawk, and the waves pound, but we’re not talking about Malibu Colony, here. This is a fairly rundown beach area, catering to lower-echelon surfers, vagrants, and strung out druggies of all ages, several of whom stand or sit on their haunches by a dilapidated old hamburger stand. Over the stand is a faded sign: &amp;ldquo;FAT SAM’S HAMBURGERS&amp;rdquo;.&lt;/p&gt;
&lt;/blockquote&gt;
&lt;p&gt;In any case, enjoy. In the &lt;a href="http://www.tumblr.com/blog/dfkoz/new/text" title="http://www.tumblr.com/blog/dfkoz/new/text" target="_blank"&gt;immortal words of John McClane&lt;/a&gt;:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Yippe-ki-yea&lt;/p&gt;
&lt;/blockquote&gt;</description><link>https://dfkoz.tumblr.com/post/84628568976</link><guid>https://dfkoz.tumblr.com/post/84628568976</guid><pubDate>Sat, 03 May 2014 12:50:00 -0400</pubDate></item><item><title>The Most Linked-To Wikipedia Articles on Stack Overflow</title><description>&lt;p&gt;I discovered the &lt;a href="http://data.stackexchange.com/stackoverflow/query/new" target="_blank"&gt;Stack Overflow Data Explorer&lt;/a&gt; this morning, which allows you to query the entire catalogue of Stack Overflow posts. After playing with it, I decided to figure out which Wikipedia articles have been linked to most often by Stack users over the past two years.&lt;/p&gt;
&lt;p&gt;Over a two-year period, there were 10,831,467 total Stack Overflow posts. 93,290 (0.86%) posts contained the text &amp;lsquo;en.wikipedia.org&amp;rsquo;, representing 115,591 links. These links pointed to about 25,000 unique articles. 14,600 articles (58.25%) appeared just once. 91.76% of the articles appeared fewer than ten times, accounting for 38.31% of all links.&lt;/p&gt;
&lt;p&gt;The most common link, &lt;a href="http://en.wikipedia.org/wiki/SQL_injection" title="http://en.wikipedia.org/wiki/SQL_injection" target="_blank"&gt;SQL Injection&lt;/a&gt;, was cited over 1,000 times. Another 42* links appeared more than 200 times, and 113 total links appeared more than 100 times. The top 10 most linked-to articles are below. &lt;a href="https://docs.google.com/spreadsheets/d/13G75Edx9qXS2v9ktc1Xl8Fb38P0Nu3DdGs4LmKOGzC0/edit?usp=sharing" title="https://docs.google.com/spreadsheets/d/13G75Edx9qXS2v9ktc1Xl8Fb38P0Nu3DdGs4LmKOGzC0/edit?usp=sharing" target="_blank"&gt;You can see the full list here&lt;/a&gt;. &lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/SQL_Injection" title="en.wikipedia.org/wiki/SQL_Injection" target="_blank"&gt;en.wikipedia.org/wiki/SQL_Injection&lt;/a&gt; (1001 links)&lt;/li&gt;
&lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Same_origin_policy" title="en.wikipedia.org/wiki/Same_origin_policy"&gt;en.wikipedia.org/wiki/Same_origin_policy&lt;/a&gt; (909)&lt;/li&gt;
&lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/JSONP" title="en.wikipedia.org/wiki/JSONP" target="_blank"&gt;en.wikipedia.org/wiki/JSONP&lt;/a&gt; (501)&lt;/li&gt;
&lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Cross-origin_resource_sharing" title="en.wikipedia.org/wiki/Cross-origin_resource_sharing" target="_blank"&gt;en.wikipedia.org/wiki/Cross-origin_resource_sharing&lt;/a&gt; (492)&lt;/li&gt;
&lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Singleton_pattern" title="en.wikipedia.org/wiki/Singleton_pattern" target="_blank"&gt;en.wikipedia.org/wiki/Singleton_pattern&lt;/a&gt; (468)&lt;/li&gt;
&lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Undefined_behavior" title="en.wikipedia.org/wiki/Undefined_behavior" target="_blank"&gt;en.wikipedia.org/wiki/Undefined_behavior&lt;/a&gt; (420)&lt;/li&gt;
&lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller" title="en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller" target="_blank"&gt;en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller&lt;/a&gt; (406)&lt;/li&gt;
&lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Single_responsibility_principle" title="en.wikipedia.org/wiki/Single_responsibility_principle" target="_blank"&gt;en.wikipedia.org/wiki/Single_responsibility_principle&lt;/a&gt; (402)&lt;/li&gt;
&lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Don%27t_repeat_yourself" title="en.wikipedia.org/wiki/Don%27t_repeat_yourself" target="_blank"&gt;en.wikipedia.org/wiki/Don%27t_repeat_yourself&lt;/a&gt; (398)&lt;/li&gt;
&lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Database_normalization" title="en.wikipedia.org/wiki/Database_normalization" target="_blank"&gt;en.wikipedia.org/wiki/Database_normalization&lt;/a&gt; (358)&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;[1] There were about 5 references to the &lt;a href="http://en.wikipedia.org/wiki/Phrases_from_The_Hitchhiker's_Guide_to_the_Galaxy#Answer_to_the_Ultimate_Question_of_Life.2C_the_Universe.2C_and_Everything_.2842.29" title="http://en.wikipedia.org/wiki/Phrases_from_The_Hitchhiker's_Guide_to_the_Galaxy#Answer_to_the_Ultimate_Question_of_Life.2C_the_Universe.2C_and_Everything_.2842.29"&gt;answer to the ultimate question of Life, the Universe, and Everything appeared.&lt;/a&gt;&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/83927509717</link><guid>https://dfkoz.tumblr.com/post/83927509717</guid><pubDate>Sat, 26 Apr 2014 12:57:00 -0400</pubDate></item><item><title>When Do Students Apply for Jobs?</title><description>&lt;p&gt;Over at &lt;a href="http://firstmarkcap.com/" title="http://firstmarkcap.com/" target="_blank"&gt;FirstMark&lt;/a&gt;, we&amp;rsquo;re running an internship program called &lt;a href="http://firstmarkelite.com/" title="http://firstmarkelite.com/" target="_blank"&gt;FirstMark Elite&lt;/a&gt; that connects the best undergraduate and graduate computer science students with summer internships at some of the most exciting startups in the world.&lt;/p&gt;
&lt;p&gt;We have received an incredible response from colleges all over the country. I know because I receive an email each and every time a student applies for the program. Over the past few months, I have noticed a distinct pattern in the days and time that we receive applications. So, as part of the &amp;ldquo;data exhaust&amp;rdquo; of running FirstMark Elite, here is an overview of &amp;ldquo;when students apply for jobs&amp;rdquo;. Presumably, these are also graphs of &amp;ldquo;when students aren&amp;rsquo;t eating or having fun.&amp;rdquo;&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/c9a1d8faaf11576a93ab08db13ef5401/tumblr_inline_n00tmzX0cp1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/e9d625b7e111036bcbb2f71917eef92f/tumblr_inline_pahc4tyRMT1qkcgdm_540.png" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/c9a1d8faaf11576a93ab08db13ef5401/tumblr_inline_n00tmzX0cp1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/ca6a87e9c1ae3d04fd78c6fb20553653/tumblr_inline_n00tn69PGB1qkcgdm.png"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/c90bb88831f15ead9e7620ff7f6433cf/tumblr_inline_pahc4ugmwF1qkcgdm_540.png" data-orig-height="300" data-orig-width="500" data-orig-src="https://64.media.tumblr.com/ca6a87e9c1ae3d04fd78c6fb20553653/tumblr_inline_n00tn69PGB1qkcgdm.png"/&gt;&lt;/figure&gt;&lt;/p&gt;

&lt;p&gt;&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/74627765597</link><guid>https://dfkoz.tumblr.com/post/74627765597</guid><pubDate>Sun, 26 Jan 2014 13:22:00 -0500</pubDate><category>data</category><category>analysis</category><category>internship</category></item><item><title>I created an interactive visualization of VC-backed Bitcoin...</title><description>&lt;img src="https://64.media.tumblr.com/4f03c8d606be68af376dce29d6074ea4/tumblr_mwstwatHSP1qmb8i4o1_500.png"/&gt;&lt;br/&gt;&lt;br/&gt;&lt;p&gt;I created an interactive visualization of VC-backed Bitcoin startups using Highcharts and Crunchbase data. See it live here: &lt;a href="http://dfkoz.com/bitcoin/"&gt;http://dfkoz.com/bitcoin/&lt;/a&gt;.&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/68025542173</link><guid>https://dfkoz.tumblr.com/post/68025542173</guid><pubDate>Sun, 24 Nov 2013 22:04:00 -0500</pubDate><category>bitcoin</category><category>data visualization</category></item><item><title>Real-Time Thermometer with SmartThings and PHP</title><description>&lt;p&gt;Here is a &lt;a href="http://dfkoz.com/fmc/temperature.php" title="http://dfkoz.com/fmc/temperature.php"&gt;real-time FirstMark temperature monitor&lt;/a&gt;, created with a &lt;a href="http://www.smartthings.com/" title="http://www.smartthings.com/"&gt;SmartThings&lt;/a&gt; starter kit, &lt;a href="http://www.highcharts.com/" title="http://www.highcharts.com/"&gt;Highcharts.js&lt;/a&gt;, a little bit of PHP, and MySQL. (Doesn&amp;rsquo;t look great on mobile yet.)&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="368" data-orig-width="500"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/f8f3f0a714a41d1d0e0634e3f3b4f9eb/2db72272fc5e9853-46/s540x810/603d8b36d625dfdddcbad510b1ea005e6cd1c37e.png" data-orig-height="368" data-orig-width="500"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;Now we&amp;rsquo;ll always know whether it's &lt;a href="http://firstmarkcap.com/team/lawrence-lenihan/" title="http://firstmarkcap.com/team/lawrence-lenihan/"&gt;LDL&lt;/a&gt; or &lt;a href="http://firstmarkcap.com/team/rick-heitzmann/" title="http://firstmarkcap.com/team/rick-heitzmann/"&gt;RH&lt;/a&gt; temperature! This is how crazy simple the SmartThings SmartApp code is:&lt;/p&gt;
&lt;div class="gist"&gt;&lt;a href="https://gist.github.com/dfkoz/7442030"&gt;https://gist.github.com/dfkoz/7442030&lt;/a&gt;&lt;/div&gt;</description><link>https://dfkoz.tumblr.com/post/66830792722</link><guid>https://dfkoz.tumblr.com/post/66830792722</guid><pubDate>Tue, 12 Nov 2013 20:35:00 -0500</pubDate></item><item><title>The Most Popular Classic Rock Songs </title><description>&lt;p&gt;Using the &lt;a href="http://api.yes.com/" title="http://api.yes.com/"&gt;yes.com API&lt;/a&gt;, I was able to load playlists for over 1,000 different radio stations. There is a lot of data to analyze here, but one of the first questions I sought to answer was &amp;ldquo;What are the most popular songs and artists on classic rock radio?&amp;rdquo;&lt;/p&gt;
&lt;p&gt;Originally, I categorized any station that played Aerosmith and Led Zeppelin at least once per day as a classic rock station. An astute friend pointed out that I was &amp;ldquo;sampling on the dependent variable.&amp;rdquo; To address this, I filtered for stations that played (i) at least one song by Pink Floyd, Led Zeppelin, The Beatles, Fleetwood Mac, the Rolling Stones, and Aerosmith, and (ii) played at least one song by all of those artists. The source data do not have reliable genre tags, so this is the most appropriate test I could think of; I am open to alternative suggestions.&lt;/p&gt;
&lt;p&gt;Of the 1,000 radio stations, 91 are considered &amp;lsquo;classic rock.' Across 1 week, 111,000 songs, 5,421 unique songs, and 1,448 unique artists, here are the twenty most popular songs:&lt;/p&gt;

&lt;p&gt;  &lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Sweet Home Alabama, Lynyrd Skynyrd (0.55 plays/day)&lt;/li&gt;
&lt;li&gt;Sweet Emotion, Aerosmith (0.53 plays/day)&lt;/li&gt;
&lt;li&gt;Carry On Wayward Son, Kansas (0.51 plays/day)&lt;/li&gt;
&lt;li&gt;More Than A Feeling, Boston (0.49 plays/day)&lt;/li&gt;
&lt;li&gt;Dream On, Aerosmith (0.47 plays/day)&lt;/li&gt;
&lt;li&gt;Another Brick In The Wall (Pt II), Pink Floyd (0.47 plays/day)&lt;/li&gt;
&lt;li&gt;The Joker, Steve Miller (0.46 plays/day)&lt;/li&gt;
&lt;li&gt;Don&amp;rsquo;t Stop Believin&amp;rsquo;, Journey (0.45 plays/day)&lt;/li&gt;
&lt;li&gt;We Will Rock You, Queen (0.45 plays/day)&lt;/li&gt;
&lt;li&gt;Another One Bites The Dust, Queen (0.45 plays/day)&lt;/li&gt;
&lt;li&gt;You Shook Me All Night Long, AC/DC (0.45 plays/day)&lt;/li&gt;
&lt;li&gt;Renegade, Styx (0.45 plays/day)&lt;/li&gt;
&lt;li&gt;Wanted Dead Or Alive, Bon Jovi (0.44 plays/day)&lt;/li&gt;
&lt;li&gt;Magic Man, Heart (0.44 plays/day)&lt;/li&gt;
&lt;li&gt;I Want You To Want Me (Live), Cheap Trick (0.44 plays/day)&lt;/li&gt;
&lt;li&gt;We Are The Champions, Queen (0.43 plays/day)&lt;/li&gt;
&lt;li&gt;Barracuda, Heart (0.43 plays/day)&lt;/li&gt;
&lt;li&gt;Centerfold, J. Geils Band (0.43 plays/day)&lt;/li&gt;
&lt;li&gt;I Love Rock 'N Roll, Joan Jett (0.43 plays/day)&lt;/li&gt;
&lt;li&gt;Free Fallin&amp;rsquo;, Tom Petty (0.42 plays/day)&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;These are the twenty most popular classic rock artists:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;Led Zeppelin (5.30 plays/day)&lt;/li&gt;
&lt;li&gt;Van Halen (4.36 plays/day)&lt;/li&gt;
&lt;li&gt;Rolling Stones (4.18 plays/day)&lt;/li&gt;
&lt;li&gt;Pink Floyd (3.92 plays/day)&lt;/li&gt;
&lt;li&gt;Aerosmith (3.63 plays/day)&lt;/li&gt;
&lt;li&gt;Queen (3.59 plays/day)&lt;/li&gt;
&lt;li&gt;Tom Petty (3.55 plays/day)&lt;/li&gt;
&lt;li&gt;AC/DC (3.14 plays/day)&lt;/li&gt;
&lt;li&gt;Journey (3.09 plays/day)&lt;/li&gt;
&lt;li&gt;Eagles (2.98 plays/day)&lt;/li&gt;
&lt;li&gt;Boston (2.82 plays/day)&lt;/li&gt;
&lt;li&gt;ZZ Top (2.74 plays/day)&lt;/li&gt;
&lt;li&gt;Steve Miller (2.60 plays/day)&lt;/li&gt;
&lt;li&gt;Foreigner (2.59 plays/day)&lt;/li&gt;
&lt;li&gt;Def Leppard (2.57 plays/day)&lt;/li&gt;
&lt;li&gt;Who (2.42 plays/day)&lt;/li&gt;
&lt;li&gt;Beatles (2.42 plays/day)&lt;/li&gt;
&lt;li&gt;U2 (2.26 plays/day)&lt;/li&gt;
&lt;li&gt;Police (2.16 plays/day)&lt;/li&gt;
&lt;li&gt;Lynyrd Skynyrd (2.15 plays/day)&lt;/li&gt;
&lt;/ol&gt;</description><link>https://dfkoz.tumblr.com/post/64038985073</link><guid>https://dfkoz.tumblr.com/post/64038985073</guid><pubDate>Mon, 14 Oct 2013 14:49:00 -0400</pubDate></item><item><title>Send a Random Vacation Photo Each Day with Honeymailer</title><description>&lt;p&gt;I was backing up some photos the other day and spent 20 wonderful minutes flipping through pictures from our honeymoon. I thought it was a shame that we rarely took time to enjoy old photos. We snapped 1,700 on a month-long honeymoon in Spain, and have thousands more from other trips. Then, inspiration struck: why not get a regular email with a random photo from our personal archives? Thus was born Honeymailer, a completely passive and spontaneous way to enjoy some old memories.&lt;/p&gt;
&lt;p&gt;Now, my wife and I get gems once a day, like this photo from Ainsa, Spain:&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="300" data-orig-width="400"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/f9b3effaa2b7a0c6febe3edb448020e1/4dc1078107a45483-6a/s540x810/071db3a066c82d34cf3c7a869bb4af44ae5c54f9.jpg" data-orig-height="300" data-orig-width="400"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;Here is the &lt;a href="https://gist.github.com/dfkoz/5860786" target="_blank"&gt;code for Honeymailer&lt;/a&gt;. It runs as a Google Apps Script. If you have any trouble setting this up, don&amp;rsquo;t hesitate to leave a comment.&lt;/p&gt;
&lt;div class="gist"&gt;&lt;a href="https://gist.github.com/dfkoz/5860786"&gt;https://gist.github.com/dfkoz/5860786&lt;/a&gt;&lt;/div&gt;</description><link>https://dfkoz.tumblr.com/post/53857988296</link><guid>https://dfkoz.tumblr.com/post/53857988296</guid><pubDate>Tue, 25 Jun 2013 14:21:00 -0400</pubDate><category>photos</category><category>email</category><category>google apps</category><category>code</category></item><item><title>According to Twitter, Chris Bosh looks like...</title><description>&lt;p&gt;Based on a sample of over 2,000 tweets with the text &amp;ldquo;Chris Bosh looks like&amp;rdquo;:&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="405" data-orig-width="480"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/37612d45d940c685d7cf9c2c8f3caadf/f802c8dfce728e9e-c6/s540x810/0ea6549b0fc3c67cdbd8d470923c968de31c4d02.png" data-orig-height="405" data-orig-width="480"/&gt;&lt;/figure&gt;&lt;/p&gt;

&lt;p&gt;&lt;/p&gt;

&lt;p&gt;&lt;/p&gt;

&lt;p&gt;&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/53452979482</link><guid>https://dfkoz.tumblr.com/post/53452979482</guid><pubDate>Thu, 20 Jun 2013 14:20:00 -0400</pubDate><category>chris bosh</category><category>heat</category><category>nba</category><category>finals</category></item><item><title>Two days in the Citi Bike program. Red areas indicate that more...</title><description>&lt;img src="https://64.media.tumblr.com/b75b318e2441106e293c4175a2ab9e3e/tumblr_mo150sojYb1qmb8i4o1_250.gifv"/&gt;&lt;br/&gt;&lt;br/&gt;&lt;p&gt;Two days in the Citi Bike program. Red areas indicate that more bikes are available relative to the green areas. Full heatmap at &lt;a href="http://dfkoz.com/citibike/" target="_blank"&gt;dfkoz.com/citibike/&lt;/a&gt;.&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/52384343656</link><guid>https://dfkoz.tumblr.com/post/52384343656</guid><pubDate>Fri, 07 Jun 2013 11:43:40 -0400</pubDate><category>citibike</category><category>maps</category><category>data visualization</category></item><item><title>Heatmap of Citi Bike density over time</title><description>&lt;a href="http://dfkoz.com/citibike/"&gt;Heatmap of Citi Bike density over time&lt;/a&gt;</description><link>https://dfkoz.tumblr.com/post/51895881419</link><guid>https://dfkoz.tumblr.com/post/51895881419</guid><pubDate>Sat, 01 Jun 2013 13:38:07 -0400</pubDate></item><item><title>The Rock and Roll Hall of Fame: From Acid Rock to Zydeco</title><description>&lt;p class="MsoNormal"&gt;The &lt;a href="http://www.rockhall.com/"&gt;Rock and Roll Hall of Fame&lt;/a&gt; completed its 26&lt;sup&gt;th&lt;/sup&gt; induction ceremony about a week ago, honoring Rush, Heart, Randy Newman, Donna Summer, Albert King, and Public Enemy. This year’s crop of inductees struck me as more appropriate for a “Popular Music Hall of Fame” than one dedicated specifically to rock. In fact, if I were a betting man, I would put money down on a name change in the next 10 years.&lt;/p&gt;

&lt;p class="MsoNormal"&gt;The first hip hop artist to be inducted into the Hall was Grandmaster Flash and The Furious Five in 2007. Since then, Run D.M.C. (2009) and the Beastie Boys (2012) have also been inducted. Artists are eligible for election beginning 25 years after their first record, so a large cohort of popular hip hop artists will become eligible over the next decade; the Grammy for &lt;a href="http://en.wikipedia.org/wiki/Grammy_Award_for_Best_Rap_Album"&gt;Best Rap Album&lt;/a&gt; was first awarded (to Naughty by Nature) in 1996. &lt;/p&gt;
&lt;p class="MsoNormal"&gt;There are also &lt;a href="http://countrymusichalloffame.org/trivia-for-group-tours/"&gt;11 artists&lt;/a&gt; that have been inducted into both the Country Music Hall of Fame and the Rock and Roll Hall of Fame, including Elvis and Johnny Cash. Of course, there are at least three reasons this is not surprising: genres are fluid, artists’ styles evolve, and—perhaps most importantly—rock was born, in part, from country music. &lt;/p&gt;
&lt;p class="MsoNormal"&gt;Even Miles Davis, who “flirted with rock from the late 1960&amp;rsquo;s on” was inducted as a performer, which the New York Times described as “&lt;a href="http://www.nytimes.com/2006/03/13/arts/music/13mile.html?_r=0"&gt;provocative for a second, and then a little meaningless&lt;/a&gt;.” In any case, this made me wonder about the breadth of genres represented across the Hall’s &lt;a href="http://www.rockhall.com/inductees/"&gt;279 inductees&lt;/a&gt;, &lt;a href="http://en.wikipedia.org/wiki/List_of_Rock_and_Roll_Hall_of_Fame_inductees"&gt;179 of whom are performers&lt;/a&gt;. &lt;/p&gt;
&lt;p class="MsoNormal"&gt;I wrote a short script to crawl Wikipedia pages for each inducted performer and tabulate the genres mentioned. Neil Young’s page, for example, lists “rock” in the &lt;a href="http://investmentbankinginsight.com/investment-banking-terminology-lead-left-bookrunner/"&gt;lead left&lt;/a&gt; position followed by “folk rock, country rock, experimental rock, hard rock”. This surfaced 144 unique genres, 50% of which are attached to only one artist, including baroque pop (Bee Gees), bubblegum pop (The Jackson 5), drone (The Velvet Underground), glam metal (Alice Cooper), jangle pop (R.E.M.), krautrock (David Bowie), spoken word (Leonard Cohen), and zydeco (Dr. John).&lt;/p&gt;
&lt;p class="MsoNormal"&gt;There is nothing extraordinarily surprising about the overall distribution of genres. The Rock and Roll Hall of Fame has, for the most part, been true to its name &amp;ndash; although we will see how much longer that remains true.&lt;/p&gt;

&lt;p&gt;&lt;/p&gt;
&lt;p class="MsoNormal"&gt;&lt;strong&gt;&amp;ldquo;Lead left&amp;rdquo; genre, all genres with at least 5 inductees&lt;br/&gt;&lt;figure class="tmblr-full" data-orig-height="325" data-orig-width="500"&gt;&lt;img src="https://64.media.tumblr.com/f0ad44396ff70279a4282eccb928b314/f89421c1e2d6d697-48/s540x810/74b7ae6acbd5e2c7a65a84a0a0a3ab940cd94b90.png" data-orig-height="325" data-orig-width="500"/&gt;&lt;/figure&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p class="MsoNormal"&gt;&lt;strong&gt;% of inductees tagged with most common genres&lt;br/&gt;&lt;figure class="tmblr-full" data-orig-height="314" data-orig-width="500"&gt;&lt;img src="https://64.media.tumblr.com/b753cbdddce340b0ea5065fe628634de/f89421c1e2d6d697-25/s540x810/ea29961849e44f0f9558826b422b302f1ef76500.png" data-orig-height="314" data-orig-width="500"/&gt;&lt;/figure&gt;&lt;/strong&gt;&lt;/p&gt;

&lt;p class="MsoNormal"&gt;&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/51605648264</link><guid>https://dfkoz.tumblr.com/post/51605648264</guid><pubDate>Tue, 28 May 2013 21:22:11 -0400</pubDate></item><item><title>Peak social? Numbers show new social startups are declining</title><description>&lt;p&gt;The following graph shows the total percentage of startups listed in Crunchbase that include the word &amp;ldquo;social&amp;rdquo; in either the company&amp;rsquo;s description or tags, by year founded.&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="290" data-orig-width="482"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/793863ff054a4ff30ee9b382ea6e5cb2/40beff00a07bc7e7-cd/s540x810/51f77ee0b28378fed7b59e2448830b873eb1c5aa.png" data-orig-height="290" data-orig-width="482"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;In response to a comment regarding percentages versus count of startups, I cooked up the following graph. This shows the total number of &amp;ldquo;social&amp;rdquo; startups founded in a given year (blue) and the total number of startups founded according (blue + gray).&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="316" data-orig-width="482"&gt;&lt;img src="https://64.media.tumblr.com/e3f0be065bd4c67889247be1fe111300/40beff00a07bc7e7-08/s540x810/16d0bdbdb258689190cc423ca204ca753388b4b9.png" data-orig-height="316" data-orig-width="482"/&gt;&lt;/figure&gt;&lt;/p&gt;

&lt;p&gt;This chart shows an increasing number of startups founded between 2005 and 2011, with a 33% decline from 2011 to 2012. I am not sure whether entrepreneurial activity has increased or decreased over the past two years, but I am fairly certain that the startup activity did not drop by a third. Instead, my guess is that either (i) data quality is higher for startups that have been around longer, or (ii) that Crunchbase is becoming a less popular place to list one&amp;rsquo;s startup (see &lt;a href="https://angel.co/" title="https://angel.co/" target="_blank"&gt;AngelList&lt;/a&gt;). This is why the original analysis looked at the data on a percentage basis.&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/46366906094</link><guid>https://dfkoz.tumblr.com/post/46366906094</guid><pubDate>Tue, 26 Mar 2013 17:58:00 -0400</pubDate></item><item><title>sxBlock: A Chrome Extension That Hides All SXSW-Related Tweets </title><description>&lt;p&gt;I have seen a lot of demand on Twitter for a way to filter out SXSW-related tweets from one&amp;rsquo;s timeline. Well, now you can do that with sxBlock. You can &lt;a href="https://chrome.google.com/webstore/detail/truthinbrowsing/bpkmbefecpfodkkcmophemddibhnldpa" title="sxBlock" target="_blank"&gt;download it here&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Also, if you would like to help refine the extension, &lt;a href="https://github.com/dfkoz/sxBlock" target="_blank"&gt;here is the source code&lt;/a&gt;.&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/44978033431</link><guid>https://dfkoz.tumblr.com/post/44978033431</guid><pubDate>Sat, 09 Mar 2013 18:50:00 -0500</pubDate></item><item><title>CPM and Price of a Super Bowl Ad Over Time</title><description>&lt;p&gt;Yesterday the &lt;a href="http://www.nytimes.com/2013/01/09/business/media/a-sellout-for-super-bowl-commercial-time.html?_r=0" title="Super Bowl Commercial Time Is a Sellout" target="_blank"&gt;New York Times reported&lt;/a&gt; that the average price for a Super Bowl ad was between $3.7 million and $3.8 million this year. Here is the average price of a Super Bowl spot over time, adjusted for inflation:&lt;/p&gt;
&lt;p&gt;&lt;img src="https://64.media.tumblr.com/c973a1f4e304cab253baceb42f5171d4/tumblr_inline_mgd6a6tSFk1qkcgdm.bmp"/&gt;&lt;/p&gt;
&lt;p&gt;CPM represents the cost for every thousand viewers. The ratings for each game are from &lt;a href="http://tvbythenumbers.zap2it.com/2012/02/04/will-super-bowl-xlvi-tv-viewership-set-another-record-poll-ratings-history/118656/" title="Super Bowl Ratings History" target="_blank"&gt;Zap2It&lt;/a&gt;. I found historical price data &lt;a href="http://en.wikipedia.org/wiki/File:Superbowl_30sec_2011_adjusted.png" title="Super Bowl Ad Price Data" target="_blank"&gt;on Wikipedia&lt;/a&gt;, although I do not completely trust the numbers; I edited what appears to be an overly precipitous price drop after the dot com bubble burst. That said, others have been through this exercise before and the graphs look &lt;a href="http://www.businessinsider.com/chart-the-incredible-inflation-of-super-bowl-ad-prices-since-67-2012-1" title="CHART: The Incredible Inflation Of Super Bowl Ad Prices Since '67  Read more: http://www.businessinsider.com/chart-the-incredible-inflation-of-super-bowl-ad-prices-since-67-2012-1#ixzz2HUY1whOo" target="_blank"&gt;quite similar&lt;/a&gt;, so I believe the data are approximately correct.&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/40096021369</link><guid>https://dfkoz.tumblr.com/post/40096021369</guid><pubDate>Wed, 09 Jan 2013 10:20:30 -0500</pubDate><category>data</category><category>visualization</category><category>analysis</category><category>football</category><category>super bowl</category><category>ad</category><category>prices</category><category>cost</category><category>advertisement</category></item><item><title>Fewer Crimes Are Committed on Christmas</title><description>&lt;p&gt;Based on data from &lt;a href="http://opendataphilly.org/opendata/resource/215/philadelphia-police-part-one-crime-incidents/" target="_blank"&gt;the Philadelphia Police Department&lt;/a&gt;, 1,640 crimes are committed each day in Philadelphia. That number falls to 838, or about half of the average, on Christmas day.&lt;/p&gt;
&lt;p&gt;The following chart shows the number of crimes reported on each calendar day between 2006 and 2012. Note that crime data for the last few days of December 2012 is projected based on 2006-2011 data.&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="203" data-orig-width="500"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/4fbf72d8d04e21c34fb1e72fa8582b41/4be095388341357a-31/s540x810/ddf3da6204125872f419d495fd76939cb4af6b6e.png" data-orig-height="203" data-orig-width="500"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;The graph does not show a post-Christmas spike, so the drop in crime appears to be real (i.e., the PD does not defer the reporting and timestamping of a crime until the next working day). The next nine lowest-crime days are all in the cold of February, which is &lt;a href="http://online.wsj.com/article/SB10001424052748703995104575389461974136120.html" target="_blank"&gt;true of New York City&lt;/a&gt;, too.&lt;/p&gt;
&lt;p&gt;Other holidays show significant drops in crime. For the following graph, 11/1 corresponds to the first Monday in November, 11/2 to the first Tuesday, 11/8 to the second Monday and so on. For simplicity, only 28 days are plotted for each month.&lt;/p&gt;
&lt;p&gt;&lt;figure class="tmblr-full" data-orig-height="203" data-orig-width="500"&gt;&lt;img alt="image" src="https://64.media.tumblr.com/d5f0cea2acad192eac04a67acea3fe2e/4be095388341357a-1e/s540x810/0573cf253120db38d3158f13febcb2d94dc4e3af.png" data-orig-height="203" data-orig-width="500"/&gt;&lt;/figure&gt;&lt;/p&gt;
&lt;p&gt;Here, the day with the fewest crimes is the first Wednesday in March, which shows a 38% drop from the average. I am not sure why. Thanksgiving is the sixth safest day with a 32% reduction. Labor Day, the first Monday in September, exhibits a 12% drop while the first Sunday in September (the day before Labor Day in six of the past seven years) shows an 11% drop. Crime is at or above average across other days in early September.&lt;/p&gt;
&lt;p&gt;Wishing everyone a happy and safe holiday season! And in case you are wondering, New Year&amp;rsquo;s Eve and Day are in fact 17% and 11% safer than an average day.&lt;/p&gt;</description><link>https://dfkoz.tumblr.com/post/38977249758</link><guid>https://dfkoz.tumblr.com/post/38977249758</guid><pubDate>Thu, 27 Dec 2012 16:09:00 -0500</pubDate><category>data</category><category>visualization</category><category>crime</category><category>statistics</category><category>holidays</category><category>police</category></item></channel></rss>
