<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/atom10full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><feed xmlns="http://www.w3.org/2005/Atom" xmlns:openSearch="http://a9.com/-/spec/opensearch/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr="http://purl.org/syndication/thread/1.0" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" gd:etag="W/&quot;AkACRn89cSp7ImA9WhRaFEk.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876</id><updated>2012-02-16T19:19:27.169-08:00</updated><title>変化球　Henkakyuu</title><subtitle type="html">Communication by design</subtitle><link rel="http://schemas.google.com/g/2005#feed" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/posts/default" /><link rel="alternate" type="text/html" href="http://henkakyuu.blogspot.com/" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><generator version="7.00" uri="http://www.blogger.com">Blogger</generator><openSearch:totalResults>16</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/atom+xml" href="http://feeds.feedburner.com/Henkakyuu" /><feedburner:info uri="henkakyuu" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><entry gd:etag="W/&quot;C08CQnY7eCp7ImA9WhZbF0k.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-7316413883894703552</id><published>2011-06-22T04:24:00.000-07:00</published><updated>2011-06-22T04:24:23.800-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-06-22T04:24:23.800-07:00</app:edited><title>I Was Wrong</title><content type="html">It's been a while. Over 3 months since the quake, a lot of new information on the issues at Fukushima have been released. I can clearly say that I was incredibly wrong.&lt;br /&gt;
&lt;br /&gt;
I still believe in Nuclear Power. I believe that the graph from &lt;a href="http://nextbigfuture.com/2011/03/deaths-per-twh-by-energy-source.html"&gt;this site&lt;/a&gt;&amp;nbsp;says it best:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/-sK6W-1c4_HQ/TgHQSAlFtKI/AAAAAAAAATQ/pUswvavyYQQ/s1600/deathperwatts.jpg" imageanchor="1" style="margin-left:1em; margin-right:1em"&gt;&lt;img border="0" height="218" width="320" src="http://1.bp.blogspot.com/-sK6W-1c4_HQ/TgHQSAlFtKI/AAAAAAAAATQ/pUswvavyYQQ/s320/deathperwatts.jpg" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
Nuclear has clear dangers. We have seen that. And the damage is incredibly scary. But there are also clear dangers from coal and oil. They are just less sudden and scary.&lt;br /&gt;
&lt;br /&gt;
I'm sorry it took so long to admit my mistake, but my mistake has to be corrected. I apologize for any confusion, and hope that people take this horrible accident as an opportunity to educate themselves about the dangers of power generation, regardless of the source. We depend on electricity, and we need to generate it. We also need to manage the risks.&lt;br /&gt;
&lt;br /&gt;
That is our job as consumers of electricity. Let's get to it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-7316413883894703552?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/bZufuPOLchQAujQZDM6QNb807BU/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/bZufuPOLchQAujQZDM6QNb807BU/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/bZufuPOLchQAujQZDM6QNb807BU/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/bZufuPOLchQAujQZDM6QNb807BU/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/c9_OYiKOn7Y" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/7316413883894703552/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2011/06/i-was-wrong.html#comment-form" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/7316413883894703552?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/7316413883894703552?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/c9_OYiKOn7Y/i-was-wrong.html" title="I Was Wrong" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/-sK6W-1c4_HQ/TgHQSAlFtKI/AAAAAAAAATQ/pUswvavyYQQ/s72-c/deathperwatts.jpg" height="72" width="72" /><thr:total>1</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2011/06/i-was-wrong.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D08HSHo9eyp7ImA9WhZTEU0.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-3460219109866991139</id><published>2011-03-14T05:29:00.000-07:00</published><updated>2011-03-14T05:30:39.463-07:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-03-14T05:30:39.463-07:00</app:edited><title>A Brief Intermission on Nuclear Power</title><content type="html">Full Disclosure: I work for a Japanese manufacturer that creates nuclear power plants (among all sorts of other wonderful things, though not the manufacturer of the plants in question). I strongly believe in the safety of nuclear power. After the massive earthquake and Tsunami in Eastern Japan, I was utterly shocked by the doom and gloom journalism from the US, especially with newscasters throwing around the word "Meltdown" as a segue to calling Fukushima a potential Chernobyl.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;There is virtually no possibility of any of the Fukushima Reactors turning in to Chernobyl&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
There's this thing called the &lt;a href="http://en.wikipedia.org/wiki/International_Nuclear_Event_Scale"&gt;International Nuclear Event Scale&lt;/a&gt;&amp;nbsp;that measures how bad a nuclear accident is. 0 is finding a gum wrapper on the floor. 7 is Chernobyl. So where is a "Meltdown"? Well, it isn't on the scale. See, a "Meltdown" is when the reactor of the plant melts. It can melt some (Three Mile Island) or melt a lot and then explode (Chernobyl). The scale indicates what's important to us -- what effect does the incident have on the people and the environment?&lt;br /&gt;
&lt;br /&gt;
From all the information I've read on the Fukushima incidents, this would be rated as a 3 currently. That's less than Three Mile Island (INES 5), and certainly less than Chernobyl (INES 7). The official report isn't out as the incident isn't over, but so far there have been no deaths and no exposure to the deadliest radioactive materials in the reactor.&lt;br /&gt;
&lt;br /&gt;
Japanese plants have been really good up until now. If you can read Japanese, you can read the reports on all incidents that rank on the INES scale on the &lt;a href="http://www.nucia.jp/"&gt;Japanese NUCIA resource&lt;/a&gt;. The worst up until now on the INES scale was a 4 at &lt;a href="http://en.wikipedia.org/wiki/Tokaimura_nuclear_accident"&gt;Tokaimura&lt;/a&gt;&amp;nbsp;where two workers died.&lt;br /&gt;
&lt;br /&gt;
If you want technical details on the accident, I suggest reading &lt;a href="http://morgsatlarge.wordpress.com/2011/03/13/why-i-am-not-worried-about-japans-nuclear-reactors/"&gt;this article&lt;/a&gt;.&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;
&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;What is a "Meltdown"?&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
Nuclear power plants have Uranium in them that fire off neutrons in a reaction. Those neutrons are slowed down by water so that they can hit more Uranium and create more neutrons. And so on. That reaction creates a lot of heat. That heat is used to turn water to steam and generate electricity in giant turbines. The water, in turn, helps cool the reactor.&lt;br /&gt;
&lt;br /&gt;
When nuclear power plants want to stop, they insert "control rods" which catch the neutrons flying around to slow down the reaction. That is not instant. It takes days for the reaction to totally stop. When the earthquake hit, the reactor automatically inserted the control rods to stop the reaction.&lt;br /&gt;
&lt;br /&gt;
While the reaction slows, it still generates heat. Water is still circulated to cool the reactor. When the electricity went off, the pumps stopped working. The backup is a set of diesel generators. The tsunami damaged the generators. The third backup is 8 hours of battery power. After 8 hours, they could no longer circulate water to the core, so it started heating up.&lt;br /&gt;
&lt;br /&gt;
The building the reactor is a really thick steel pressure vessel center to contain the reactor and the nuclear material. The nuclear material is contained in fuel rods. As long as those rods stay sealed (and they are designed to withstand over 2000 degrees celsius), the really radioactive stuff is contained.&lt;br /&gt;
&lt;br /&gt;
If the reactor gets hot enough, those fuel rods melt. That is the start of a meltdown. When the rods melt, the radioactive stuff inside is released. In this case it was Iodine and Cesium. In addition, when the temperature gets that high, water gets separated into hydrogen and oxygen gas. Both of those are explosive, so to prevent an explosion, pressure was released from the reactor into the atmosphere.&lt;br /&gt;
Along with the gases was some of the iodine and cesium. That is bad, of course, but the levels were not high enough to cause lasting health consequences.&lt;br /&gt;
&lt;br /&gt;
The fear is if the core continues to melt. If all attempts to cool the core are unsuccessful and the fuel rods melt entirely and continue to react after they melt, then theoretically they could melt through the giant steel container holding them, and release the fuel (uranium/plutonium) into the atmosphere. That would be bad, but it is predicated on so many worst case scenarios. While this started because of a worse-than-worst case scenario, the likelihood of things continuing to go that bad is very very small.&lt;br /&gt;
&lt;br /&gt;
This reactor was built 40 years ago using 40+ year-old technology. It was designed to withstand a Magnitude 8.0 quake. It has been hit by a 9.0 quake, a several meter tsunami, and the accident has still be contained within the safety measures it was designed with.&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;
&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;Nuclear Power is Safe&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
Yes, nuclear accidents are really really scary. The potential of Chernobyl is a huge scary cloud looming over our heads. And at times when the news is showing explosions and radiation released to the atmosphere, it's hard to think of nuclear power as safe.&lt;br /&gt;
&lt;br /&gt;
But it is also unfair to nuclear plants. In the US the worst accident was Three Mile Island. Nobody died in that accident, yet no nuclear power plants have been built in the US since.&amp;nbsp;By comparison, &lt;a href="http://www.msha.gov/accinj/accinj.htm"&gt;50 people died from coal mining in the US in 2010 alone&lt;/a&gt;. Coal accounts for almost 50% of US electricity generation.&lt;br /&gt;
&lt;br /&gt;
Nuclear Power is CO2-emission and NOx-emission free. It has an incredible capacity for base load&amp;nbsp;generation, and countries like France are able to produce over 80% of their electricity from Nuclear alone. &lt;a href="http://web.mit.edu/nuclearpower/"&gt;MIT conducted a great study &lt;/a&gt;on the future of Nuclear Power in 2003 if you'd like to learn more.&lt;br /&gt;
&lt;br /&gt;
Please educate yourself rather than making a knee-jerk reaction. Electricity demand is growing as the population of the world increases, and that means a need to match that demand. Nuclear is one option we need to consider seriously as an emission-free source of power.&lt;br /&gt;
&lt;b&gt;&lt;/b&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-3460219109866991139?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/S5kSmB2TH4IkMsLDd5hPMyOTVRI/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/S5kSmB2TH4IkMsLDd5hPMyOTVRI/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/S5kSmB2TH4IkMsLDd5hPMyOTVRI/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/S5kSmB2TH4IkMsLDd5hPMyOTVRI/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/aBq9cilxB1M" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/3460219109866991139/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2011/03/brief-intermission-on-nuclear-power.html#comment-form" title="8 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/3460219109866991139?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/3460219109866991139?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/aBq9cilxB1M/brief-intermission-on-nuclear-power.html" title="A Brief Intermission on Nuclear Power" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><thr:total>8</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2011/03/brief-intermission-on-nuclear-power.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CkMERHg7fSp7ImA9Wx9VGEs.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-4865401469006341860</id><published>2011-02-04T15:00:00.000-08:00</published><updated>2011-02-04T15:00:05.605-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-02-04T15:00:05.605-08:00</app:edited><title>Top 500 Position Players by WAR</title><content type="html">My series of top 500 Position Players by WAR is finished and up on Fangraphs. If you missed it, please take a minute to check it out if it interests you:&lt;br /&gt;
&lt;br /&gt;
&lt;p style="text-align: center"&gt;&lt;em&gt;&lt;a href="http://www.fangraphs.com/blogs/index.php/position-player-fwar-19th-century/"&gt;Baseball Prehistory&lt;/a&gt; | &lt;a href="http://www.fangraphs.com/blogs/index.php/position-players-by-war-deadball-era/"&gt;Deadball Era&lt;/a&gt; | &lt;a href="http://www.fangraphs.com/blogs/index.php/position-players-by-war-liveball-era/"&gt;Liveball Era&lt;/a&gt; | &lt;a href="http://www.fangraphs.com/blogs/index.php/position-players-by-war-post-war-era/"&gt;Post-War&lt;/a&gt;&lt;br /&gt;
&lt;a href="http://www.fangraphs.com/blogs/index.php/position-players-by-war-expansion-era/"&gt;Expansion&lt;/a&gt; | &lt;a href="http://www.fangraphs.com/blogs/index.php/position-players-by-war-free-agency/"&gt;Free Agency&lt;/a&gt; | &lt;a href="http://www.fangraphs.com/blogs/index.php/position-players-by-war-modern-era/"&gt;Modern Era&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-4865401469006341860?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/k_VspSlI8-W28NDtec8toKeTpGI/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/k_VspSlI8-W28NDtec8toKeTpGI/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/k_VspSlI8-W28NDtec8toKeTpGI/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/k_VspSlI8-W28NDtec8toKeTpGI/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/h1RfIuOGi-4" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/4865401469006341860/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2011/02/top-500-position-players-by-war.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/4865401469006341860?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/4865401469006341860?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/h1RfIuOGi-4/top-500-position-players-by-war.html" title="Top 500 Position Players by WAR" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><thr:total>0</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2011/02/top-500-position-players-by-war.html</feedburner:origLink></entry><entry gd:etag="W/&quot;A0YCSH88cSp7ImA9Wx9VEUo.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-6334192247706017620</id><published>2011-01-27T17:46:00.000-08:00</published><updated>2011-01-27T17:46:09.179-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-01-27T17:46:09.179-08:00</app:edited><title>A Correction for Jon Peltier</title><content type="html">The way the web works is funny. I have been reading &lt;a href="http://peltiertech.com/WordPress/"&gt;Jon Peltier's blog&lt;/a&gt; for several years now, and learned a lot of what I can do in Excel, and about principles of design from his blog (and ones like it). I was shocked when he commented on my blog when &lt;a href="http://henkakyuu.blogspot.com/2011/01/rules-of-thumb-for-visualization.html"&gt;I wrote about basic principles of data visualization for baseball&lt;/a&gt;. I was possibly more shocked when I read his comment and realized that he was very right and I'd made a charting faux-pas.&lt;br /&gt;
&lt;br /&gt;
Here is the chart he takes issue with:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSbsSqq3dYI/AAAAAAAAARA/Bcx18-L0PLM/s1600/Pie+Graphs.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="115" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSbsSqq3dYI/AAAAAAAAARA/Bcx18-L0PLM/s320/Pie+Graphs.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
Jon doesn't like pie charts much. In the data visualization community, there's been a lot of back and forth over whether pie charts are useful. Since they are used everywhere, people are familiar with them, so I look on them a bit more favorably than Jon does, but that wasn't the essence of his complaint.&lt;br /&gt;
&lt;br /&gt;
You should never use multiple pie charts if you're going to compare them. It's just no easy to compare the size of slices across multiple pie charts.&lt;br /&gt;
&lt;br /&gt;
So instead, Jon recommends using bar charts, like this:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_AtlVw4SMQfg/TUIeI6c7U5I/AAAAAAAAASY/e0IHCy0qPy8/s1600/Reimagined+Pie+Chart.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TUIeI6c7U5I/AAAAAAAAASY/e0IHCy0qPy8/s320/Reimagined+Pie+Chart.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;br /&gt;
&lt;/div&gt;And I think I have to admit it looks a lot better, and is a lot easier to compare between the various players this way. I still think pie charts are okay if you keep them simple, but I didn't follow my own advice and keep them simple enough. This is probably more the proper simplicity:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_AtlVw4SMQfg/TUIf0Hz1wOI/AAAAAAAAASc/FxTQOECzqK8/s1600/Jon+Pie+Chart.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://4.bp.blogspot.com/_AtlVw4SMQfg/TUIf0Hz1wOI/AAAAAAAAASc/FxTQOECzqK8/s320/Jon+Pie+Chart.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-6334192247706017620?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/nXlUoyXP6R4Sb2nQVWyuUSGD5ds/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/nXlUoyXP6R4Sb2nQVWyuUSGD5ds/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/nXlUoyXP6R4Sb2nQVWyuUSGD5ds/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/nXlUoyXP6R4Sb2nQVWyuUSGD5ds/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/MiOpFmdcZsw" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/6334192247706017620/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2011/01/correction-for-jon-peltier.html#comment-form" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/6334192247706017620?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/6334192247706017620?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/MiOpFmdcZsw/correction-for-jon-peltier.html" title="A Correction for Jon Peltier" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSbsSqq3dYI/AAAAAAAAARA/Bcx18-L0PLM/s72-c/Pie+Graphs.png" height="72" width="72" /><thr:total>2</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2011/01/correction-for-jon-peltier.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DkYNQXY9eyp7ImA9Wx9XF0k.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-3556297819763374665</id><published>2011-01-11T03:09:00.000-08:00</published><updated>2011-01-11T03:09:50.863-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-01-11T03:09:50.863-08:00</app:edited><title>Rules of Thumb for Complex Visualization</title><content type="html">Last time I discussed some simple graphing techniques that will make visualizations easier to grasp. This time I want to discuss some slightly more complicated techniques involving graphing more complicated data.&lt;br /&gt;
&lt;br /&gt;
Basic graphs usually tackle only a few pieces of information:&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Bar Graph:&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_AtlVw4SMQfg/TSbsAVbj3mI/AAAAAAAAAQ0/dOMTakuGO94/s1600/Good+Bar+Graph.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://1.bp.blogspot.com/_AtlVw4SMQfg/TSbsAVbj3mI/AAAAAAAAAQ0/dOMTakuGO94/s320/Good+Bar+Graph.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;b&gt;&amp;nbsp;&lt;/b&gt; &lt;br /&gt;
&lt;br /&gt;
#1: Players&lt;br /&gt;
#2: Home Runs&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Pie Graphs:&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSbsSqq3dYI/AAAAAAAAARA/Bcx18-L0PLM/s1600/Pie+Graphs.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="115" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSbsSqq3dYI/AAAAAAAAARA/Bcx18-L0PLM/s320/Pie+Graphs.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
#1: Players&lt;br /&gt;
#2: Plate Appearance Outcomes&lt;br /&gt;
&lt;br /&gt;
Both of these graphs were simple. The players are constants, so they don't really need much special treatment (they are just labels), so we're really only graphing a single variable for each.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Line Graphs:&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_AtlVw4SMQfg/TSbsFLCkbjI/AAAAAAAAAQ4/ImeEKIv9qZM/s1600/Line+Graph+Sample.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://1.bp.blogspot.com/_AtlVw4SMQfg/TSbsFLCkbjI/AAAAAAAAAQ4/ImeEKIv9qZM/s320/Line+Graph+Sample.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
#1: Players&lt;br /&gt;
#2: Home Runs&lt;br /&gt;
#3: Over Time&lt;br /&gt;
&lt;br /&gt;
This one is slightly more complicated, but we are all so used to seeing graphs over time like this that most of us wouldn't find it too complicated since while time varies, it only varies in a predictable way (we all know when tomorrow will be).&lt;br /&gt;
&lt;br /&gt;
Lots of times we have to deal with more complicated data sets. Instead of just graphing a single unpredictable variable, what if we have to graph two of them?&lt;br /&gt;
&lt;br /&gt;
To give a simple baseball example, let's say you want to see how many wins each team got in 2010, and how much money they paid in salaries.&lt;br /&gt;
&lt;br /&gt;
There are two main ways you can do it. You can label the various points with colors and leave a legend, or you can label the points on the chart and use the same color.&lt;br /&gt;
&lt;br /&gt;
Here are examples:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_AtlVw4SMQfg/TSw3Yagx9VI/AAAAAAAAASE/vB-GO_bIGZQ/s1600/Confetti.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://3.bp.blogspot.com/_AtlVw4SMQfg/TSw3Yagx9VI/AAAAAAAAASE/vB-GO_bIGZQ/s320/Confetti.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
Looks like confetti, doesn't it?&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_AtlVw4SMQfg/TSw3YuahaAI/AAAAAAAAASI/DdsdL0NnKFU/s1600/Labels.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://4.bp.blogspot.com/_AtlVw4SMQfg/TSw3YuahaAI/AAAAAAAAASI/DdsdL0NnKFU/s320/Labels.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
When dealing with this much data, there are just too many different colors to handle a different color for each, so if the point is to show each team, then it's probably better to go with labels. Note that the labels are a lighter color and pretty small -- if I made them black, they would draw attention away from the data, and focus the eyes on the labels -- that would make it harder to see the trends.&lt;br /&gt;
&lt;br /&gt;
So when is color good? Let's say you wanted to make this chart to show the differences between AL and NL teams. That would be great for color.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_AtlVw4SMQfg/TSw3WzSYKJI/AAAAAAAAAR4/DnzzE12HuDA/s1600/AL+v.+NL+1.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://4.bp.blogspot.com/_AtlVw4SMQfg/TSw3WzSYKJI/AAAAAAAAAR4/DnzzE12HuDA/s320/AL+v.+NL+1.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
The benefit of adding color here is that it tells a story. The AL has a much wider gap between the top teams in salary and the bottom teams. Boston and New York are just way ahead of the group, and the top NL teams in payroll are between #2 and #3 in the AL. Clearly there's a gap between the leagues.&lt;br /&gt;
&lt;br /&gt;
You can adjust the colors to tell whatever story you want. Let's say we just want to focus on the AL East and how absurdly unbalanced they are.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_AtlVw4SMQfg/TSw3VA-pFNI/AAAAAAAAARo/Uea9ipTjDHY/s1600/AL+East+1.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://3.bp.blogspot.com/_AtlVw4SMQfg/TSw3VA-pFNI/AAAAAAAAARo/Uea9ipTjDHY/s320/AL+East+1.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
I just made the AL East jump out by making them really dark, and all the other teams grey. This tells the story of how incredibly well Toronto and Tampa did considering their financial disadvantage. It also shows the huge gap between the "Haves" and the "Have Nots" in the game's most competitive division.&lt;br /&gt;
&lt;br /&gt;
But let's say we want to make this even more complicated. Let's say we want to not only show data for one year, but show data for 3 years. If we throw all that data on a single graph, we get a giant mess.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSw3ZDszhuI/AAAAAAAAASM/4ISwXTxk00I/s1600/Labels+2.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSw3ZDszhuI/AAAAAAAAASM/4ISwXTxk00I/s320/Labels+2.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
Too many labels, no real trending, just a giant blotch of stuff. It becomes pretty hard to tell what is what. So we need to really focus on what we want to say rather than just dumping data on a graph.&lt;br /&gt;
&lt;br /&gt;
For instance, if we want to take a look at AL vs. NL from 2008-2010 in payroll and wins, we can do that. But time doesn't really come into the picture if we do it this way, it just looks like a giant mess of dots, and doesn't help us with trending.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSw3XSz8OcI/AAAAAAAAAR8/-B9GNqhfI0w/s1600/AL+v.+NL+2.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSw3XSz8OcI/AAAAAAAAAR8/-B9GNqhfI0w/s320/AL+v.+NL+2.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
One way we can make things a bit clearer is by changing our colors around. The more recent the year, the more solid we make the color -- that way we can see trending visually over time.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSw3X6Ar6HI/AAAAAAAAASA/5zmA_RWnPW0/s1600/AL+v.+NL+3.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSw3X6Ar6HI/AAAAAAAAASA/5zmA_RWnPW0/s320/AL+v.+NL+3.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
For another example, if we just want to see where the AL East is flying around to in the grand scheme of things, we can color them in, connect the dots, and give an idea of how the division is trending.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_AtlVw4SMQfg/TSw3VtdMbUI/AAAAAAAAARs/SY3SsWtqa3M/s1600/AL+East+2.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://3.bp.blogspot.com/_AtlVw4SMQfg/TSw3VtdMbUI/AAAAAAAAARs/SY3SsWtqa3M/s320/AL+East+2.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
That shows us a bit more. Baltimore and Toronto are cutting salaries but have improved their win totals in 2010. Tampa Bay is spending more and more each year. The Yankees are essentially treading water. And the Red Sox made a huge jump in salary for 2010.&lt;br /&gt;
&lt;br /&gt;
The other teams are all grey, and they aren't color coded by year. That is a judgment call. Is that information important to your story? In fact, if you only care about the AL East teams, you could even remove all of the excess dots and show just the AL East information. This shows how the division is working internally without cluttering it up with all those extra details.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSw3WPo718I/AAAAAAAAARw/yH41Rm6gDs0/s1600/AL+East+3.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSw3WPo718I/AAAAAAAAARw/yH41Rm6gDs0/s320/AL+East+3.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
Another alternative is to make them even less visible so that the AL East data stands out more, but the other information is still out there.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_AtlVw4SMQfg/TSw3WtktljI/AAAAAAAAAR0/UbZLqVdpQE4/s1600/AL+East+4.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://3.bp.blogspot.com/_AtlVw4SMQfg/TSw3WtktljI/AAAAAAAAAR0/UbZLqVdpQE4/s320/AL+East+4.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
This all comes back to the first rule of graphing: what story are you trying to tell? You should leave in information that is important to your story, and take out anything that doesn't matter. Personally I think that showing Toronto as a more-or-less middle of the pack team, Baltimore as among the perennial losers, and Tampa Bay as the thrifty winners is useful, so I like the last graph best. That tells the story of the AL East best in my mind. But always decide for yourselves.&lt;br /&gt;
&lt;br /&gt;
The basic lesson here is that you can use color to add extra information to a graph, but if you have too much information jumbled together, no amount of color will save you.&lt;br /&gt;
&lt;br /&gt;
Think about your message, make sure that the message is the most obvious thing in the graph, and play around until you find something that works well for you.&lt;br /&gt;
&lt;br /&gt;
If you want to code more data than this, you need to move into the world of interactive graphing like &lt;a href="http://www.gapminder.org/"&gt;Gapminder.org&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
But that's a story for another day...&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;References:&lt;/b&gt;&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Win Data from &lt;a href="http://www.baseball-reference.com/leagues/MLB/2008-standings.shtml"&gt;Baseball Reference&lt;/a&gt;&lt;b&gt;&amp;nbsp;&lt;/b&gt;&lt;/li&gt;
&lt;li&gt;2010 Salary Data from &lt;a href="http://www.cbssports.com/mlb/salaries/teams"&gt;CBS Sports&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;2008-2009 Salary Data from &lt;a href="http://baseball.about.com/od/newsrumors/a/09teamsalaries.htm"&gt;About.com&lt;/a&gt; &lt;b&gt;&lt;/b&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;i&gt;As a side note, I will be on vacation in the US for the next couple weeks, so there won't be many blog posts here. I'll try to check in regularly with twitter and e-mail and the like, and have two scheduled columns going up at Fangraphs each Friday.&lt;/i&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-3556297819763374665?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/uiStkRkZ-L-ERh56TkcuIjD8ve8/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/uiStkRkZ-L-ERh56TkcuIjD8ve8/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/uiStkRkZ-L-ERh56TkcuIjD8ve8/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/uiStkRkZ-L-ERh56TkcuIjD8ve8/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/8Giin_jU8_o" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/3556297819763374665/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2011/01/rules-of-thumb-for-complex.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/3556297819763374665?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/3556297819763374665?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/8Giin_jU8_o/rules-of-thumb-for-complex.html" title="Rules of Thumb for Complex Visualization" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/_AtlVw4SMQfg/TSbsAVbj3mI/AAAAAAAAAQ0/dOMTakuGO94/s72-c/Good+Bar+Graph.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2011/01/rules-of-thumb-for-complex.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DUEGR306eip7ImA9Wx9XE0Q.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-1463889181191787961</id><published>2011-01-07T02:47:00.000-08:00</published><updated>2011-01-07T02:53:46.312-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-01-07T02:53:46.312-08:00</app:edited><title>Rules of Thumb for Visualization</title><content type="html">If you're going to make a visualization of any data (baseball or otherwise), here are a few recommendations to make it more effective.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;#1 Know Your Story&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
Every graph should tell a story. If it's not, then why are you making it? What story are you trying to tell? What do you want the reader to get out of it? If you aren't clear about what the reader should take away from the graph, usually it's time to think about what you're trying to express before trying to make a graph of it. So for this example, I am going to compare the three career home run leaders of all time.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;#2 Pick an Appropriate Graph Type&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
There are three main graph types:&lt;br /&gt;
- Bar Graphs&lt;br /&gt;
- Line Graphs&lt;br /&gt;
- Pie Charts&lt;br /&gt;
&lt;br /&gt;
Bar Graphs are a good default. 98% of the time, if you're comparing things, a bar graph is a good place to start. When you use bar graphs, there is one major rule that you have to follow: always start the main axis at 0. We judge the categories by the height of the bar, so if you don't start from 0, it warps the data. Here's an example:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSbrxb8CsGI/AAAAAAAAAQw/DVbfW6BoSY4/s1600/Bad+Bar+Graph.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSbrxb8CsGI/AAAAAAAAAQw/DVbfW6BoSY4/s320/Bad+Bar+Graph.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
Wow! Bonds DESTROYED Ruth's record, didn't he? Only he didn't. Here's the same graph with the axis starting at 0:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_AtlVw4SMQfg/TSbsAVbj3mI/AAAAAAAAAQ0/dOMTakuGO94/s1600/Good+Bar+Graph.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://1.bp.blogspot.com/_AtlVw4SMQfg/TSbsAVbj3mI/AAAAAAAAAQ0/dOMTakuGO94/s320/Good+Bar+Graph.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
Tells quite a different story, doesn't it?&lt;br /&gt;
&lt;br /&gt;
One of the times you don't want to use bar graphs is when you're talking about something that happened over time. For instance, if you are going to look at how many home runs each of those players had by age, a line graph is a better bet:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_AtlVw4SMQfg/TSbsFLCkbjI/AAAAAAAAAQ4/ImeEKIv9qZM/s1600/Line+Graph+Sample.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://1.bp.blogspot.com/_AtlVw4SMQfg/TSbsFLCkbjI/AAAAAAAAAQ4/ImeEKIv9qZM/s320/Line+Graph+Sample.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
If I do it with a bar graph, it gets messy, and harder to read:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_AtlVw4SMQfg/TSbsK_TR5UI/AAAAAAAAAQ8/eo18Pt_-8WQ/s1600/Line+Graph+as+Bar+Sample.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://4.bp.blogspot.com/_AtlVw4SMQfg/TSbsK_TR5UI/AAAAAAAAAQ8/eo18Pt_-8WQ/s320/Line+Graph+as+Bar+Sample.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
Pie charts are the oddball of the group. Humans don't judge areas of circles too well, so pie charts are really only good if you're dealing with 2-3 categories, so that it's easy to eyeball. And the percentages always have to add up to 100%. For instance, if we want to look at how often a player strikes out, walks, or hits the ball, a pie graph is a mighty fine choice:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSbsSqq3dYI/AAAAAAAAARA/Bcx18-L0PLM/s1600/Pie+Graphs.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="115" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSbsSqq3dYI/AAAAAAAAARA/Bcx18-L0PLM/s320/Pie+Graphs.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;#3 Don't Overdo It&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
Sometimes you have a lot of data, and you get tempted to try to put it all on one graph. Let's say you want to show how many home runs each player got along with how many plate appearances he had each season. You could throw them on the same graph like this:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSbsWWsYI6I/AAAAAAAAARE/n4zHUFYEBww/s1600/Bad+Combo+Graph.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSbsWWsYI6I/AAAAAAAAARE/n4zHUFYEBww/s320/Bad+Combo+Graph.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
Now the problem is that the message gets muddied. What exactly are you trying to say with that added data? If you want to show the different in PA/HR by age, then maybe it's a better bet to make a second graph that focuses just on that:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSbsWWsYI6I/AAAAAAAAARE/n4zHUFYEBww/s1600/Bad+Combo+Graph.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_AtlVw4SMQfg/TSbsaCrTfcI/AAAAAAAAARI/5x8zBREjxaU/s1600/Good+Combo+Graph.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://3.bp.blogspot.com/_AtlVw4SMQfg/TSbsaCrTfcI/AAAAAAAAARI/5x8zBREjxaU/s320/Good+Combo+Graph.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
More graphs isn't necessarily a bad thing if it helps you tell your message better.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;#4 Focus the Message&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
When you create a chart in Excel, it really makes it ugly. Look at the first graph I created using Excel Defaults:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_AtlVw4SMQfg/TSbshRbGSnI/AAAAAAAAARM/ZCRGdHZ7b5Y/s1600/Excel+Default+Bar+Graph.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://4.bp.blogspot.com/_AtlVw4SMQfg/TSbshRbGSnI/AAAAAAAAARM/ZCRGdHZ7b5Y/s320/Excel+Default+Bar+Graph.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
Here are some of the problems:&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;The background is too dark&lt;/li&gt;
&lt;li&gt;The gridlines are too strong&lt;/li&gt;
&lt;li&gt;The axis is messed up (it doesn't start from zero!)&lt;/li&gt;
&lt;li&gt;The title and legend duplicate information&lt;/li&gt;
&lt;li&gt;The player names are tiny &lt;/li&gt;
&lt;li&gt;The bar graphs all have shadows for some reason&lt;/li&gt;
&lt;/ul&gt;&lt;br /&gt;
What is the story of this graph? It shows how many home runs the three home run leaders have. So we want to focus on those three blue bars, who did them, and what they mean. So I eliminated the background of the graph, fixed the axis, lightened the grid lines, gave a proper title, and made the names of each player more visible:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_AtlVw4SMQfg/TSbsAVbj3mI/AAAAAAAAAQ0/dOMTakuGO94/s1600/Good+Bar+Graph.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="208" src="http://1.bp.blogspot.com/_AtlVw4SMQfg/TSbsAVbj3mI/AAAAAAAAAQ0/dOMTakuGO94/s320/Good+Bar+Graph.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;#5 Add Color (where appropriate)&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
If you'll notice, most of my graphs use the same color. When we look at a graph and see a lot of color, we assume that the color means something. So color is another tool to help us tell the story. If I made the three bars in the career home run graphs different colors, our brain would tell us "Hey, the bars must be a different color for a reason!" and it would spend a little time trying to figure out what the color is telling us.&lt;br /&gt;
&lt;br /&gt;
When there's a reason to add color (for instance, in the pie graph) it's a good idea to be consistent and to make sure that the color adds to the story, rather than to distract it. I used red for strikeouts because they are "bad", blue for walks because they are "good", and grey for the rest because they are neutral (and not part of the story, but need to be there to make sure the pies add up to 100%).&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;#6 Resist the Urge to Pretty-up the Graph with Chart Junk&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
A lot of the graphs we see on a daily basis are prettied up. People add 3D effects, drop shadows, gradients, etc. But those things don't typically add to the story we're trying to tell. They just "look cool" so people want to use them. Adding chart junk to your graphs is like putting a spoiler on a Honda Civic -- maybe it looks like it should go faster, but it's really just weighing the car down.&lt;br /&gt;
&lt;br /&gt;
For instance, let's say I get the urge to pretty-up my pie graphs. What was pretty obvious before suddenly becomes a lot less obvious:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_AtlVw4SMQfg/TSbtJcqeJoI/AAAAAAAAARQ/nawvPErz5mA/s1600/Bad+Pie+Graphs.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="115" src="http://4.bp.blogspot.com/_AtlVw4SMQfg/TSbtJcqeJoI/AAAAAAAAARQ/nawvPErz5mA/s320/Bad+Pie+Graphs.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
It's a lot harder to compare the slices in 3D, because 3D graphs distort data. We aren't so good at adjusting perspective in our head, so we lose a lot of ability to analyze the data. And the gradients add absolutely nothing but headaches since we lose the center of the grey slice making it even harder to figure out the angle (and with the angle, the size of the slices).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;hr /&gt;&lt;br /&gt;
These are just the basics, with a really simple example. The above suggestions are just that -- suggestions. The most important thing is to keep the message at the center of the graph. Find what works best with your audience, and displays the data best, and you'll do fine.&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;All of the above examples were created using Microsoft Excel. Graphs from Excel can be copy-pasted into a vector data editing program like Adobe Illustrator or &lt;a href="http://en.wikipedia.org/wiki/Inkscape"&gt;Inkscape&lt;/a&gt; to have better control over how it looks.&lt;/i&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-1463889181191787961?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/NmqV-_8bdnVw0_sPX-f2sNJ7u7s/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/NmqV-_8bdnVw0_sPX-f2sNJ7u7s/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/NmqV-_8bdnVw0_sPX-f2sNJ7u7s/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/NmqV-_8bdnVw0_sPX-f2sNJ7u7s/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/mSDvuwCooJk" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/1463889181191787961/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2011/01/rules-of-thumb-for-visualization.html#comment-form" title="4 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/1463889181191787961?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/1463889181191787961?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/mSDvuwCooJk/rules-of-thumb-for-visualization.html" title="Rules of Thumb for Visualization" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSbrxb8CsGI/AAAAAAAAAQw/DVbfW6BoSY4/s72-c/Bad+Bar+Graph.png" height="72" width="72" /><thr:total>4</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2011/01/rules-of-thumb-for-visualization.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DkcBR34zeSp7ImA9Wx9XEk0.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-8131753193950015739</id><published>2011-01-04T20:58:00.000-08:00</published><updated>2011-01-04T21:07:36.081-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-01-04T21:07:36.081-08:00</app:edited><title>A Glimpse at Pitcher WAR</title><content type="html">Happy New Year!&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
There has been a lot of hubbub by Adam Darowski over at &lt;a href="http://www.beyondtheboxscore.com/"&gt;Beyond the Box Score&lt;/a&gt; on Hall of Fame Wins Above Replacement, and how to better get an idea of peak value and career value for people who weight them differently.&lt;br /&gt;
&lt;br /&gt;
You can see his work &lt;a href="http://darowski.com/ballot/"&gt;here&lt;/a&gt; and &lt;a href="http://darowski.com/wwar/"&gt;here&lt;/a&gt; (it is awesome and interactive).&lt;br /&gt;
&lt;br /&gt;
Personally I believe a lot in the power of the eye and what it sees when it takes a look. Design a graphic using your brain, and let your eyes soak it in and let your gut come to a conclusion about the data. While I love Adam's graphs, I don't know if "Weighted WAR" is the way to go -- it is quite arbitrary, and doesn't really measure peak performance as much as it measures performance over a certain level. A player who has an incredible 10 WAR season at age 22 and a 10 WAR season at age 32 will look like they have the same peak (8 WAR above MVP) as a player who has 7 WAR from 10 straight seasons.&lt;br /&gt;
&lt;br /&gt;
So I thought to myself, "How can we make that distinction?" I diddled with a lot of data, and I came up with this:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSPzQ0N6tUI/AAAAAAAAAQs/P5Jc2NucW8s/s1600/Pitcher-HOF-by-Era-%25282%25237B8AB.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSPzQ0N6tUI/AAAAAAAAAQs/P5Jc2NucW8s/s1600/Pitcher-HOF-by-Era-%25282%25237B8AB.png" width="480" /&gt;&lt;/a&gt;&lt;/div&gt;(Click for a larger version)&lt;br /&gt;
&lt;br /&gt;
The graph includes all the players in the top 50 for career pitching WAR, as well as any player in either the Hall of Fame or the &lt;a href="http://www.baseballthinkfactory.org/files/hall_of_merit/"&gt;Hall of Merit&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
Let your eyes come to your own conclusions, but here is what I noticed:&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Early careers were a lot shorter than they were later on&lt;/li&gt;
&lt;li&gt;The best pitchers jump right out -- Young, Nichols, Johnson&lt;/li&gt;
&lt;li&gt;Pitchers in the modern era are starting their careers a lot later, and ending them earlier than they did in the Expansion Era&lt;/li&gt;
&lt;/ul&gt;If you have any suggestions, improvements, etc., let me know. I will be doing a version for batters on FanGraphs in the near future. &lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;References:&lt;/b&gt;&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Data collected from &lt;a href="http://www.baseball-reference.com/"&gt;Baseball Reference&lt;/a&gt; via &lt;a href="http://www.baseballprojection.com/war/top500p.htm"&gt;Baseball Projection&lt;b&gt; &lt;/b&gt;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Raw Data in a &lt;a href="https://spreadsheets.google.com/ccc?key=0AvhKIaAw27e_dGFxUjlUV21Zd1pJd0tjSVJYYXJsMVE&amp;amp;hl=en&amp;amp;authkey=CIyn5_ML"&gt;Google Spreadsheet&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;Created using Excel for initial formatting, and Adobe Illustrator for prettying up. Licensed under Creative Commons Attribution, Non-Commercial License.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-8131753193950015739?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/1sj-F4dtG4jJWwFG-6caMiKUnXk/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/1sj-F4dtG4jJWwFG-6caMiKUnXk/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/1sj-F4dtG4jJWwFG-6caMiKUnXk/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/1sj-F4dtG4jJWwFG-6caMiKUnXk/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/nIfdNCysgJM" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/8131753193950015739/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2011/01/glimpse-at-pitcher-war.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/8131753193950015739?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/8131753193950015739?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/nIfdNCysgJM/glimpse-at-pitcher-war.html" title="A Glimpse at Pitcher WAR" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/_AtlVw4SMQfg/TSPzQ0N6tUI/AAAAAAAAAQs/P5Jc2NucW8s/s72-c/Pitcher-HOF-by-Era-%25282%25237B8AB.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2011/01/glimpse-at-pitcher-war.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DkMEQ34_fyp7ImA9Wx9QE0w.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-6588379485348251083</id><published>2010-12-25T14:00:00.000-08:00</published><updated>2010-12-25T14:00:02.047-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-12-25T14:00:02.047-08:00</app:edited><title>Merry Christmas</title><content type="html">I spent far more time than I ever intended on a reply to &lt;a href="http://www.beyondtheboxscore.com/2010/12/21/1888629/the-2011-hall-of-fame-ballot-graphically"&gt;Jinaz's Hall of Fame WAR Comparison post over at Beyond the Boxscore&lt;/a&gt;. But what came out was interesting, so I'm sticking it up here and explaining it.&lt;br /&gt;
&lt;br /&gt;
Back in the day, I made my &lt;a href="http://henkakyuu.blogspot.com/2010/12/woba-by-ball-strike-count.html"&gt;Cumulative WAR by Age Graph&lt;/a&gt; in response to the same graphs that Jinaz used. Basically, the goal was to show the same data in a slightly different way, to tell a slightly different story. While the "nth best" season works great for normal progressions, it doesn't work so well when a player has an odd career (like one marred by injuries, or one interrupted by war). In cases like that, the cumulative WAR by age tells a much different story. &lt;a href="http://www.fangraphs.com/graphsw.aspx?playerid2=1109&amp;amp;playerid3=1000001&amp;amp;playerid4=&amp;amp;playerid5="&gt;Look at Bonds vs. Aaron for example&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
Well this year, Adam Darowski introduced a tremendous &lt;a href="http://darowski.com/ballot/#pitchers"&gt;Interactive 2011 Hall of Fame Ballot&lt;/a&gt;. So lots of people want to look at other careers in a similar way -- by tremendous (6+ WAR), good (3+ WAR), and better than nothing (0+ WAR) seasons. Sky Kalkman came up with &lt;a href="http://www.flickr.com/photos/skyking162/5280807477/"&gt;a version combining Jinaz's line graph with Adam's radar graph&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
Something about it struck me as a bit off, and counter-intuitive. I couldn't figure out why exactly, so in trying to explain it, I worked out a solution of my own:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_AtlVw4SMQfg/TRId9v73p5I/AAAAAAAAAQk/0t4aNaGqmxs/s1600/blyleven-v-morris-pyramid.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="237" src="http://3.bp.blogspot.com/_AtlVw4SMQfg/TRId9v73p5I/AAAAAAAAAQk/0t4aNaGqmxs/s320/blyleven-v-morris-pyramid.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
Seasons with negative WAR have been removed. There are some good points and bad points about this graph. The big negative is that you can only compare two players. On the plus side, it shows not only how many 6+ WAR a player had, but also how many seasons with over 6 WAR a player had. That can help you get another dimension in the data -- did the player have a long peak of consecutive seasons over 6 WAR? Did they have a tremendous fluke of a 10 WAR season and sink back to 5+ after that?&lt;br /&gt;
&lt;br /&gt;
I don't know how much use this will actually have to most people, but it is a different way to compare players, so I figured I'd put it up.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;References:&lt;/b&gt;&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;WAR Figures from &lt;a href="http://baseballprojection.com/war/playerindex.htm"&gt;Rally's Baseball Projection Site&lt;/a&gt;&lt;b&gt;&amp;nbsp;&lt;/b&gt;&lt;/li&gt;
&lt;/ul&gt;Image is licensed under a Creative Commons Attribution, Non-Commercial License. Original graph was created in Excel and then prettied up in Adobe Illustrator.&lt;b&gt; &lt;/b&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-6588379485348251083?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/73bM60bwwo_yKwcNmsNg-O7uNTo/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/73bM60bwwo_yKwcNmsNg-O7uNTo/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/73bM60bwwo_yKwcNmsNg-O7uNTo/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/73bM60bwwo_yKwcNmsNg-O7uNTo/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/WMNvE-JurIE" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/6588379485348251083/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2010/12/merry-christmas.html#comment-form" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/6588379485348251083?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/6588379485348251083?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/WMNvE-JurIE/merry-christmas.html" title="Merry Christmas" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://3.bp.blogspot.com/_AtlVw4SMQfg/TRId9v73p5I/AAAAAAAAAQk/0t4aNaGqmxs/s72-c/blyleven-v-morris-pyramid.png" height="72" width="72" /><thr:total>2</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2010/12/merry-christmas.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D0cCQnk_eip7ImA9Wx9QEE0.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-5268471391119772242</id><published>2010-12-22T00:04:00.000-08:00</published><updated>2010-12-22T00:04:23.742-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-12-22T00:04:23.742-08:00</app:edited><title>Free Agent Signings by Year</title><content type="html">In his &lt;a href="http://www.fangraphs.com/blogs/index.php/the-two-markets/"&gt;"The Two Markets" post on Fangraphs&lt;/a&gt;, Dave Cameron talked about how the trade market and the free agent market relate to each other, and if the decrease in trade returns may be related to the increase in free agent spending.&lt;br /&gt;
&lt;br /&gt;
I wondered if there was any indicator of what is going on with the free agent market, decided to grab some data and see if I could figure out anything. To be honest, I didn't know what (if anything) I would find, and just figured I'd dive into the data and see what there is to see. As a quick note, I took away international signings from the pool -- they really muck things up (players from Cuba come over pretty young and throw off averages since they are complete outliers). I also ignored minor league deals.&lt;br /&gt;
&lt;br /&gt;
First thing I looked at was the average age of free agents. My thought was, "Maybe the recent trend of signing players before their arbitration runs out has reduced the pool of young free agents, and increased the free agent age -- that would reduce supply of younger talented players, and increase costs of the remaining free agents."&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_AtlVw4SMQfg/TRGfGU5zhgI/AAAAAAAAAQQ/tO2CiOoiuWA/s1600/Average-Median-FA-Age.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="206" src="http://4.bp.blogspot.com/_AtlVw4SMQfg/TRGfGU5zhgI/AAAAAAAAAQQ/tO2CiOoiuWA/s320/Average-Median-FA-Age.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
Well, the data didn't exactly bear that out. It looks like the average age of free agents has had a consistent downward trend. So I thought, "Well, maybe that's because guys like Julio Franco refused to retire and drove up the average age. Maybe we should be looking at the average age of free agents who were signed!"&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_AtlVw4SMQfg/TRGgNcgowUI/AAAAAAAAAQU/XKOq3BeqUh0/s1600/Average-Signed-FA-Age.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="206" src="http://1.bp.blogspot.com/_AtlVw4SMQfg/TRGgNcgowUI/AAAAAAAAAQU/XKOq3BeqUh0/s320/Average-Signed-FA-Age.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;Again, we see the same trend, there's almost no difference. So it looks like the talent pool, for some reason, is getting younger in the MLB from 2006 to today. It looked to be time to take it from a different angle.&lt;br /&gt;
&lt;br /&gt;
I thought about contract length next. How many contracts of each type are they handing out? And what age are players in each bracket? Are they giving more 1-year contracts to younger players who had a bad year, injury concerns, or something to prove?&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_AtlVw4SMQfg/TRGkXAgBktI/AAAAAAAAAQY/jdF0kzGUEwI/s1600/Average-Age-by-Contract-Len.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="206" src="http://4.bp.blogspot.com/_AtlVw4SMQfg/TRGkXAgBktI/AAAAAAAAAQY/jdF0kzGUEwI/s320/Average-Age-by-Contract-Len.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;The darker the color, the more recent the year (2006-2010). As you can see, for the most part, the contracts are granted to younger players pretty-much across the board, regardless of contract length, with longer contracts being given to progressively younger players. That makes sense. So I looked at the average age that a contract ends, so that we get an idea of when teams want to cut ties, rather than the age they want to make them:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://2.bp.blogspot.com/_AtlVw4SMQfg/TRGnIky6-cI/AAAAAAAAAQc/kcbIEUfqqPg/s1600/Average-Age-at-End-of-Contr.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="206" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TRGnIky6-cI/AAAAAAAAAQc/kcbIEUfqqPg/s320/Average-Age-at-End-of-Contr.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;Lighter colors are shorter contracts (from 1-5+ years). A couple things I noticed here are that the younger the player (and the longer the contract), the older the player seems to be when he ends. I wonder if this is a premium on getting premier talent signed long-term -- you are almost required to keep the player longer than you would otherwise like to. We don't see the same pattern in 2-year or 3-year contracts, but we see it in 4-year contracts for 2008 (Derek Lowe, 41 and Ryan Dempster, 37), and we see it in 5+ year contracts from 2008 through 2010.&lt;br /&gt;
&lt;br /&gt;
But unfortunately, I'm still nowhere near answering my original question. Why the heck are we seeing such a decline in free agent ages?&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_AtlVw4SMQfg/TRGsFZWh0_I/AAAAAAAAAQg/t48vG3-iSkw/s1600/Years-and-Players-Signed.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="206" src="http://1.bp.blogspot.com/_AtlVw4SMQfg/TRGsFZWh0_I/AAAAAAAAAQg/t48vG3-iSkw/s320/Years-and-Players-Signed.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;I looked at how many years of contracts teams were giving to free agents, and how many players they gave them to. The closer the two lines get, the shorter each contract will be. If they're on top of each other, it would mean that they are only handing out 1-year contracts.&lt;br /&gt;
&lt;br /&gt;
So through all this, I never really figured it out. What I can say is that it looks like teams are getting a lot more conservative about big free agent signings, and that something around 2007-2008 seemed to cause it. It could be that the huge amount of contracts signed in 2006 flooded teams with players that are still under contract, and we will see a small rebound as those players go back on the market leaving more openings on each team. Or it could be a new understanding of aging, causing teams to be more conservative about signing aging players. Or it could be something I'm not seeing.&lt;br /&gt;
&lt;br /&gt;
I'm sorry that this is so long and drawn out, but I think there's something hidden in the data trying to jump out. I fiddled around with it a dozen ways over many hours, and spent quite a bit of time thinking of different ways to tackle the data, but whatever is hidden didn't jump out at me.&lt;br /&gt;
&lt;br /&gt;
I think that it may be hidden in data I haven't put in the spreadsheet. Perhaps the poor free agent signings of 2006 (Soriano, Zito, Carlos Lee, Juan Pierre, etc.) made people think twice about spending big bucks on the Free Agent market. Or maybe the Financial Crisis starting from 2007-2008 had an impact on team finances. Or maybe there just weren't good players on the market. Or...&lt;br /&gt;
&lt;br /&gt;
If you can figure it out, please, let me know.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;References:&lt;/b&gt;&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Data from &lt;a href="http://espn.go.com/mlb/freeagents/_/year/2010"&gt;ESPN&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;b&gt; &lt;/b&gt;Raw Data in a &lt;a href="https://spreadsheets.google.com/ccc?key=0AvhKIaAw27e_dFhaX3gwTlc4Nlg0QVcyQ0pPMGRLSVE&amp;amp;hl=en#gid=0"&gt;Google Spreadsheet&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;Note: For reasons unknown, Google doesn't do the formulas right. If you download as an Excel file, then you should be able to see a lot fewer #DIV/0 errors. Graphics were created in Excel and then imported into Adobe Illustrator. There's nothing special about them, but if you do want to use them, feel free under a Creative Commons Attribution, Non-commercial license.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-5268471391119772242?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/v_BCDUrDORDB8Rn1mpCbwRYUb-U/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/v_BCDUrDORDB8Rn1mpCbwRYUb-U/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/v_BCDUrDORDB8Rn1mpCbwRYUb-U/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/v_BCDUrDORDB8Rn1mpCbwRYUb-U/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/aGZXHF7KYRA" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/5268471391119772242/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2010/12/free-agent-signings-by-year.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/5268471391119772242?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/5268471391119772242?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/aGZXHF7KYRA/free-agent-signings-by-year.html" title="Free Agent Signings by Year" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/_AtlVw4SMQfg/TRGfGU5zhgI/AAAAAAAAAQQ/tO2CiOoiuWA/s72-c/Average-Median-FA-Age.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2010/12/free-agent-signings-by-year.html</feedburner:origLink></entry><entry gd:etag="W/&quot;AkQCSHs_eSp7ImA9Wx9RGU0.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-3008665588038402074</id><published>2010-12-20T21:12:00.000-08:00</published><updated>2010-12-20T21:12:49.541-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-12-20T21:12:49.541-08:00</app:edited><title>Writing at Fangraphs</title><content type="html">Sorry for the lack of updates over the past week. I have several things I am working on, but I am intentionally not posting them here because I have been added to the writing staff at Fangraphs. I still plan to post some things here, but I will be focusing a lot more on writing for the bigger audience there.&lt;br /&gt;
&lt;br /&gt;
Some things I am working on:&lt;br /&gt;
&lt;ul&gt;&lt;li&gt; Finishing the graphical game summary &lt;/li&gt;
&lt;li&gt;Using the Japanese NPB stats I posted to do some analysis on the difference with the MLB&lt;/li&gt;
&lt;li&gt;Creating a "Baseball Basics" presentation geared toward non-sabermetric fans explaining about why they should care about how baseball works&lt;/li&gt;
&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-3008665588038402074?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/gbEyos5lWJnVERi606EGaNTukCY/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/gbEyos5lWJnVERi606EGaNTukCY/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/gbEyos5lWJnVERi606EGaNTukCY/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/gbEyos5lWJnVERi606EGaNTukCY/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/R1PlUY9NWPw" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/3008665588038402074/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2010/12/writing-at-fangraphs.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/3008665588038402074?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/3008665588038402074?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/R1PlUY9NWPw/writing-at-fangraphs.html" title="Writing at Fangraphs" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><thr:total>0</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2010/12/writing-at-fangraphs.html</feedburner:origLink></entry><entry gd:etag="W/&quot;A0cCQHw-fCp7ImA9Wx9RE04.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-7713241713105363547</id><published>2010-12-14T07:04:00.000-08:00</published><updated>2010-12-14T07:04:21.254-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-12-14T07:04:21.254-08:00</app:edited><title>A Work in Progress</title><content type="html">I am rethinking the way we look at game summaries. Most sites have a play by play, box score, and even a WPA graph for the game. Isn't there a better way to get an idea of how the game flowed without sifting through the text?&lt;br /&gt;
&lt;br /&gt;
I figure there must be, so I'm trying to come up with something, but it's a process. And it's nowhere near done, but I may as well show what I have to get feedback, or at least to put it out there in case someone else has use for it in its current form.&lt;br /&gt;
&lt;br /&gt;
Pardon the lack of polish, it was created solely in Excel (turned to PNG in Illustrator):&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_AtlVw4SMQfg/TQeFXciSykI/AAAAAAAAAQE/rXbuV6PLnUg/s1600/phillies-reds-%252812-14-2010%2529.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="133" src="http://1.bp.blogspot.com/_AtlVw4SMQfg/TQeFXciSykI/AAAAAAAAAQE/rXbuV6PLnUg/s320/phillies-reds-%252812-14-2010%2529.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;(Click for a larger version)&lt;br /&gt;
&lt;br /&gt;
What I want to do is to give a nice graphical review of what happened on the field, coupled with the run expectancy for each part. Each base-out state at the start of the at-bat is shown by the graphic. Red is no outs, yellow one out, blue two outs. To the left of each base graphic is a white dot showing the expected runs (0 to 3) before the at-bat, and to the right is the expected runs (0 to 3) after the at-bat. If the at-bat ended the inning, the dot is black (at zero). Each player's contribution is the gap between the left and the right.&lt;br /&gt;
&lt;br /&gt;
But there's still a lot of stuff missing:&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Color coding the base-out states is ugly. I want to change the background color to correspond to the number of outs (white for 0, light grey for 1, grey for 2, and black for 3 outs)&lt;/li&gt;
&lt;li&gt;I need to show when a player scored runs in his at-bats. For instance, the 3-run HR by Sardinha in the 4th inning makes it look like he dropped the run expectancy (1st and 2nd with 2 outs to bases empty 2 outs). In reality he scored 3 runs minus that drop in expectancy&lt;/li&gt;
&lt;li&gt;I need to figure out a better way of dealing with stolen bases. There is one in the 9th, but there's no way of knowing that it happened, or who did it. That's going to be a challenge (stupid non-discrete events in an otherwise discrete game!)&lt;/li&gt;
&lt;/ul&gt;My ultimate goal is to be able to give a nice simple way to tally up the expected runs and scored runs in the middle (simple addition across), and a nice simple way to tally up the run expectancy added by each player (simple addition downward). I'd be much happier if it also looked nice, but I'll focus on getting the hard work of making it useful first.&lt;br /&gt;
&lt;br /&gt;
If anyone has any input, please let me know. And if there are any Excel geniuses out there who know how I can use a custom marker for XY charts with a transparent background (transparent turns black when you add it to the chart), I'd be eternally grateful.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;References:&lt;/b&gt;&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Game Log from &lt;a href="http://www.fangraphs.com/plays.aspx?date=2010-06-30&amp;amp;team=Reds&amp;amp;dh=0"&gt;Fangraphs&lt;/a&gt;&lt;b&gt;&amp;nbsp;&lt;/b&gt;&lt;/li&gt;
&lt;/ul&gt;Image is licensed under Creative Commons attribution, non-commercial license. Feel free to use it as you'd like, and if you'd like the file I've used for it, just ask.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-7713241713105363547?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/-tskwmNCPMWJj2q3ULCkCRaC2c8/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/-tskwmNCPMWJj2q3ULCkCRaC2c8/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/-tskwmNCPMWJj2q3ULCkCRaC2c8/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/-tskwmNCPMWJj2q3ULCkCRaC2c8/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/d9hoICtMMmQ" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/7713241713105363547/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2010/12/work-in-progress.html#comment-form" title="4 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/7713241713105363547?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/7713241713105363547?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/d9hoICtMMmQ/work-in-progress.html" title="A Work in Progress" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/_AtlVw4SMQfg/TQeFXciSykI/AAAAAAAAAQE/rXbuV6PLnUg/s72-c/phillies-reds-%252812-14-2010%2529.png" height="72" width="72" /><thr:total>4</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2010/12/work-in-progress.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DE8CSX45fCp7ImA9Wx9REkw.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-891191927501277086</id><published>2010-12-12T21:07:00.000-08:00</published><updated>2010-12-12T21:07:48.024-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-12-12T21:07:48.024-08:00</app:edited><title>Nippon Pro Baseball Data (2005-2010)</title><content type="html">I have spent a bit of time putting together the batting and pitching stats from Japanese pro baseball from 2005-2010. I've uploaded it onto Google Documents so that you can use it as well. The English names are all pretty-much wrong (I guess there are a few correct ones in there), because there are over 1000 different last names alone, and translating them would have taken a long time. However, the names there should be usable for doing stats, and searching for the name of any player on google by copy-pasting the characters should get you a site with the proper English name.&lt;br /&gt;
&lt;br /&gt;
Also, Japan tracks "Hold Points" which are holds + relief wins. Useless stat, but it's in there.&lt;br /&gt;
&lt;br /&gt;
All data was retrieved from the &lt;a href="http://www.npb.or.jp/teams/"&gt;NPB homepage&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
Please feel free to use it however you'd like, and I will start delving into it myself in the near future. You can access the Google spreadsheet &lt;a href="https://spreadsheets.google.com/ccc?key=t94_PguVB1n1T-ZNPmg5sHw&amp;amp;authkey=CLz4gbEJ&amp;amp;hl=en#gid=0"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-891191927501277086?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/JwBc3u56_wT-ZrzGrS81TvLtzQo/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/JwBc3u56_wT-ZrzGrS81TvLtzQo/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/JwBc3u56_wT-ZrzGrS81TvLtzQo/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/JwBc3u56_wT-ZrzGrS81TvLtzQo/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/fnb0KLnOyhE" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/891191927501277086/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2010/12/nippon-pro-baseball-data-2005-2010.html#comment-form" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/891191927501277086?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/891191927501277086?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/fnb0KLnOyhE/nippon-pro-baseball-data-2005-2010.html" title="Nippon Pro Baseball Data (2005-2010)" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><thr:total>2</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2010/12/nippon-pro-baseball-data-2005-2010.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CU4ESXY-eSp7ImA9Wx9REk0.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-1870187095321538860</id><published>2010-12-12T01:43:00.000-08:00</published><updated>2010-12-12T17:31:48.851-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-12-12T17:31:48.851-08:00</app:edited><title>2011 Red Sox Salary Commitments and Projected Wins</title><content type="html">&lt;a href="http://www.beyondtheboxscore.com/2010/12/10/1868341/crowd-sourced-sabergraphic-challenge-team-contracts-projected-war"&gt;Beyond the Box Score&lt;/a&gt; asked for its readers to make graphs showing a team's salary commitments and projected wins in a way that you can understand with a glance. Unfortunately, that's a lot of information with one graph, and I cheated by making an entire sheet covering all the information in several graphs:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_AtlVw4SMQfg/TQSVoCWQHfI/AAAAAAAAAQA/RlnH6maxM-Y/s1600/red-sox-salaries.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="http://1.bp.blogspot.com/_AtlVw4SMQfg/TQSVoCWQHfI/AAAAAAAAAQA/RlnH6maxM-Y/s320/red-sox-salaries.png" width="247" /&gt;&lt;/a&gt;&lt;/div&gt;(Click for a larger image, or &lt;a href="http://www.scribd.com/doc/45174346"&gt;download as a PDF&lt;/a&gt;)&lt;br /&gt;
&lt;br /&gt;
The top graph shows the projected Wins Above Replacement projected for the team. The projections were taken from Fangraphs and fiddled with a bit. I am not passing judgment on the actual projections (in other words, take them with a grain or six of salt), just putting down what was projected. Replacement level was set at 50 wins (which is why the Y-axis starts at 50).&lt;br /&gt;
&lt;br /&gt;
The second graph shows projected salaries for each player, taken from Cot's Contracts. For players who have under 3 years of service time, I had them paid 110% of the league minimum, which was $400,000 in 2010, and I projected to increase by $10,000 a year. For players with more than 3 years, but less than 6 (the 3 arbitration years) I had them paid 40% of their previous year's value the first year, 60% of their value the second, and 80% value the third (estimates taken from tangotiger and/or fangraphs).&lt;br /&gt;
&lt;br /&gt;
The bottom portion shows the legend for the mess of colors, and a table of the data for the players projected to be on the roster for 2011. Since there are only 21 players, 4 more will be added, but since I don't know who they are or how much they will be paid, I just left the spots blank.&lt;br /&gt;
&lt;br /&gt;
There are also thick black lines on the first two graphs. For budget, I took the 2011 salaries, and calculated an 8% increase every year to create a projected budget (thick line in the second graph). Using that projected budget, and the projected cost of a win ($4.75 million in 2011, 8% inflation every year after that), I calculated how many wins the remaining budget minus salary costs could buy on the free agent market, and put it as a thick line on the top half of the graph.&lt;br /&gt;
&lt;br /&gt;
So what?&lt;br /&gt;
&lt;br /&gt;
Well, looking at the graph a bit, a few things popped out at me. The first is the importance of cost-controlled players. Despite the Red Sox having a lot of talent projecting to an unreasonable amount of wins (114 in 2011 -- again, grain of salt), if they don't get more cost controlled players that can make a contribution at low cost, they will drop down to an average team in 5 years.&lt;br /&gt;
&lt;br /&gt;
The second thing I noticed was the Red Sox long-term commitments to homegrown players like Youkilis, Pedroia, and Lester. Their contributions are quite cheap for the amount of wins they are projected for. All three deals were signed before free agency, so the players accepted a discount for job security. That helps a lot.&lt;br /&gt;
&lt;br /&gt;
The third thing I noticed was the amount of starting pitching they have signed. Lester, Beckett, and Lackey all have 4 years left under contract. Buccholz has 5 years left of service time (I think, but I've never been too bright). Even Matsuzaka is still under contract for 2 more years, and Tim Wakefield the knuckle-baller is also there for another year when/if someone gets injured.&lt;br /&gt;
&lt;br /&gt;
Anyway, this is just an exercise, and I would love to do it for all 30 teams, but I'm afraid it would be a full-time job. Feel free to use the format if you want, and I will be happy to support you if you need it.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;References:&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Contract Data from&lt;b&gt; &lt;/b&gt;&lt;a href="http://mlbcontracts.blogspot.com/2004/12/boston-red-sox.html"&gt;Cot's Contracts&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;WAR Data from &lt;a href="http://www.fangraphs.com/"&gt;Fangraphs&lt;/a&gt;&lt;a href="http://www.baseball-reference.com/leagues/split.cgi?t=b&amp;amp;lg=MLB&amp;amp;year=2010#count::none" target="_blank" title="Baseball Reference Split Data (2010 MLB)"&gt;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Player photos from &lt;a href="http://sports.yahoo.com/mlb/teams/bos"&gt;Yahoo Sports&lt;/a&gt; &lt;/li&gt;
&lt;/ul&gt;The graph was initially made in Excel, then imported into Adobe Illustrator. Licensed under Creative  Commons, Attribution, Non-Commercial License (save for the player photos which are copyrighted).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-1870187095321538860?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/v95hT9cWtXJ1voycgmTI2fZIY14/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/v95hT9cWtXJ1voycgmTI2fZIY14/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/v95hT9cWtXJ1voycgmTI2fZIY14/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/v95hT9cWtXJ1voycgmTI2fZIY14/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/n2UFmmwEd6M" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/1870187095321538860/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2010/12/2011-red-sox-salary-commitments-and.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/1870187095321538860?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/1870187095321538860?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/n2UFmmwEd6M/2011-red-sox-salary-commitments-and.html" title="2011 Red Sox Salary Commitments and Projected Wins" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/_AtlVw4SMQfg/TQSVoCWQHfI/AAAAAAAAAQA/RlnH6maxM-Y/s72-c/red-sox-salaries.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2010/12/2011-red-sox-salary-commitments-and.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CEMGQHgzcCp7ImA9Wx9SF0Q.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-6700375314949331704</id><published>2010-12-07T23:13:00.000-08:00</published><updated>2010-12-07T23:13:41.688-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-12-07T23:13:41.688-08:00</app:edited><title>Ted Williams Rolling wOBA per 150 Games</title><content type="html">I wanted to try a new take on seeing the progression of players. Rather than looking at wOBA by year and injuries/playing time separately, I wanted to add them all into one graph so you can see both how they've performed, and how much they've performed.&lt;br /&gt;
&lt;br /&gt;
My first attempt uses Ted Williams. The red line is his career wOBA, the blue line is the league average wOBA over that period, and the black line shows how he performed over the past 150 games. As the line fades further from black, that means he played fewer of the past 150 games.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_AtlVw4SMQfg/TP8t7stPQFI/AAAAAAAAAP8/yBzAPQjLWlE/s1600/tedwilliams.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="236" src="http://4.bp.blogspot.com/_AtlVw4SMQfg/TP8t7stPQFI/AAAAAAAAAP8/yBzAPQjLWlE/s320/tedwilliams.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
&lt;br /&gt;
What does this tell us? Well, the darker the line, the more confident we should be that his performance is based on skill, and not just random fluctuations.&lt;br /&gt;
&lt;br /&gt;
In the future I want to use this type of graph to compare multiple players and see what their performance and injury history looks like to give a quick glance evaluation between similar free agents, or trade candidates.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;References:&lt;/b&gt;&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Raw Data as an &lt;a href="https://docs.google.com/leaf?id=0B_hKIaAw27e_MzA5NGM5MDgtYjE4Ny00NjcxLThhMWUtMmY5OWRmZjEzOWVk&amp;amp;hl=en&amp;amp;authkey=CIaEvL8H"&gt;Excel Spreadsheet&lt;b&gt; &lt;/b&gt;(8.1 MB)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Ted Williams Game Logs from &lt;a href="http://www.baseball-reference.com/players/w/willite01.shtml"&gt;Baseball Reference&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Ted Williams Career wOBA from &lt;a href="http://www.fangraphs.com/statss.aspx?playerid=1014040&amp;amp;position=OF"&gt;Fangraphs&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;This graph was created in Excel and then edited in Adobe Illustrator. It is licensed under a Creative Commons Attribution, Non-Commercial license.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-6700375314949331704?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/g4iYmIOJXrV2bNVWgt8L_hdnAO0/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/g4iYmIOJXrV2bNVWgt8L_hdnAO0/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/g4iYmIOJXrV2bNVWgt8L_hdnAO0/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/g4iYmIOJXrV2bNVWgt8L_hdnAO0/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/FaFJAR0R_V0" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/6700375314949331704/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2010/12/ted-williams-rolling-woba-per-150-games.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/6700375314949331704?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/6700375314949331704?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/FaFJAR0R_V0/ted-williams-rolling-woba-per-150-games.html" title="Ted Williams Rolling wOBA per 150 Games" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/_AtlVw4SMQfg/TP8t7stPQFI/AAAAAAAAAP8/yBzAPQjLWlE/s72-c/tedwilliams.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2010/12/ted-williams-rolling-woba-per-150-games.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CEMERHo7fSp7ImA9Wx9RFkQ.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-8715670923618546774</id><published>2010-12-06T22:34:00.000-08:00</published><updated>2010-12-18T09:13:25.405-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-12-18T09:13:25.405-08:00</app:edited><title>wOBA by Ball-Strike Count</title><content type="html">I am a big fan of graphs and baseball. &lt;a href="http://www.fangraphs.com/"&gt;Fangraphs&lt;/a&gt; made me excited  because putting complex data into reasonably easy to understand graphs  helps open up sabermetrics to more fans. I'm a big fan of statistical  analysis, but after a while, a table full of numbers just starts running  together and stops making sense. That's what makes graphs such an  effective tool.&lt;br /&gt;
&lt;br /&gt;
I've dabbled in graphs myself. When people were  creating the WAR graphs to compare hall of famers, I made a sample graph  showing cumulative WAR by age on &lt;a href="http://insidethebook.com/ee" target="_blank" title="The Book Blog"&gt;Tom Tango's Book Blog&lt;/a&gt;:&lt;br /&gt;
&lt;br /&gt;
&lt;a href="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3Uf20CQfI/AAAAAAAAAPI/EEfUPhuj708/s1600/WAR+graph+by+age.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="210" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3Uf20CQfI/AAAAAAAAAPI/EEfUPhuj708/s320/WAR+graph+by+age.png" width="320" /&gt;&lt;/a&gt;&lt;br /&gt;
(click for a larger image)&lt;br /&gt;
&lt;br /&gt;
Of course, soon after Fangraphs &lt;a href="http://www.fangraphs.com/graphsw.aspx?playerid2=1003865&amp;amp;playerid3=1006905&amp;amp;playerid4=1009517&amp;amp;playerid5=1000137" target="_blank" title="WAR Graphs"&gt;came out with a far better looking one&lt;/a&gt;, saving me the headache of figuring out how to automate it.&lt;br /&gt;
&lt;br /&gt;
Here is my latest foray into the world of graphs, looking at wOBA by count:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/_AtlVw4SMQfg/TQzrp5N1YCI/AAAAAAAAAQI/ZEbOU3NLVZ0/s1600/split-count-%25282010-12-14%2529-BB.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="247" src="http://4.bp.blogspot.com/_AtlVw4SMQfg/TQzrp5N1YCI/AAAAAAAAAQI/ZEbOU3NLVZ0/s320/split-count-%25282010-12-14%2529-BB.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;/div&gt;&lt;br /&gt;
(click for a larger image)&lt;br /&gt;
&lt;br /&gt;
Let  me explain the mess you see above. The horizontal X-axis shows the  amount of pitches. The first pitch is all the way to the left, and a  full count is all the way to the right. The vertical Y-axis shows the  wOBA for all at-bats that go through that count.&lt;br /&gt;
&lt;br /&gt;
Since all at-bats  go through the first pitch, the average wOBA is .330 (league average).  The higher on the graph, the more likely a player is going to do  something good. As you can see, the best count for hitters is 3-0, and  the worst count is 0-2. On 3-0 the average hitter is better than &lt;a href="http://www.fangraphs.com/statss.aspx?playerid=1109&amp;amp;position=OF" target="_blank" title="Barry Bonds"&gt;2001-2002 Barry Bonds&lt;/a&gt;, and on 0-2 they're batting more like &lt;a href="http://www.fangraphs.com/statss.aspx?playerid=2233&amp;amp;position=PB" target="_blank" title="Adam Wainwright"&gt;Adam Wainwright in 2010&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
The  size of the counts (by area) are the amount of times that count has  happened. There were 936,848 PA in my sample, so the first pitch is the  biggest. There were only 47,488 3-0 counts, so that is the smallest.  Each of the counts is a graph in and of itself showing what happened at  that count.&lt;br /&gt;
&lt;br /&gt;
Blue is ball, red is strike, and gray means the play  ended. As you can see, with 2 strikes the play ends with another, so  there are only balls and ended at-bats.&lt;br /&gt;
&lt;br /&gt;
&lt;h1&gt;So What?&lt;/h1&gt;&lt;h1&gt;&amp;nbsp;&lt;/h1&gt;I made this graph for my own use. It is a nice easy-reference tool  to track what's happening each pitch. I can follow and see if a batter's  chances went up or down, and how likely the at-bat is going to end on  each pitch (really roughly).&lt;br /&gt;
&lt;br /&gt;
Ideally I would make one for each team, so  that you can get one for your own team and use it when you're watching  games, or even for each player so that you can compare and contrast  Vladimir Guerrero with Kevin Youkilis, or the Twins and the Yankees,  etc. And there's a good chance that there are things that you can think  of to use this graph for, so please let me know what they are in the  comments.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;References:&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Raw Data in a&lt;b&gt; &lt;/b&gt;&lt;a href="https://spreadsheets.google.com/ccc?key=0AvhKIaAw27e_dHZ0Q1QzVUVMU0ZIc3pVZXR6RWNrSFE&amp;amp;hl=en#gid=0" target="_blank" title="Google Spreadsheet"&gt;Google Spreadsheet&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Data from &lt;a href="http://www.baseball-reference.com/leagues/split.cgi?t=b&amp;amp;lg=MLB&amp;amp;year=2010#count::none" target="_blank" title="Baseball Reference Split Data (2010 MLB)"&gt;Baseball Reference Split Data (2006-2010 MLB)&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;wOBA Numbers Calculated using &lt;a href="http://tangotiger.net/bdb/lwts_woba_for_bdb.txt" target="_blank" title="Tango Tiger's 2008 MLB wOBA Data"&gt;Tango Tiger's wOBA Data (2008 MLB)&lt;/a&gt; &lt;/li&gt;
&lt;/ul&gt;The graph was initially made in Excel to get the bubble positions  and sizes, then imported into Adobe Illustrator to add the pie graphs,  connecting lines, etc. Images are licensed under Creative Commons, Attribution, Non-Commercial License.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-8715670923618546774?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/PPQNl2dky1ZKqYeseCaHvAK2j44/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/PPQNl2dky1ZKqYeseCaHvAK2j44/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/PPQNl2dky1ZKqYeseCaHvAK2j44/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/PPQNl2dky1ZKqYeseCaHvAK2j44/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/3e2MaBVhlDE" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/8715670923618546774/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2010/12/woba-by-ball-strike-count.html#comment-form" title="4 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/8715670923618546774?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/8715670923618546774?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/3e2MaBVhlDE/woba-by-ball-strike-count.html" title="wOBA by Ball-Strike Count" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3Uf20CQfI/AAAAAAAAAPI/EEfUPhuj708/s72-c/WAR+graph+by+age.png" height="72" width="72" /><thr:total>4</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2010/12/woba-by-ball-strike-count.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DUUAQX49eyp7ImA9Wx9SF00.&quot;"><id>tag:blogger.com,1999:blog-4257932056731228876.post-5185153705424717261</id><published>2010-12-06T22:09:00.000-08:00</published><updated>2010-12-06T23:34:00.063-08:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2010-12-06T23:34:00.063-08:00</app:edited><title>Run Expectancy by Base-Out State (2010 MLB)</title><content type="html">How many runs does a team score on average when there is nobody on and no outs?&lt;br /&gt;
&lt;br /&gt;
The answer to that question is available by run expectancy charts, like &lt;a href="http://www.baseballprospectus.com/statistics/sortable/index.php?cid=139594"&gt;this one provided by Baseball Prospectus&lt;/a&gt;. But to most people, that's just a mess of numbers, and it's difficult to follow.&lt;br /&gt;
&lt;br /&gt;
Here is the same information in a chart form:&lt;br /&gt;
&lt;a href="http://1.bp.blogspot.com/_AtlVw4SMQfg/TP3N2JARj2I/AAAAAAAAAPA/mn3cuZSf98M/s1600/baseout-states-%25282010-12-08%2529.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="240" src="http://1.bp.blogspot.com/_AtlVw4SMQfg/TP3N2JARj2I/AAAAAAAAAPA/mn3cuZSf98M/s320/baseout-states-%25282010-12-08%2529.png" width="320" /&gt;&lt;/a&gt;&lt;br /&gt;
(click for a larger version) &lt;br /&gt;
&lt;br /&gt;
The Red base diagrams show the runs expected with zero outs. Orange with one out. Blue with two outs. The first column is with the bases empty. Moving to the right, it shows the runs expected with 1 runner on, 2 runners on, or the bases loaded. As you can see, the more people on base, the more runs you are expected to score, and the fewer outs, the more runs you are expected to score.&lt;br /&gt;
&lt;br /&gt;
References:&lt;br /&gt;
&lt;ul&gt;&lt;li&gt;Spreadsheet with &lt;a href="https://spreadsheets.google.com/ccc?key=0AvhKIaAw27e_dFgteU9ya0hEZ2dIZjNPQ2lZWUdqMHc&amp;amp;hl=en&amp;amp;authkey=CJLr97gI#gid=0"&gt;Raw Data&lt;/a&gt; &lt;/li&gt;
&lt;li&gt; Run Expectancies from &lt;a href="http://www.baseballprospectus.com/statistics/sortable/index.php?cid=139594"&gt;Baseball Prospectus Run Expectancy Chart&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Data was collected using &lt;a href="http://www.baseball-reference.com/play-index/event_finder.cgi"&gt;Baseball Reference Batting Event Finder&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;Image was editing using Adobe Illustrator. Vector data is available on request. This image is available under a Creative Commons Attribution, Non-Commercial License.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4257932056731228876-5185153705424717261?l=henkakyuu.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/Gzaecbr6YHbAFE_uZkcduVP_ACM/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/Gzaecbr6YHbAFE_uZkcduVP_ACM/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/Gzaecbr6YHbAFE_uZkcduVP_ACM/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/Gzaecbr6YHbAFE_uZkcduVP_ACM/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/Henkakyuu/~4/ZjP69EyeotY" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://henkakyuu.blogspot.com/feeds/5185153705424717261/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://henkakyuu.blogspot.com/2010/12/run-expectancy-by-base-out-state-2010.html#comment-form" title="5 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/5185153705424717261?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/4257932056731228876/posts/default/5185153705424717261?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/Henkakyuu/~3/ZjP69EyeotY/run-expectancy-by-base-out-state-2010.html" title="Run Expectancy by Base-Out State (2010 MLB)" /><author><name>Joshua Maciel</name><uri>http://www.blogger.com/profile/17823507869992026737</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://2.bp.blogspot.com/_AtlVw4SMQfg/TP3ZCnDUtnI/AAAAAAAAAPc/ozo9P_XW9rY/S220/twitter.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/_AtlVw4SMQfg/TP3N2JARj2I/AAAAAAAAAPA/mn3cuZSf98M/s72-c/baseout-states-%25282010-12-08%2529.png" height="72" width="72" /><thr:total>5</thr:total><feedburner:origLink>http://henkakyuu.blogspot.com/2010/12/run-expectancy-by-base-out-state-2010.html</feedburner:origLink></entry></feed>

