<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" version="2.0">

<channel>
	<title>Statistical Modeling, Causal Inference, and Social Science</title>
	
	<link>http://andrewgelman.com</link>
	<description />
	<lastBuildDate>Wed, 16 May 2012 20:00:06 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/StatisticalModeling" /><feedburner:info xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" uri="statisticalmodeling" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><feedburner:emailServiceId xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0">StatisticalModeling</feedburner:emailServiceId><feedburner:feedburnerHostname xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0">http://feedburner.google.com</feedburner:feedburnerHostname><item>
		<title>Question 6 of my final exam for Design and Analysis of Sample Surveys</title>
		<link>http://andrewgelman.com/2012/05/question-6-of-my-final-exam-for-design-and-analysis-of-sample-surveys/</link>
		<comments>http://andrewgelman.com/2012/05/question-6-of-my-final-exam-for-design-and-analysis-of-sample-surveys/#comments</comments>
		<pubDate>Wed, 16 May 2012 20:00:06 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Political Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15420</guid>
		<description><![CDATA[6. A survey of New York City residents is performed using cluster sampling. The design effect is 3.0. From the survey, the estimated proportion who prefer the Mets to the Yankees is 0.42 with a standard error of 0.05. How many people were in the sample? Solution to question 5 From yesterday: 5. Which of [...]]]></description>
			<content:encoded><![CDATA[<p>6. A survey of New York City residents is performed using cluster sampling. The design effect is 3.0. From the survey, the estimated proportion who prefer the Mets to the Yankees is 0.42 with a standard error of 0.05. How many people were in the sample?</p>
<p><strong>Solution to question 5</strong></p>
<p>From <a href="http://andrewgelman.com/2012/05/question-5-of-my-final-exam-for-design-and-analysis-of-sample-surveys/">yesterday</a>:</p>
<blockquote><p>5. Which of the following better describes changes in public opinion on most issues? (Choose only one.)</p>
<p>(a) Dynamic stability: On any given issue, average opinion remains stable but liberals and conservatives move back and forth in opposite directions (the “accordion model”)</p>
<p>(b) Uniform swing: Average opinion on an issue can move but the liberals and conservatives don’t move much relative to each other (the disribution of opinions is a “solid block of wood”)</p>
<p>(c) Compensating tradeoffs: When considering multiple survey questions on the same general topic, average opinion can move sharply to the left or right on individual questions while the average over all the questions remains stable (the “rubber band model”)</p></blockquote>
<p>Solution:  b.  You can make an argument for option a over the long term, but if you have to pick just one of the three, you have to go with uniform swing.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/question-6-of-my-final-exam-for-design-and-analysis-of-sample-surveys/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Wikipedia author confronts Ed Wegman</title>
		<link>http://andrewgelman.com/2012/05/wikipedia-author-confronts-ed-wegman/</link>
		<comments>http://andrewgelman.com/2012/05/wikipedia-author-confronts-ed-wegman/#comments</comments>
		<pubDate>Wed, 16 May 2012 13:32:32 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Zombies]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15410</guid>
		<description><![CDATA[Wegman: &#8220;It&#8217;s not reprinted 100 percent like you had it.&#8221; Wikipedia guy: &#8220;No, you added another paragraph at the end and you changed the headline. . . . You even copied the typos that I&#8217;ve corrected on my website. It was taken verbatim and reprinted in your paper.&#8221; The original author got a check for [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://gawker.com/5910485/paper-plagiarizes-blog-post-writer-goes-to-paper-to-confront-publisher-and-get-paid"><iframe width="640" height="360" src="http://www.youtube.com/embed/YAtOs5yzVus?feature=player_embedded" frameborder="0" allowfullscreen></iframe></a></p>
<p>Wegman:  &#8220;It&#8217;s not reprinted 100 percent like you had it.&#8221;</p>
<p>Wikipedia guy:  &#8220;No, you added another paragraph at the end and you changed the headline. . . . You even copied the typos that I&#8217;ve corrected on my website.  It was taken verbatim and reprinted in your paper.&#8221;</p>
<p>The original author got a <a href="http://gawker.com/5910485/paper-plagiarizes-blog-post-writer-goes-to-paper-to-confront-publisher-and-get-paid">check</a> for $500 but, unfortunately, no free subscription to &#8220;Wiley Interdisciplinary Reviews: Computational Statistics&#8221; (a $1400-$2800 <a href="http://andrewgelman.com/2011/09/wiley-wegman-chutzpah-update/">value</a>).</p>
<p>P.S.  To those who think I&#8217;m being mean to Wegman:  I haven&#8217;t yet heard that he&#8217;s apologized to the people whose work he copied without attribution, or to the people who spent their time tracking all this down, or to the U.S. Congress for misrepresenting his expertise in his official report.</p>
<p>Everyone makes mistakes, and just about everyone has ethical lapses at times.  But when you get caught you&#8217;re supposed to make apology and restitution.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/wikipedia-author-confronts-ed-wegman/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Question 5 of my final exam for Design and Analysis of Sample Surveys</title>
		<link>http://andrewgelman.com/2012/05/question-5-of-my-final-exam-for-design-and-analysis-of-sample-surveys/</link>
		<comments>http://andrewgelman.com/2012/05/question-5-of-my-final-exam-for-design-and-analysis-of-sample-surveys/#comments</comments>
		<pubDate>Tue, 15 May 2012 20:00:57 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Political Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15390</guid>
		<description><![CDATA[5. Which of the following better describes changes in public opinion on most issues? (Choose only one.) (a) Dynamic stability: On any given issue, average opinion remains stable but liberals and conservatives move back and forth in opposite directions (the “accordion model”) (b) Uniform swing: Average opinion on an issue can move but the liberals [...]]]></description>
			<content:encoded><![CDATA[<p>5. Which of the following better describes changes in public opinion on most issues? (Choose only one.)</p>
<p>(a) Dynamic stability: On any given issue, average opinion remains stable but liberals and conservatives move back and forth in opposite directions (the “accordion model”)</p>
<p>(b) Uniform swing: Average opinion on an issue can move but the liberals and conservatives don’t move much relative to each other (the disribution of opinions is a “solid block of wood”)</p>
<p>(c) Compensating tradeoffs: When considering multiple survey questions on the same general topic, average opinion can move sharply to the left or right on individual questions while the average over all the questions remains stable (the “rubber band model”)</p>
<p><strong>Solution to question 4</strong></p>
<p>From <a href="http://andrewgelman.com/2012/05/question-4-of-my-final-exam-for-design-and-analysis-of-sample-surveys/">yesterday</a>:</p>
<blockquote><p>4. Researchers have found that survey respondents overreport church attendance. Thus, naive estimates from surveys overstate the percentage of Americans who attend church regularly. Does this have a large impact on estimates of time trends in religious attendance?</p></blockquote>
<p>Solution:  Yes.  See <a href="http://www.stat.columbia.edu/~gelman/surveys/HadawayMarlerChaves1993.pdf">this article</a> by Hadaway, Marler, and Chaves, who write, &#8220;We suspect that the actual attendance rate has declined since World War II, despite the fact that the survey rate remained basically stable.&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/question-5-of-my-final-exam-for-design-and-analysis-of-sample-surveys/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>A statistical research project:  Weeding out the fraudulent citations</title>
		<link>http://andrewgelman.com/2012/05/a-statistical-research-project-weeding-out-the-fraudulent-citations/</link>
		<comments>http://andrewgelman.com/2012/05/a-statistical-research-project-weeding-out-the-fraudulent-citations/#comments</comments>
		<pubDate>Tue, 15 May 2012 13:26:46 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Sociology]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15381</guid>
		<description><![CDATA[John Mashey points me to a blog post by Phil Davis on &#8220;the emergence of a citation cartel.&#8221; Davis tells the story: Cell Transplantation is a medical journal published by the Cognizant Communication Corporation of Putnam Valley, New York. In recent years, its impact factor has been growing rapidly. In 2006, it was 3.482 [I [...]]]></description>
			<content:encoded><![CDATA[<p>John Mashey points me to a <a href="http://scholarlykitchen.sspnet.org/2012/04/10/emergence-of-a-citation-cartel/">blog post</a> by Phil Davis on &#8220;the emergence of a citation cartel.&#8221;  Davis tells the story:<br />
<span id="more-15381"></span></p>
<blockquote><p>Cell Transplantation is a medical journal published by the Cognizant Communication Corporation of Putnam Valley, New York. In recent years, its impact factor has been growing rapidly. In 2006, it was 3.482 [I think he means "3.5"---ed.]. In 2010, it had almost doubled to 6.204.</p>
<p>When you look at which journals cite Cell Transplantation, two journals stand out noticeably: the Medical Science Monitor, and The Scientific World Journal. According to the JCR, neither of these journals cited Cell Transplantation until 2010.</p>
<p>Then, in 2010, a review article was published in the Medical Science Monitor citing 490 articles, 445 of which were to papers published in Cell Transplantation. All 445 citations pointed to papers published in 2008 or 2009 — the citation window from which the journal’s 2010 impact factor was derived. Of the remaining 45 citations, 44 cited the Medical Science Monitor, again, to papers published in 2008 and 2009.</p>
<p>Three of the four authors of this paper sit on the editorial board of Cell Transplantation. Two are associate editors, one is the founding editor. The fourth is the CEO of a medical communications company.</p></blockquote>
<p>Further details follow, but I think you get the picture.</p>
<p>Davis continues:</p>
<blockquote><p>The ease to which members of an editorial board were able to use a cartel of journals to influence their journal’s impact factor concerns me greatly because the cost to do so is very low, the rewards are astonishingly high, it is difficult to detect, and the practice can be facilitated very easily by overlapping editorial boards or through cooperative agreements between them. What’s more, editors can protect these “reviews” from peer review if they are labeled as “editorial material,” as some are. It’s the perfect strategy for gaming the system.</p></blockquote>
<p>I&#8217;d think this sort of thing should be detectable using statistical analysis.  This seems like the sort<br />
of fight-fire-with-fire situation where there will be at least a partial<br />
technological solution.</p>
<p>P.S.  The <a href="http://scholarlykitchen.sspnet.org/2012/04/10/emergence-of-a-citation-cartel/#comments">discussion thread</a> is worth reading too.</p>
<p>P.P.S.  Lots more interesting stuff on that blog, for example <a href="http://scholarlykitchen.sspnet.org/2012/04/12/the-academic-spring-shallo-rhetoric-aimed-at-the-wrong-target/">this post</a> by Kent Anderson disparaging open-access publishing.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/a-statistical-research-project-weeding-out-the-fraudulent-citations/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Question 4 of my final exam for Design and Analysis of Sample Surveys</title>
		<link>http://andrewgelman.com/2012/05/question-4-of-my-final-exam-for-design-and-analysis-of-sample-surveys/</link>
		<comments>http://andrewgelman.com/2012/05/question-4-of-my-final-exam-for-design-and-analysis-of-sample-surveys/#comments</comments>
		<pubDate>Mon, 14 May 2012 21:00:34 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Political Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15385</guid>
		<description><![CDATA[4. Researchers have found that survey respondents overreport church attendance. Thus, naive estimates from surveys overstate the percentage of Americans who attend church regularly. Does this have a large impact on estimates of time trends in religious attendance? Solution to question 3 From yesterday: 3. We discussed in class the best currently available method for [...]]]></description>
			<content:encoded><![CDATA[<p>4. Researchers have found that survey respondents overreport church attendance. Thus, naive estimates from surveys overstate the percentage of Americans who attend church regularly. Does this have a large impact on estimates of time trends in religious attendance?</p>
<p><strong>Solution to question 3</strong></p>
<p>From <a href="http://andrewgelman.com/2012/05/question-3-of-my-final-exam-for-design-and-analysis-of-sample-surveys/">yesterday</a>:</p>
<blockquote><p>3. We discussed in class the best currently available method for estimating the proportion of military servicemembers who are gay. What is that method? (Recall the problems with the direct approach: there is no simple way to survey servicemembers at random, nor is it likely that they would answer such a question honestly.)</p></blockquote>
<p>Solution:  I was talking about the <a href="http://andrewgelman.com/2010/07/gaydar_update_a/">work</a> of Gary Gates, combining an estimate of the percentage of gays in the population with an estimate of the probability that someone is in the military, given that he or she is gay.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/question-4-of-my-final-exam-for-design-and-analysis-of-sample-surveys/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>I hate to get all Gerd Gigerenzer on you here, but . . .</title>
		<link>http://andrewgelman.com/2012/05/i-hate-to-get-all-gerd-gigerenzer-on-you-here-but/</link>
		<comments>http://andrewgelman.com/2012/05/i-hate-to-get-all-gerd-gigerenzer-on-you-here-but/#comments</comments>
		<pubDate>Mon, 14 May 2012 13:22:45 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Causal Inference]]></category>
		<category><![CDATA[Decision Theory]]></category>
		<category><![CDATA[Economics]]></category>
		<category><![CDATA[Literature]]></category>
		<category><![CDATA[Political Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15353</guid>
		<description><![CDATA[Jonathan Cantor points me to an opinion piece by psychologist Reid Hastie, &#8220;Our Gift for Good Stories Blinds Us to the Truth.&#8221; I have mixed feelings about Hastie&#8217;s article. On one hand I do think his point is important. It&#8217;s not new to me, but presumably it&#8217;s new to many readers of bloomberg.com. I like [...]]]></description>
			<content:encoded><![CDATA[<p>Jonathan Cantor points me to an <a href="http://www.bloomberg.com/news/2012-05-09/our-gift-for-good-stories-blinds-us-to-the-truth.html">opinion piece</a> by psychologist Reid Hastie, &#8220;Our Gift for Good Stories Blinds Us to the Truth.&#8221;</p>
<p>I have mixed feelings about Hastie&#8217;s article.  On one hand I do think his point is important.  It&#8217;s not new to me, but presumably it&#8217;s new to many readers of bloomberg.com.  I like Hastie&#8217;s book (with Robyn Dawes), Rational Choice in an Uncertain World, and I&#8217;m predisposed to like anything new that he writes.</p>
<p>On the other hand, there&#8217;s something about Hastie&#8217;s article that bothered me.  It seemed a bit smug, as if he thinks he understands the world and wants to just explain it to the rest of us.  That could be fine&#8212;after all, Hastie is a distinguished psychology researcher&#8212;but I wasn&#8217;t so clear that he&#8217;s so clear on what he&#8217;s saying.  For example:</p>
<blockquote><p>The human brain is designed to support two modes of thought: visual and narrative. These forms of thinking are universal across human societies throughout history, develop reliably early in individuals’ lives, and are associated with specialized regions of the brain.</p></blockquote>
<p>Is that really true?  How does math fit into this picture?  Or music?  Music has a sort of narrative structure but it doesn&#8217;t seem quite like a story, either.</p>
<p>Hastie continues:</p>
<blockquote><p>What isn’t universal or natural is the kind of highly structured cognitive processes that underlie logical and mathematical thinking.</p></blockquote>
<p>Not natural . . . really?  Maybe math is not universal, but certainly it&#8217;s natural.  I was doing it when I was 2 years old.  And music, that does seem to be universal, no?</p>
<p>Later on:</p>
<blockquote><p>The mathematics of causal reasoning has recently experienced a major change, with the widespread acceptance of Bayesian Causal Networks as a normative, rational model for causal induction and reasoning.</p></blockquote>
<p>Ummm . . . maybe Hastie is a bit too accepting of this particular story!  I think Bayesian inference is great&#8212;I wrote two books on the topic!&#8212;but I <a href="http://www.stat.columbia.edu/~gelman/research/published/philosophy_online4.pdf">wouldn&#8217;t go so far</a> as to call it &#8220;a normative, rational model for causal induction and reasoning.&#8221;  But I suppose that if I feel able to opine about psychology, I can&#8217;t object to Hastie expressing his views on statistics.</p>
<p>Hastie continues with a famous example:</p>
<blockquote><p>The legendary theorists of decision-making Amos Tversky and Daniel Kahneman illustrated [our desire for stories] with the following pair of judgment questions: One group of respondents was asked, “What is the probability that a massive flood will occur sometime in the next year and drown more than 1,000 Americans?” The typical estimate was low (less than 20 percent). But, when another comparable sample of respondents was asked, “What is the probability that an earthquake in California will be followed by a flood in the next year that drowns at least 1,000 Americans?” the estimates were significantly higher.</p>
<p>The irrationality is that the second question is about a much more specific event, an earthquake that would be only one of the several reasons for the flood referred to in the first question. It is logically impossible for the second probability to be higher than the first. But, because the second question provides a plausible scenario for the unlikely outcome in the first query, our innate preference for a good story trumps our logical thinking skills.</p></blockquote>
<p>This story is a great example of the availability heuristic, but I don&#8217;t see how it demonstrates a problem with &#8220;our logical thinking skills.&#8221;  When responding to the first question, many people have difficulty visualizing that massive flood.  The second question gives a clue.  But I don&#8217;t see the combination of responses (coming from different sets of people) as indicating irrationality.  Most people are not flood experts.  They answer the questions as best they can, and when you give more information they will use it.</p>
<p>I hate to get all Gerd Gigerenzer on you here, but what&#8217;s the point of saying that this &#8220;trumps our logical thinking skills&#8221;?  I think Kahneman and Tversky did better, decades ago, by writing of &#8220;heuristics and biases.&#8221;</p>
<p><strong>What&#8217;s the political message here?</strong></p>
<p>The article under discussion concludes with:</p>
<blockquote><p>So the next time you hear a good story about why the financial recession, or any other economically significant event, was caused by a single collection of bad actors &#8212; or how a simple linear narrative “explains” an important event &#8212; remember this: Just as we are wired to like a diet rich in fats and sugars, we have an appetite for simple, coherent narratives. Neither habit is good for our long-term health.</p>
<p>(Reid Hastie, a professor of behavioral science at the University of Chicago Booth School of Business, is a contributor to Business Class. The opinions expressed are his own.)</p></blockquote>
<p>Aaahhhh, now I get the message:  The financial crisis is nobody&#8217;s fault!  Let&#8217;s put aside the politics of blame, let&#8217;s all work together etc etc.  OK, fine.  Does this apply to all catastrophes?  If you know someone in a plane that crashed, are we allowed to check if the pilot was stoned before takeoff?  If someone takes $100,000 from you on a fraudulent pretext, and you catch him, are you allowed to try to collect?  Or is it only in the financial crisis that we should set aside all &#8220;good stories&#8221; and &#8220;simple linear narratives&#8221;?</p>
<p>I agree that our financial problems our complex, and I&#8217;m all for warning people about the simplicity of storytelling, but I&#8217;m also a bit suspicious of someone from the University of Chicago School of Business telling me not to think about stories of the financial crisis.</p>
<p><strong>Getting quantitative</strong></p>
<p>Also, I&#8217;m surprised that, when people estimate &#8220;the probability that a massive flood will occur sometime in the next year and drown more than 1,000 Americans&#8221; as less than 20%, Hastie characterizes that estimate as &#8220;low.&#8221;  Even Katrina drowned only 387 people (according to <a href="http://new.dhh.louisiana.gov/assets/docs/katrina/deceasedreports/KatrinaDeaths_082008.pdf">this source</a> which I found by googling Katrina drownings).  If a 20% chance of this &#8220;massive flood&#8221; occurring in a one-year period is &#8220;low,&#8221; I&#8217;d be interested in what Hastie thinks is a more reasonable probability estimate.</p>
<p><strong>Responding</strong></p>
<p>Hastie&#8217;s article bothered me for two reasons.  First, what does it mean to it describe &#8220;the kind of highly structured cognitive processes that underlie logical and mathematical thinking&#8221; as &#8220;unnatural.&#8221;  I don&#8217;t quite get what &#8220;natural&#8221; means here.</p>
<p>Second, I see an implicit political message, which seems to be that we shouldn&#8217;t blame anyone for the financial crisis:</p>
<blockquote><p>We know there was no single cause or event that set in motion the crisis and that the truth is complex and multicausal. So why do we keep seeking the easy answers? It may be that we are hard-wired to do so.</p></blockquote>
<p>Or, as the guy said in Repo Man, &#8220;it&#8217;s society&#8217;s fault.&#8221;</p>
<p>I contacted bloomberg.com, the publishers of the above-linked article, but was told:</p>
<blockquote><p>We typically don&#8217;t publish opeds responding to articles we&#8217;ve published, though we welcome letters to the editor. We also post corrections to pieces containing factual errors and would gladly review any objections you have to Mr. Hastie&#8217;s column.</p></blockquote>
<p>Fair enough, but in this case I don&#8217;t think the problems would be resolved by a correction note.  I&#8217;m more bothered by the totality of the piece.  For example, the claim that logical reasoning is &#8220;unnatural&#8221; is not quite a &#8220;factual error&#8221; but it still seems wrong to me.</p>
<p>P.S.  Someone who knows the judgment and decision making field better than I do writes:</p>
<blockquote><p>I don&#8217;t think that Reid has a political agenda here. (He has only been at Chicago for a few years, and Chicago&#8217;s School of Business is not monolithic.) . . . To say that blame narratives are oversimplified is not the same as saying that nobody should be blamed; you may be reading the latter subtext into his text.</p></blockquote>
<p>So maybe I was being unfair.  Although I&#8217;d feel a little better about Hastie&#8217;s column if he&#8217;d clarified that, even though stories can be oversimplified, the &#8220;life is complicated&#8221; defense shouldn&#8217;t be used to get people off the hook.</p>
<p>Also, I&#8217;m still unhappy about the claim that logical and mathematical reasoning is &#8220;unnatural.&#8221;  But this fits with the innumeracy of thinking there&#8217;s a greater-than-20%-chance of a major flood in any given year.  I feel that, to Hastie, numbers are just words.  Which is consistent with the idea that mathematical reasoning is unnatural <em>to him</em>.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/i-hate-to-get-all-gerd-gigerenzer-on-you-here-but/feed/</wfw:commentRss>
		<slash:comments>35</slash:comments>
		</item>
		<item>
		<title>Question 3 of my final exam for Design and Analysis of Sample Surveys</title>
		<link>http://andrewgelman.com/2012/05/question-3-of-my-final-exam-for-design-and-analysis-of-sample-surveys/</link>
		<comments>http://andrewgelman.com/2012/05/question-3-of-my-final-exam-for-design-and-analysis-of-sample-surveys/#comments</comments>
		<pubDate>Sun, 13 May 2012 20:00:58 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Political Science]]></category>
		<category><![CDATA[Teaching]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15362</guid>
		<description><![CDATA[3. We discussed in class the best currently available method for estimating the proportion of military servicemembers who are gay. What is that method? (Recall the problems with the direct approach: there is no simple way to survey servicemembers at random, nor is it likely that they would answer such a question honestly.) Solution to [...]]]></description>
			<content:encoded><![CDATA[<p>3. We discussed in class the best currently available method for estimating the proportion of military servicemembers who are gay. What is that method? (Recall the problems with the direct approach: there is no simple way to survey servicemembers at random, nor is it likely that they would answer such a question honestly.)</p>
<p><strong>Solution to question 2</strong></p>
<p>From <a href="http://andrewgelman.com/2012/05/question-2-of-my-final-exam-for-design-and-analysis-of-sample-surveys/">yesterday</a>:</p>
<blockquote><p>2. Which of the following are useful goals in a pilot study? (Indicate all that apply.)</p>
<p>(a) You can search for statistical significance, then from that decide what to look for in a confirmatory analysis of your full dataset.</p>
<p>(b) You can see if you find statistical significance in a pre-chosen comparison of interest.</p>
<p>(c) You can examine the direction (positive or negative, even if not statistically significant) of comparisons of interest.</p>
<p>(d) With a small sample size, you cannot hope to learn anything conclusive, but you can get a crude estimate of effect size and standard deviation which will be useful in a power analysis to help you decide how large your full study needs to be.</p>
<p>(e) You can talk with survey respondents and get a sense of how they perceived your questions.</p>
<p>(f) You get a chance to learn about practical difficulties with sampling, nonresponse, and question wording.</p>
<p>(g) You can check if your sample is approximately representative of your population.</p></blockquote>
<p>Solution:  e and f.  The purpose of a pilot study is to test out the data collection.  The sample size will be too small for a, b, c, d, and g.  In some of their earliest work, Kahneman and Tversky documented the common misconception of researchers that data from a small pilot study should closely match the population.</p>
<p>The question would have clearer if I&#8217;d inserted the word &#8220;small&#8221; before &#8220;pilot&#8221; in the preamble.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/question-3-of-my-final-exam-for-design-and-analysis-of-sample-surveys/feed/</wfw:commentRss>
		<slash:comments>21</slash:comments>
		</item>
		<item>
		<title>Stolen jokes</title>
		<link>http://andrewgelman.com/2012/05/stolen-jokes/</link>
		<comments>http://andrewgelman.com/2012/05/stolen-jokes/#comments</comments>
		<pubDate>Sun, 13 May 2012 13:46:56 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Literature]]></category>
		<category><![CDATA[Sociology]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15135</guid>
		<description><![CDATA[Fun stories here (from Kliph Nesteroff, link from Mark Palko).]]></description>
			<content:encoded><![CDATA[<p>Fun stories <a href="http://blog.wfmu.org/freeform/2007/05/would_you_belie.html">here</a> (from Kliph Nesteroff, <a href="http://observationalepidemiology.blogspot.com/2012/04/what-im-currently-blogging-about-in.html">link</a> from Mark Palko).</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/stolen-jokes/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>More on Uncle Woody</title>
		<link>http://andrewgelman.com/2012/05/more-on-uncle-woody/</link>
		<comments>http://andrewgelman.com/2012/05/more-on-uncle-woody/#comments</comments>
		<pubDate>Sun, 13 May 2012 00:31:44 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Art]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15367</guid>
		<description><![CDATA[Here. See also here. He did Wacky Packs!]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.newyorker.com/online/blogs/culture/2012/05/art-spiegelman-discusses-maurice-sendak.html">Here</a>.</p>
<p>See also <a href="http://andrewgelman.com/2006/07/uncle_woody/">here</a>.  He did Wacky Packs!</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/more-on-uncle-woody/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Question 2 of my final exam for Design and Analysis of Sample Surveys</title>
		<link>http://andrewgelman.com/2012/05/question-2-of-my-final-exam-for-design-and-analysis-of-sample-surveys/</link>
		<comments>http://andrewgelman.com/2012/05/question-2-of-my-final-exam-for-design-and-analysis-of-sample-surveys/#comments</comments>
		<pubDate>Sat, 12 May 2012 20:00:08 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Political Science]]></category>
		<category><![CDATA[Teaching]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15349</guid>
		<description><![CDATA[2. Which of the following are useful goals in a pilot study? (Indicate all that apply.) (a) You can search for statistical significance, then from that decide what to look for in a confirmatory analysis of your full dataset. (b) You can see if you find statistical significance in a pre-chosen comparison of interest. (c) [...]]]></description>
			<content:encoded><![CDATA[<p>2. Which of the following are useful goals in a pilot study? (Indicate all that apply.)</p>
<p>(a) You can search for statistical significance, then from that decide what to look for in a confirmatory analysis of your full dataset.</p>
<p>(b) You can see if you find statistical significance in a pre-chosen comparison of interest.</p>
<p>(c) You can examine the direction (positive or negative, even if not statistically significant) of comparisons of interest.</p>
<p>(d) With a small sample size, you cannot hope to learn anything conclusive, but you can get a crude estimate of effect size and standard deviation which will be useful in a power analysis to help you decide how large your full study needs to be.</p>
<p>(e) You can talk with survey respondents and get a sense of how they perceived your questions.</p>
<p>(f) You get a chance to learn about practical difficulties with sampling, nonresponse, and question wording.</p>
<p>(g) You can check if your sample is approximately representative of your population.</p>
<p><strong>Solution to question 1</strong></p>
<p>From <a href="http://andrewgelman.com/2012/05/question-1-of-my-final-exam-for-design-and-analysis-of-sample-surveys/">yesterday</a>:</p>
<blockquote><p>1. Suppose that, in a survey of 1000 people in a state, 400 say they voted in a recent primary election. Actually, though, the voter turnout was only 30%. Give an estimate of the probability that a nonvoter will falsely state that he or she voted. (Assume that all voters honestly report that they voted.)</p></blockquote>
<p>Solution:  Draw the probability tree, you get that the proportion of people who say they voted is .3+.7p.  Solve .3+.7p=.4, you get p=(.4-.3)/.7=.14, or 14%.  I was also going to ask for the standard error (which you&#8217;d obtain by starting with the standard error for the &#8220;.4&#8243; and propagating that through) but I decided to keep it simple.  As it was, only about half the students got this question right.  This is not a knock on the kids&#8212;I just didn&#8217;t teach this material well&#8212;I&#8217;m just letting you know to give a sense that this isn&#8217;t such an easy problem.</p>
<p>P.S.  As some commenters note, Problem 1 isn&#8217;t so realistic.  Commenter awm <a href="http://andrewgelman.com/2012/05/question-1-of-my-final-exam-for-design-and-analysis-of-sample-surveys/#comment-82577">points out</a> that &#8220;for the most part people aren’t lying and that the sorts of people who participate in surveys about elections are disproportionately the sort of people who vote.&#8221;  My problem would&#8217;ve been cleaner if I&#8217;d also said to assume there was no nonresponse, and if I&#8217;d chosen a better example!</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/question-2-of-my-final-exam-for-design-and-analysis-of-sample-surveys/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>black and Black, white and White</title>
		<link>http://andrewgelman.com/2012/05/black-and-black-white-and-white/</link>
		<comments>http://andrewgelman.com/2012/05/black-and-black-white-and-white/#comments</comments>
		<pubDate>Sat, 12 May 2012 13:13:25 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Public Health]]></category>
		<category><![CDATA[Sociology]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15118</guid>
		<description><![CDATA[I&#8217;ve always thought it looked strange to see people referred to in print as Black or White rather than black or white. For example consider this sentence: &#8220;A black guy was walking down the street and he saw a bunch of white guys standing around.&#8221; That looks fine, whereas &#8220;A Black guy was walking down [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve always thought it looked strange to see people referred to in print as Black or White rather than black or white.  For example consider this sentence:  &#8220;A black guy was walking down the street and he saw a bunch of white guys standing around.&#8221;  That looks fine, whereas &#8220;A Black guy was walking down the street and he saw a bunch of White guys standing around&#8221;&#8212;that looks weird to me, as if the encounter was taking place in an Ethnic Studies seminar.</p>
<p>But maybe I&#8217;m wrong on this.  Jay Livingston <a href="http://montclairsoci.blogspot.com/2012/04/white-on-white.html">argues</a> that black and white are colors whereas Black and White are races (or, as I would prefer to say, ethnic categories) and illustrates with this picture of a white person and a White person:</p>
<p><a href="http://andrewgelman.com/wp-content/uploads/2012/04/00-Segal-Heather-2.jpg"><img src="http://andrewgelman.com/wp-content/uploads/2012/04/00-Segal-Heather-2.jpg" alt="" title="00 Segal Heather 2" width="320" height="240" class="alignnone size-full wp-image-15119" /></a></p>
<p>In conversation, I sometimes talk about pink people, brown people, and tan people, but that won&#8217;t work in a research paper.</p>
<p>P.S.  I suspect Carp will argue that I&#8217;m being naive:  meanings of words change across contexts and over time.  To which I reply:  Sure, but I still have to choose how to write these words!</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/black-and-black-white-and-white/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Question 1 of my final exam for Design and Analysis of Sample Surveys</title>
		<link>http://andrewgelman.com/2012/05/question-1-of-my-final-exam-for-design-and-analysis-of-sample-surveys/</link>
		<comments>http://andrewgelman.com/2012/05/question-1-of-my-final-exam-for-design-and-analysis-of-sample-surveys/#comments</comments>
		<pubDate>Fri, 11 May 2012 20:00:22 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Political Science]]></category>
		<category><![CDATA[Teaching]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15346</guid>
		<description><![CDATA[1. Suppose that, in a survey of 1000 people in a state, 400 say they voted in a recent primary election. Actually, though, the voter turnout was only 30%. Give an estimate of the probability that a nonvoter will falsely state that he or she voted. (Assume that all voters honestly report that they voted.) [...]]]></description>
			<content:encoded><![CDATA[<p>1.  Suppose that, in a survey of 1000 people in a state, 400 say they voted in a recent primary election. Actually, though, the voter turnout was only 30%. Give an estimate of the probability that a nonvoter will falsely state that he or she voted. (Assume that all voters honestly report that they voted.)</p>
<p>P.S.  The commenters are picking up some of the unintended &#8220;Hare and pineapple&#8221; ambiguity in my question!</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/question-1-of-my-final-exam-for-design-and-analysis-of-sample-surveys/feed/</wfw:commentRss>
		<slash:comments>25</slash:comments>
		</item>
		<item>
		<title>Are our referencing errors undermining our scholarship and credibility? The case of expatriate failure rates</title>
		<link>http://andrewgelman.com/2012/05/are-our-referencing-errors-undermining-our-scholarship-and-credibility-the-case-of-expatriate-failure-rates/</link>
		<comments>http://andrewgelman.com/2012/05/are-our-referencing-errors-undermining-our-scholarship-and-credibility-the-case-of-expatriate-failure-rates/#comments</comments>
		<pubDate>Fri, 11 May 2012 13:19:58 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Sociology]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15113</guid>
		<description><![CDATA[Thomas Basbøll points to this ten-year-old article from Anne-Wil Harzing on the consequences of sloppy citations. Harzing tells the story of an unsupported claim that is contradicted by published data but has been presented as fact in a particular area of the academic literature. She writes that &#8220;high expatriate failure rates [with "expatriate failure" defined [...]]]></description>
			<content:encoded><![CDATA[<p>Thomas Basbøll points to <a href="http://business.nmsu.edu/~mhyman/M610_Articles/Harzing_Journal_of_Organizational_Behavior_2002.pdf">this</a> ten-year-old article from Anne-Wil Harzing on the consequences of sloppy citations.  Harzing tells the story of an unsupported claim that is contradicted by published data but has been presented as fact in a particular area of the academic literature.  She writes that &#8220;high expatriate failure rates [with "expatriate failure" defined as "the expatriate returning home before his/her contractual period of employment abroad expires"] were in fact a myth created by massive misquotations and careless copying of references.&#8221;  Many papers claimed an expatriate failure rate of 25-40% (according to Harzing, this is much higher than the actual rate as estimated from empirical data), with this overly-high rate supported by a complicated link of references leading to . . . no real data.</p>
<p>Hartzing reports the following published claims:</p>
<blockquote><p>Harvey (1996: 103): `The rate of failure of expatriate managers relocating overseas from United States based MNCs has been estimated to range between 25±40 per cent (Tung, 1982, 1988; Mendenhall and Oddou, 1985; Gray 1991; Wedersphan, 1992; Solomon, 1994; Dowling, Schuler and Welch, 1994; Swaak, 1995).&#8217;<br />
Shay and Bruce (1997: 30): `Cross-industry studies have estimated US expatriate failure, defined as premature return from an overseas assignment, at between 25±40 per cent for developed countries (Baker and Ivancevich, 1971; Tung, 1981).&#8217;</p>
<p>Ashamalla (1998: 54): `According to a number of recent [emphasis added] studies, the rate of failure among American expatriates ranges from 25 to 40 per cent depending on the location of the assignment (Dumaine, Fortune, 1995; McDonald, 1993; Ralston et al., 1995).&#8217;</p></blockquote>
<p>Hartzing writes that, despite the profusion of references which appear to show multiple confirmations, these claims all comes from a single publication from 1979 which itself gives no source for its numbers.</p>
<p>If you believe Hartzing on this (and I see no reason not to), this is not about one or two sloppy researchers; rather, it seems to be general practice for people to thrown in citations without reading the original articles, thus creating a statistical problem of increasing the apparent N by treating multiple instances of the same claim as if they were independent pieces of supporting evidence.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/are-our-referencing-errors-undermining-our-scholarship-and-credibility-the-case-of-expatriate-failure-rates/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>My final exam for Design and Analysis of Sample Surveys</title>
		<link>http://andrewgelman.com/2012/05/my-final-exam-for-design-and-analysis-of-sample-surveys/</link>
		<comments>http://andrewgelman.com/2012/05/my-final-exam-for-design-and-analysis-of-sample-surveys/#comments</comments>
		<pubDate>Thu, 10 May 2012 19:28:11 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Statistics]]></category>
		<category><![CDATA[Political Science]]></category>
		<category><![CDATA[Teaching]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15339</guid>
		<description><![CDATA[We had 28 class periods, so I wrote an exam with an approximate correspondence of one question per class. Rather than dumping the exam in your lap all at once, I&#8217;ll post the questions once per day. Then each day I&#8217;ll post the answer to yesterday&#8217;s questions. So it will be 29 days in all. [...]]]></description>
			<content:encoded><![CDATA[<p>We had 28 class periods, so I wrote an exam with an approximate correspondence of one question per class.  Rather than dumping the exam in your lap all at once, I&#8217;ll post the questions once per day.  Then each day I&#8217;ll post the answer to yesterday&#8217;s questions.  So it will be 29 days in all.  I&#8217;ll post them to appear late in the day so as not to interfere with our main daily posts (which are currently backed up to early June).</p>
<p>The course was offered in the political science department and covered a mix of statistical and political topics.  Followers of our <a href="http://andrewgelman.com/2012/05/the-hare-the-pineapple-and-ed-wegman/">recent discussion</a> on test questions won&#8217;t be surprised to learn that some of the questions are ambiguous.  This wasn&#8217;t on purpose.  I tried my best, but good questions are hard to write.</p>
<p>Question 1 will appear tomorrow.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/my-final-exam-for-design-and-analysis-of-sample-surveys/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Now that’s what I call a lag!</title>
		<link>http://andrewgelman.com/2012/05/now-thats-what-i-call-a-lag/</link>
		<comments>http://andrewgelman.com/2012/05/now-thats-what-i-call-a-lag/#comments</comments>
		<pubDate>Thu, 10 May 2012 13:44:11 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Economics]]></category>
		<category><![CDATA[Sociology]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=14835</guid>
		<description><![CDATA[I received the following email the other day: Dear Dr. Gelman, I am emailing to let you know that your accepted article for Economic Inquiry will be published in print in the forthcoming April 2012 Issue. You will be receiving hard copies of the journal from Wiley-Blackwell for distribution to yourself and the Co Authors. [...]]]></description>
			<content:encoded><![CDATA[<p>I received the following email the other day:</p>
<blockquote><p>Dear Dr. Gelman,</p>
<p>I am emailing to let you know that your accepted article for Economic Inquiry will be published in print in the forthcoming April 2012 Issue.  You will be receiving hard copies of the journal from Wiley-Blackwell for distribution to yourself and the Co Authors.</p></blockquote>
<p>Hmmm . . . Economic Inquiry . . . didn&#8217;t I publish something there once?  A quick <a href="http://www.stat.columbia.edu/~gelman/research/published/">check</a> turned up <a href="http://www.stat.columbia.edu/~gelman/research/published/probdecisive2.pdf">this paper</a> from 2010.</p>
<p>I wonder what this new paper is.  Did someone submit something with my name on it?  I remember my surprise when, many years ago, I received a postcard asking for a reprint of my article in the Journal of Cerebral Blood Flow and Metabolism.  I was sure they were looking for the wrong Andrew Gelman, but, no, it turned out that my coauthors had submitted <a href="http://www.stat.columbia.edu/~gelman/research/published/jcbfm199133a.pdf">that article</a> all on their own.</p>
<p>In this case, though, there was no new article.  Economic Inquiry was indeed talking about my 2010 paper, which appeared online two years ago but is coming out in print only this month.</p>
<p>P.S.  To add insult to injury:  I wrote this post in March but it&#8217;s not appearing until May because I have a long lead time for non-topical entries on this blog.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/now-thats-what-i-call-a-lag/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Varying treatment effects, again</title>
		<link>http://andrewgelman.com/2012/05/varying-treatment-effects-again/</link>
		<comments>http://andrewgelman.com/2012/05/varying-treatment-effects-again/#comments</comments>
		<pubDate>Thu, 10 May 2012 03:02:02 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Causal Inference]]></category>
		<category><![CDATA[Political Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15090</guid>
		<description><![CDATA[This time from Bernard Fraga and Eitan Hersh. Once you think about it, it&#8217;s hard to imagine any nonzero treatment effects that don&#8217;t vary. I&#8217;m glad to see this area of research becoming more prominent. (Here&#8216;s a discussion of another political science example, also of voter turnout, from a few years ago, from Avi Feller [...]]]></description>
			<content:encoded><![CDATA[<p>This time <a href="http://scholar.harvard.edu/bfraga/files/fragahersh_2011.pdf">from</a> Bernard Fraga and Eitan Hersh.  Once you think about it, it&#8217;s hard to imagine any nonzero treatment effects that don&#8217;t vary.  I&#8217;m glad to see this area of research becoming more prominent.  (<a href="http://andrewgelman.com/2009/07/estimating_trea/">Here</a>&#8216;s a discussion of another political science example, also of voter turnout, from a few years ago, from Avi Feller and Chris Holmes.)</p>
<p>Some of my fragmentary work on varying treatment effects is <a href="http://www.stat.columbia.edu/~gelman/research/published/gelman.pdf">here</a> (Treatment Effects in Before-After Data) and <a href="http://www.stat.columbia.edu/~gelman/research/published/JASAAP03026R1.pdf">here</a> (Estimating Incumbency Advantage and Its Variation, as an Example of a Before–After Study).</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/varying-treatment-effects-again/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>The first version of my “inference from iterative simulation using parallel sequences” paper!</title>
		<link>http://andrewgelman.com/2012/05/the-first-version-of-my-inference-from-iterative-simulation-using-parallel-sequences-paper/</link>
		<comments>http://andrewgelman.com/2012/05/the-first-version-of-my-inference-from-iterative-simulation-using-parallel-sequences-paper/#comments</comments>
		<pubDate>Wed, 09 May 2012 13:29:57 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Bayesian Statistics]]></category>
		<category><![CDATA[Statistical computing]]></category>
		<category><![CDATA[R]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15291</guid>
		<description><![CDATA[From August 1990. It was in the form of a note sent to all the people in the statistics group of Bell Labs, where I&#8217;d worked that summer. To all: Here&#8217;s the abstract of the work I&#8217;ve done this summer. It&#8217;s stored in the file, /fs5/gelman/abstract.bell, and copies of the Figures 1-3 are on Trevor&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p>From August 1990.  It was in the form of a note sent to all the people in the statistics group of Bell Labs, where I&#8217;d worked that summer.</p>
<blockquote><p>To all:</p>
<p>Here&#8217;s the abstract of the work I&#8217;ve done this summer.  It&#8217;s stored in the file,<br />
/fs5/gelman/abstract.bell, and copies of the Figures 1-3 are on Trevor&#8217;s desk.<br />
Any comments are of course appreciated; I&#8217;m at gelman@stat.berkeley.edu.</p>
<p>On the Routine Use of Markov Chains for Simulation</p>
<p>Andrew Gelman and Donald Rubin, 6 August 1990</p>
<p>corrected version:  8 August 1990</p></blockquote>
<p><span id="more-15291"></span></p>
<blockquote><p>1.  Simulation</p>
<p>In probability and statistics we can often specify multivariate distributions<br />
many of whose properties we do not fully understand&#8211;perhaps, as in the<br />
Ising model of statistical physics, we can write the joint density function, up<br />
to a multiplicative constant that cannot be expressed in closed form.<br />
For an example in statistics, consider the Normal random<br />
effects model in the analysis of variance, which can be<br />
easily placed in a Bayesian framework with a conjugate prior distribution.<br />
All the conditional densities of the resulting posterior distribution<br />
are simple, but marginal densities can only be written in integral form and<br />
can only be calculated approximately.  (For details, see Kinderman and Snell<br />
(1980) or Pickard (1987) for the Ising model, and Lindley and Smith (1972)<br />
for the Bayesian random effects model.)</p>
<p>In such cases, we may not even be able to compute marginal moments of the<br />
difficult distribution, let alone more complicated and interesting summaries<br />
that would help us understand a probability model or posterior inference.<br />
When direct methods such as analytic or numerical integration of &#8220;nuisance&#8221;<br />
parameters are not computationally feasible, we might try Monte Carlo simulation;<br />
in the simplest form, we draw a finite set of independent random samples from our<br />
distribution, and then calculate desired distributional summaries as functions of<br />
the sampled points.  The Monte Carlo method is quite general and powerful; it is<br />
easy to calculate arbitrary quantities of interest<br />
such as the expected long-distance correlation in the Ising model or a posterior<br />
95% confidence region for the largest block effect in a random effects model.<br />
Any aspect of the distribution can be approximated to any desired accuracy if<br />
the number of independently sampled points is large enough.<br />
Simulation also has the advantage of flexiblility:  once a sample is drawn, it<br />
can be used to learn about any number of different distributional summaries.</p>
<p>2.  Markov chain methods</p>
<p>Drawing independent random samples is a wonderful tool that is unfortunately not<br />
available for every distribution; in particular, the Ising model and random<br />
effects posterior distributions mentioned above do not permit direct<br />
simulation.  Fortunately, a form of indirect simulation method exists for<br />
any multivariate distribution if we can calculate its joint density<br />
(up to a multiplicative constant) or if we can sample from all its univariate<br />
conditional densities.  The first of these methods was introduced by<br />
Metropolis et al. (1953) in the Journal of Chemical Physics.  Our work focuses<br />
on a similar and slightly simpler method called the Gibbs sampler by Geman and<br />
Geman (1984) in an article for the IEEE.</p>
<p>Let F(x) be our distribution; the Metropolis algorithm takes a starting (vector)<br />
point x0 and constructs a series x1, x2, . . ., that is a sample from an<br />
ergodic Markov chain whose stationary distribution is F(x).  Computer<br />
simulation of the series requires calculation of the density f(x) (up to a<br />
constant).  These samples xj are not independent; however, the stationary<br />
distribution of the Markov chain is<br />
correct, so if we take a long enough series, the set of values {x1, . . ., xn}<br />
takes the place of the distribution just as an<br />
independent random sample does (although of course an independent sample<br />
carries more information than a Markov chain sample of the<br />
same length).</p>
<p>The Gibbs sampler is a similar algorithm, which produces a Markov chain that<br />
converges to the desired distribution, this time requiring draws from all the<br />
univariate conditional densities at each iteration.</p>
<p>3.  Have we converged yet?</p>
<p>Markov chain simulation methods are attractive for many problems because they<br />
enable us to flexibly summarize intractable multivariate distributions by making<br />
full use of the mathematical structure we do know, using a tool we think we<br />
understand&#8211;Monte Carlo simulation.  Unfortunately, using a sample of a Markov<br />
chain to estimate a distribution raises an immediate question:  how long a series<br />
is needed?  After one or two steps, we are almost certainly still too close to<br />
the starting point to hope for unbiased summaries.  Asymptotically, the chain is<br />
stationary, and all is OK (with some loss of efficiency compared to independent<br />
samples, as mentioned above).</p>
<p>To obtain a feeling for the practical difficulties, we ran the Gibbs sampler for<br />
2000 steps to simulate a case of the Ising model.  To give the minimum of details:<br />
x is a vector of binary variables defined on a 100 by 100 lattice; each step of<br />
the Gibbs sampler took on the order of 10,000 computations; and we summarize<br />
each iterate xj by the sample correlation r on the lattice&#8211;a function r(x) that<br />
lies between -1 and 1.  Theoretical calculations (Pickard, 1987) show that<br />
under our model&#8211;the Ising model with beta = 0.5&#8211;the marginal distribution<br />
of r is approximately Gaussian with mean around 0.85 or 0.9 and standard<br />
deviation around 0.01.  We&#8217;d like to know whether the set {r(x1), . . .,<br />
r(x2000)} from the simulated Markov chain can serve as a substitute for the<br />
marginal distribution of r.</p>
<p>Figure 1 shows the values of r(xj), for j=1 to 2000.  (r(x0) = 0, and the first<br />
few values are cut off to improve resolution on the graph.)  The Markov chain<br />
seems to have &#8220;converged to stationarity&#8221; after the thousand or so steps required<br />
to shake off its initial state.  How do we know it has converged, though?  Figure<br />
2 zooms in of the first 500 steps of the series, whose apparent convergence we<br />
know to be illusory.  For comparison we ran the Gibbs<br />
sampler again for 2000 steps, but this time starting at a point x0 for which<br />
r(x0) = 1; Figure 3 displays the series r(xj), which again seems to have<br />
converged nicely.  To destroy all illusions about convergence, hold<br />
Figures 1 and 3 up to the light.  The two Markov chains have &#8220;converged&#8221; to<br />
different distributions!  We are, of course, still observing transient<br />
behavior.  </p>
<p>Interestingly, the means of the series in Figures 1 and 3 differ, but<br />
the variances are roughly equal.  We&#8217;re not sure why, but it seems to be a<br />
general feature in these Markov chain simulations that the variance converges<br />
before the mean.</p>
<p>All simulations and plots were done using the New S Language:<br />
A Programming Environment for Data Analysis and Graphics.</p>
<p>4.  The answer:  parallel Markov chains</p>
<p>To restate the general problem:  we wish to summarize an intractable<br />
distribution F(x) by running the Gibbs sampler (or a similar method such as the<br />
Metropolis algorithm) until the distribution of the set of Markov chain<br />
iterates is close to F.  As shown in the previous section, convergence seems<br />
impossible to monitor from a single finite realization of the Markov chain;<br />
consequently, we follow the implicit suggestion of Figures 1 and 3 and track<br />
several parallel sample paths.</p>
<p>Consider m independent runs of the Gibbs sampler, each of length n, starting<br />
from m different initial states x10, . . ., xm0:</p>
<p>series 1:  x11, . . ., x1n<br />
. . .<br />
series m:  xm1, . . ., xmn.</p>
<p>Again, we focus attention on a univariate summary, say r(x); we want to<br />
use the observed simulations rij to determine whether the series of r&#8217;s are<br />
close to convergence after n steps.<br />
To understand our method, consider the set of series as m blocks, each with<br />
n observations, in the one-way analysis of variance layout (that is, ignore the<br />
time ordering in the series).  We will work with the total sum of squares<br />
(with (mn-1) degrees of freedom) and the &#8220;within&#8221; sum of squares (with<br />
(m-1)(n-1) degrees of freedom).</p>
<p>First assume for simplicity that the starting points of the simulated series<br />
are themselves independent random samples from F(x).  (Of course, if this<br />
condition could be obtained in practice,<br />
a Markov chain simulation method would not be<br />
needed.)  With independent starting points, all values of any series<br />
are independent of all the values of any other series, and the unconditional<br />
variance of any point rij is just the marginal variance var r under the<br />
distribution F.  We can then estimate var r, given the &#8220;data matrix&#8221; (rij),<br />
by [total SS - (within SS)/m] / ((m-1)n).  (Algebraic derivations appear<br />
in the longer version of this article.)  Given the assumption of initial<br />
independence, this &#8220;between&#8221; estimate of variance<br />
(not the same as the usual &#8220;between&#8221; estimate in ANOVA) is unbiased for finite<br />
series of any length.</p>
<p>In contrast, the estimated variance within the series, (within SS) / ((m-1)(n-1)),<br />
has expectation var r only in the limit as n -> infinity.<br />
For finite series, the expected within mean square increases with n, assuming,<br />
as is likely, that the random variables r(x1), r(x2), . . ., from the Markov<br />
chain are positively correlated.  The discrepancy between the two estimates<br />
of var r suggests a test:  declare the Markov chain to have converged when<br />
the within mean square is close to the variance estimate between series, with<br />
confidence intervals derived from classical ANOVA theory.  Because of the<br />
dependence within blocks, the degrees of freedom of the between and within<br />
estimates are less than (m-1)n and m(n-1), respectively.  We can<br />
approximately correct for this information loss (once again, details will be<br />
provided in the longer article).</p>
<p>Once we are close enough to convergence to be satisfied, the variance estimates<br />
and degrees of freedom corrections alluded to above allow us to estimate the<br />
marginal summaries E r, var r, and Normal-theory confidence intervals for our<br />
Monte Carlo approximations.  We can run the series longer if more precision<br />
is desired, and can repeat the process to study the marginal distributions of<br />
other parameters (without, of course, having to simulate any new series of x&#8217;s).</p>
<p>In practice, the starting points of the parallel series can never be sampled<br />
independently with distribution F(x); the simulated series are thus no longer<br />
stationary for any finite n, formally invalidating the above analysis.  We<br />
currently have two strategies designed to make the independence assumption<br />
approximately true.  First, we try to pick starting values that are far apart and,<br />
if anything, more dispersed than independent random samples. The m parallel series<br />
should then start far apart and grow closer as they approach stationarity, as in<br />
Figures 1 and 3; since the variance between series declines with n, the<br />
comparison-of-variances test should be conservative.  Second, we reduce the<br />
effect of the starting values by crudely throwing away the the first half<br />
of each simulated series until approximate convergence has been reached.<br />
Once again, Figures 1 and 3 illustrate how a few early steps<br />
of the Markov chain can greatly distort estimates of means and variances<br />
within series.  We hope that the conservative strategies of starting with<br />
dispersed points and throwing away early simulations will yield confidence<br />
regions that are wider than those obtained by the ideal method, but that<br />
still have good coverage properties.</p>
<p>The idea of comparing parallel simulations is not new; for<br />
example, Fosdick (1959) applied the Metropolis algorithm to the Ising model<br />
by simulating four series independently, from each of two different starting<br />
points.  Approximate convergence was declared when the two groups of series<br />
became indistinguishable on the scale of a prechosen error bound.</p>
<p>5.  Some references</p>
<p>     Ehrman, J. R., Fosdick, L. D., and Handscomb, D. C. (1960).<br />
Computation of order parameters in an Ising lattice by the Monte Carlo method.<br />
{\em Journal of Mathematical Physics} {\bf 1} 547&#8211;558.</p>
<p>     Fosdick, L. D. (1959).  Calculation of order parameters in a binary<br />
alloy by the Monte Carlo method.  {\em Physical Review} {\bf 116}, 565&#8211;573.</p>
<p>     Geman, S., and Geman, D. (1984).  Stochastic relaxation, Gibbs<br />
distributions, and the Bayesian restoration of images.  {\em IEEE Transactions<br />
on Pattern Analysis and Machine Intelligence} {\bf 6}, 721&#8211;741.</p>
<p>     Hammersley, J. M., and Handscomb, D. C. (1964), chapter 9.  {\em Monte Carlo<br />
Methods}.  London:  Chapman and Hall.</p>
<p>     Kinderman, R., and Snell, J. L. (1980).  {\em Markov Random Fields and<br />
their Applications}.  Providence, R.I.:  American Mathematical Society.</p>
<p>     Lindley, D. V., and Smith, A. F. M. (1972).  Bayes estimates for the linear<br />
model.  {\em Journal of the Royal Statistical Society B} {\bf 34}, 1&#8211;41.</p>
<p>     Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H., and<br />
Teller, E. (1953).  Equation of state calculations by fast computing machines.<br />
{\em Journal of Chemical Physics} {\bf 21}, 1087&#8211;1092.</p>
<p>     Pickard, D. K. (1987).  Inference for discrete Markov fields:  the<br />
simplest nontrivial case.  {\em Journal of the American Statistical Association}<br />
{\bf 82} 90&#8211;96.</p>
<p>     Ripley, B. D. (1981).  {\em Spatial Statistics}, p. 16ff.  New York:  Wiley.</p>
<p>     Tanner, M. A., and Wong, W. H. (1987).  The calculation of posterior<br />
distributions by data augmentation.  {\em Journal of the American Statistical<br />
Association} {\bf 82}, 528&#8211;550.</p></blockquote>
<p>I wrote the article but properly listed Rubin as coauthor, as the idea came about after many long phone conversations.  I encountered the idea of between-within comparison in the 1959 paper by Fosdick (see above citations); I can&#8217;t remember how I found that paper but it must have been from a literature search, going backward from more recent sources.  Anyway, when I brought up this idea, Rubin picked up on it right away, as it was close to methods he had developed for inference from multiple imputations.  Once we had that connection, the idea was there.  And I&#8217;d credit Rubin&#8217;s influence for my goal of estimating a potential scale reduction factor&#8212;that is, a numerical measure of lack of mixing&#8212;rather than formulating the problem as a hypothesis test.</p>
<p>The <a href="http://www.stat.columbia.edu/~gelman/research/published/itsim.pdf">published article</a> appeared over two years later in the journal Statistical Science, in a much expanded version.</p>
<p>In some ways, I prefer this short paper to the full version.  I like the snappy style, and I like the clarity about what we believe and what we don&#8217;t know.  I regret not submitting some version of the above article to a journal immediately, right then in Aug 1990.  On the other hand, editors and reviewers for statistics journals can be very stuffy, and an article such as above with a concept but no theoretical derivations probably would&#8217;ve been shot down over and over and over.  Maybe it just took two years to put in enough blah blah blah to make it publishable.</p>
<p>The above is more like a blog post than a journal article.  It contains the key idea with no messing around.</p>
<p>P.S.  You&#8217;ll notice above that I wrote, &#8220;Any comments are of course appreciated.&#8221;  And you probably won&#8217;t be surprised to hear that I got no comments.  It took me a long time to realize that most people don&#8217;t want to comment on things.  When we were getting close to finishing the first edition of Bayesian Data Analysis back in 1994, I printed out copies and gave them to lots of prominent statisticians I knew, but very few gave any comments at all.  It&#8217;s not about me; people just don&#8217;t like to read and make comments.  We get some comments on the blog, but when you consider the number of comments and the number of readers, you&#8217;ll realize that most people don&#8217;t comment here either.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/the-first-version-of-my-inference-from-iterative-simulation-using-parallel-sequences-paper/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>chartsnthings !</title>
		<link>http://andrewgelman.com/2012/05/chartsnthings/</link>
		<comments>http://andrewgelman.com/2012/05/chartsnthings/#comments</comments>
		<pubDate>Tue, 08 May 2012 13:42:01 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Statistical graphics]]></category>
		<category><![CDATA[R]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15310</guid>
		<description><![CDATA[Yair pointed me to this awesome blog of how the NYT people make their graphs. This blows away all other stat graphics blogs (including this one). Lots of examples from mockup to first tries to final version. I recognize a lot of what they&#8217;re doing from my own experience. Also from my experience it&#8217;s hard [...]]]></description>
			<content:encoded><![CDATA[<p>Yair pointed me to <a href="http://chartsnthings.tumblr.com/">this awesome blog</a> of how the NYT people make their graphs.  This blows away all other stat graphics blogs (including this one).  Lots of examples from mockup to first tries to final version.  I recognize a lot of what they&#8217;re doing from my own experience.  Also from my experience it&#8217;s hard to get all these details down:  once you have the final graph, it&#8217;s easy to forget how you go there.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/chartsnthings/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Happy news on happiness; what can we believe?</title>
		<link>http://andrewgelman.com/2012/05/happy-news-on-happiness-what-can-we-believe/</link>
		<comments>http://andrewgelman.com/2012/05/happy-news-on-happiness-what-can-we-believe/#comments</comments>
		<pubDate>Mon, 07 May 2012 15:50:10 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Decision Theory]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15303</guid>
		<description><![CDATA[Sharon Jayson writes: The conventional wisdom that&#8217;s developed over the past few decades — based on early research — has said parents are less happy, more depressed and have less-satisfying marriages than their childless counterparts. But now, two new studies presented as part of the Population Association of America&#8217;s annual meeting suggest that earlier findings [...]]]></description>
			<content:encoded><![CDATA[<p>Sharon Jayson <a href="http://www.usatoday.com/news/health/wellness/story/2012-05-03/parents-happiness-population-survey/54767508/1?">writes</a>:</p>
<blockquote><p>The conventional wisdom that&#8217;s developed over the past few decades — based on early research — has said parents are less happy, more depressed and have less-satisfying marriages than their childless counterparts.</p>
<p>But now, two new studies presented as part of the Population Association of America&#8217;s annual meeting suggest that earlier findings in several studies weren&#8217;t so clear-cut and may, in fact, be flawed. The newer analyses presented this week use analytical methods based on data from almost 130,000 adults around the globe — including more than 52,000 parents — and the conclusions aren&#8217;t so grim. They say that parents today may indeed be happier than non-parents and that parental happiness levels — while they do drop — don&#8217;t dip below the levels they were before having children. . . .</p>
<p>The other study, of some 120,000 adults from two nationally representative surveys between 1972-2008, finds that parents were indeed less happy than non-parents in the decade 1985-95, but parents from 1995 to 2008 were happier. . . .</p></blockquote>
<p>This is consistent with <a href="http://andrewgelman.com/2010/12/age_and_happine/">my observation</a> that happiness research is a mess.  Don&#8217;t get me wrong:  I think happiness is very much worth studying.  But various seemingly well-known results don&#8217;t seem so clear when they are studied more carefully.  This is not the first time that Happiness Scholar #2 comes to an opposite conclusion as Happiness Scholar #1.</p>
<p>The larger point, perhaps, is that it that &#8220;stylized facts&#8221; (the social-science term for generally-accepted findings) can mislead.  Sometimes the interaction is bigger than the main effect.</p>
<p>Here&#8217;s a <a href="http://www.demogr.mpg.de/en/projects_publications/publications_1904/journal_articles/a_global_perspective_on_happiness_and_fertility_4061.htm">recent</a> paper by Mikko Myrskylä, one of the researchers mentioned in the above article:</p>
<blockquote><p>The literature on fertility and happiness has neglected comparative analysis. We investigate the fertility/happiness association using data from the World Values Surveys for 86 countries. We find that globally, happiness decreases with the number of children. This association, however, is strongly modified by individual and contextual factors. Most importantly, we find that the association between happiness and fertility evolves from negative to neutral to positive above age 40, and is strongest among those who are likely to benefit most from upward intergenerational transfers. In addition, analyses by welfare regime show that the negative fertility/ happiness association for younger adults is weakest in countries with high public support for families, and the positive association above age 40 is strongest in countries where old-age support depends mostly on the family.</p></blockquote>
<p>The news article ends with the following comment:</p>
<blockquote><p>&#8220;The first child increases happiness quite a lot. The second child a little. The third not at all,&#8221; says Myrskylä.</p></blockquote>
<p>As a statistician, I hate hate hate hate hate when people ignore variability and present results deterministically.  The above statement might be an accurate summary of average patterns but is certainly not true in every case!</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/happy-news-on-happiness-what-can-we-believe/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>The hare, the pineapple, and Ed Wegman</title>
		<link>http://andrewgelman.com/2012/05/the-hare-the-pineapple-and-ed-wegman/</link>
		<comments>http://andrewgelman.com/2012/05/the-hare-the-pineapple-and-ed-wegman/#comments</comments>
		<pubDate>Mon, 07 May 2012 15:27:32 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15280</guid>
		<description><![CDATA[Commenters here are occasionally bothered that I spend so much time attacking frauds and plagiarists. See, for example, here and here. Why go on and on about these losers, given that there are more important problems in the world such as war, pestilence, hunger, and graphs where the y-axis doesn&#8217;t go all the way down [...]]]></description>
			<content:encoded><![CDATA[<p>Commenters here are occasionally bothered that I spend so much time attacking frauds and plagiarists.  See, for example, <a href="http://andrewgelman.com/2012/04/another-day-another-plagiarist/#comment-79050">here</a> and <a href="http://andrewgelman.com/2012/04/another-day-another-plagiarist/#comment-79155">here</a>.  Why go on and on about these losers, given that there are more important problems in the world such as war, pestilence, hunger, and graphs where the y-axis doesn&#8217;t go all the way down to zero?</p>
<p>Part of the story is that I do research for a living so I resent people who devalue research through misattribution or fraud, in the same way that rich people don&#8217;t like counterfeiters.</p>
<p>What really bugs me, though, is when cheaters get caught and still don&#8217;t admit it.  People like Hauser, Wegman, Fischer, and Weick get under my skin because they have the chutzpah to just deny deny deny.  The grainy time-stamped videotape with their hand in the cookie jar is right there, and they&#8217;ll still talk around the problem.  Makes me want to scream.</p>
<p>This happens <em>all the time</em>.  All. Over. The. Place.</p>
<p>Everybody makes mistakes, and just about everybody does things they shouldn&#8217;t, every now and then.  But to not apologize when you&#8217;re caught, that to me just seems evil, showing a disrespect not just for the people involved but for the very concept of truth.  (As you can see, I wouldn&#8217;t make a good criminal defense lawyer.)</p>
<p>Anyway, here&#8217;s the latest story.  There&#8217;s always an outrage-of-the-week, and I don&#8217;t mean to make a big deal about this particular scandal.  It&#8217;s just another example of what I consider the disgraceful pattern of people refusing to admit error.</p>
<p><strong>Part 1:  The mistake</strong><br />
<span id="more-15280"></span><br />
The following question was on a New York State <a href="http://www.nydailynews.com/new-york/talking-pineapple-question-state-exam-stumps-article-1.1064657">eighth grade reading exam</a>:</p>
<blockquote><p>The Hare and the Pineapple</p>
<p>by Daniel Pinkwater</p>
<p>In olden times, the animals of the forest could speak English just like you and me. One day, a pineapple challenged a hare to a race.</p>
<p>(I forgot to mention, fruits and vegetables were able to speak too.)</p>
<p>A hare is like a rabbit, only skinnier and faster. This particular hare was known to be the fastest animal in the forest.</p>
<p>“You, a pineapple have the nerve to challenge me, a hare, to a race,” the hare asked the pineapple. “This must be some sort of joke.”</p>
<p>“No,” said the pineapple. “I want to race you. Twenty-six miles, and may the best animal win.&#8221;</p>
<p>&#8220;You aren&#8217;t even an animal!&#8221; the hare said. “You&#8217;re a tropical fruit!&#8221;</p>
<p>“Well, you know what I mean,” the pineapple said.</p>
<p>The animals of the forest thought it was very strange that tropical fruit should want to race a very fast animal.</p>
<p>&#8220;The pineapple has some trick up its sleeve,&#8221; a moose said.</p>
<p>Pineapples don&#8217;t have sleeves, an owl said</p>
<p>&#8220;Well, you know what I mean,” the moose said. &#8220;If a pineapple challenges a hare to a race, it must be that the pineapple knows some secret trick that will allow it to win.”</p>
<p>“The pineapple probably expects us to root for the hare and then look like fools when it loses,” said a crow. “Then the pineapple will win the race because the hare is overconfident and takes a nap, or gets lost, or something.”</p>
<p>The animals agreed that this made sense. There was no reason a pineapple should challenge a hare unless it had a clever plan of some sort. So the animals, wanting to back a winner, all cheered for the pineapple.</p>
<p>When the race began, the hare sprinted forward and was out of sight in less than a minute. The pineapple just sat there, never moving an inch.</p>
<p>The animals crowded around watching to see how the pineapple was going to cleverly beat the hare. Two hours later when the hare cross the finish line, the pineapple was still sitting still and hadn&#8217;t moved an inch.</p>
<p>The animals ate the pineapple.</p>
<p>MORAL: Pineapples don&#8217;t have sleeves</p></blockquote>
<p>Several questions follow, including:</p>
<blockquote><p>The animals ate the pineapple most likely because they were</p>
<p>A Hungry<br />
B Excited<br />
C Annoyed<br />
D Amused</p>
<p>Which animal spoke the wisest words?</p>
<p>A The hare<br />
B The moose<br />
C The crow<br />
D The owl</p></blockquote>
<p>Nope, I don&#8217;t know how to answer these either.</p>
<p><strong>Part 2:  Exposure</strong></p>
<p>The story hits the Daily News and other media outlets.  To its credit, the state education commission takes the question <a href="http://articles.nydailynews.com/2012-04-20/news/31375924_1_pineapple-state-tests-new-exams">off the test</a>.  More on the story <a href="http://ideas.time.com/2012/05/04/what-everyone-missed-on-the-pineapple-question/">here</a>.</p>
<p><strong>Part 3:  No admission of error</strong></p>
<p>From Time magazine, a <a href="http://ideas.time.com/2012/05/04/pineapplegate-exclusive-memo-detailing-the-hare-and-the-pineapple-passage/">memo</a> from Jon S. Twing of Pearson, the company that made the test with the silly questions:</p>
<blockquote><p>Pearson is confident that the NYS Grades 3-8 English Language Arts (ELA) and Mathematics assessments have been developed to support valid and reliable interpretations of scores for their intended uses. The “Hare and the Pineapple” passage and associated items were placed on the Grade 8 ELA test after the NYSfield test data associated with the multiple choice items and the feedback from the “final eyes” committee determined that this was an appropriate passage and set of items to include on the test. . . .</p>
<p>“The Hare and the Pineapple” passage is intended to measure NYS Standard “interpretation of character traits, motivations, and behavior” and “eliciting supporting detail”. . . .</p></blockquote>
<p>Blah blah blah . . . But now the good part.  Jon S. Twing writes:</p>
<blockquote><p>There have been two items of the set of six that have been challenged by NY teachers and students as the test was under way April 17-19, 2012 -Item 7 and Item 8. The correct answers and rationales to Item 7 and Item 8 are as follows:</p>
<p>• Item 7: The correct answer is C. The question regarding the animals’ possible motivation for eating the pineapple requires a reader to infer the correct answer from clues conveyed in the text. While all of the options are plausible motivations, the most likely answer is that the animals were annoyed. Paragraph 13 indicates that the animals support the pineapple to win the race because they assume the pineapple has a clever plan. However, the pineapple never moves during the race. From these clues and events, a reader can infer that the animals are annoyed. The text does not support the inference that the animals are motivated by hunger, excitement, or amusement.</p>
<p>• Item 8: The correct answer is D. The question regarding the wisest animal requires the reader to apply close analytic reading skills to determine which of the choices represents the wisest animal based on clues given in the text. The moose and the crow are the two animals that present the incorrect idea that the pineapple has a clever plan to win the race. This idea is proven false when the hare wins the race. The hare is presented as incredulous that a pineapple would challenge him to a race, but overconfidently agrees to race a pineapple.</p></blockquote>
<p>Huh?  The hare &#8220;<em>overconfidently</em> agrees to race a pineapple&#8221;???  What&#8217;s overconfident about that?  A pineapple can&#8217;t run at all!</p>
<p>Twing continues:</p>
<blockquote><p>Finally, the owl declares that “Pineapples don’t have sleeves,” which is a factually accurate statement. This statement is also presented as the moral of the story, allowing a careful reader to infer that the owl is the wisest animal.</p></blockquote>
<p>Sorry, Jon S. Twing, but that doesn&#8217;t make sense at all.  The moose makes it perfectly clear that &#8220;a trick up the sleeve&#8221; is just a figure of speech.  The owl is <em>wise</em> for taking a figure of speech literally?  In that case, why not kill the joke entirely with &#8220;hares can&#8217;t talk&#8221;?  Grrrrrr.</p>
<p>You gotta remember that the people taking this exam are 8th graders who are trying to <em>not</em> get tricked themselves.</p>
<p>And, by the way, I have a feeling &#8220;the moral of the story&#8221; is a joke, in the style of James Thurber&#8217;s Fables for Our Time.</p>
<p>Twing wraps things up with a bunch of words and numbers suggesting that these questions have good psychometric properties, i.e. that students who got other questions right tended to get these right also.  That&#8217;s interesting:  even an impossible-to-answer question has some patterns which some students are able to match.  But that doesn&#8217;t make these good exam questions.</p>
<p>The psychometrics are relevant but I just think it&#8217;s embarrassing that this poor guy is in the position of claiming, first, that the animals are annoyed, when there is <em>no evidence at all</em> of that in the quoted passage, and, second, all that business of pineapples and sleeves.</p>
<p>Why not just say:  Hey, it&#8217;s not easy to write a test!  We made a mistake!  Sorry!</p>
<p>But nooooooo, he can&#8217;t do that, he&#8217;s gotta defend defend defend defend defend.  OK, I&#8217;m not calling him a Wegman&#8212;there&#8217;s no reason to suspect that any botched wikipedia copying went down&#8212;but that makes the whole thing even worse, in a way.  These people didn&#8217;t even do anything intentionally wrong.  They just made a mistake.  But they still won&#8217;t admit it.  &#8220;The correct answer is C,&#8221; indeed.</p>
<p>Here I am, railing against universal human nature again.  That&#8217;s what blogging is for, I suppose.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/the-hare-the-pineapple-and-ed-wegman/feed/</wfw:commentRss>
		<slash:comments>88</slash:comments>
		</item>
		<item>
		<title>Lists of Note and Letters of Note</title>
		<link>http://andrewgelman.com/2012/05/lists-of-note-and-letters-of-note/</link>
		<comments>http://andrewgelman.com/2012/05/lists-of-note-and-letters-of-note/#comments</comments>
		<pubDate>Mon, 07 May 2012 13:11:00 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Literature]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=14719</guid>
		<description><![CDATA[These (from Shaun Usher) are surprisingly good, especially since he appears to come up with new lists and letters pretty regularly. I suppose a lot of them get sent in from readers, but still. Here&#8217;s my favorite recent item, a letter sent to the Seattle Bureau of Prohibition in 1931: Dear Sir: My husband is [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.listsofnote.com/">These</a> (from Shaun Usher) are surprisingly good, especially since he appears to come up with new lists and letters pretty regularly.  I suppose a lot of them get sent in from readers, but still.</p>
<p>Here&#8217;s my favorite recent item, a <a href="http://www.lettersofnote.com/2012/02/will-you-please-have-his-place-raided.html">letter</a> sent to the Seattle Bureau of Prohibition in 1931:</p>
<blockquote><p>Dear Sir:</p>
<p>My husband is in the habit of buying a quart of wiskey every other day from a Chinese bootlegger named Chin Waugh living at 317-16th near Alder street.</p>
<p>We need this money for household expenses. Will you please have his place raided? He keeps a supply planted in the garden and a smaller quantity under the back steps for quick delivery. If you make the raid at 9:30 any morning you will be sure to get the goods and Chin also as he leaves the house at 10 o&#8217;clock and may clean up before he goes.</p>
<p>Thanking you in advance, </p>
<p>I remain<br />
yours truly,</p>
<p>Mrs. Hillyer</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/lists-of-note-and-letters-of-note/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Picking on Stephen Wolfram</title>
		<link>http://andrewgelman.com/2012/05/picking-on-stephen-wolfram/</link>
		<comments>http://andrewgelman.com/2012/05/picking-on-stephen-wolfram/#comments</comments>
		<pubDate>Mon, 07 May 2012 01:17:33 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Zombies]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15288</guid>
		<description><![CDATA[Shalizi. But this one is still my favorite.]]></description>
			<content:encoded><![CDATA[<p><a href="http://bactra.org/weblog/915.html">Shalizi</a>.</p>
<p>But <a href="http://andrewgelman.com/2010/05/alert_incompete/">this one</a> is still my favorite.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/picking-on-stephen-wolfram/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Fun with google autocomplete</title>
		<link>http://andrewgelman.com/2012/05/fun-with-google-autocomplete/</link>
		<comments>http://andrewgelman.com/2012/05/fun-with-google-autocomplete/#comments</comments>
		<pubDate>Sun, 06 May 2012 23:52:16 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Political Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15063</guid>
		<description><![CDATA[Aleks points us to this idea of labeling for news.]]></description>
			<content:encoded><![CDATA[<p>Aleks points us to <a href="http://www.niemanlab.org/2012/02/eat-your-vegetables-ideas-from-news-nutritionists-to-reform-our-media-diets/">this idea</a> of labeling for news.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/fun-with-google-autocomplete/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>I’m skeptical about this skeptical article about left-handedness</title>
		<link>http://andrewgelman.com/2012/05/im-skeptical-about-this-skeptical-article-about-left-handedness/</link>
		<comments>http://andrewgelman.com/2012/05/im-skeptical-about-this-skeptical-article-about-left-handedness/#comments</comments>
		<pubDate>Sun, 06 May 2012 13:33:41 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Public Health]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15067</guid>
		<description><![CDATA[I was flipping through the paper and noticed an opinion piece by linguist and science writer Rik Smits, &#8220;Lefties aren&#8217;t special after all&#8221;: Few truly insignificant traits receive as much attention as left-handedness. In just the last couple of generations, an orientation once associated with menace has become associated with leadership, creativity, even athletic prowess. [...]]]></description>
			<content:encoded><![CDATA[<p>I was flipping through the paper and noticed an <a href="http://www.nytimes.com/2012/04/15/opinion/sunday/lefties-arent-special-after-all.html">opinion piece</a> by linguist and science writer Rik Smits, &#8220;Lefties aren&#8217;t special after all&#8221;:</p>
<blockquote><p>Few truly insignificant traits receive as much attention as left-handedness. In just the last couple of generations, an orientation once associated with menace has become associated with leadership, creativity, even athletic prowess. Presidents Gerald R. Ford, George Bush, Bill Clinton and Barack Obama were born left-handed (as was Ronald Reagan, though he learned to write with his right hand). Folklore has it that southpaws are unusually common in art and architecture schools. Left-handed athletes like Tim Tebow and Randy Johnson are celebrated.</p></blockquote>
<p>Sounds interesting so far.  Then we get several paragraphs of history of how people got things wrong (authoritarians of past generations who forced lefties to use their right hands, silly &#8220;blank slate&#8221; ideologues, etc.).</p>
<p>What about the science?  Smits writes:</p>
<blockquote><p>Left-handers have been redefined as creative, broad-minded, natural leaders. Meanwhile, other studies continue to identify all sorts of negative associations with left-handers — clumsiness, propensity to die prematurely, higher breast cancer rates and greater vulnerability to suicide.</p>
<p>After reviewing hundreds of such studies for a book on left-handers, I [Smits] found that the evidence of positive qualities associated with left-handedness was anecdotal at best, while the scores of studies associating left-handedness with all manner of afflictions were generally too unreliable to have any practical consequence.</p></blockquote>
<p>I&#8217;d have to see Smits&#8217;s book to judge (and, before you get on my case for commenting on a book that I haven&#8217;t read, please reflect that Smits <em>chose</em> to publish his article in the Times, and I think it&#8217;s expected that many people will write a newspaper article without reading the corresponding book.  Certainly, when I wrote op-eds about Red State Blue State, I wanted these to stand on their own for the benefit of the vast majority of newspaper readers who were not reading the book), but I&#8217;m skeptical of his skepticism.  The studies I&#8217;ve seen of handedness do have potential problems, so I wouldn&#8217;t object to labeling as &#8220;speculative&#8221; such claims such as &#8220;mathematicians are more likely to be left-handed&#8221; or &#8220;left-handers live shorter lives than right-handers.&#8221;  At the same time, such claims are not scientifically implausible, and they do seem supported to some extent by the data.</p>
<p>I have not looked at the research on handedness recently, so I&#8217;m not sure whether Smits&#8217;s skepticism reflects new information or whether it is just a statement that the claimed findings about left-handers have not been proved.  If the latter, I think it would be better to say that it&#8217;s not clear to what extent left-handers are different from the majority, rather than to say with such certainty that &#8220;lefties aren&#8217;t special at all.&#8221;</p>
<p>P.S.  I&#8217;m right-handed.  I don&#8217;t have any personal stake in all this; it&#8217;s a topic I got interested in awhile ago when Seth and I taught our class on left-handedness.  At the time we recognized the weakness of the studies on the topic, but we also recognized weaknesses in sweeping arguments that attempted to dismiss the findings by arguing for selection effects etc.  My impression at the time was that there was some evidence of interesting and important systematic differences between lefties and righties, but that it was also possible we were seeing nothing more than a bunch of statistical artifacts.  My conclusion was that skepticism was warranted but it would be going too far to be certain that nothing was going on.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/im-skeptical-about-this-skeptical-article-about-left-handedness/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Recently in the sister blog</title>
		<link>http://andrewgelman.com/2012/05/recently-in-the-sister-blog-2/</link>
		<comments>http://andrewgelman.com/2012/05/recently-in-the-sister-blog-2/#comments</comments>
		<pubDate>Sat, 05 May 2012 19:19:50 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Political Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15278</guid>
		<description><![CDATA[Culture war: The rules You can only accept capital punishment if you’re willing to have innocent people executed every now and then The politics of America’s increasing economic inequality]]></description>
			<content:encoded><![CDATA[<p><a href="http://themonkeycage.org/blog/2012/04/26/culture-war-the-rules/">Culture war: The rules</a></p>
<p><a href="http://themonkeycage.org/blog/2012/04/25/death-death-death-death-death/">You can only accept capital punishment if you’re willing to have innocent people executed every now and then</a></p>
<p><a href="http://themonkeycage.org/blog/2012/04/22/the-politics-of-americas-increasing-economic-inequality/">The politics of America’s increasing economic inequality</a></p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/recently-in-the-sister-blog-2/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Related to z-statistics</title>
		<link>http://andrewgelman.com/2012/05/related-to-z-statistics/</link>
		<comments>http://andrewgelman.com/2012/05/related-to-z-statistics/#comments</comments>
		<pubDate>Sat, 05 May 2012 13:27:07 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Zombies]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15065</guid>
		<description><![CDATA[Pawel Sobkowicz writes: How many zombies do you know?&#8217; Using indirect survey methods to measure alien attacks and outbreaks of the undead, Arxiv preprint arXiv:1003.6087, 2010 I hope you would find interesting the following paper, recently posted on arXiv: Aliens on Earth. Are reports of close encounters correct?, arXiv:1203.6805 This is soooooo much better than [...]]]></description>
			<content:encoded><![CDATA[<p>Pawel Sobkowicz writes:</p>
<blockquote><p>How many zombies do you know?&#8217; Using indirect survey methods to measure alien attacks and outbreaks of the undead, Arxiv preprint arXiv:1003.6087, 2010<br />
I hope you would find interesting the following paper, recently posted on arXiv:<br />
Aliens on Earth. Are reports of close encounters correct?, arXiv:1203.6805</p></blockquote>
<p>This is soooooo much better than getting links to bad graphs or to papers on sex ratios!</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/related-to-z-statistics/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Models, assumptions, and data summaries</title>
		<link>http://andrewgelman.com/2012/05/models-assumptions-and-data-summaries/</link>
		<comments>http://andrewgelman.com/2012/05/models-assumptions-and-data-summaries/#comments</comments>
		<pubDate>Fri, 04 May 2012 13:55:41 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Science]]></category>
		<category><![CDATA[Public Health]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=14890</guid>
		<description><![CDATA[I saw an analysis recently that I didn&#8217;t like. I won&#8217;t go into the details, but basically it was a dose-response inference, where a continuous exposure was binned into three broad categories (terciles of the data) and the probability of an adverse event was computed for each tercile. The effect and the sample size was [...]]]></description>
			<content:encoded><![CDATA[<p>I saw an analysis recently that I didn&#8217;t like.  I won&#8217;t go into the details, but basically it was a dose-response inference, where a continuous exposure was binned into three broad categories (terciles of the data) and the probability of an adverse event was computed for each tercile.  The effect and the sample size was large enough that the terciles were statistically-significantly different from each other in probability of adverse event, with the probabilities increasing from low to mid to high exposure, as one would predict.</p>
<p>I didn&#8217;t like this analysis because it is equivalent to fitting a step function.  There is a tendency for people to interpret the (arbitrary) tercile boundaries as being meaningful thresholds even though the underlying dose-response relation has to be continuous.  I&#8217;d prefer to start with a linear model and then add nonlinearity from there with a spline or whatever.</p>
<p>At this point I stepped back and thought:  Hey, the divide-into-three analysis does not literally assume a step function.  It doesn&#8217;t assume anything at all; it&#8217;s just a data summary!  People discretize input variables all the time!  So why am I complaining?</p>
<p>I justify my complaints on two levels.  First on the grounds of interpretation:  my applied colleagues really were interpreting the three-category model in terms of thresholds.  The three categories were:  &#8220;0 to A&#8221;, &#8220;A to B&#8221;, and &#8220;B to infinity&#8221;.  And somebody really was saying something about the effect of exposure A or exposure B.  Which just ain&#8217;t right.</p>
<p>My second issue is statistical efficiency.  You can say that the categorical-input model is nothing but a summary, an estimate of averages&#8212;but by binning like this, you lose statistical efficiency.  And you become the slave to &#8220;statistical significance&#8221;; there&#8217;s the temptation to butcher your analysis and throw away tons of information, just so you can get a single clean, statistically significant result.</p>
<p>P.S.  The more categories you have, the less of a concern it is to discretize.  And sometimes your data come in discrete form (see <a href="http://andrewgelman.com/2012/03/voting-patterns-of-americas-whites-from-the-masses-to-the-elites/">here</a>, for example).</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/models-assumptions-and-data-summaries/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>New New York data research organizations</title>
		<link>http://andrewgelman.com/2012/05/new-new-york-data-research-organizations/</link>
		<comments>http://andrewgelman.com/2012/05/new-new-york-data-research-organizations/#comments</comments>
		<pubDate>Thu, 03 May 2012 22:55:33 +0000</pubDate>
		<dc:creator>Aleks Jakulin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15261</guid>
		<description><![CDATA[In a single day, New York City obtained two data analysis/statistics/machine learning organizations: Microsoft Research New York City with John Langford (machine learning), Duncan Watts (networks), and Dave Pennock (algorithmic economics). eBay technology center focusing on data &#8211; led by Chris Dixon, the co-founder of the recommendation engine company Hunch, which has recently been acquired [...]]]></description>
			<content:encoded><![CDATA[<p>In a single day, New York City obtained two data analysis/statistics/machine learning organizations:</p>
<ul>
<li><a href="http://www.microsoft.com/en-us/news/features/2012/may12/05-02MSRNYC.aspx" target="_blank">Microsoft Research</a> New York City with <a href="http://hunch.net/">John Langford</a> (machine learning), <a href="http://en.wikipedia.org/wiki/Duncan_J._Watts">Duncan Watts</a> (networks), and <a href="http://blog.oddhead.com/">Dave Pennock</a> (algorithmic economics).</li>
<li><a href="http://gigaom.com/2012/05/03/ebay-puts-down-roots-in-nyc-with-data-focused-tech-center/">eBay technology center</a> focusing on data &#8211; led by <a href="http://cdixon.org/">Chris Dixon</a>, the co-founder of the recommendation engine company Hunch, which has recently been acquired by eBay.</li>
</ul>
<p>New York already has Facebook&#8217;s engineering <a href="https://www.facebook.com/note.php?note_id=249404228453921">unit</a>, Twitter&#8217;s East Coast <a href="http://www.nyc.gov/portal/site/nycgov/menuitem.c0935b9a57bb4ef3daf2f1c701c789a0/index.jsp?pageID=mayor_press_release&amp;catID=1194&amp;doc_name=http%3A%2F%2Fwww.nyc.gov%2Fhtml%2Fom%2Fhtml%2F2011b%2Fpr352-11.html&amp;cc=unused1978&amp;rc=1194&amp;ndi=1" target="_blank">headquarters</a>, and Google&#8217;s <a href="http://www.google.com/about/jobs/locations/new-york/">second-largest</a> engineering office.</p>
<p>The data community here is on an upswing, and it might be one of the best places to be if you&#8217;re into applied statistics, machine learning or data analysis.</p>
<p>Post by <a href="http://twitter.com/aleksj" target="_blank">Aleks Jakulin</a>.</p>
<p>P.S. (from Andrew):  The formerly-Yahoo-now-Microsoft researchers have a more-or-less formal connection to Columbia, through the Applied Statistics Center, where some of them will be organizing occasional mini-conferences and workshops!</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/new-new-york-data-research-organizations/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>News from the sister blog!</title>
		<link>http://andrewgelman.com/2012/05/news-from-the-sister-blog/</link>
		<comments>http://andrewgelman.com/2012/05/news-from-the-sister-blog/#comments</comments>
		<pubDate>Thu, 03 May 2012 18:35:55 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15258</guid>
		<description><![CDATA[US National Academy of Sciences elects 84 new members (Please click through and read the whole thing.)]]></description>
			<content:encoded><![CDATA[<p><a href="http://blogs.nature.com/news/2012/05/us-national-academy-of-sciences-elects-84-new-members.html">US National Academy of Sciences elects 84 new members</a></p>
<p>(Please click through and read the whole thing.)</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/news-from-the-sister-blog/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Google Translate for code, and an R help-list bot</title>
		<link>http://andrewgelman.com/2012/05/google-translate-for-code-and-an-r-help-list-bot/</link>
		<comments>http://andrewgelman.com/2012/05/google-translate-for-code-and-an-r-help-list-bot/#comments</comments>
		<pubDate>Thu, 03 May 2012 13:28:46 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Statistical computing]]></category>
		<category><![CDATA[R]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15251</guid>
		<description><![CDATA[What we did in our Stan meeting yesterday: Some discussion of revision of the Nuts paper, some conversations about parameterizations of categorical-data models, plans for the R interface, blah blah blah. But also, I had two exciting new ideas! Google Translate for code Wouldn&#8217;t it be great if Google Translate could work on computer languages? [...]]]></description>
			<content:encoded><![CDATA[<p>What we did in our Stan meeting yesterday:</p>
<p>Some discussion of revision of the Nuts paper, some conversations about parameterizations of categorical-data models, plans for the R interface, blah blah blah.</p>
<p>But also, I had two exciting new ideas!</p>
<p><strong>Google Translate for code</strong></p>
<p>Wouldn&#8217;t it be great if Google Translate could work on computer languages?  I suggested this and somebody said that it might be a problem because code isn&#8217;t always translatable.  But that doesn&#8217;t worry so much.  Google Translate for human languages isn&#8217;t perfect either but it&#8217;s a useful guide.  If I want to write a message to someone in French or Spanish or Dutch, I wouldn&#8217;t just write it in English and run it through Translate.  What I do is try my best to write it in the desired language, but I can try out some tricky words or phrases in the translator.  Or, if I start by translating, I go back and forth to make sure it all makes sense.</p>
<p><strong>An R help-list bot</strong></p>
<p>We were talking about how to build a Stan community that will be helpful to a diverse range of users without taking up too much of our time, and that&#8217;s when I came up with a brilliant idea.  Let&#8217;s take a successful existing help group&#8212;for example, the R-help mailing list&#8212;then make a database of the helpful bits of advice of a distinguished and frequent contributor to the list.  The bot would be easy:  whatever the question is that comes in, just send back a random tip.  I have a feeling that advice such as &#8220;PLEASE do, and not send HTML&#8221; and &#8220;My guess is that this is a Mac-specific question (e.g. you are using the R.app GUI), so please consider if this is the appropriate list&#8221; and &#8220;The posting guide was not followed&#8221; and &#8220;Please use the R-devel list to comment on current development versions&#8221; would work pretty well for almost any question (maybe after some global sub of Stan for R).</p>
<p>It&#8217;s sort of like when we were kids and had this book, Bennett Cerf&#8217;s Book of Riddles&#8212;it was a gray with a picture of a big red rock-eater on the cover.  We started out by just reading through the riddles one at a time but we had more fun after inventing a game where we&#8217;d open the book and read a riddle, then open again at random to give the answer.  For example:<br />
Q:  What&#8217;s big, red and eats rocks?<br />
A:  Two cats stuck in a tree!</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/google-translate-for-code-and-an-r-help-list-bot/feed/</wfw:commentRss>
		<slash:comments>16</slash:comments>
		</item>
		<item>
		<title>Selection bias, or, How you can think the experts don’t check their models, if you simply don’t look at what the experts actually are doing</title>
		<link>http://andrewgelman.com/2012/05/selection-bias/</link>
		<comments>http://andrewgelman.com/2012/05/selection-bias/#comments</comments>
		<pubDate>Wed, 02 May 2012 13:23:08 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Zombies]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15007</guid>
		<description><![CDATA[My friend Seth, whom I know from Berkeley (we taught a course together on left-handedness), has a blog on topics ranging from thoughtful discussions of scientific evidence, to experiences with his unconventional weight-loss scheme, offbeat self-experimentation, and advocacy of fringe scientific theories, leavened with occasional dollops of cynicism and political extremism. I agree with Seth [...]]]></description>
			<content:encoded><![CDATA[<p>My friend Seth, whom I know from Berkeley (we taught a course together on left-handedness), has a blog on topics ranging from thoughtful <a href="http://blog.sethroberts.net/2012/04/12/interview-with-daniel-wolfson-of-choosing-wisely/">discussions</a> of scientific evidence, to <a href="http://blog.sethroberts.net/2012/03/13/ten-years-of-weights-including-two-years-on-the-shangri-la-diet/">experiences</a> with his unconventional weight-loss scheme, <a href="http://blog.sethroberts.net/2012/04/09/unexplained-change-in-brain-speed/">offbeat</a> self-experimentation, and advocacy of <a href="http://blog.sethroberts.net/2012/04/06/do-sonograms-cause-autism-new-evidence/">fringe</a> scientific theories, leavened with occasional dollops of <a href="http://blog.sethroberts.net/2007/06/19/can-dish-it-out-but-cant-take-it/">cynicism</a> and political <a href="http://blog.sethroberts.net/2007/04/03/my-holocaust/">extremism</a>.  I agree with Seth on <a href="http://andrewgelman.com/2010/03/clippin_it/">some</a> things but not others.  (<a href="http://andrewgelman.com/2010/03/clippin_it/#comment-53303">Here&#8217;s</a> Seth&#8217;s reason for not attempting a clinical trial of his diet.)</p>
<p>Recently I was disturbed (but, I&#8217;m sorry to say, not surprised) to see Seth <a href="http://blog.sethroberts.net/2012/04/10/assorted-links-169/">post</a> the following:</p>
<blockquote><p><a href="http://mises.org/daily/5892/The-Skeptics-Case">Predictions of climate models versus reality</a>. I [Seth] have only seen careful prediction-vs-reality comparisons made by AGW [anthropogenic global warming] skeptics. Those who believe humans are dangerously warming the planet appear to be silent on this subject.</p></blockquote>
<p>In response, Phil <a href="http://blog.sethroberts.net/2012/04/10/assorted-links-169/#comment-962944">commented</a>:</p>
<blockquote><p>Funny, on the day you [Seth] made your post saying that you haven’t seen comparisons between models and predictions except by skeptics, the <a href="http://www.realclimate.org/index.php/archives/2012/04/evaluating-a-1981-temperature-projection/">top entry</a> on RealClimate, the single most prominent global-warming-related blog that is not run by skeptics, was “Evaluating a 1981 temperature projection.”</p></blockquote>
<p>Pretty amazing, huh?  On its face it would seem surprising to claim that the majority of leading climate scientists don&#8217;t do &#8220;careful prediction-vs-reality comparisons,&#8221; and indeed on the very day of Seth&#8217;s post, there is such a comparison right there on the first place you might look for what the climate scientists are doing!</p>
<p>How did Seth miss it?</p>
<p>A clue comes from the sources on which Seth relies.  His link above points to the webpage of the Von Mises Institute, a political advocacy organization.  Other Seth links to global warming stories have come from an climate change skeptic blog, a right-leaning politics blog, another climate change skeptic blog, another advocacy organization, one more climate change skeptic blog, a letter to the Wall Street Journal, yet another climate change skeptic blog, an op-ed in Forbes magazine, a lecture by science writer and political activist Matt Ridley, another blog that seems to specialize in climate change skepticism, conservative political columnist Jeff Jacoby, a Wall Street Journal op-ed, still one more climate change skeptic blog, a conservative religious magazine, . . . ummm, you get the idea.</p>
<p>If these are your sources, you can get a distorted view of the opinions and arguments of &#8220;those who believe humans are dangerously warming the planet&#8221;!  Any one or two or three of the above sources might be informative, but it doesn&#8217;t make sense to <em>only</em> look there.  (Quick:  glance again at the list of sources in the above paragraph.)<br />
<span id="more-15007"></span><br />
I&#8217;m not saying that Seth has any sort of duty to read the scientific literature&#8212;he&#8217;s trained as a psychologist, not as a physicist&#8212;nor does he need to read RealClimate (even if only as a supplement to the Von Mises Institute page, Wall Street journal op-eds, and so on), but it seems pretty silly for him to be so sure of himself on the science, given that his selection of politically-loaded sources.</p>
<p>I&#8217;m <a href="http://andrewgelman.com/2009/12/how_do_i_form_m/">not an expert</a> on climate science either (I&#8217;m currently involved in a research project using tree-ring data to estimate historical climate, but my expertise in this project is in the statistics, not the biology or physics), so I asked Phil what his thoughts were on the particular article that Seth linked to (by David M. W. Evans, described as &#8220;a mathematician and engineer, with six university degrees including a PhD from Stanford University in electrical engineering&#8221;).</p>
<p>Phil responded:</p>
<blockquote><p>If you look at http://en.wikipedia.org/wiki/File:Satellite_Temperatures.png the temperature anomaly was higher in 1988 than in any other year until 1997&#8230;and was lower in the past couple of years than at most times in the past decade. The paper Seth blogged about looks at the change in temperatures since 1988 and says hey, there hasn&#8217;t been any increase whereas this modeler guy said the temperature would be way higher by now.  If he had chosen 1985 or 1989 as the starting point instead, it would look different. Of course he&#8217;d say he chose 1988 because that&#8217;s when Hansen testified in Congress, so fair enough in a way&#8230;but this is a bit like the housing data that you blogged about, why not show the earlier data too?  And the reason is the same as in the housing data: it&#8217;s because if you show the earlier data it undermines the story you are trying to tell. 1988 was an unusually hot year, by a lot.</p></blockquote>
<p>Phil then turns to the RealClimate post (by Geert Jan van Oldenborgh and Rein Haarsma) mentioned earlier:</p>
<blockquote><p>They compare a forecast from 1981 to data. This uses a land-ocean index rather than tropospheric temperatures&#8230;perhaps that&#8217;s an example of this guy cherry-picking too (the land-ocean temperature increase has been more rapid than the satellite-derived tropospheric temperature increase for reasons someone else might know but I don&#8217;t).  Anyway this one shows a totally different story of course.  Actually this guy cheats a little by starting his baseline too high I think, though that&#8217;s just based on a visual impression.</p>
<p>One thing that people who do these things don&#8217;t seem to understand is that the uncertainties in climate models (like low/medium/high curves) are supposed to represent the uncertainty in the trend, not the uncertainty for an individual year. It&#8217;s sort of like showing uncertainties in a regression model with one of those bow-tie curves: that doesn&#8217;t mean that you&#8217;re predicting that the individual data points will fall within the upper and lower curve, it means that the true line is probably somewhere in there. Similarly, people say &#8220;the past five years (or whatever) have been below the predicted trend line so the trend line must be too high,&#8221; apparently not realizing (or deliberately ignoring) that there is a lot of year-to-year variation, plus autocorrelation on the scale of a decade or so. It&#8217;s a true statement that global mean temperature was as high in 1988 as it was last year; it&#8217;s false to say that there&#8217;s not an upward trend in temperatures.</p></blockquote>
<p>I think, though, that if Seth continues to search for information from his usual sources, he will remain convinced that there is no man-made global warming and that vast majority of scientists who study these problems are not interested in checking their models, have not ever thought about the ice age, etc etc.</p>
<p>I think this is important not only for the followers of Seth Roberts but more generally in that it illustrates the traps that people can fall into when seeking out confirmation of their beliefs.  Seth is better-equipped than most people to read about scientific evidence, yet he is stuck, not only in holding a scientific view which I find implausible (after all, I might be wrong) but in not understanding that Geert Jan van Oldenborgh, Rein Haarsma, etc etc etc are doing serious science.  It&#8217;s sad, and it&#8217;s scary.</p>
<p>P.S.  Phil provides a good summary in <a href="http://andrewgelman.com/2012/05/selection-bias/#comment-81592">this</a> comment.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/selection-bias/feed/</wfw:commentRss>
		<slash:comments>54</slash:comments>
		</item>
		<item>
		<title>Huff the Magic Dragon</title>
		<link>http://andrewgelman.com/2012/05/huff-the-magic-dragon/</link>
		<comments>http://andrewgelman.com/2012/05/huff-the-magic-dragon/#comments</comments>
		<pubDate>Tue, 01 May 2012 23:44:33 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Public Health]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15235</guid>
		<description><![CDATA[Upon reading this, Susan remarked, &#8220;Don&#8217;t you think it&#8217;s interesting that a guy who promotes smoking has a last name of &#8216;Huff&#8217;? Reminds me of the Dennis/Dentist studies.&#8221; Good point. P.S. As discussed in the linked thread, the great statistician R. A. Fisher was notorious for minimizing the risks of smoking. How does this connect [...]]]></description>
			<content:encoded><![CDATA[<p>Upon reading <a href="http://andrewgelman.com/2012/04/how-to-mislead-with-how-to-lie-with-statistics/">this</a>, Susan remarked, &#8220;Don&#8217;t you think it&#8217;s interesting that a guy who promotes smoking has a last name of &#8216;Huff&#8217;?  Reminds me of the Dennis/Dentist studies.&#8221;</p>
<p>Good point.</p>
<p>P.S.  As discussed in the linked thread, the great statistician R. A. Fisher was notorious for minimizing the risks of smoking.  How does this connect to Fisher&#8217;s name, one might ask?</p>
<p><a href="http://andrewgelman.com/wp-content/uploads/2012/05/poisson.jpg"><img src="http://andrewgelman.com/wp-content/uploads/2012/05/poisson.jpg" alt="" title="poisson" width="240" height="240" class="alignnone size-full wp-image-15246" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/huff-the-magic-dragon/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Colorless green facts asserted resolutely</title>
		<link>http://andrewgelman.com/2012/05/colorless-green-facts-asserted-resolutely/</link>
		<comments>http://andrewgelman.com/2012/05/colorless-green-facts-asserted-resolutely/#comments</comments>
		<pubDate>Tue, 01 May 2012 14:10:26 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Science]]></category>
		<category><![CDATA[Sociology]]></category>
		<category><![CDATA[Sports]]></category>
		<category><![CDATA[Teaching]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15230</guid>
		<description><![CDATA[Thomas Basbøll [yes, I've learned how to smoothly do this using alt-o] gives some writing advice: What gives a text presence is our commitment to asserting facts. We have to face the possibility that we may be wrong about them resolutely, and we do this by writing about them as though we are right. This [...]]]></description>
			<content:encoded><![CDATA[<p>Thomas Basbøll [yes, I've learned how to smoothly do this using alt-o] gives <a href="http://secondlanguage.blogspot.com/2012/05/there.html">some writing advice</a>:</p>
<blockquote><p>What gives a text presence is our commitment to asserting facts. We have to face the possibility that we may be wrong about them resolutely, and we do this by writing about them as though we are right.</p></blockquote>
<p>This and an earlier <a href="http://andrewgelman.com/2012/04/believe-your-models-up-to-the-point-that-you-abandon-them/">remark</a> by Basbøll are closely related in my mind to predictive model checking and to <a href="http://www.stat.columbia.edu/~gelman/research/published/philosophy.pdf">Bayesian statistics</a>:  we make strong assumptions and then engage the data and the assumptions in a dialogue:  assumptions + data -> inference, and we can then compare the inference to the data which can reveal problems with our model (or problems with the data, but that&#8217;s really problems with the model too, in this case problems with the model for the data).</p>
<p>I like the idea that a condition for a story to be useful is that we put some belief into it.  (One doesn&#8217;t put belief into a joke.)  And also the converse, that thnking hard about a story and believing it can be the precondition to ultimately rejecting it because its implications don&#8217;t make sense.  It&#8217;s like in chess:  the way to refute a move is to consider making the move (which is as irrevocable in a chess context as believing a story is, in the context of basing a social-scientific theory on it.)</p>
<p>I&#8217;m also reminded of the advice from Pólya (or somebody like that) about solving math problems.  If the question is, &#8220;Is statement A true?&#8221;, you can try to prove A or find a counterexample.  But it&#8217;s hard to do both at the same time!  Better to take a guess and go from there:  if you try and try to prove it and fail, this may give insight into where to find a counterexample (in those folds of the problem that make A so hard to prove); conversely, if you can&#8217;t find a counterexample no matter how hard you look, you can try to systematize that search, thus perhaps leading to a proof that no counterexample exists.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/colorless-green-facts-asserted-resolutely/feed/</wfw:commentRss>
		<slash:comments>23</slash:comments>
		</item>
		<item>
		<title>Modeling y = a + b + c</title>
		<link>http://andrewgelman.com/2012/05/modeling-y-a-b-c/</link>
		<comments>http://andrewgelman.com/2012/05/modeling-y-a-b-c/#comments</comments>
		<pubDate>Tue, 01 May 2012 13:33:11 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Statistics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=14626</guid>
		<description><![CDATA[Brandon Behlendorf writes: I [Behlendorf] am replicating some previous research using OLS [he's talking about what we call "linear regression"---ed.] to regress a logged rate (to reduce skew) of Y on a number of predictors (Xs). Y is the count of a phenomena divided by the population of the unit of the analysis. The problem [...]]]></description>
			<content:encoded><![CDATA[<p>Brandon Behlendorf writes:<br />
<span id="more-14626"></span></p>
<blockquote><p>I [Behlendorf] am replicating some previous research using OLS [he's talking about what we call "linear regression"---ed.] to regress a logged rate (to reduce skew) of Y on a number of predictors (Xs).  Y is the count of a phenomena divided by the population of the unit of the analysis.  The problem that I am encountering is that Y is composite count of a number of distinct phenomena [A+B+C], and these phenomena are not uniformly distributed across the sample.   Most of the research in this area has conducted regressions either with Y or with individual phenomena [A or B or C] as the dependent variable.  Yet it seems that if [A, B, C] are not uniformly distributed across the sample of units in the same proportion, then the use of Y would be biased, since as a count of [A+B+C] divided by the population, it would treat as equivalent units both [2+0.5+1.5] and [4+0+0]. </p>
<p>My goal is trying to find a methodology which allows a researcher to regress Y on a number of Xs, but which accounts for the uneven variation in the distributions of the individual phenomena [A+B+C] that constitute Y.  I have thought that it could be treated within a Structural Equation Model as multiple dependent variables, or through a process of joint estimation, but in essence I know the latent factor (Y) that one usually does not know when trying to measure through some sort of SEM or Rasch Model.  I have also considered weighting [A,B,C] by converting them into percentages of the total count of each phenomena within the sample (i.e. (A1/sum A(1-100)) + (B1/sum B(1-100)) + (C1/sum C(1-100))), but the result lacks interpretational quality as to the overall relationship between Xs and Y.</p></blockquote>
<p>My reply:</p>
<p>First off, the reason for logging is to model a multiplicative relationship using an additive model.  Skewness is typically irrelevant (see the discussion of regression assumptions in chapter 3 or 4 of ARM).  No big deal here, I just wanted to get that out of the way.  Also, if y is a count, you might want to use an overdispersed Poisson regression as discussed in chapter 6.</p>
<p>My main question is, if you have a, b, and c, why not just model them separately?  Is it a sample size issue, that by combining a,b,c into y, you get more stable estimates?  If so, that&#8217;s ok, and you could always try weighted averages if that makes sense in your application.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/05/modeling-y-a-b-c/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Systematic review of publication bias in studies on publication bias</title>
		<link>http://andrewgelman.com/2012/04/systematic-review-of-publication-bias-in-studies-on-publication-bias/</link>
		<comments>http://andrewgelman.com/2012/04/systematic-review-of-publication-bias-in-studies-on-publication-bias/#comments</comments>
		<pubDate>Mon, 30 Apr 2012 20:23:11 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Statistics]]></category>
		<category><![CDATA[Public Health]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15222</guid>
		<description><![CDATA[Via Yalda Afshar, a 2005 paper by Hans-Hermann Dubben and Hans-Peter Beck-Bornholdt: Publication bias is a well known phenomenon in clinical literature, in which positive results have a better chance of being published, are published earlier, and are published in journals with higher impact factors. Conclusions exclusively based on published studies, therefore, can be misleading. [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://andrewgelman.com/wp-content/uploads/2012/04/stephen-colbert-portrait-at-smithsonian.jpg"><img src="http://andrewgelman.com/wp-content/uploads/2012/04/stephen-colbert-portrait-at-smithsonian.jpg" alt="" title="Stephen Colbert potrait donated to NMAH" width="404" height="520" class="alignnone size-full wp-image-15223" /></a></p>
<p>Via <a href="http://twitter.com/#!/yafshar">Yalda Afshar</a>, <a href="http://www.bmj.com/highwire/filestream/379739/field_highwire_article_pdf/0.pdf">a 2005 paper</a> by Hans-Hermann Dubben and Hans-Peter Beck-Bornholdt:</p>
<blockquote><p>Publication bias is a well known phenomenon in clinical literature, in which positive results have a better chance of being published, are published earlier, and are published in journals with higher impact factors. Conclusions exclusively based on published studies, therefore, can be misleading. Selective under-reporting of research might be more widespread and more likely to have adverse consequences for patients than publication of deliberately falsified data. We investigated whether there is preferential publication of positive papers on publication bias.</p></blockquote>
<p>They conclude, &#8220;We found no evidence of publication bias in reports on publication bias.&#8221;  But of course that&#8217;s the sort of finding regarding publication bias of findings on publication bias that you&#8217;d expect would get published.  What we really need is a careful meta-analysis to estimate the level of publication bias in studies of publication bias of studies of publication bias.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/systematic-review-of-publication-bias-in-studies-on-publication-bias/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>I suppose it’s too late to add Turing’s run-around-the-house-chess to the 2012 London Olympics?</title>
		<link>http://andrewgelman.com/2012/04/i-suppose-its-too-late-to-add-turings-run-around-the-house-chess-to-the-2012-london-olympics/</link>
		<comments>http://andrewgelman.com/2012/04/i-suppose-its-too-late-to-add-turings-run-around-the-house-chess-to-the-2012-london-olympics/#comments</comments>
		<pubDate>Mon, 30 Apr 2012 14:20:18 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Sports]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15214</guid>
		<description><![CDATA[Daniel Murrell writes: I see you have a blog post about turing chess . . . I&#8217;ve seen another reference to it but am unable to find a definitive source. Do you know of a source where I could find out about the history of the idea? My reply: You mean the run-around-the-house thing? I [...]]]></description>
			<content:encoded><![CDATA[<p>Daniel Murrell writes:</p>
<blockquote><p>I see you have a blog post about turing chess . . . I&#8217;ve seen another reference to it but am unable to find a definitive source. Do you know of a source where I could find out about the history of the idea?</p></blockquote>
<p>My reply:</p>
<p>You mean the run-around-the-house thing?  I don&#8217;t know where it comes from.  It&#8217;s a well known story, if you google Turing chess run around the house you can find lots of references but I don&#8217;t know the definitive source.  I can blog and see if anything comes up!</p>
<p>I&#8217;ve never actually played the game.  I&#8217;ll try it outdoors sometime, perhaps.  When I last <a href="http://andrewgelman.com/2010/01/a_problem_with/">posted</a> on the topic, we had a fun discussion, revealing that the rules are not as clear as one might think.  It makes me wonder if anyone&#8217;s thought hard about it and come up with a good set of &#8220;official rules.&#8221;</p>
<p>Any thoughts?</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/i-suppose-its-too-late-to-add-turings-run-around-the-house-chess-to-the-2012-london-olympics/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>We go to war with the data we have, not the data we want</title>
		<link>http://andrewgelman.com/2012/04/we-go-to-war-with-the-data-we-have-not-the-data-we-want/</link>
		<comments>http://andrewgelman.com/2012/04/we-go-to-war-with-the-data-we-have-not-the-data-we-want/#comments</comments>
		<pubDate>Mon, 30 Apr 2012 00:14:30 +0000</pubDate>
		<dc:creator>Phil</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15205</guid>
		<description><![CDATA[This post is by Phil. Psychologists perform experiments on Canadian undergraduate psychology students and draws conclusions that (they believe) apply to humans in general; they publish in Science. A drug company decides to embark on additional trials that will cost tens of millions of dollars based on the results of a careful double-blind study&#8230;.whose patients are [...]]]></description>
			<content:encoded><![CDATA[<p>This post is by Phil.</p>
<p>Psychologists perform experiments on Canadian undergraduate psychology students and draws conclusions that (they believe) apply to humans in general; they publish in Science. A drug company decides to embark on additional trials that will cost tens of millions of dollars based on the results of a careful double-blind study&#8230;.whose patients are all volunteers from two hospitals. A movie studio holds 9 screenings of a new movie for volunteer viewers and, based on their survey responses, decides to spend another $8 million to re-shoot the ending.  A researcher interested in the effect of ventilation on worker performance conducts a months-long study in which ventilation levels are varied and worker performance is monitored&#8230;in a single building.</p>
<p>In almost all fields of research, most studies are based on convenience samples, or on random samples from a larger population that is itself a convenience sample. The paragraph above gives just a few examples.  The benefits of carefully conducted randomized trials are well known, but so are the costs and impediments. Lucky people studying some natural phenomena like solar output or earthquakes can deal with complete datasets, but for most data analysts and applied statisticians the fact that your data are not a random sample from your population of interest is so commonplace that it usually goes without saying. This does not mean that it goes without thinking, of course: most researchers, and all good ones, think about the extent to which their results might or might not be applicable to a wider population and try to frame their conclusions accordingly. But most or all researchers are willing to extrapolate their results to wider populations to some degree. The movie studio reshoots their ending not because they want to please their 9 test audiences, but because they think that the response of those 9 test audiences tells them something about millions of other viewers, even though those 9 audiences were not selected according to a careful, randomized sampling scheme.</p>
<p>If you think everything I&#8217;ve said so far is so obvious as to be boring, so did I, but I was proven wrong. Read on.</p>
<p><span id="more-15205"></span></p>
<p>I&#8217;m currently working with time series data on building electricity consumption. People want to be able to answer questions like &#8220;I changed something about my large commercial building on March 1; how much energy am I saving,&#8221; and one way to do that is to fit a statistical model using data from before March 1, use it to predict the energy use after March 1, and compare the predictions to what actually happened. There are other uses for these models too.</p>
<p>There are quite a few companies that offer energy modeling programs. Typically, a company develops its own proprietary tool. Everyone thinks their package is better than anyone else&#8217;s, or at least they say so, but there&#8217;s usually no evidence.  A few months ago a company approached us to ask if we will compare the accuracy of their tool to some standard methods and allow them to publicize the results if they want to. (I work at a government research lab and they correctly see us as a disinterested party.)  We said sure. Our chosen approach is cross-validation. We compiled electricity data from a few dozen large commercial buildings &#8212; the population of interest &#8212; blanked out big chunks of data, and gave the resulting dataset to the company. Their task is to make predictions for the missing time periods and give them to us, and we will compare their predictions to reality. We&#8217;re doing the same with some standard prediction models. The data we&#8217;ve given them are a convenience sample. There&#8217;s no practical way to get data from a random sample of buildings in the country or even from a single electric utility, in part because of privacy issues (the utilities can&#8217;t give out the data without permission of the building owners). Our data are a grab bag from different sources, and certainly not representative of the broad population of commercial buildings in many ways. Still, what else can one do? We want to find out if these guys have a model that outperforms standard models, and this is a way to find the answer.</p>
<p>To my surprise, our postdoc strongly disagrees. He asserts that since we don&#8217;t have a sampling plan, just a haphazard collection of building data, we can&#8217;t say anything at all. He says &#8220;sure, of course you can say which method performs better for these 52 buildings, but you cannot say a single thing about a 53rd building. Nothing.&#8221;  At first I thought he meant that we should be cautious about making firm claims, and I certainly agree with that, but that&#8217;s not it: he really thinks that it is wrong to draw any conclusions whatsoever from our results. He thinks it&#8217;s wrong (incorrect, and borderline immoral) to say that our results are even suggestive. I offered a wager: if we find that one method performs substantially better than the others on average, I will bet you dinner that it will perform better for a 53rd building. He said sure, fine for wagering over dinner, but it is scientifically indefensible to make any statement at all about which of the methods performs better in general on the basis of anything we might find using our dataset.</p>
<p>To me this has some parallels to a <a href="http://andrewgelman.com/2012/04/bad-news-about-some-statisticians/">recent post </a>about a theoretical statistician whose work is useless in practice.  I am as aware of the problems of biased datasets as anyone &#8212; I once looked into the issue of bias in indoor radon measurement datasets, and the bias turned out to be absolutely enormous &#8212; but in most fields of research if you think you can&#8217;t learn anything about the wider world unless that wider world is inside your sampling frame, you may as well quit now because you are never going to get the world into your sampling frame.</p>
<p>This is an interesting problem because it is sort of outside the realm of statistics, and into some sort of meta-statistical area. How can you judge whether your results can be extrapolated to the &#8220;real world,&#8221; if you can&#8217;t get a real-world sample to compare to? (And if you could get a sample to compare to, you would, and then this problem wouldn&#8217;t come up).</p>
<p>I would welcome thoughtful commentary on this subject.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/we-go-to-war-with-the-data-we-have-not-the-data-we-want/feed/</wfw:commentRss>
		<slash:comments>53</slash:comments>
		</item>
		<item>
		<title>Clueless Americans think they’ll never get sick</title>
		<link>http://andrewgelman.com/2012/04/clueless-americans-think-theyll-never-get-sick/</link>
		<comments>http://andrewgelman.com/2012/04/clueless-americans-think-theyll-never-get-sick/#comments</comments>
		<pubDate>Sun, 29 Apr 2012 13:48:10 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Public Health]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15017</guid>
		<description><![CDATA[Cassie Murdoch points to a report from a corporate survey: Sixty-two percent of U.S. employees say it&#8217;s not likely they or a family member will be diagnosed with a serious illness like cancer, a survey indicates. The Aflac WorkForces Report, a survey of nearly 1,900 benefits decision-makers and more than 6,100 U.S. workers, also indicated [...]]]></description>
			<content:encoded><![CDATA[<p>Cassie Murdoch <a href="http://jezebel.com/5900205/majority-of-americans-are-pretty-sure-theyre-never-going-to-die">points</a> to a <a href="http://www.upi.com/Health_News/2012/04/08/Most-say-theyll-never-get-seriously-ill/UPI-74351333933813/">report</a> from a corporate survey:</p>
<blockquote><p>Sixty-two percent of U.S. employees say it&#8217;s not likely they or a family member will be diagnosed with a serious illness like cancer, a survey indicates.</p>
<p>The Aflac WorkForces Report, a survey of nearly 1,900 benefits decision-makers and more than 6,100 U.S. workers, also indicated 55 percent said they were not very or not at all likely to be diagnosed with a chronic illness, such as heart disease or diabetes.</p></blockquote>
<p>Here are some actual statistics:</p>
<blockquote><p>The American Cancer Society, Cancer Facts &#038; Figures 2012, said 1-in-3 women and 1-in-2 men will be diagnosed with cancer at some point in their lives, and the National Safety Council, Injury Facts 2011 edition, says more than 38.9 million injuries occur in a year requiring medical treatment.</p>
<p>The American Heart Association, Heart Disease &#038; Stroke Statistics 2012, said 1-in-6 U.S. deaths were caused by coronary heart disease, Tillman said.</p></blockquote>
<p>And some details on the survey:</p>
<blockquote><p>The survey conducted in January and February by Research Now. The first 3,151 worker interviews were nationally representative, while the remaining 3,000 interviews were conducted among the Top 30 designated market areas.</p></blockquote>
<p>Did these people really say they that neither they nor a family member will have a serious illness?  Is this for real?  What were they thinking?  I&#8217;m used to seeing wacky survey findings, but this one is ridiculous.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/clueless-americans-think-theyll-never-get-sick/feed/</wfw:commentRss>
		<slash:comments>19</slash:comments>
		</item>
		<item>
		<title>Agreement Groups in US Senate and Dynamic Clustering</title>
		<link>http://andrewgelman.com/2012/04/agreement-groups-in-us-senate-and-dynamic-clustering/</link>
		<comments>http://andrewgelman.com/2012/04/agreement-groups-in-us-senate-and-dynamic-clustering/#comments</comments>
		<pubDate>Sat, 28 Apr 2012 15:55:15 +0000</pubDate>
		<dc:creator>Aleks Jakulin</dc:creator>
				<category><![CDATA[Political Science]]></category>
		<category><![CDATA[Statistical graphics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15194</guid>
		<description><![CDATA[Adrien Friggeri has a lovely visualization of US Senators movement between clusters: You have to click the image and play with it to appreciate it. The methodology isn&#8217;t yet published &#8211; but I can see how this could be very illuminating. The dynamic clustering aspect hasn&#8217;t been researched much &#8211; one of the notable pieces [...]]]></description>
			<content:encoded><![CDATA[<p>Adrien Friggeri has a lovely <a href="http://friggeri.net/senate/#visualization">visualization</a> of US Senators movement between clusters:</p>
<p><a href="http://andrewgelman.com/wp-content/uploads/2012/04/friggeri.png"><img src="http://andrewgelman.com/wp-content/uploads/2012/04/friggeri.png" alt="" width="500" class="aligncenter size-full wp-image-15195" /></a></p>
<p>You have to click the image and play with it to appreciate it. The methodology isn&#8217;t yet published &#8211; but I can see how this could be very illuminating. The dynamic clustering aspect hasn&#8217;t been researched much &#8211; one of the notable pieces is the Blei and Lafferty <a href="http://topics.cs.princeton.edu/Science/" target="_blank">dynamic topic model of Science</a>.</p>
<p>I did a static <a href="http://pan.oxfordjournals.org/content/17/3/291.abstract" target="_blank">analysis of the US Senate</a> back in 2005 with Wray Buntine and coauthors. Some additional visualizations and the source code are <a href="http://www.stat.columbia.edu/~jakulin/Politics/" target="_blank">here</a>. We did a dynamic analysis of US Supreme Court on <a href="http://andrewgelman.com/2005/11/why_do_supreme/" target="_blank">this blog</a> but there&#8217;s also a <a href="http://scholar.google.com/citations?view_op=view_citation&amp;hl=en&amp;user=1Rq5MfEAAAAJ&amp;cstart=20&amp;citation_for_view=1Rq5MfEAAAAJ:QIV2ME_5wuYC" target="_blank">paper</a>. </p>
<p>My knowledge on this topic is out of date, however. Who has been doing good work in this area? I&#8217;ll organize the links.</p>
<p>[added 4/29/12, via <a href="http://www.people.fas.harvard.edu/~airoldi/">Edo Airoldi</a>]: Visualizing the Evolution of Community Structures in<br />
Dynamic Social Networks by Khairi Reda et al (2011) [<a href="http://compbio.cs.uic.edu/~chayant/papers/KhairiEtal-DynCommEurovis11.pdf">PDF</a>].</p>
<p>[added 4/29/12, via <a href="http://ariddell.org/">Allen Riddell</a>] Joint Analysis of Time-Evolving Binary Matrices and Associated Documents by Eric Wang et al (2010) [<a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.185.3179">PDF</a>] [<a href="https://sites.google.com/site/matrixtopics/">Video</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/agreement-groups-in-us-senate-and-dynamic-clustering/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Understanding simulations in terms of predictive inference?</title>
		<link>http://andrewgelman.com/2012/04/understanding-simulations-in-terms-of-predictive-inference/</link>
		<comments>http://andrewgelman.com/2012/04/understanding-simulations-in-terms-of-predictive-inference/#comments</comments>
		<pubDate>Sat, 28 Apr 2012 13:36:13 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Bayesian Statistics]]></category>
		<category><![CDATA[Miscellaneous Statistics]]></category>
		<category><![CDATA[Statistical computing]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=14912</guid>
		<description><![CDATA[David Hogg writes: My (now deceased) collaborator and guru in all things inference, Sam Roweis, used to emphasize to me that we should evaluate models in the data space &#8212; not the parameter space &#8212; because models are always effectively &#8220;effective&#8221; and not really, fundamentally true. Or, in other words, models should be compared in [...]]]></description>
			<content:encoded><![CDATA[<p>David Hogg writes:</p>
<blockquote><p>My (now deceased) collaborator and guru in all things inference, Sam Roweis, used to emphasize to me that we should evaluate models in the data space &#8212; not the parameter space &#8212; because models are always effectively &#8220;effective&#8221; and not really, fundamentally true. Or, in other words, models should be compared in the space of their predictions, not in the space of their parameters (the  parameters didn&#8217;t really &#8220;exist&#8221; at all for Sam).  In that spirit, when we estimate the effectiveness of a MCMC method or tuning &#8212; by autocorrelation time or ESJD or anything else &#8212; shouldn&#8217;t we be looking at the changes in the model predictions over time, rather than the changes in the parameters over time?  That is, the autocorrelation time should be the autocorrelation time in what the model (at the walker position) predicts for the data, and the ESJD should be the expected squared jump distance in what the model predicts for the data?  This might resolve the concern I expressed a few months ago to you that the ESJD is not affine-invariant, and etc.  Thoughts?</p></blockquote>
<p>Hogg continues with an example:</p>
<blockquote><p>Imagine you have a three-planet model for some radial velocity data. In the naivest implementation, you have a three-factorial exact degeneracy from swapping planets, but the modes are very well separated in parameter space:  Your autocorrelation time in the parameters is essentially infinite (because you will never switch from one permutation of the planets to another, realistically), but in the predictions the autocorrelation time is finite and fine.</p></blockquote>
<p>My reply:</p>
<p>It depends on the context.  Sometimes we have a redundant parameterization in which the individual parameters are not identified, but predictions are well-identified.  For a simple example, suppose you have a model, y ~ N (a+b, 1), with a uniform prior distribution on (a,b).  Then your data don&#8217;t tell you anything about a or b, but you can get good inference for a+b and good predictions for new data from the same model.  On the other hand, if you want to make a prediction for new data z ~ N(a,1), you&#8217;re out of luck.</p>
<p>More generally, one problem I have with the hard-line predictivist stance&#8212;the idea that models and parameters are mere fictions whereas predictions are real&#8212;is that models and parameters can be thought of as bridges between the data of yesterday and the data of tomorrow.  Consider the speed of light.  It&#8217;s not just part of a prediction for some particular measurement.  It&#8217;s also a universal constant.  For a more humble example, consider our discussion of physiologically-based pharmacokinetics models in Section 4.3 of my <a href="http://www.stat.columbia.edu/~gelman/research/published/bois2.pdf">article</a> with Bois and Jiang.  In a Bayesian model, good parameterization can be important, as it is typically through the parameters that we put in prior information.  In many ways, the parameterization represents a key source of prior information.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/understanding-simulations-in-terms-of-predictive-inference/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>“How to Lie with Statistics” guy worked for the tobacco industry to mock studies of the risks of smoking statistics</title>
		<link>http://andrewgelman.com/2012/04/how-to-mislead-with-how-to-lie-with-statistics/</link>
		<comments>http://andrewgelman.com/2012/04/how-to-mislead-with-how-to-lie-with-statistics/#comments</comments>
		<pubDate>Fri, 27 Apr 2012 14:23:47 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Public Health]]></category>
		<category><![CDATA[Sociology]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15153</guid>
		<description><![CDATA[Remember How to Lie With Statistics? It turns out that the author worked for the cigarette companies. John Mashey points to this, from Robert Proctor’s book, &#8220;Golden Holocaust: Origins of the Cigarette Catastrophe and the Case for Abolition&#8221;: Darrell Huff, author of the wildly popular (and aptly named) How to Lie With Statistics, was paid [...]]]></description>
			<content:encoded><![CDATA[<p>Remember How to Lie With Statistics?  It turns out that the author worked for the cigarette companies.  John Mashey <a href="http://andrewgelman.com/2012/04/gross-misuse-of-statistics-can-be-a-good-thing-if-it-indicates-the-acceptance-of-the-importance-of-statistical-reasoning/#comment-79923">points</a> to this, from Robert Proctor’s book, &#8220;Golden Holocaust: Origins of the Cigarette Catastrophe and the Case for Abolition&#8221;:</p>
<blockquote><p>Darrell Huff, author of the wildly popular (and aptly named) How to Lie With Statistics, was paid to testify before Congress in the 1950s and then again in the 1960s, with the assigned task of ridiculing any notion of a cigarette-disease link. On March 22, 1965, Huff testified at hearings on cigarette labeling and advertising, accusing the recent Surgeon General’s report of myriad failures and “fallacies.” Huff peppered his attack with with amusing asides and anecdotes, lampooning spurious correlations like that between the size of Dutch families and the number of storks nesting on rooftops–which proves not that storks bring babies but rather that people with large families tend to have larger houses (which therefore attract more storks).</p></blockquote>
<p>This was all a surprise to me, and I suspect to other statisticians as well.  For example, Huff&#8217;s activities with the cigarette companies are not mentioned on his Wikipedia page (as of 17 Apr 2012), nor are they mentioned in <a href="http://www-stat.wharton.upenn.edu/~steele/Publications/PDF/TN148.pdf">an article</a> on Huff by probabilist J. Michael Steele from 2005.</p>
<p><strong>&#8220;How to Lie with Smoking Statistics&#8221;</strong></p>
<p>Darrell Huff is best known for How to Lie with Statistics but he wrote or cowrote <a href="http://legacy.library.ucsf.edu/action/document/page?tid=ufa99d00&#038;page=2">several other books</a>, including Pictures by Pete (1944), The Dog that Came True (1946), How to Take a Chance (1959), Score: The Strategy of Taking Tests (1961), Cycles in Your Life (1964), How to be the Parent of a Successful Creative Child (1968), Twenty Careers of Tomorrow (1945), and How to Lower Your Food Bills (1963).</p>
<p>It appears that in the late 1960s he was also working on <a href="http://legacy.library.ucsf.edu/tid/mod15b00/pdf">a book</a> called &#8220;How to Lie with Smoking Statistics,&#8221; which <a href="http://legacy.library.ucsf.edu/tid/neb76b00/pdf">the publisher</a> saw &#8220;high likelihood of proceeding into print.&#8221;</p>
<p>In November 1965, <a href="http://legacy.library.ucsf.edu/tid/rfx92f00/pdf">a letter</a> was sent to Huff as follows:</p>
<p><a href="http://andrewgelman.com/wp-content/uploads/2012/05/Screen-shot-2012-04-24-at-11.23.49-AM.png"><img src="http://andrewgelman.com/wp-content/uploads/2012/05/Screen-shot-2012-04-24-at-11.23.49-AM.png" alt="" title="Screen shot 2012-04-24 at 11.23.49 AM" width="376" height="208" class="alignnone size-full wp-image-15156" /></a></p>
<p><a href="http://legacy.library.ucsf.edu/tid/xre35e00/pdf">Here</a>&#8216;s a letter from 1967 where Huff asks the tobacco dudes for another $1500 to keep writing.</p>
<p>And <a href="http://legacy.library.ucsf.edu/tid/civ92f00/pdf">here</a>&#8216;s a letter from mid-1968 from Huff&#8217;s publisher, Macmillan:</p>
<p><a href="http://andrewgelman.com/wp-content/uploads/2012/05/Screen-shot-2012-04-24-at-11.19.18-AM.png"><img src="http://andrewgelman.com/wp-content/uploads/2012/05/Screen-shot-2012-04-24-at-11.19.18-AM.png" alt="" title="Screen shot 2012-04-24 at 11.19.18 AM" width="425" height="227" class="alignnone size-full wp-image-15155" /></a></p>
<p>But publication &#8220;as soon as possible&#8221; never seems to have occurred. </p>
<p>What happened?  In one of these documents, William Kloepfer, vice president for public relations for the Tobacco Institute, wrote of the manuscript, &#8220;Frankly, this mass of verbiage needs drastic editing before it will directly address itself to the needs of our industry.&#8221;</p>
<p>After glancing at <a href="http://legacy.library.ucsf.edu/tid/gzr66b00/pdf">a couple of sections</a> from the draft, I gotta say that William Kloepfer had a point:  &#8220;mass of verbiage&#8221; is a pretty good description!  Huff&#8217;s book chapter reads like a bad sitcom where the writers were too lazy to put together enough material and they just milk the same couple of jokes over and over again.</p>
<p>In retrospect, I think Huff really dodged a bullet on this one.  If &#8220;How to Lie with Smoking Statistics&#8221; had come out, I expect it would&#8217;ve destroyed his reputation&#8212;remember, we&#8217;re talking 1969 here, that&#8217;s five years <em>after</em> the Surgeon General&#8217;s report&#8212;and taken a big bite out of the later sales and reputation of his 1954 bestseller.</p>
<p><strong>How sincere was Huff?  Did he tank his book for strategic reasons?</strong></p>
<p>I wanted to call this post, How to Lie With &#8220;How to Lie With Statistics,&#8221; but to be fair I have no reason to believe that Huff was lying or intentionally deceiving in his testimony.  He may well have simply been misleading himself in analogizing research on the effects of smoking to silly things like studies of storks and babies.  And if he was sincere in his views, I can hardly fault him for <a href="http://legacy.library.ucsf.edu/tid/izr66b00/pdf">collecting</a> some money for his efforts.</p>
<p>On the other hand, <a href="http://legacy.library.ucsf.edu/tid/vvr87e00/pdf">this document</a> makes me think that Huff may have seen his role as producing talking points in support of a predetermined conclusion.  I guess we&#8217;ll never know if he really wanted to publish How to Lie with Smoking Statistics.  Maybe he intentionally sabotaged it because he sensed it would ruin his reputation, whereas it was possible for him to keep the consulting and testimony under the radar.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/how-to-mislead-with-how-to-lie-with-statistics/feed/</wfw:commentRss>
		<slash:comments>15</slash:comments>
		</item>
		<item>
		<title>Bad news about (some) statisticians</title>
		<link>http://andrewgelman.com/2012/04/bad-news-about-some-statisticians/</link>
		<comments>http://andrewgelman.com/2012/04/bad-news-about-some-statisticians/#comments</comments>
		<pubDate>Fri, 27 Apr 2012 00:31:00 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Statistics]]></category>
		<category><![CDATA[Sociology]]></category>
		<category><![CDATA[Teaching]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15178</guid>
		<description><![CDATA[Sociologist Fabio Rojas reports on &#8220;a conversation I [Rojas] have had a few times with statisticians&#8221;: Rojas: “What does your research tell us about a sample of, say, a few hundred cases?” Statistician: “That’s not important. My result works as n–> 00.” Rojas: “Sure, that’s a fine mathematical result, but I have to estimate the [...]]]></description>
			<content:encoded><![CDATA[<p>Sociologist Fabio Rojas <a href="http://orgtheory.wordpress.com/2012/04/27/stupid-statistician-tricks/">reports</a> on &#8220;a conversation I [Rojas] have had a few times with statisticians&#8221;:</p>
<blockquote><p>Rojas:  “What does your research tell us about a sample of, say, a few hundred cases?”</p>
<p>Statistician: “That’s not important. My result works as n–> 00.”</p>
<p>Rojas: “Sure, that’s a fine mathematical result, but I have to estimate the model with, like, totally finite data. I need inference, not limits. Maybe the estimate doesn’t work out so well for small n.”</p>
<p>Statistician: “Sure, but if you have a few million cases, it’ll work in the limit.”</p>
<p>Rojas: “Whoa. Have you ever collected, like, real world network data? A million cases is hard to get.”</p></blockquote>
<p>The conversation continues in this frustrating vein.  Rojas writes:</p>
<blockquote><p>This illustrates a fundamental issue in statistics (and other sciences). One you formalize a model and work mathematically, you are tempted to focus on what is mathematically interesting instead of the underlying problem motivating the science. . . .</p>
<p>We have the same issue in statistics. “Statistics” can mean “the mathematics of distributions and other functions arising in statistical models.” Or it can mean the traditional problems of statistics like inference, measurement, model estimation, sampling, data collection/management, forecasting, and description. The problem for a guy like me (a social scientist with real data) is that the label “statistician” often denotes someone who is actually a mathematician who happens to be interested in distributions. . . . What I really want is a nuts and bolts person to help me solve problems.</p></blockquote>
<p>My first reaction&#8212;actually, my main reaction&#8212;is that Rojas hangs out with the wrong sort of statistician.  Following the links, I see that Rojas works at Indiana University, which features a large statistics department.  I suspect he had the misfortune to encounter &#8220;a mathematician who happens to be interested in distributions&#8221; and he didn&#8217;t realize he could shop around among the many statisticians in that department who work on applied social research.</p>
<p>On the other hand, it&#8217;s a bad sign that Rojas reports having this conversation multiple times.  I thought that statisticians nowadays know they&#8217;re supposed to be helpful on real problems.  That &#8220;n -> infinity&#8221; thing seems so old-fashioned!  I&#8217;d like to believe that Rojas was just having some bad luck, but maybe there&#8217;s more of this bad stuff going on than I realized.  Or maybe it was just a communication problem?</p>
<p>It&#8217;s hard for me to imagine a statistician in 2012 telling a sociologist, &#8220;if you have a few million cases, it’ll work in the limit,” except as a joke, as an ironic comment on the limitations of some of our theory.  But perhaps that just reflects the poverty of my imagination.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/bad-news-about-some-statisticians/feed/</wfw:commentRss>
		<slash:comments>22</slash:comments>
		</item>
		<item>
		<title>Let’s play “Guess the smoother”!</title>
		<link>http://andrewgelman.com/2012/04/lets-play-guess-the-smoother/</link>
		<comments>http://andrewgelman.com/2012/04/lets-play-guess-the-smoother/#comments</comments>
		<pubDate>Thu, 26 Apr 2012 14:19:36 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Statistics]]></category>
		<category><![CDATA[Statistical graphics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15173</guid>
		<description><![CDATA[Andre de Boer writes: In my profession as a risk manager I encountered this graph: I can&#8217;t figure out what kind of regression this is, would you be so kind to enlighten me? The points represent (maturity,yield) of bonds. My reply: That&#8217;s a fun problem, reverse-engineering a curve fit! My first guess is lowess, although [...]]]></description>
			<content:encoded><![CDATA[<p>Andre de Boer writes:</p>
<blockquote><p>In my profession as a risk manager I encountered this graph:</p>
<p><a href="http://andrewgelman.com/wp-content/uploads/2012/04/collAAA.gif"><img src="http://andrewgelman.com/wp-content/uploads/2012/04/collAAA.gif" alt="" title="collAAA" width="553" height="369" class="alignnone size-full wp-image-15174" /></a></p>
<p>I can&#8217;t figure out what kind of regression this is, would you be so kind to enlighten me?<br />
The points represent (maturity,yield) of bonds.</p></blockquote>
<p>My reply:  That&#8217;s a fun problem, reverse-engineering a curve fit!  My first guess is lowess, although it seems too flat and asympoty on the right side of the graph to be lowess.  Maybe a Gaussian process?  Looks too smooth to be a spline.  I guess I&#8217;ll go with my original guess, on the theory that lowess is the most accessible smoother out there, and if someone fit something much more complicated they&#8217;d make more of a big deal about it.  On the other hand, if the curve is an automatic output of some software (Excel? Stata?) then it could be just about anything.</p>
<p>Does anyone have any ideas?</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/lets-play-guess-the-smoother/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>Modeling probability data</title>
		<link>http://andrewgelman.com/2012/04/modeling-probability-data/</link>
		<comments>http://andrewgelman.com/2012/04/modeling-probability-data/#comments</comments>
		<pubDate>Thu, 26 Apr 2012 13:21:41 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Statistics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=14790</guid>
		<description><![CDATA[Rafael Huber writes: I conducted an experiment in which subjects where asked to estimate the probability of a certain event given a number of information (like a wheater forecaster or a stockmarket trader). These probability estimates are the dependent variable of my experiment. My goal is to model the data with a (hierarchical) Bayesian regression. [...]]]></description>
			<content:encoded><![CDATA[<p>Rafael Huber writes:<br />
<span id="more-14790"></span></p>
<blockquote><p>I conducted an experiment in which subjects where asked to estimate the probability of a certain event given a number of information (like a wheater forecaster or a stockmarket trader). These probability estimates are the dependent variable of my experiment. My goal is to model the data with a (hierarchical) Bayesian regression. A linear equation with all the presented information (quantified as log odds) defines the mu of a normal likelihood. The tau as precision is another free parameter.</p>
<p>y[r] ~ dnorm( mu[r] , tau[ subj[r] ] )<br />
mu[r] <- b0[ subj[r] ] + b1[ subj[r] ] * x1[r] + b2[ subj[r] ] * x2[r] + b3[ subj[r] ] * x3[r]</p>
<p>My problem is that I do not believe that the normal is the correct probability distribution to model probability data (&#8230; because the error is limited). However, until now nobody was able to tell me how I can correctly model probability data.</p></blockquote>
<p>My reply:  You can take the logit of the data before analyzing them.  That is assuming there are no stated probabilities of 0 and 1.  You could round these to 0.01 and 0.99.  In any case you should graph the data and fitted model to see if there are problems.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/modeling-probability-data/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Dyson’s baffling love of crackpots</title>
		<link>http://andrewgelman.com/2012/04/dysons-baffling-love-of-crackpots/</link>
		<comments>http://andrewgelman.com/2012/04/dysons-baffling-love-of-crackpots/#comments</comments>
		<pubDate>Wed, 25 Apr 2012 14:39:51 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Literature]]></category>
		<category><![CDATA[Miscellaneous Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=14906</guid>
		<description><![CDATA[Peter Woit reports on the sympathy that well-known physicist Freeman Dyson has with crackpot theorists. The interesting part is that Dyson has positive feelings for these cranks, even while believing that their theories are completely wrong: In my [Dyson's] career as a scientist, I twice had the good fortune to be a personal friend of [...]]]></description>
			<content:encoded><![CDATA[<p>Peter Woit <a href="http://www.math.columbia.edu/~woit/wordpress/?p=4488">reports</a> on the sympathy that well-known physicist Freeman Dyson has with crackpot theorists.  The interesting part is that Dyson has positive feelings for these cranks, <em>even while believing that their theories are completely wrong</em>:</p>
<blockquote><p>In my [Dyson's] career as a scientist, I twice had the good fortune to be a personal friend of a famous dissident. One dissident, Sir Arthur Eddington, was an insider like Thomson and Tait. The other, Immanuel Velikovsky, was an outsider like Carter. Both of them were tragic figures, intellectually brilliant and morally courageous, with the same fatal flaw as Carter. Both of them were possessed by fantasies that people with ordinary common sense could recognize as nonsense. I made it clear to both that I did not believe their fantasies, but I admired them as human beings and as imaginative artists. I admired them most of all for their stubborn refusal to remain silent. With the whole world against them, they remained true to their beliefs. I could not pretend to agree with them, but I could give them my moral support.</p></blockquote>
<p>Dyson first writs about Eddington.  I agree with Peter Woit that &#8220;this sympathy for a great physicist who headed down a wrong path in his later years is easy to understand, but the case of Velikovsky is less so. Velikovsky was a well-known author of crackpot best-sellers starting in the 1950s . . . and a neighbor of Dyson’s in Princeton.&#8221;</p>
<p>Woit quotes what Dyson &#8220;wrote as a proposed blurb for Velikovsky in 1977&#8243;:</p>
<blockquote><p>First, as a scientist, I [Dyson] disagree profoundly with many of the statements in your books. Second, as your friend, I disagree even more profoundly with those scientists who have tried to silence your voice. To me, you are no reincarnation of Copernicus or Galileo. You are a prophet in the tradition of William Blake, a man reviled and ridiculed by his contemporaries but now recognized as one of the greatest of English poets. A hundred and seventy years ago, Blake wrote: “The Enquiry in England is not whether a Man has Talents and Genius, but whether he is Passive and Polite and a Virtuous Ass and obedient to Noblemen’s Opinions in Art and Science. If he is, he is a Good Man. If not, he must be starved.” So you stand in good company. Blake, a buffoon to his enemies and an embarrassment to his friends, saw Earth and Heaven more clearly than any of them. Your poetic visions are as large as his and as deeply rooted in human experience. I am proud to be numbered among your friends.</p></blockquote>
<p>Now back in 2012, Dyson writes:</p>
<blockquote><p>Science is a creative interaction of observation with imagination. “Physics at the Fringe” is what happens when imagination loses touch with observation. Imagination by itself can still enlarge our vision when observation fails. The mythologies of Carter and Velikovsky fail to be science, but they are works of art and high imagining. As William Blake told us long ago, “You never know what is enough unless you know what is more than enough.”</p></blockquote>
<p>I&#8217;m with Woit (I think) here.  I don&#8217;t see the appeal of bad science.  I don&#8217;t think such voices should be silenced (as Dyson puts it), but it&#8217;s probably a good thing to keep these theories off the nonfiction shelves.  I see the appeal of poetry and literature and philosophy and all sorts of things that aren&#8217;t science, and I recognize that poets, philosophers, etc., can motivate themselves by all sorts of wacked-out theories.  Look at Philip K. Dick.  His visions are part of who he was, and I wouldn&#8217;t trade Valis for anything, but without the art the visions aren&#8217;t so exciting.  My impression is that the problem with crackpot scientific theories is not that they are not beautiful but that they lead to no scientific progress or understanding.  To put it another way:  as a poet, Velikovsky does not have much to offer. It is only if his theories point toward scientific understanding that they have value.  In contrast, Blake was an artists whose visions are appealing without any necessity for them to correspond to scientific reality.</p>
<p>To me, a good analogy would be with the fascinating &#8220;<a href="http://www.avam.org/">outsider art</a>&#8221; done by schizophrenics, where an entire canvas is covered with tiny scribbles relating to the nature of the universe.  These artworks can be just amazing and I don&#8217;t think anyone should try to silence their voices.  But I don&#8217;t think it does anybody any favors to call this science.</p>
<p>I think my view is shared by most scientists.  Dyson gets some attention here partly from his eminence and partly because of his contrary views.  That&#8217;s fine&#8212;he&#8217;s done enough good work that he&#8217;s earned the right to have his ideas broadcast&#8212;but it all seems a bit odd to me.  Whatever people think of William Blake&#8217;s scientific ideas now, he is admired as a poet and artist.  Velikovsky is more of a historical footnote in the annals of past bestsellers, a pop-culture artifact who belongs with the Chariots of the Gods guy, the Jupiter Effect guy, the Bible Code guy, the people who made the Search for Noah&#8217;s Ark movie, etc etc.  I don&#8217;t think anyone will be reading his books for the pleasure of his prose.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/dysons-baffling-love-of-crackpots/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		</item>
		<item>
		<title>ESPN is looking to hire a research analyst</title>
		<link>http://andrewgelman.com/2012/04/espn-is-looking-to-hire-a-research-analyst/</link>
		<comments>http://andrewgelman.com/2012/04/espn-is-looking-to-hire-a-research-analyst/#comments</comments>
		<pubDate>Tue, 24 Apr 2012 21:45:16 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Sports]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15160</guid>
		<description><![CDATA[This is somebody&#8217;s dream job, I&#8217;m sure . . . ESPN is looking for a statistician to join the HR department as a Research Analyst. The job will consist of analytical research and producing statistics about the people that work at ESPN. Topics of interest will include productivity, efficiency, and retention of employees, among other [...]]]></description>
			<content:encoded><![CDATA[<p>This is somebody&#8217;s dream job, I&#8217;m sure . . .</p>
<blockquote><p>ESPN is looking for a statistician to join the HR department as a <a href="http://jobs.espncareers.com/connecticut/research/jobid2157116-hr-research-analyst-jobs">Research Analyst</a>. The job will consist of analytical research and producing statistics about the people that work at ESPN. Topics of interest will include productivity, efficiency, and retention of employees, among other items. In addition to data mining and producing reports, we also field surveys and analyze results.</p>
<p>The position is located at the headquarters in Bristol, Connecticut, the same campus where nearly all ESPN shows are produced. ESPN is a Disney company, so discounts and free admission to Disney parks are available for employees. Flexible work arrangements are available, along with working in the New York City office part-time if desired.</p>
<p>The role is a relatively new function and will have a high impact very quickly on helping the business function. Statistical software, text books, and any other resource needed to get the job done will be provided.</p>
<p>The link for the application is below. Any interested candidates with questions can contact Michael.J.Springer@espn.com or the recruiter Amy.McManus@espn.com.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/espn-is-looking-to-hire-a-research-analyst/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Non-Bayesian analysis of Bayesian agents?</title>
		<link>http://andrewgelman.com/2012/04/non-bayesian-analysis-of-bayesian-agents/</link>
		<comments>http://andrewgelman.com/2012/04/non-bayesian-analysis-of-bayesian-agents/#comments</comments>
		<pubDate>Tue, 24 Apr 2012 15:21:44 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Bayesian Statistics]]></category>
		<category><![CDATA[Decision Theory]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=14362</guid>
		<description><![CDATA[Econometrician and statistician Dale Poirier writes: 24 years ago (1988, Journal of Economics Perspectives) I [Poirier] noted cognitive dissonance among some economists who treat the agents in their theoretical framework as Bayesians, but then analyze the data (even in the same paper!) as a frequentist. Recently, I have found similar cases in cognitive science. I [...]]]></description>
			<content:encoded><![CDATA[<p>Econometrician and statistician Dale Poirier writes:</p>
<blockquote><p>24 years ago (1988, Journal of Economics Perspectives) I [Poirier] noted cognitive dissonance among some economists who treat the agents in their theoretical framework as Bayesians, but then analyze the data (even in the same paper!) as a frequentist. Recently, I have found similar cases in cognitive science. I suspect other disciplines exhibit such behavior. Do you know of any examples in political science?</p></blockquote>
<p>My reply:</p>
<p>I don&#8217;t know of any such examples in political science.  Game theoretic models are popular in poli sci, but I haven&#8217;t seen much in the way of models of Bayesian decision making.</p>
<p>Here are two references (not in political science) that might be helpful.</p>
<p>1.  I have <a href="http://andrewgelman.com/2011/09/economists-dont-think-like-accountants-but-maybe-they-should/">argued</a> that the utility model (popular in economics and political science as a way of providing &#8220;microfoundations&#8221; for analyses of aggregate behavior) is actually more of a bit of folk-psychology that should not be taken seriously.  To me, it is silly that many economists and political scientists give this model such prominence.  Utility theory can be a helpful normative model in many situations, but I don&#8217;t think it should be anything close to foundational as </p>
<p>2.  Are you familiar with the work of Josh Tenenbaum?  He is a cognitive scientist at MIT who has been working on Bayesian models for human reasoning and also Bayesian methods for fitting such models given data from psychological experiments.  See <a href="http://andrewgelman.com/2008/03/the_great_chain/">here</a> and <a href="http://andrewgelman.com/2008/02/real_statistics/">here</a>.</p>
<p>Getting back to Poirier&#8217;s original point, it could make complete sense to me to use non-Bayesian inference to learn about Bayesian agents, as long as you believe that (a) people&#8217;s behavior can be reasonably approximated by Bayesian decision rules, and (b) from a normative standpoint, non-Bayesian inference is to be preferred.  It seems that many economists believe both a and b, so I don&#8217;t necessarily see any cognitive dissonance in using non-Bayesian statistical inference while modeling behavior as Bayesian.</p>
<p>The funny thing is, I believe not-A and not-B, so my preference would be to use Bayesian inference for non-Bayesian models of behavior.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/non-bayesian-analysis-of-bayesian-agents/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Infographic of the year</title>
		<link>http://andrewgelman.com/2012/04/infographic-of-the-year/</link>
		<comments>http://andrewgelman.com/2012/04/infographic-of-the-year/#comments</comments>
		<pubDate>Mon, 23 Apr 2012 21:08:23 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Science]]></category>
		<category><![CDATA[Statistical graphics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15150</guid>
		<description><![CDATA[This (by Frans Hofmeester) is excellent. What really makes it work, I think, is that it goes slowly enough. 2 minutes and 45 seconds is enough time for me, as a viewer, to feel like I&#8217;m living through each stage of development. If the video were sped up to go from 0 to 12 in [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://gawker.com/5904260/filmmaker-films-daughter-every-week-for-twelve-years-puts-together-time+lapse-video">This</a> (by Frans Hofmeester) is excellent.</p>
<p><iframe src="http://player.vimeo.com/video/40448182" width="400" height="300" frameborder="0" webkitAllowFullScreen mozallowfullscreen allowFullScreen></iframe></p>
<p>What really makes it work, I think, is that it goes slowly enough.  2 minutes and 45 seconds is enough time for me, as a viewer, to feel like I&#8217;m living through each stage of development.  If the video were sped up to go from 0 to 12 in only 30 seconds, that would be cool in its own way but would give up the sense of local stability that is characteristic of development.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/infographic-of-the-year/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>“Any old map will do” meets “God is in every leaf of every tree”</title>
		<link>http://andrewgelman.com/2012/04/any-old-map-will-do-meets-god-is-in-every-leaf-of-every-tree/</link>
		<comments>http://andrewgelman.com/2012/04/any-old-map-will-do-meets-god-is-in-every-leaf-of-every-tree/#comments</comments>
		<pubDate>Mon, 23 Apr 2012 13:23:37 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Sociology]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15099</guid>
		<description><![CDATA[As a statistician I am particularly worried about the rhetorical power of anecdotes (even though I use them in my own reasoning; see discussion below). But much can be learned from a true anecdote. The rough edges&#8212;the places where the anecdote doesn&#8217;t fit your thesis&#8212;these are where you learn. We have recently had a discussion [...]]]></description>
			<content:encoded><![CDATA[<p>As a statistician I am particularly worried about the rhetorical power of anecdotes (even though I use them in my own reasoning; see discussion below).  But much can be learned from a <em>true</em> anecdote.  The rough edges&#8212;the places where the anecdote doesn&#8217;t fit your thesis&#8212;these are where you learn.</p>
<p>We have recently had a discussion (<a href="http://andrewgelman.com/2012/04/another-day-another-plagiarist/">here</a> and <a href="http://andrewgelman.com/2012/04/believe-your-models-up-to-the-point-that-you-abandon-them/">here</a>) of Karl Weick, a prominent scholar of business management who plagiarized a story and then went on to draw different lessons from the pilfered anecdote in several different publications published over many years.</p>
<p>Setting aside an issues of plagiarism and rulebreaking, I argue that, by hiding the source of the story and changing its form, Weick and his management-science audience are losing their ability to get anything out of it beyond empty confirmation.</p>
<p>A full discussion follows.</p>
<p><strong>1.  The lost Hungarian soldiers</strong></p>
<p>Thomas Basbøll (who has the unusual (to me) job of &#8220;writing consultant&#8221; at the Copenhagen Business School) has been writing in different places about the a story that has been making the rounds over the past few decades among organizational sociologists and management consultants.  The story started with a discovery that of plagiarism by the eminent scholar Karl Weick but then moved toward a more general exploration of storytelling and belief.  (I learned about this example via an email from Basbøll (who had become aware of my interest in plagiarism); it turns out we also have a common interest in the bases of scientific and scholarly ideas.)</p>
<p>From Basbøll&#8217;s latest and most historical telling (linked from <a href="http://www.cbs.dk/Forskning/Institutter-centre/Projekter/What-Makes-Organization/Menu/Working-papers">here</a>, via Basbøll&#8217;s <a href="http://secondlanguage.blogspot.com/2012/04/belief-and-orthodoxy-part-2.html">blog</a>), supplemented by Wikipedia, I summarize what happened in time order (which is somewhat ahistorical in that it does not represent the order in which Basbøll, and perhaps Weick, learned about these events):<br />
<span id="more-15099"></span><br />
1916:  Albert Szent-Györgyi, a medical student in Budapest, serves in World War 1.</p>
<p>1930:  Working in Szeged, Hungary, Szent-Györgyi  and his colleagues discover vitamin C.  In the next several decades, he continues to make research contributions and becomes a prominent scientist, eventually moving to the U.S. after World War 2.  He dies in 1986.</p>
<p>1972:  Medical researcher Oscar Hechter reports the following in the proceedings of a &#8220;an international conference on cell membrane structure,&#8221; published in 1972:</p>
<blockquote><p>Let me close by sharing with you a story told me by Albert Szent-Györgyi. A small group of Hungarian troops were camped in the Alps during the First World War. Their commander, a young lieutenant, decided to send out a small group of men on a scouting mission. Shortly after the scouting group left it began to snow, and it snowed steadily for two days. The scouting squad did not return, and the young officer, something of an intellectual and an idealist, suffered a paroxysm of guilt over having sent his men to their death. In his torment he questioned not only his decision to send out the scouting mission, but also the war itself and his own role in it. He was a man tormented.</p>
<p>Suddenly, unexpectedly, on the third day the long-overdue scouting squad returned. There was great joy, great relief in the camp, and the young commander questioned his men eagerly. &#8220;Where were you?&#8221; he asked. &#8220;How did you survive, how did you find your way back?&#8221; The sergeant who had led the scouts replied, &#8220;We were lost in the snow and we had given up hope, had resigned ourselves to die. Then one of the men found a map in his pocket. With its help we knew we could find our way back. We made camp, waited for the snow to stop, and then as soon as we could travel we returned here.&#8221; The young commander asked to see this wonderful map. It was a map not of the Alps but of the Pyrenees!</p></blockquote>
<p>The moral of the story, as given by Hechter and by Bernard Pullman at another symposium a year later, is that the map gave the soldiers the confidence to make good decisions.  Basbøll writes:</p>
<blockquote><p>The map of the Pyrenees is like a controversial paper or a tentative model study in science. Whether or not it turns out to be an accurate representation of the &#8220;territory&#8221; is less important than the stimulus it may provide to further research, argue both Hechter and Pullman.</p></blockquote>
<p>1977:  Immunologist Miroslav Holub publishes a poem (of the prosy, non-rhyming sort) telling the lost-soldiers story (again, crediting Szent-Györgyi) in the Times Literary Supplement, translated from the Czech.  Holub may have actually attended the meeting reported on by Hechter. </p>
<p>1982:  Robert Swieringa and Karl Weick publish an article including a nearly word-for-word transcription of Holub&#8217;s poem, but not using quotation marks or acknowledging Holub at all, presenting the story as &#8220;an incident that happened,&#8221; and placing the event (implausibly) in Switzerland.</p>
<p>&#8220;Sometime in the mid-1980s&#8221;:  Weick tells the story to Bob Engel, a &#8220;top Wall Street executive&#8221; who &#8220;was about to take leadership of a new strategic planning group at Morgan Guaranty.&#8221;  The moral of the story:  &#8220;When you are lost, any old map will do.&#8221; </p>
<p>1995:  Weick writes:</p>
<blockquote><p>What is interesting about Engel&#8217;s twist to the story is that he has described a situation that most leaders face. Followers are often lost and even the leader is not sure where to go. All the leaders know is that the plan or the map they have in front of them are not sufficient to get them out. What the leader has to do, when faced with this situation, is instill some confidence in people, get them moving some general direction, and be sure they look closely at cues created by their actions so that they learn where they were and get some better idea of where they are and where they want to be.</p></blockquote>
<p>2005:  Barbara Czarniawska reports on a 1998 talk:</p>
<blockquote><p>&#8220;If any old map will do to help you find your way out of the Alps,&#8221; Weick had said, &#8220;then surely any old story will do to help you find your way out of puzzles in the human condition.&#8221;</p></blockquote>
<p>As Basbøll notes, the moral of the story keeps changing!  In the earliest known tellings (by Hechter and Pullman), the only clear role of the map was to calm the soldiers os they could find their way back to camp on their own.  By 1998 (or 2005), the map has an actual instrumental use.  What is creepy (to me) is that the story has changed from a story about people using an external device (the map) as a way to calm themselves and make a reasoned course of action, to a parable about savvy managers (&#8220;leaders&#8221;) who can manipulate their underlings, to the conclusion that &#8220;any old story will do,&#8221; a claim that makes the original events (or non-events) irrelevant and would seem to me to encourage a form of scholarship that does nothing but confirm people&#8217;s preconceptions.</p>
<p>This is where we come to the point about anecdotes that I discussed in the three paragraphs at the start of the post.</p>
<p>2006:  Basbøll and Henrik Graham uncover the Weick plagiarism.  Various scholars engage in discussion of the case over the next several years.</p>
<p><strong>Full circle</strong></p>
<p>Going beyond questions of plagiarism and scholarly ethics, the Szent-Györgyi/Holub/Weick story is relevant for our discussion because it sheds (anecdotal) light on the relevance of anecdotal reasoning and the nature of evidence.  Weick&#8217;s misrepresentation of Holub&#8217;s texts suggests how such flexible uses of stories can lend themselves to flexible conclusions.  Reliance on anecdotes is already iffy because of selection problems, but when the stories can be altered, the concepts of confirmation and falsification are turned on their head.  (Weick and his ilk may very well argue that his stories are not evidence but merely entry points for involving the audience to think about his deeper principles, but then this just pushes the question back one step:  What, then, is the evidence for those principles, and what is the role of anecdotes in the translation of principles to the outside world?)</p>
<p><strong>2.  Which leaf on which tree are you talking about?</strong></p>
<p>I was giving a talk the other day to a group of statistics graduate students, on the subject of connections between teaching and research, and I mentioned one of my favorite <a href="http://andrewgelman.com/2009/05/handy_statistic/">sayings</a>, &#8220;God is in every leaf of every tree.&#8221;  What this means is that if you study any problem carefully and seriously enough, you will come to interesting statistical research problems.</p>
<p>If you&#8217;ve been reading this far, the above paragraph should sound familiar.  The meta-statistical principle that &#8220;God is in every leaf of every tree&#8221; is very close to Weick&#8217;s &#8220;any old story will do to help you find your way out of puzzles in the human condition.&#8221;</p>
<p>Now, though, let me explore the differences as well as the similarities between the two quotes.  In the leaf/tree scenario, it is important&#8212;crucial&#8212;that the statistician or scientist look carefully at the particular leaf in question.  By carefully trying to resolve the contradictions involved in a single sampling problem, or a single causal inference, or a single dataset, or (in Holub&#8217;s field of immunology), a single medical patient, we might take ourselves along the path to a general solution&#8212;or at least a recognition and better understanding of a problem with the current scientific formulation.  But you have to study the actual leaf!  Studying an abstract leaf, or a secondhand story about a leaf, won&#8217;t do the trick.  Constructing the leaf from a research hypothesis, or altering the leaf to fit the hypotheses, won&#8217;t give you as much (although you can still learn something, for example if you take care to note how many changes you needed to make to keep your audience happy).</p>
<p>Similarly, the history of the various misrepresentations and changes to Szent-Györgyi&#8217;s story, as related and interpreted by Basbøll, give some sense of the various uses that the story has been put.  Each telling comes with its own &#8220;moral&#8221; or message.</p>
<p><strong>3.  The methodological attribution problem</strong></p>
<p>One of my meta-principle of statistics, which in my published <a href="http://www.stat.columbia.edu/~gelman/research/published/gelman_discussion_of_efron.pdf">discussion</a> of an article of Brad Efron I call the &#8220;methodological attribution problem&#8221;:</p>
<blockquote><p>The many useful contributions of a good statistical consultant, or collaborator, will often be attributed to the statistician&#8217;s methods or philosophy rather than to the artful efforts of the statistician himself or herself. Don Rubin has told me that scientists are fundamentally Bayesian (even if they do not realize it), in that they interpret uncertainty intervals Bayesianly. Brad Efron has talked vividly about how his scientific collaborators find permutation tests and p-values to be the most convincing form of evidence. Judea Pearl assures me that graphical models describe how people really think about causality. And so on. I am sure that all these accomplished researchers, and many more, are describing their experiences accurately. Rubin wielding a posterior distribution is a powerful thing, as is Efron with a permutation test or Pearl with a graphical model, and I believe that (a) all three can be helping people solve real scientific problems, and (b) it is natural for their collaborators to attribute some of these researchers&#8217; creativity to their methods.<br />
The result is that each of us tends to come away from a collaboration or consulting experience with the warm feeling that our methods really work, and that they represent how scientists really think. In stating this, I am not trying to espouse some sort of empty pluralism&#8212;the claim that, for example, we would be doing just as well if we were all using fuzzy sets, or correspondence analysis, or some other obscure statistical method. There is certainly a reason that methodological advances are made, and this reason is typically that existing methods have their failings. Nonetheless, I think we all have to be careful about attributing too much from our collaborators&#8217; and clients&#8217; satisfaction with our methods.</p></blockquote>
<p>As suggested in my discussion, I came to this meta-principle through my indirect experiences of hearing various researchers talk about the efficacy of their statistical methods.  Theoreticians and methodologists often have extreme confidence in their approaches, and applied researchers often have what seems to me to be oddly strong opinions about statistical methods.</p>
<p><strong>Playing tennis without a map</strong></p>
<p>As the saying goes, research is when you don&#8217;t know what you&#8217;re doing.  We don&#8217;t have maps; much of research can&#8217;t be automated.  In the spirit of Weick&#8217;s writings, let me say that statistical models and scientific theories can play useful roles even when far off from the truth.</p>
<p>I <a href="http://andrewgelman.com/2012/04/believe-your-models-up-to-the-point-that-you-abandon-them/">discussed</a> this a few days ago in the context of the two-edged nature of belief. On the one hand, belief is powerful. By conditioning on assumptions, we can rule out alternatives and move quickly and surely. But belief is risky, especially since all of our beliefs, if stated precisely enough, our false. The resolution is that we can use the strength and power of beliefs to better study their limitations.</p>
<p>From a statistical (and philosophy-of-science) perspective, strong assuptions play two roles: First, with strong assumps we can (often) make strong and precise inferences. The likelihood function is a powerful thing. Second, strong assumptions are strongly checkable and falsifiable. We take our models seriously, work with them as if we believe them unquestioningly, then use the leverage from this simulation of belief to check model fit and explore discrepancies between inferences and data.</p>
<p><strong>4.  Building up a store of anecdotes</strong></p>
<p>One difference between academic statistics and the academic study of management (as embodied by Karl Weick) is that applied statisticians such as myself live in the world of &#8220;tabletop experiments&#8221; (as they say in physics) whereas Weick (and, I assume, other scholars of his field) do &#8220;big science.&#8221;  What I mean is that I work on lots of little problems and a few big ones, whereas Weick has a couple of main areas of focus.  I have built up a huge store of <em>personal</em> anecdotes about statistics, whereas Weick must largely rely on the anecdotes of others.  I&#8217;m David Sedaris, he&#8217;s Jay Leno (not the best analogy here; what would be a better example of a storyteller who uses recycled stories but tells them well?).  Weick represents storytelling as an important part of his style, which puts him at a particular dependence on interpreting events that did not happen to him.</p>
<p>I&#8217;m not saying that I&#8217;m better than Weick in this dimension; we&#8217;re just different.  A historian of the Middle Ages, for example, would have no directly relevant personal experience at all.  That&#8217;s just the way it is!</p>
<p>Getting back to the comparison:  I really do use anecdotes as evidence, as well as to illustrate existing principles.  My hundreds of statistical  experiences have been important in the development of my ideas.  This shows up even in my published papers and in how I evaluate the methodological contributions of others:  I typically trust an idea to the extent that it helps solve what seems to me to be a real applied problem.</p>
<p>Weick is in a different situation:  he uses stories to grab his audience and perhaps to inspire him to come up with new theories or modification of existing theories.  For Weick, anecdotes play the role of the map in his some of his interpretations of Szent-Györgyi&#8217;s story:  It&#8217;s ok if you get the details and sourcing wrong, all that matters that it motivates you to move forward.  As noted, that approach would <em>not</em> work in statistical research (or, I suspect, in medical research) because it would deprive us of the opportunity to learn from anecdotes&#8217; specific features.  Maybe it&#8217;s ok in Weick&#8217;s area of management research, as long as the stories and their alterations are accurately sourced.</p>
<p><strong>5.  Putting it all together</strong></p>
<p>I have discussed several themes relating to the use of anecdotes in scientific learning.  It&#8217;s easy to laugh at anecdotes, but I recognize that they are crucial in my meta-statistical understanding.  That is, much of my judgment about what methods to use, comes from my own personal experiences.  I suspect that many people without my breadth of statistical experience are still relying on it indirectly through my books and articles.</p>
<p>In other fields, though, anecdotes play a different role, and their truth or falsity, even their sources, do not seem to be so important.  But I am make this judgment based on anecdotal evidence.  All of this is also related, I believe, to out recent discussions of the problems of scientific publications, statistical significance, and so forth.</p>
<p>In particular, all of this is related to model checking.  Anecdotes with accurate sourcing can reveal problems in a model, whereas when you start altering an anecdote and hiding its source, you lose a key opportunity for learning.  This relates to the much-discussed selection problems in quantitative research.</p>
<p><strong>6.  The mysterious move to Switzerland</strong></p>
<p>One of the few things that Weick adds along with his plagiarism of the Holub poem is to place the action, identified only as Hungarian soldiers in the Alps, in Switzerland.  What were Hungarian troops doing in Switzerland, one might ask?  My guess, following Nick Cox&#8217;s <a href="http://andrewgelman.com/2012/04/another-day-another-plagiarist/#comment-79155">comment</a> on our earlier discussion, is simple ignorance:  Weick is American, and when we hear about the Alps, we automatically think &#8220;the Swiss Alps.&#8221;  When we think about World War 1, we think about western Europe.  Perhaps Weick in 1982 did not realize that not all the Alps are in Switzerland, nor was he primed to reflect upon the location of the theater of the war involving Austria-Hungary and Italy.  This small error is irrelevant to the use of the story as a management parable&#8212;Weick could just as well have used a clearly fictional example such as that of Winnie the Pooh or the Sneeches or Yoruga la Tortuga (I imagine that the sort of managers who would hire a sociologist in the first place would love the morals of those stories!)&#8212;but it demonstrates the risks of copying a story without attribution.  The deadly combination of word-for-word quotation and gratuitous error (as in the notorious <a href="http://andrewgelman.com/2011/09/another-wegman-plagiarism-copying-without-attribution-and-further-discussion-of-why-scientists-cheat/">case</a> where Ed Wegman copied 2^n and it came out as 2n) reveals an ignorance of the underlying material, leading outsiders such as myself to question the scholar&#8217;s competence as well as his integrity.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/any-old-map-will-do-meets-god-is-in-every-leaf-of-every-tree/feed/</wfw:commentRss>
		<slash:comments>28</slash:comments>
		</item>
		<item>
		<title>Please stop me before I barf again</title>
		<link>http://andrewgelman.com/2012/04/please-stop-me-before-i-barf-again/</link>
		<comments>http://andrewgelman.com/2012/04/please-stop-me-before-i-barf-again/#comments</comments>
		<pubDate>Sun, 22 Apr 2012 20:15:51 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Statistical graphics]]></category>
		<category><![CDATA[Zombies]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=15122</guid>
		<description><![CDATA[Pointing to some horrible graphs, Kaiser writes, &#8220;The Earth Institute needs a graphics adviser.&#8221; I agree. The graphs are corporate standard, neither pretty or innovative enough to qualify as infographics, not informational enough to be good statistical data displays. Some examples include the above exploding pie chart, which, as Kaiser notes, is not merely ugly [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://junkcharts.typepad.com/junk_charts/2012/04/the-earth-institute-needs-a-graphics-advisor.html">Pointing</a> to some horrible graphs, Kaiser writes, &#8220;The Earth Institute needs a graphics adviser.&#8221;</p>
<p><a href="http://andrewgelman.com/wp-content/uploads/2012/04/earthpie.png"><img src="http://andrewgelman.com/wp-content/uploads/2012/04/earthpie.png" alt="" title="earthpie" width="542" height="342" class="alignnone size-full wp-image-15125" /></a></p>
<p>I agree.  The graphs are corporate standard, neither pretty or innovative enough to qualify as infographics, not informational enough to be good statistical data displays.</p>
<p>Some examples include the above exploding pie chart, which, as Kaiser notes, is not merely ugly and ridiculously difficult to read (given that it is conveying only nine data points) but also invites suspicion of its numbers, and pages and pages of graphs that could be better compressed into a compact displays (see pages 25-65 of the report).  Yes, this is all better than tables of numbers, but I don&#8217;t see that much thought went into displaying patterns of information or telling a story.  It&#8217;s more graph-as-data-dump.</p>
<p>To be fair, the report does have some a clean scatterplot (on page 65).  But, overall, the graphs are not well-integrated with the messages in the text.</p>
<p>I feel a little bit bad about this, because I&#8217;m involved with the Earth Institute.  I should be their graphics adviser!  I&#8217;m actually surprised they didn&#8217;t ask me for advice on this.  I gave a talk to the Earth Institute postdocs a year or so ago, so they should know I like graphs!</p>
<p>P.S.  Nathan Yau <a href="http://flowingdata.com/2012/04/25/world-happiness-report-makes-statisticians-unhappy">reproduces</a> the report&#8217;s Figure 11, which is just as bad as the exploding pie chart shown above.</p>
]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2012/04/please-stop-me-before-i-barf-again/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
	</channel>
</rss>

