<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" version="2.0">

<channel>
	<title>Statistical Modeling, Causal Inference, and Social Science</title>
	
	<link>http://andrewgelman.com</link>
	<description />
	<lastBuildDate>Wed, 22 May 2013 13:27:18 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/StatisticalModeling" /><feedburner:info xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" uri="statisticalmodeling" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><feedburner:emailServiceId xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0">StatisticalModeling</feedburner:emailServiceId><feedburner:feedburnerHostname xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0">http://feedburner.google.com</feedburner:feedburnerHostname><item>
		<title>To Throw Away Data: Plagiarism as a Statistical Crime</title>
		<link>http://andrewgelman.com/2013/05/22/to-throw-away-data-plagiarism-as-a-statistical-crime/</link>
		<comments>http://andrewgelman.com/2013/05/22/to-throw-away-data-plagiarism-as-a-statistical-crime/#comments</comments>
		<pubDate>Wed, 22 May 2013 13:27:18 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Statistics]]></category>
		<category><![CDATA[Sociology]]></category>
		<category><![CDATA[Zombies]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18783</guid>
		<description><![CDATA[<p>I&#8217;ve been blogging a lot lately about plagiarism (sorry, Bob!), and one thing that&#8217;s been bugging me is, why does it bother me so much. Part of the story is simple: much of my reputation comes from the words I write, so I bristle at any attempt to devalue words. I feel the same way [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/22/to-throw-away-data-plagiarism-as-a-statistical-crime/">To Throw Away Data: Plagiarism as a Statistical Crime</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2013/04/plaig1-150x150.jpg?8b0eec" alt="plaig1" width="150" height="150" class="alignnone size-thumbnail wp-image-18784" /><img src="/wp-content/uploads/2013/04/plaig2-150x150.jpg?8b0eec" alt="plaig2" width="150" height="150" class="alignnone size-thumbnail wp-image-18785" /></p>
<p>I&#8217;ve been blogging a lot lately about plagiarism (sorry, Bob!), and one thing that&#8217;s been bugging me is, why does it bother me so much.  Part of the story is simple:  much of my reputation comes from the words I write, so I bristle at any attempt to devalue words.  I feel the same way about plagiarism that a rich person would feel about counterfeiting:  Don&#8217;t debase my currency!</p>
<p>But it&#8217;s more than that.  After discussing this a bit with Thomas Basbøll, I realized that I&#8217;m bothered by the way that plagiarism interferes with the transmission of information:</p>
<blockquote><p>Much has been written on the ethics of plagiarism. One aspect that has received less notice is plagiarism’s role in corrupting our ability to learn from data: We propose that plagiarism is a statistical crime. It involves the hiding of important information regarding the source and context of the copied work in its original form. Such information can dramatically alter the statistical inferences made about the work.</p>
<p>In statistics, throwing away data is a no-no. From a classical perspective, inferences are determined by the sampling process: point estimates, confidence intervals and hypothesis tests all require knowledge of (or assumptions about) the probability distribution of the observed data. In a Bayesian analysis, it is necessary to include in the model all variables that are relevant to the data-collection process. In either case, we are generally led to faulty inferences if we are given data from urn A and told they came from urn B.</p>
<p>A statistical perspective on plagiarism might seem relevant only to cases in which raw data are unceremoniously and secretively transferred from one urn to another. But statistical consequences also result from plagiarism of a very different kind of material: stories. To underestimate the importance of contextual information, even when it does not concern numbers, is dangerous.</p></blockquote>
<p><a href="http://www.americanscientist.org/issues/id.16038,y.0,no.,content.true,page.1,css.print/issue.aspx">Here&#8217;s our full article</a> (which has just appeared in the American Scientist).  It features two of the recurring characters from this blog.  Here&#8217;s our conclusion:</p>
<blockquote><p>Scholars in fields ranging from psychology to history to computer science have recognized that stories are part of how people understand the world. As statisticians, we can consider reasoning from stories as a form of approximate inference. From this perspective, statistical principles should provide some approximate guidance about the potential biases and precision of such inferences. One key principle is not to throw away information and, if discarding data is for some reason necessary, to describe as clearly as possible the mechanism by which the relevant information was excluded. Plagiarism violates both these rules and, as such, is a violation of statistical ethics, beyond any other considerations of moral behavior.</p></blockquote>
<p>P.S.  I&#8217;m more interested in scientific plagiarism than the legal or literary variety, but <a href="http://www.thecrimson.com/article/2004/9/27/prof-admits-to-misusing-source-harvard/">this</a> 2004 news article by Daniel Hemel and Lauren Schuker (which I found by googling *laurence tribe plagiarism*) is full of good quotes.  Here&#8217;s my favorite part:<br />
<span id="more-18783"></span></p>
<blockquote><p>Tribe’s mea culpa comes just three weeks after another prominent Harvard faculty member—Climenko Professor of Law Charles J. Ogletree—publicly apologized for copying six paragraphs almost word-for-word from a Yale scholar in a recent book, All Deliberate Speed.</p>
<p>Last fall, Frankfurter Professor of Law Alan M. Dershowitz also battled plagiarism charges. And in 2002, Harvard Overseer Doris Kearns Goodwin admitted that she had accidently copied passages from another scholar in her bestseller The Fitzgeralds and the Kennedys.</p>
<p>University President Lawrence H. Summers told The Crimson in an interview last week—before the allegations against Tribe surfaced—that he did not see “a big trend” of plagiarism problems at the Law School as a result of the charges against Ogletree and Dershowitz, but indicated that a third case would change his mind.</p>
<p>“If you had a third one, then I would have said, okay, you get to say this is a special thing, a focused problem at the Law School,” Summers said of the recent academic dishonesty cases.</p>
<p>He declined comment last night.</p></blockquote>
<p>The post <a href="http://andrewgelman.com/2013/05/22/to-throw-away-data-plagiarism-as-a-statistical-crime/">To Throw Away Data: Plagiarism as a Statistical Crime</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/22/to-throw-away-data-plagiarism-as-a-statistical-crime/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Recently in the sister blog</title>
		<link>http://andrewgelman.com/2013/05/21/recently-in-the-sister-blog-6/</link>
		<comments>http://andrewgelman.com/2013/05/21/recently-in-the-sister-blog-6/#comments</comments>
		<pubDate>Tue, 21 May 2013 13:53:56 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Political Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=19038</guid>
		<description><![CDATA[<p>The end of Michelle Rhee. The relevance of statisticians to researchers in different fields of social science. Regression discontinuity. Free expression vs. not wanting to make anyone personally uncomfortable. Political coalitions are diverse (and there’s no use pretending otherwise). According to David Brooks, staying out of jail is a conservative value. I’ve heard of the [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/21/recently-in-the-sister-blog-6/">Recently in the sister blog</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><a href="http://themonkeycage.org/2013/05/20/the-end-of-michelle-rhee/">The end of Michelle Rhee</a>.</p>
<p><a href="http://themonkeycage.org/2013/05/17/the-relevance-of-statisticians-to-researchers-in-different-fields-of-social-science/">The relevance of statisticians to researchers in different fields of social science</a>.</p>
<p><a href="http://themonkeycage.org/2013/05/15/elections-expert-jim-snyder-sez-rd-is-a-ok/">Regression discontinuity</a>.</p>
<p><a href="http://themonkeycage.org/2013/05/13/free-expression-vs-not-wanting-to-make-anyone-personally-uncomfortable/">Free expression vs. not wanting to make anyone personally uncomfortable</a>.</p>
<p><a href="http://themonkeycage.org/2013/05/10/political-coalitions-are-diverse/">Political coalitions are diverse (and there’s no use pretending otherwise)</a>.</p>
<p><a href="http://themonkeycage.org/2013/05/07/according-to-david-brooks-staying-out-of-jail-is-a-conservative-value/">According to David Brooks, staying out of jail is a conservative value</a>.</p>
<p><a href="http://themonkeycage.org/2013/05/06/ive-heard-of-the-irb-but-this-is-ridiculous/">I’ve heard of the IRB, but this is ridiculous</a>.</p>
<p><a href="http://themonkeycage.org/2013/04/30/this-will-make-richard-florida-very-happy/">This will make Richard Florida very happy</a>.</p>
<p><a href="http://themonkeycage.org/2013/04/26/i-dont-know-whether-to-call-it-communism-or-crony-capitalism/">I don’t know whether to call it communism or crony capitalism . . .</a>.</p>
<p><a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3579644/">Concepts and folk theories</a>.</p>
<p>This should keep youall busy for awhile.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/21/recently-in-the-sister-blog-6/">Recently in the sister blog</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/21/recently-in-the-sister-blog-6/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Evaluating Columbia University’s Frontiers of Science course</title>
		<link>http://andrewgelman.com/2013/05/20/evaluating-columbia-universitys-frontiers-of-science-course/</link>
		<comments>http://andrewgelman.com/2013/05/20/evaluating-columbia-universitys-frontiers-of-science-course/#comments</comments>
		<pubDate>Mon, 20 May 2013 13:18:01 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Teaching]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=19000</guid>
		<description><![CDATA[<p>Frontiers of Science is a course offered as part of Columbia University&#8217;s Core Curriculum. The course is controversial, with some people praising its overview of several areas of science, and others feeling that a more traditional set of introductory science courses would do the job better. Last month, the faculty in charge of the course [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/20/evaluating-columbia-universitys-frontiers-of-science-course/">Evaluating Columbia University&#8217;s Frontiers of Science course</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Frontiers of Science is a course offered as part of Columbia University&#8217;s Core Curriculum.  The course is controversial, with some people praising its overview of several areas of science, and others feeling that a more traditional set of introductory science courses would do the job better.</p>
<p>Last month, the faculty in charge of the course wrote the following public <a href="http://spectrum.columbiaspectator.com/spectrum/survey-frontiers-of-science-actually-pretty-effective">letter</a>:<br />
<span id="more-19000"></span></p>
<blockquote><p>The United States is in the midst of a debate over the value of a traditional college education. Why enroll in a place like Columbia College when you can obtain an undergraduate degree for $10,000 or learn everything from Massive Open Online Courses? In more parochial terms, what is the value added by approaches such as Columbia’s Core Curriculum? Recently students in our Core Course, Frontiers of Science (FoS), provided a partial answer. The FoS faculty designed a survey to gauge the scientific skills and knowledge of the Class of 2016 both before and after taking FoS. In an assembly held during orientation week last August, 966 first-year Columbia College students answered questions covering basic skills such as statistics, probability, and the reading and analysis of graphs, as well as content to be taught during the fall session of FoS. For the 519 College students taking FoS in the fall semester, the same survey was administered again as part of the final exam. The mean score for the initial orientation survey was less than 28%. The mean score for the same questions at the end of the semester was 76%! (The margin of error in both cases was ±1.0%.) To control for gains independent of FoS, 167 first-year College students, who did not take FoS in the fall, answered the same survey questions again at the start of the spring semester. The mean score of those students was 31%, not substantially greater than they and their classmates had scored on the survey during orientation. This type of research always has its limitations. However, the results with and without FoS are so different that the conclusion is inescapable. The scientific habits and knowledge that FoS imparts are new to, and effectively learned by, first-year students after one semester of intense study . . .</p></blockquote>
<p>This sounds pretty good, especially the part about &#8220;basic skills such as statistics, probability, and the reading and analysis of graphs.&#8221;  My first thought is that, if these skills are so important, maybe all the students at Columbia should be taking a course in probability and statistics!  But then I remembered that our intro stat course isn&#8217;t so great (I know, I&#8217;ve taught it several times), so maybe it&#8217;s just as well if some biologists, physicists, etc., create a new statistics module from scratch.  Seriously, I have lots of ideas of how we could teach intro prob/stat better, but when I actually try to do it, I get all tangled in the details.  So I can&#8217;t very well object to outsiders taking a shot at it.  As users of statistics, they might have a better idea than I do of how to teach the subject.</p>
<p>Similarly, I wouldn&#8217;t be surprised if the engineers could teach a better intro physics class than the physicists could.  And it could well make sense to have biologists teach first-year chemistry, and psychologists teach first-year biology.</p>
<p>What, then, would the statistician teach?  First year math, of course.  It would make the mathematicians cringe, but it might be closer to the students&#8217; own level.  Statisticians are users of math, and we could teach the subject from a user&#8217;s perspective.  Just as, arguably, the collection of scientists who run Frontiers of Science might be teaching probability and statistics in a way more useful to freshman than whatever my colleagues and I in the stat department could come up with.</p>
<p>OK, now that I&#8217;ve established my complete acceptance of the idea that this team can, and perhaps should, be teaching probability and statistics to Columbia&#8217;s freshmen, let me say that there are a bunch of things about the above-quoted letter that bother me.</p>
<p>First, the economic basis for their argument:  &#8220;Why enroll in a place like Columbia College when you can obtain an undergraduate degree for $10,000 or learn everything from Massive Open Online Courses?&#8221;</p>
<p>It&#8217;s not new that you can obtain an undergraduate degree for $10,000!  Until relatively recently, in-state tuition at state schools was cheap, pretty much everywhere.  At Columbia you get the top professors teaching what they know best.  But that argument doesn&#8217;t work for Frontiers of Science, which features non-statisticians teaching statistics, and a particle physicist who strips off his clothes to get students to recognize that nuclear weapons proliferation is scary.</p>
<p>As for online education, it&#8217;s great, but if you can &#8220;learn everything&#8221; from a mook, you can learn everything by buying a stack of textbooks.  Or maybe not.  I just don&#8217;t see that the results from this study have anything to do with mooks, one way or another.</p>
<p>Second, the pre-test and post-test.  This actually does seem like a good idea, and I <a href="http://www.stat.columbia.edu/~gelman/research/published/ChanceEthics2.pdf">remain</a> embarrassed that my statistics colleagues and I don&#8217;t do this in our own classes.  That said, &#8220;The margin of error in both cases was ±1.0%.&#8221;  Are they kidding?  And &#8220;the conclusion is inescapable&#8221;??  We&#8217;d need to know more about the post-test conditions of those 167 freshmen who were not in the course.  One student <a href="http://columbialion.com/blog/frosci-department-claims-their-study-proves-frosci-isnt-worthless/#comment-1208">writes</a> that the students in Frontiers of Science were given &#8220;premade cheat sheets during the final&#8221; with formulas.  Maybe the 167 controls were given the cheat sheets too, but such sheets might not mean much without some prior coaching.</p>
<p>What was on the quiz?  Nicholas Christie-Blick, one of the authors of the letter quoted above, writes, &#8220;The questions and answers will be released. . . . Nor does it matter much what the questions are. What matters is that they are representative of the range of skills in scientific thinking that we call Scientific Habits.&#8221;</p>
<p>I&#8217;d say something snarky here, but then again I remember that my own intro stat course hasn&#8217;t always gone so well, nor is it clear how much my students ever learned.</p>
<p>So, in quick summary, I&#8217;m suspicious of the claims of efficacy of the Frontiers of Science course.  But I&#8217;m sympathetic with their efforts to measure anything at all.  Measurement in education is tough.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/20/evaluating-columbia-universitys-frontiers-of-science-course/">Evaluating Columbia University&#8217;s Frontiers of Science course</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/20/evaluating-columbia-universitys-frontiers-of-science-course/feed/</wfw:commentRss>
		<slash:comments>26</slash:comments>
		</item>
		<item>
		<title>What happened that the journal Psychological Science published a paper with no identifiable strengths?</title>
		<link>http://andrewgelman.com/2013/05/20/what-happened-that-the-journal-psychological-science-published-a-paper-with-no-identifiable-strengths/</link>
		<comments>http://andrewgelman.com/2013/05/20/what-happened-that-the-journal-psychological-science-published-a-paper-with-no-identifiable-strengths/#comments</comments>
		<pubDate>Mon, 20 May 2013 13:04:37 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Zombies]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=19035</guid>
		<description><![CDATA[<p>The other day we discussed that paper on ovulation and voting (you may recall that the authors reported a scattered bunch of comparisons, significance tests, and p-values, and I recommended that they would&#8217;ve done better to simply report complete summaries of their data, so that readers could see the comparisons of interest in full context), [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/20/what-happened-that-the-journal-psychological-science-published-a-paper-with-no-identifiable-strengths/">What happened that the journal Psychological Science published a paper with no identifiable strengths?</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>The other day we <a href="http://andrewgelman.com/2013/05/17/how-can-statisticians-help-psychologists-do-their-research-better/">discussed</a> that paper on ovulation and voting (you may recall that the authors reported a scattered bunch of comparisons, significance tests, and p-values, and I recommended that they would&#8217;ve done better to simply report complete summaries of their data, so that readers could see the comparisons of interest in full context), and I was thinking a bit more about why I was so bothered that it was published in Psychological Science, which I&#8217;d thought of as a serious research journal.<br />
<span id="more-19035"></span><br />
My concern isn&#8217;t just that that the paper is bad&#8212;after all, lots of bad papers get published&#8212;but rather that it had nothing really going for it, <em>except</em> that it was headline bait.  It was a survey done on Mechanical Turk, that&#8217;s it.  No clever design, no clever questions, no care in dealing with nonresponse problems, no innovative data analysis, no nothing.  The paper had nothing to offer, except that it had no obvious flaws.  Psychology is a huge field full of brilliant researchers.  Its top journal can choose among so many papers. To pick this one, a paper that had nothing to offer, that seems to me like a sign of a serious problem.</p>
<p>A good study does not need to be methodologically original, but it should be methodologically sound.  When we do surveys we worry about nonresponse.  To take a few hundred people off MTurk and not even <em>look</em> for possible nonresponse bias, this is not serious.</p>
<p>But, again, it&#8217;s not so much that this paper was flawed, as that it had nothing much positive to offer.</p>
<p>Just to be clear:  I&#8217;m really really really really not trying to censor such work, and I&#8217;m really really really really not saying this work should not be published.  What I&#8217;m saying is that the <em>top</em> journal in a field should not be publishing such routine work.  They should be publishing the best quality research, not just random things that happen to slip through the cracks.</p>
<p>I mean, sure, the referees should&#8217;ve caught the problems with that paper.  But I blame the editors for even considering publication.  Even without noticing the paper&#8217;s methodological flaws, it was nothing special.</p>
<p>And, once you decide to start publishing mediocre papers in your top journal, you&#8217;re asking for trouble.  You&#8217;re encouraging more of the same.</p>
<p>To clarify, let me compare this with some other high-profile examples:</p>
<p>- Christakis and Fowler&#8217;s study of the contagion of obesity.  This was published in top journals and was later found to have some serious methodological issues.  But it&#8217;s not a mediocre work.  They had a unique dataset, a new idea, and some new methods of data analysis.  OK, they made some mistakes, but I can&#8217;t fault a leading journal for publishing this work.  It has a lot of special strengths.</p>
<p>- Bem&#8217;s paper claiming to demonstrate ESP.  OK, I wouldn&#8217;t have published this one.  But I can see where the journal was coming from on this.  If the results had held up, it would&#8217;ve been the scientific story of the decade, and the journal didn&#8217;t want to miss out.  The editors didn&#8217;t show the best judgment here, but their decision was understandable.</p>
<p>- Kanazawa&#8217;s papers of schoolyard evolutionary biology.  I&#8217;ve written about the mistakes here, and this work has a lot of similarities to the ovulation-and-voting study.  The difference is that Kanazawa&#8217;s papers were published in a middling place&#8212;the Journal of Theoretical Biology&#8212;not in a top journal of their field.  Don&#8217;t get me wrong, JTB is respectable, but it&#8217;s in the middle of the pack.  It&#8217;s not expected that they are publishing the best of the best.</p>
<p>- Hamilton&#8217;s paper in the American Sociological Review, claiming that college students get worse grades if their parents pay.  This paper had a gaping hole (not adjusting for the selection effect arising from less well-funded students dropping out) and I think it was a mistake for it to be published as is&#8212;but that&#8217;s just something the reviewers didn&#8217;t catch.  On the plus side, Hamilton&#8217;s paper was thoughtful and had some in-depth analysis of large datasets.  It had mistakes, but it had strengths too.  It was not a turn-the-crank, run-an-online-survey-and-spit-out-p-values job.</p>
<p>My point is that, in all these cases of the publication of flawed work (and one could add the work of Mark Hauser and Bruno Frey as well), the published papers either had clear strengths or else were not published in top journals.  When an interesting, exciting, but flawed paper (such as those by Bem, Hauser, etc) is published in a top journal, that&#8217;s too bad, but it&#8217;s understandable.  When a possibly interesting paper (such as those by Kanazawa) is published in an OK journal, that makes sense too. It would not make sense to demand perfection.  But when a mediocre paper (which also happens to have serious methodological flaws) is published in a top journal, there&#8217;s something seriously wrong going on.  There are lots of things that can make a research paper special, and this paper had none of those things (unless anything combining voting and sex in an election year is considered special).</p>
<p>P.S.  Let me emphasize that my goal here is not to pile on and slam the ovulation and voting paper.  In fact, I&#8217;ve refrained linking to the paper here, just to give the authors a break.  They did a little study that happened to be flawed.  That&#8217;s no big deal.  I&#8217;ve done lots of little studies that happened to be flawed, and sometimes my flawed work gets published.  I&#8217;m not criticizing the authors for making some mistakes.  I hope they can do better next time.  I&#8217;m criticizing the journal for publishing a mediocre paper with little to offer.  That&#8217;s not just a retrospective mistake; it seems like a problem with their policies that they would think that such an unremarkable paper could even be seriously considered for publication in the top journal of their field.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/20/what-happened-that-the-journal-psychological-science-published-a-paper-with-no-identifiable-strengths/">What happened that the journal Psychological Science published a paper with no identifiable strengths?</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/20/what-happened-that-the-journal-psychological-science-published-a-paper-with-no-identifiable-strengths/feed/</wfw:commentRss>
		<slash:comments>33</slash:comments>
		</item>
		<item>
		<title>Prose is paragraphs, prose is sentences</title>
		<link>http://andrewgelman.com/2013/05/19/prose-is-paragraphs-prose-is-sentences/</link>
		<comments>http://andrewgelman.com/2013/05/19/prose-is-paragraphs-prose-is-sentences/#comments</comments>
		<pubDate>Sun, 19 May 2013 13:42:07 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Literature]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=19029</guid>
		<description><![CDATA[<p>This isn&#8217;t quite right&#8212;poetry, too, can be in paragraph form (see Auden, for example, or Frost, or lots of other examples)&#8212;but Basbøll is on to something here. I&#8217;m reminded of Nicholson Baker&#8217;s hilarious &#8220;From the Index of First Lines,&#8221; which is truly the poetic counterpart to Basbøll&#8217;s argument in prose:</p><p>The post <a href="http://andrewgelman.com/2013/05/19/prose-is-paragraphs-prose-is-sentences/">Prose is paragraphs, prose is sentences</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>This isn&#8217;t quite right&#8212;poetry, too, can be in paragraph form (see Auden, for example, or Frost, or lots of other examples)&#8212;but Basbøll is on to <a href="http://secondlanguage.blogspot.com/2013/05/poetry-and-prose.html">something</a> here.</p>
<p>I&#8217;m reminded of Nicholson Baker&#8217;s hilarious &#8220;From the Index of First Lines,&#8221; which is truly the poetic counterpart to Basbøll&#8217;s argument in prose:</p>
<p><img src="/wp-content/uploads/2013/05/Screen-Shot-2013-05-17-at-9.51.07-PM.png?8b0eec" alt="Screen Shot 2013-05-17 at 9.51.07 PM" width="460" height="628" class="alignnone size-full wp-image-19030" /></p>
<p>The post <a href="http://andrewgelman.com/2013/05/19/prose-is-paragraphs-prose-is-sentences/">Prose is paragraphs, prose is sentences</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/19/prose-is-paragraphs-prose-is-sentences/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>uuuuuuuuuuuuugly</title>
		<link>http://andrewgelman.com/2013/05/18/uuuuuuuuuuuuugly/</link>
		<comments>http://andrewgelman.com/2013/05/18/uuuuuuuuuuuuugly/#comments</comments>
		<pubDate>Sat, 18 May 2013 13:15:44 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Statistical graphics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18702</guid>
		<description><![CDATA[<p>Hamdan Azhar writes: I came across this graphic of vaccine-attributed decreases in mortality and was curious if you found it as unattractive and unintuitive as I did. Hope all is well with you! My reply: All&#8217;s well with me. And yes, that&#8217;s one horrible graph. It has all the problems with a bad infographic with [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/18/uuuuuuuuuuuuugly/">uuuuuuuuuuuuugly</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Hamdan Azhar writes:</p>
<blockquote><p>I came across this graphic of vaccine-attributed decreases in mortality and was curious if you found it as unattractive and unintuitive as I did. Hope all is well with you!</p></blockquote>
<p>My reply:  All&#8217;s well with me.  And yes, that&#8217;s one horrible graph.  It has all the problems with a bad infographic with none of the virtues.  Compared to this monstrosity, the typical USA Today graph is a stunning, beautiful masterpiece.  I don&#8217;t think I want to soil this webpage with the image.  In fact, I don&#8217;t even want to link to it.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/18/uuuuuuuuuuuuugly/">uuuuuuuuuuuuugly</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/18/uuuuuuuuuuuuugly/feed/</wfw:commentRss>
		<slash:comments>16</slash:comments>
		</item>
		<item>
		<title>Where do theories come from?</title>
		<link>http://andrewgelman.com/2013/05/17/where-do-theories-come-from/</link>
		<comments>http://andrewgelman.com/2013/05/17/where-do-theories-come-from/#comments</comments>
		<pubDate>Fri, 17 May 2013 17:49:53 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Bayesian Statistics]]></category>
		<category><![CDATA[Miscellaneous Statistics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18688</guid>
		<description><![CDATA[<p>Lee Sechrest sends along this article by Brian Haig and writes that it &#8220;presents what seems to me a useful perspective on much of what scientists/statisticians do and how science works, at least in the fields in which I work.&#8221; Here&#8217;s Haig&#8217;s abstract: A broad theory of scientific method is sketched that has particular relevance [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/17/where-do-theories-come-from/">Where do theories come from?</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="http://i1.wp.com/themonkeycage.org/wp-content/uploads/2013/04/abduction.jpg?resize=300%2C187" alt="abduction" class="alignnone size-medium wp-image-29471" data-recalc-dims="1" /></p>
<p>Lee Sechrest sends along <a href="http://themonkeycage.org/wp-content/uploads/2013/04/Abductive-theory-of-method-Haig.pdf">this article</a> by Brian Haig and writes that it &#8220;presents what seems to me a useful perspective on much of what scientists/statisticians do and how science works, at least in the fields in which I work.&#8221;  Here&#8217;s Haig&#8217;s abstract:<br />
<span id="more-18688"></span></p>
<blockquote><p>A broad theory of scientific method is sketched that has particular relevance for the behavioral sciences. This theory of method assembles a complex of specific strategies and methods that are used in the detection of empirical phenomena and the subsequent construction of explanatory theories. A characterization of the nature of phenomena is given, and the process of their detection is briefly described in terms of a multistage model of data analysis. The construction of explanatory theories is shown to involve their generation through abductive, or explanatory, reasoning, their development through analogical modeling, and their fuller appraisal in terms of judgments of the best of competing explanations. The nature and limits of this theory of method are discussed in the light of relevant developments in scientific methodology.</p></blockquote>
<p>I found this very difficult to read and forwarded it to Cosma Shalizi, who writes:</p>
<blockquote><p>Like a lot of what I read about abduction, it seems much more a theory (or sketch of a theory) of how scientists think, than of scientific method. Put another way, the H-D account of scientific method has always tended to &#8220;black-box&#8221; the issue of where hypotheses come from, in favor of what to do with them once you have them.  I think this is usually helpful, but there&#8217;s no reason not to try to open up the black box, and study the origin of hypotheses; if there&#8217;s a role for abduction, it&#8217;s there, in explicating the &#8220;generate&#8221; part of generate-and-test.  (In fact, if memory serves, Peirce later repented of his term &#8220;abduction&#8221;, and just called it &#8220;hypothesis&#8221; or &#8220;hypothesizing&#8221;.)  If one could show, or even plausibly suggest, that certain modes of hypothesizing are systematically more reliable or fruitful than others, that would be extremely valuable. This paper in particular seems to have some odd confusions of levels between what are presumably fairly permanent parts of how scientists think (analogy), and current technological artifacts &#8212; I love bootstrapping, but it hardly belongs in the same category as a component of scientific method.  (And as for stem-and-leaf plots&#8230;)</p></blockquote>
<p>Sechrest wrote:</p>
<blockquote><p>Too much research seems to be addressed to determining whether &#8220;it is,&#8221; or &#8220;it isn&#8217;t.&#8221; The more important question very often is &#8220;Why is (or isn&#8217;t) it?&#8221; To me, abduction seems more likely to occur in the aftermath of having seen something. Isaac Asimov once said The most exciting phrase to hear in science, the one that heralds new discoveries, is not &#8216;Eureka!&#8217; but &#8216;That&#8217;s funny&#8230;&#8217; </p>
<p>And that is when abduction begins, the attempt to identify explanations and reason toward the best one. For example, at least some drug trials begin with seemingly sensible expectations that a drug will work; those expectations are often wrong. Usually that is the end of the matter. But, to me, an important question may well be &#8220;Why didn&#8217;t the drug work as expected?&#8221; (A &#8220;why isn&#8217;t it? question) The abductive process cannot lead directly to a clear-cut answer, but it can get us closer. And that is what, I think, good scientists do. Ineffective scientists (and I have seen them many times) say, &#8220;Well, that didn&#8217;t work. Anybody got another idea?&#8221;</p></blockquote>
<p>I have nothing to add to the above discussion, except to point to our <a href="http://andrewgelman.com/2013/02/11/why-waste-time-philosophizing/">recent discussion</a> of the challenges of systematizing model building.  As I see it, new ideas arise from anomalies in data with respect to existing theories. </p>
<p>The post <a href="http://andrewgelman.com/2013/05/17/where-do-theories-come-from/">Where do theories come from?</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/17/where-do-theories-come-from/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>How can statisticians help psychologists do their research better?</title>
		<link>http://andrewgelman.com/2013/05/17/how-can-statisticians-help-psychologists-do-their-research-better/</link>
		<comments>http://andrewgelman.com/2013/05/17/how-can-statisticians-help-psychologists-do-their-research-better/#comments</comments>
		<pubDate>Fri, 17 May 2013 13:03:36 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Science]]></category>
		<category><![CDATA[Miscellaneous Statistics]]></category>
		<category><![CDATA[Zombies]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=19016</guid>
		<description><![CDATA[<p>I received two emails yesterday on related topics. First, Stephen Olivier pointed me to this post by Daniel Lakens, who wrote the following open call to statisticians: You would think that if you are passionate about statistics, then you want to help people to calculate them correctly in any way you can. . . . [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/17/how-can-statisticians-help-psychologists-do-their-research-better/">How can statisticians help psychologists do their research better?</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>I received two emails yesterday on related topics.</p>
<p>First, Stephen Olivier pointed me to <a href="https://sites.google.com/site/lakens2/blog/thelimitedpracticalsignificanceofstatisticians">this post</a> by Daniel Lakens, who wrote the following open call to statisticians:</p>
<blockquote><p>You would think that if you are passionate about statistics, then you want to help people to calculate them correctly in any way you can. . . . you’d think some statisticians would be interested in helping a poor mathematically challenged psychologist out by offering some practical advice.</p></blockquote>
<p>I&#8217;m the right person to ask this question, since I actually have written a lot of material that helps psychologists (and others) with their data analysis. But there clearly are communication difficulties, in that my work and that of other statisticians hasn&#8217;t reached Lakens. Sometimes the contributions of statisticians are made indirectly. For example, I wrote Bayesian Data Analysis, and then Kruschke wrote Doing Bayesian Data Analysis. Our statistics book made it possible for Kruschke to write his excellent book for psychologists. This is a reasonable division of labor.</p>
<p>That said, I&#8217;d like to do even more. So I will make some specific suggestions for data analysis in psychology right here in this post, in the context of my next story:<br />
<span id="more-19016"></span><br />
Dan Kahan sent me this note:</p>
<blockquote><p>The <a href="http://www.culturalcognition.net/blog/2013/1/10/an-interesting-story-on-whether-strengthening-self-defense-l.html">most egregious instance </a>of totally bogus methods I had the misfortune to feel obliged to call foul on involved an econometrics study that purported to find that changes in law that <i>never happened</i> increased homicides by &#8220;lowering the cost&#8221; of committing them&#8230;)</p>
<p>Actually, as you know, often times investigation of a &#8220;wtf?!&#8221; report like this discloses that the problem is in the news report &amp; not in the study.</p>
<p>I think<a href="http://andrewgelman.com/2013/05/05/cleaning-up-science/"> you agree</a> that many of the &#8220;bad statistics/methods&#8221; problems &amp; even the &#8220;nonreplicability&#8221; problem are rooted in the perpetuation of a set of mindless statistical protocols associated with ossified conception of NHT (one from which all the thought that it might have reflected was drained away &amp; discarded decades ago).</p>
<p>But certainly another problem is the &#8220;wtf?!!!!!!&#8221; conception of psychology.  Its distinguishing feature is its supposed discovery of <a href="http://www.amazon.com/Drunk-Tank-Pink-Unexpected-Forces/dp/1594204543">phenomena that are shocking bizarre &amp; lack any coherent theory</a>.</p>
<p>The alternative conception of psychology is the &#8220;<a href="http://andrewgelman.com/2011/05/19/everything_is_o/">everything is obvious &#8211; once you know the answer</a>.&#8221;  The main point of empirical research isn&#8217;t to shock people. It&#8217;s to adjudicate disputes between competing plausible conjectures about what causes what we see.  More accounts of what is going are plausible than are true; without valid inference from observation, we will never separate the former from the sea of the latter &amp; will drown in a sea of &#8220;just so&#8221; story telling.</p>
<p>I have zero confidence in &#8220;wtf?!!!&#8221; &amp; am convinced that it is a steady stream of bogus, nonreplicable studies that hurt the reputation of psychology.</p>
<p>I have <span style="text-decoration: underline;">lots</span> of confidence in EIO&#8211;OYKTA. It&#8217;s not nearly so sexy &#8212; which is good, b/c it removes the temptation to cut corners in all the familiar, petty ways that researchers do (usually by coaxing out a shy &#8220;p &lt; 0.05&#8243; to emerge  w/ one or another data-manipulative come-on line). But it is dealing with matters that reflect real, theorized, validated mechanisms of psychology (the issue in each case is &#8212; which one?!), and ones that are important enough for researchers to keep at it essentially forever, revising, correcting, improving our evolving understanding of what&#8217;s going on.</p></blockquote>
<p>Kahan points to a much-<a href="http://blogs.discovermagazine.com/seriouslyscience/2013/05/10/ovulation-makes-single-women-more-likely-to-vote-for-barack-obama/#.UZVLVJVXDGA">mocked</a> and <a href="http://www.popsci.com/science/article/2012-10/co-author-and-science-journal-respond-cnn-pulling-women-vote-their-hormones-article">criticized</a> study by Kristina Durante, Ashley Arsena, Vladas Griskevicius, &#8220;The Fluctuating Female Vote: Politics, Religion, and the Ovulatory Cycle,&#8221; which was reported then retracted from CNN under the title, &#8220;Study looks at voting and hormones: Hormones may influence female voting choices.&#8221;</p>
<p>The relevance for the present discussion is that this paper was published in Psychological Science, a top journal in psychology.  Here&#8217;s the abstract:</p>
<blockquote><p>Each month many women experience an ovulatory cycle that regulates fertility. Whereas research finds that this cycle influences women’s mating preferences, we propose that it might also change women’s political and religious views. Building on theory suggesting that political and religious orientation are linked to reproductive goals, we tested how fertility influenced women’s politics, religiosity, and voting in the 2012 U.S. presidential election. In two studies with large and diverse samples, ovulation had drastically different effects on single versus married women. Ovulation led single women to become more liberal, less religious, and more likely to vote for Barack Obama. In contrast, ovulation led married women to become more conservative, more religious, and more likely to vote for Mitt Romney. In addition, ovulatory-induced changes in political orientation mediated women’s voting behavior. Overall, the ovulatory cycle not only influences women’s politics, but appears to do so differently for single versus married women.</p></blockquote>
<p>I took a look at the <a href="http://business.utsa.edu/faculty/kdurante/files/Durante_PresidentialElection_Hormones.pdf">paper</a>, and what I found was a bunch of comparisons and p-values, some of which were statistically significant, and then lots of stories.  The problem is that there are so many different things that could be compared, and all we see is some subset of the comparisons.  Many of the reported effects seem much too large to be plausible.  And there&#8217;s a casual use of causal language (for example, the words &#8220;influenced,&#8221; &#8220;effects,&#8221; and &#8220;induced&#8221; in the above abstract) to describe correlations.</p>
<p>Beyond all that, I found the claimed effects implausibly large.  For example, they report that, among women in relationships, 40% in the ovulation period supported Romney, compared to 23% in the non-fertile part of their cycle.  Given that surveys find vary few people switching their vote preferences during the campaign for <em>any</em> reason, I just don&#8217;t buy it.  The authors might respond that they don&#8217;t care about the magnitude of the difference, just the sign, but (a) with a magnitude of this size, we&#8217;re talking noise noise noise, and (b) one could just as easily explain this as a differential nonresponse pattern:  maybe liberal or conservative women in different parts of their cycle are more or less likely to participate in a survey.  It would easy enough to come up with a story about that!</p>
<p>Anyway, my point is not to slam the work of Durante et al.  They did a little study, wrote it up, and submitted it to one of the leading journals in their field.  It&#8217;s not their fault the journal chose to publish it.</p>
<p>Also, let me emphasize that I&#8217;m not saying that their claims (regarding the effects of ovulation) are false.  I&#8217;m just saying that the evidence from their paper isn&#8217;t as strong as they make it out to be.</p>
<p><strong>A statistician offers helpful advice for psychology researchers</strong></p>
<p>My real goal here is to address the question that was brought up at the beginning of this post:  What recommendations can I, as a statistician, give to psychology researchers?  Here are a few, presented in the context of the paper on ovulation and political attitudes:</p>
<p>1.  Analyze <em>all</em> your data.  For most of their analyses, the authors threw out all the data from participants who were PMS-ing or having their period.  (&#8220;We also did not include women at the beginning of the ovulatory cycle (cycle days 1–6) or at the very end of the ovulatory cycle (cycle days 26–28) to avoid potential confounds due to premenstrual or menstrual symptoms.&#8221;)  That&#8217;s a mistake.  Instead of throwing out one-third of their data, they should&#8217;ve just included that other category in their analysis.</p>
<p>2.  Present <em>all</em> your comparisons.  The paper leads us through a hopscotch of comparisons and p-values.  Better just to present everything.  I have no idea if the researchers combed through everything and selected the best results, or if they simply made a bunch of somewhat arbitrary decisions throughout of what to look for.</p>
<p>For example, I would&#8217;ve liked to see a comparison of respondents in different parts of their cycle on variables such as birth year, party identification, marital status, etc etc.  Just a whole damn table (even better would be a <a href="http://www.stat.columbia.edu/~gelman/research/published/dodhia.pdf">graph</a> but, hey, I won&#8217;t get greedy here) showing these differences for every possible variable.</p>
<p>Instead, what do we get?  Several pages full of averages, percentages, F tests, chi-squared tests, and p-values, all presented in paragraph form.  Better to have all possible comparisons in one convenient table.</p>
<p>3.  Make your data public.  If the topic is worth studying, you should want others to be able to make rapid progress.  If there&#8217;s some confidentiality restrictions, remove the respondents&#8217; identifying information.  Then post the data online.</p>
<p>4.  And now some advice for journal editors.  What&#8217;s the purpose of a top journal in a field such as psychology?  I think they should be publishing the top work.  This paper is not top work, by any standard.  The researchers asked a few survey questions to a bunch of people on Mechanical Turk, then did who knows how many comparisons and significance tests, reported some subset of the results, then went on to story time.  It&#8217;s not innovative data collection, it&#8217;s not great theory, it&#8217;s not great data analysis, it&#8217;s not a definitive data source, it&#8217;s nothing.  What it is, is headline-bait that&#8217;s not obviously wrong.  But is that the appropriate standard?  It&#8217;s not obviously wrong to three referees, so publish it?  Psychological Science <a href="http://www.psychologicalscience.org/index.php/publications">is</a> &#8220;the flagship journal of the Association for Psychological Science . . . the highest ranked empirical journal in psychology.&#8221;</p>
<p>As a statistician, my advice is:  if a paper is nothing special, you don&#8217;t have to publish it in your flagship journal.  In this as with the notorious Daryl Bem article, I feel that the journal almost seemed to feel an obligation to publish a dubious claim, just because some referees didn&#8217;t happen to find any flaws in the data collection or analysis.  But if Psychological Science does not publish this article, it&#8217;s not censorship or suppression; the authors can feel free to submit it to a lesser journal.  For the leading journal to have such low standards, this is bad news for the entire field.  For one thing, it encourages future researchers to focus on this sort of sloppy work.</p>
<p>I&#8217;m hoping the above advice will make Stephen Olivier happy.  I&#8217;m not just sitting there criticizing something, or telling someone to use R instead of SPSS, or lecturing psychologists about their lack of mathematical skills.  I&#8217;m giving some very specific suggestions that you, the psychology researcher, can use in your next research project (or, if you&#8217;re a journal editor, in your next publication decision).</p>
<p>There is, of course, lots and lots of additional advice that I and other statisticians could give.  The above is just a start.  But I wanted to start <em>somewhere</em>, just to demonstrate to Olivier (and others) that this is indeed possible.</p>
<p>P.S.  Blogger Echidne raised <a href="http://echidneofthesnakes.blogspot.com/2012/10/women-vote-their-hormones-study-itself.html">similar</a> points last year.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/17/how-can-statisticians-help-psychologists-do-their-research-better/">How can statisticians help psychologists do their research better?</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/17/how-can-statisticians-help-psychologists-do-their-research-better/feed/</wfw:commentRss>
		<slash:comments>39</slash:comments>
		</item>
		<item>
		<title>How do we choose our default methods?</title>
		<link>http://andrewgelman.com/2013/05/16/how-do-we-choose-our-default-methods/</link>
		<comments>http://andrewgelman.com/2013/05/16/how-do-we-choose-our-default-methods/#comments</comments>
		<pubDate>Thu, 16 May 2013 13:59:12 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Statistics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=19011</guid>
		<description><![CDATA[<p>I was asked to write an article for the Committee of Presidents of Statistical Societies (COPSS) 50th anniversary volume. Here it is (it&#8217;s labeled as &#8220;Chapter 1,&#8221; which isn&#8217;t right; that&#8217;s just what came out when I used the template that was supplied). The article begins as follows: The field of statistics continues to be [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/16/how-do-we-choose-our-default-methods/">How do we choose our default methods?</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>I was asked to write an article for the Committee of Presidents of Statistical Societies (COPSS) 50th anniversary volume.  <a href="http://www.stat.columbia.edu/~gelman/research/published/copss.pdf">Here it is</a> (it&#8217;s labeled as &#8220;Chapter 1,&#8221; which isn&#8217;t right; that&#8217;s just what came out when I used the template that was supplied).  The article begins as follows:</p>
<blockquote><p>The field of statistics continues to be divided into competing schools of thought. In theory one might imagine choosing the uniquely best method for each problem as it arises, but in practice we choose for ourselves (and recom- mend to others) default principles, models, and methods to be used in a wide variety of settings. This article briefly considers the informal criteria we use to decide what methods to use and what principles to apply in statistics problems.</p></blockquote>
<p>And then I follow up with these sections:</p>
<blockquote><p>Statistics: the science of defaults</p>
<p>Ways of knowing</p>
<p>The pluralist’s dilemma</p></blockquote>
<p>And here&#8217;s the concluding paragraph:</p>
<blockquote><p>Statistics is a young science in which progress is being made in many areas. Some methods in common use are many decades or even centuries old, but recent and current developments in nonparametric modeling, regularization, and multivariate analysis are central to state-of-the-art practice in many areas of applied statistics, ranging from psychometrics to genetics to predictive modeling in business and social science. Practitioners have a wide variety of statistical approaches to choose from, and researchers have many potential directions to study. A casual and introspective review suggests that there are many different criteria we use to decide that a statistical method is worthy of routine use. Those of us who lean on particular ways of knowing (which might include: performance on benchmark problems, success in new applications, insight into toy problems, optimality as shown by simulation studies or mathematical proofs, or success in the marketplace) should remain aware of the relevance of all these dimensions in the spread of default procedures.</p></blockquote>
<p>Regular blog readers will recognize many of these themes, but I hope this particular presentation has some added value.  And this is as good a place as any to thank my many correspondents who&#8217;ve helped contribute to the development and expression of these ideas.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/16/how-do-we-choose-our-default-methods/">How do we choose our default methods?</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/16/how-do-we-choose-our-default-methods/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Does quantum uncertainty have a place in everyday applied statistics?</title>
		<link>http://andrewgelman.com/2013/05/15/does-quantum-uncertainty-have-a-place-in-everyday-applied-statistics/</link>
		<comments>http://andrewgelman.com/2013/05/15/does-quantum-uncertainty-have-a-place-in-everyday-applied-statistics/#comments</comments>
		<pubDate>Wed, 15 May 2013 19:13:14 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Statistics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=19008</guid>
		<description><![CDATA[<p>Several months ago, Mike Betancourt and I wrote a discussion for the article, Can quantum probability provide a new direction for cognitive modeling?, by Emmanuel Pothos and Jerome Busemeyer, in Behavioral and Brain Sciences. We didn&#8217;t say much, but it was a milestone for me because, with this article, BBS became the 100th journal I&#8217;d [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/15/does-quantum-uncertainty-have-a-place-in-everyday-applied-statistics/">Does quantum uncertainty have a place in everyday applied statistics?</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Several months ago, Mike Betancourt and I wrote a discussion for the article, Can quantum probability provide a new direction for cognitive modeling?, by Emmanuel Pothos and Jerome Busemeyer, in Behavioral and Brain Sciences.  We didn&#8217;t say much, but it was a milestone for me because, with this article, BBS became the 100th journal I&#8217;d published in.</p>
<p>Anyway, the full article with its 34 discussions just appeared <a href="http://journals.cambridge.org/action/displayIssue?jid=BBS&#038;volumeId=36&#038;seriesId=0&#038;issueId=03&#038;etoc=Y">in the journal</a>.  <a href="http://journals.cambridge.org/repo_A89VVypV">Here</a> it is.</p>
<p>What surprised me, in reading the full discussion, was how supportive the commentary was.  Given the topic of Pothos and Busemeyer&#8217;s article, I was expecting the discussions to range from gentle mockery to outright abuse.  The discussion that Mike and I wrote was moderately encouraging, and I was expecting this to fall on the extreme positive end of the spectrum.</p>
<p>Actually, though, most of the discussions were positive, and only a couple were purely negative (those would be &#8220;Quantum models of cognition as Orwellian newspeak&#8221; by Michael Lee and Wolf Vanpaemel, and &#8220;Physics envy: Trying to fit a square peg into a round hole,&#8221; by James Shenteau and David Weiss).  We expressed some vague skepticism but it&#8217;s hard for me to be really negative about the idea, given that classical probability theory is not actually correct, and we do indeed live in a quantum world (otherwise all our tables and chairs would fall apart, for one thing).  I certainly see no <em>logical</em> reason why our models of probability and uncertainty should be restricted to the &#8220;Boltzmannian&#8221; simplification.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/15/does-quantum-uncertainty-have-a-place-in-everyday-applied-statistics/">Does quantum uncertainty have a place in everyday applied statistics?</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/15/does-quantum-uncertainty-have-a-place-in-everyday-applied-statistics/feed/</wfw:commentRss>
		<slash:comments>77</slash:comments>
		</item>
		<item>
		<title>Reputations changeable, situations tolerable</title>
		<link>http://andrewgelman.com/2013/05/15/reputations-changeable-situations-tolerable/</link>
		<comments>http://andrewgelman.com/2013/05/15/reputations-changeable-situations-tolerable/#comments</comments>
		<pubDate>Wed, 15 May 2013 13:48:17 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Bayesian Statistics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18540</guid>
		<description><![CDATA[<p>David Kessler, Peter Hoff, and David Dunson write: Marginally specified priors for nonparametric Bayesian estimation Prior specification for nonparametric Bayesian inference involves the difficult task of quantifying prior knowledge about a parameter of high, often infinite, dimension. Realistically, a statistician is unlikely to have informed opinions about all aspects of such a parameter, but may [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/15/reputations-changeable-situations-tolerable/">Reputations changeable, situations tolerable</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>David Kessler, Peter Hoff, and David Dunson <a href="http://arxiv.org/abs/1204.6505">write</a>:</p>
<blockquote><p>Marginally specified priors for nonparametric Bayesian estimation</p>
<p>Prior specification for nonparametric Bayesian inference involves the difficult task of quantifying prior knowledge about a parameter of high, often infinite, dimension. Realistically, a statistician is unlikely to have informed opinions about all aspects of such a parameter, but may have real information about functionals of the parameter, such the population mean or variance. This article proposes a new framework for nonparametric Bayes inference in which the prior distribution for a possibly infinite-dimensional parameter is decomposed into two parts: an informative prior on a finite set of functionals, and a nonparametric conditional prior for the parameter given the functionals. Such priors can be easily constructed from standard nonparametric prior distributions in common use, and inherit the large support of the standard priors upon which they are based. Additionally, posterior approximations under these informative priors can generally be made via minor adjustments to existing Markov chain approximation algorithms for standard nonparametric prior distributions. We illustrate the use of such priors in the context of multivariate density estimation using Dirichlet process mixture models, and in the modeling of high-dimensional sparse contingency tables.</p></blockquote>
<p>This seems very important to me, also I love the idea of Hoff and Dunson on the same paper, sort of like one of those 70&#8242;s supergroups.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/15/reputations-changeable-situations-tolerable/">Reputations changeable, situations tolerable</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/15/reputations-changeable-situations-tolerable/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>GPstuff: Bayesian Modeling with Gaussian Processes</title>
		<link>http://andrewgelman.com/2013/05/14/gpstuff-bayesian-modeling-with-gaussian-processes/</link>
		<comments>http://andrewgelman.com/2013/05/14/gpstuff-bayesian-modeling-with-gaussian-processes/#comments</comments>
		<pubDate>Tue, 14 May 2013 13:34:46 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Bayesian Statistics]]></category>
		<category><![CDATA[Stan]]></category>
		<category><![CDATA[Statistical computing]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18997</guid>
		<description><![CDATA[<p>I think it&#8217;s part of my duty as a blogger to intersperse, along with the steady flow of jokes, rants, and literary criticism, some material that will actually be useful to you. So here goes. Jarno Vanhatalo, Jaakko Riihimäki, Jouni Hartikainen, Pasi Jylänki, Ville Tolvanen, and Aki Vehtari write: The GPstuff toolbox is a versatile [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/14/gpstuff-bayesian-modeling-with-gaussian-processes/">GPstuff: Bayesian Modeling with Gaussian Processes</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>I think it&#8217;s part of my duty as a blogger to intersperse, along with the steady flow of jokes, rants, and literary criticism, some material that will actually be useful to you.</p>
<p>So here goes.</p>
<p>Jarno Vanhatalo, Jaakko Riihimäki, Jouni Hartikainen, Pasi Jylänki, Ville Tolvanen, and Aki Vehtari <a href="http://jmlr.csail.mit.edu/papers/v14/vanhatalo13a.html">write</a>:</p>
<blockquote><p>The <a href="http://mloss.org/software/view/451/">GPstuff</a> toolbox is a versatile collection of Gaussian process models and computational tools required for Bayesian inference. The tools include, among others, various inference methods, sparse approximations and model assessment methods.</p></blockquote>
<p>We can actually now fit Gaussian processes in <a href="http://mc-stan.org">Stan</a>.  But for big problems (or even moderately-sized problems), full Bayes can be slow.  GPstuff uses EP, which is faster.  At some point we&#8217;d like to implement EP in Stan.  (Right now we&#8217;re working with Dave Blei to implement VB.)</p>
<p>GPstuff really works.  I saw Aki use it to fit a nonparametric version of the Bangladesh well-switching example in ARM.  He was sitting in his office and just whipped up the model and fit it.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/14/gpstuff-bayesian-modeling-with-gaussian-processes/">GPstuff: Bayesian Modeling with Gaussian Processes</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/14/gpstuff-bayesian-modeling-with-gaussian-processes/feed/</wfw:commentRss>
		<slash:comments>19</slash:comments>
		</item>
		<item>
		<title>Stan!</title>
		<link>http://andrewgelman.com/2013/05/13/stan-2/</link>
		<comments>http://andrewgelman.com/2013/05/13/stan-2/#comments</comments>
		<pubDate>Mon, 13 May 2013 23:50:36 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Bayesian Statistics]]></category>
		<category><![CDATA[Public Health]]></category>
		<category><![CDATA[Stan]]></category>
		<category><![CDATA[Statistical computing]]></category>
		<category><![CDATA[R]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=19001</guid>
		<description><![CDATA[<p>Guy Freeman writes: I thought you&#8217;d all like to know that Stan was used and referenced in a peer-reviewed Rapid Communications paper on influenza. Thank you for this excellent modelling language and sampler, which made it possible to carry out this work quickly! I haven&#8217;t actually read the paper, but I&#8217;m happy to see Stan [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/13/stan-2/">Stan!</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Guy Freeman writes:</p>
<blockquote><p>I thought you&#8217;d all like to know that Stan was used and referenced in a peer-reviewed Rapid Communications <a href="http://www.eurosurveillance.org/ViewArticle.aspx?ArticleId=20475">paper</a> on influenza.  Thank you for this excellent modelling language and sampler, which made it possible to carry out this work quickly!</p></blockquote>
<p>I haven&#8217;t actually read the paper, but I&#8217;m happy to see Stan getting around like that. </p>
<p>The post <a href="http://andrewgelman.com/2013/05/13/stan-2/">Stan!</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/13/stan-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A Structural Comparison of Conspicuous Consumption in China and the United States</title>
		<link>http://andrewgelman.com/2013/05/13/a-structural-comparison-of-conspicuous-consumption-in-china-and-the-united-states/</link>
		<comments>http://andrewgelman.com/2013/05/13/a-structural-comparison-of-conspicuous-consumption-in-china-and-the-united-states/#comments</comments>
		<pubDate>Mon, 13 May 2013 13:53:17 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Economics]]></category>
		<category><![CDATA[Literature]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=17795</guid>
		<description><![CDATA[<p>David Jinkins writes: The objective of this paper is to measure the relative importance of conspicous consumption to Americans and Chinese. To this end, I estimate the parameters of a utility function borrowed from recent theoretical work using American and Chinese data. The main parameter of interest governs the amount that individuals care about peer [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/13/a-structural-comparison-of-conspicuous-consumption-in-china-and-the-united-states/">A Structural Comparison of Conspicuous Consumption in China and the United States</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2013/05/bmw.jpg?8b0eec" alt="bmw" width="259" height="194" class="alignnone size-full wp-image-18995" /></p>
<p>David Jinkins <a href="http://veryshuai.no-ip.org/research/research.html">writes</a>:</p>
<blockquote><p>The objective of this paper is to measure the relative importance of conspicous consumption to Americans and Chinese. To this end, I estimate the parameters of a utility function borrowed from recent theoretical work using American and Chinese data. The main parameter of interest governs the amount that individuals care about peer group beliefs regarding their welfare. Using survey data on the visibility of different good categories along with household budget surveys, I find that Chinese consumers care twice as much as American consumers about the beliefs of their peer group.</p></blockquote>
<p>I came across this draft research manuscript by following the links back after Jinkins commented on our blog.  The framing of the paper is a bit more foundation-y and a bit less statistic-y than I&#8217;d prefer, but I guess that&#8217;s just the way they do things in economics, compared to statistics or (some) political science.  In any case, I wanted to point you to this paper, partly to let you know that I do read the blog comments and even sometimes follow the links, and also because it&#8217;s unusually well-written, not just in its first paragraph but all the way through.  He&#8217;s got to work a bit on his presentation of results&#8212;I see some ugly tables there&#8212;but I think that&#8217;s much easier to learn than it is to learn how to write.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/13/a-structural-comparison-of-conspicuous-consumption-in-china-and-the-united-states/">A Structural Comparison of Conspicuous Consumption in China and the United States</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/13/a-structural-comparison-of-conspicuous-consumption-in-china-and-the-united-states/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>OpenData Latinoamerica</title>
		<link>http://andrewgelman.com/2013/05/12/opendata-latinoamerica/</link>
		<comments>http://andrewgelman.com/2013/05/12/opendata-latinoamerica/#comments</comments>
		<pubDate>Sun, 12 May 2013 22:31:28 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Statistics]]></category>
		<category><![CDATA[Political Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18992</guid>
		<description><![CDATA[<p>Miguel Paz writes: Poderomedia Foundation and PinLatam are launching OpenDataLatinoamerica.org, a regional data repository to free data and use it on Hackathons and other activities by HacksHackers chapters and other organizations. We are doing this because the road to the future of news has been littered with lost datasets. A day or so after every [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/12/opendata-latinoamerica/">OpenData Latinoamerica</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Miguel Paz <a href="http://ijnet.org/blog/opendata-latinoamerica-will-bring-together-data-across-region">writes</a>:</p>
<blockquote><p>Poderomedia Foundation and PinLatam are launching OpenDataLatinoamerica.org, a regional data repository to free data and use it on Hackathons and other activities by HacksHackers chapters and other organizations.</p>
<p>We are doing this because the road to the future of news has been littered with lost datasets. A day or so after every hackathon and meeting where a group has come together to analyze, compare and understand a particular set of data, someone tries to remember where the successful files were stored. Too often, no one is certain. Therefore with Mariano Blejman we realized that we need a central repository where you can share the data that you have proved to be reliable: OpenData Latinoamerica, which we are leading as ICFJ Knight International Journalism Fellows.</p>
<p>If you work in Latin America or Central America your organization can take part in OpenDataLatinoamerica.org. To apply, go to the website and answer a simple form agreeing to meet the standard criteria for open data. Once the application is approved, you will receive an account to start running and managing open data, becoming part of the community.</p></blockquote>
<p>The post <a href="http://andrewgelman.com/2013/05/12/opendata-latinoamerica/">OpenData Latinoamerica</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/12/opendata-latinoamerica/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Crime novels for economists</title>
		<link>http://andrewgelman.com/2013/05/12/crime-novels-for-economists/</link>
		<comments>http://andrewgelman.com/2013/05/12/crime-novels-for-economists/#comments</comments>
		<pubDate>Sun, 12 May 2013 13:05:38 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Economics]]></category>
		<category><![CDATA[Literature]]></category>
		<category><![CDATA[Political Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18989</guid>
		<description><![CDATA[<p>Following up on this post by Noah Smith on economics in science fiction, Mark Palko writes on economics in crime fiction. Just as almost all science fiction is ultimately about politics, one could say that just about all crime fiction is about economics. But if I had to pick one crime novelist with an economics [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/12/crime-novels-for-economists/">Crime novels for economists</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2013/05/eddie1.jpg?8b0eec" alt="eddie" width="256" height="192" class="alignnone size-full wp-image-18990" /></p>
<p>Following up on <a href="http://noahpinionblog.blogspot.com/2013/05/science-fiction-for-economists.html">this post</a> by Noah Smith on economics in science fiction, Mark Palko <a href="http://observationalepidemiology.blogspot.com/2013/05/thrillers-on-economics-updated.html#comment-form">writes</a> on economics in crime fiction.</p>
<p>Just as almost all science fiction is ultimately about politics, one could say that just about all crime fiction is about economics.</p>
<p>But if I had to pick one crime novelist with an economics focus, I&#8217;d pick George V. Higgins.  In one of his novels, his character Jerry Kennedy had a riff on the difference between guys who get a salary and guys who have to work for every dollar.  But, really, almost all his novels are full of economics.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/12/crime-novels-for-economists/">Crime novels for economists</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/12/crime-novels-for-economists/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Actually, I have no problem with this graph</title>
		<link>http://andrewgelman.com/2013/05/11/actually-i-have-no-problem-with-this-graph/</link>
		<comments>http://andrewgelman.com/2013/05/11/actually-i-have-no-problem-with-this-graph/#comments</comments>
		<pubDate>Sat, 11 May 2013 13:02:58 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Statistical graphics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18523</guid>
		<description><![CDATA[<p>Tom Salvesen asks, is this the worst info-graphic of the year? I say, no. Nobody really cares about these numbers. It&#8217;s an amusing feature. The alternative would not be a better display of these data, the alternative would be some photo or cartoon. They&#8217;re just having fun. I wouldn&#8217;t give it any design awards but [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/11/actually-i-have-no-problem-with-this-graph/">Actually, I have no problem with this graph</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2013/04/infovis-636x1024.jpg?8b0eec" alt="infovis" width="318" height="512" class="alignnone size-large wp-image-18524" /></p>
<p>Tom Salvesen asks, is this the worst info-graphic of the year?</p>
<p>I say, no.  Nobody really cares about these numbers.  It&#8217;s an amusing feature.  The alternative would not be a better display of these data, the alternative would be some photo or cartoon.  They&#8217;re just having fun.  I wouldn&#8217;t give it any design awards but it&#8217;s fine, it is what it is.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/11/actually-i-have-no-problem-with-this-graph/">Actually, I have no problem with this graph</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/11/actually-i-have-no-problem-with-this-graph/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>The recursion of pop-econ</title>
		<link>http://andrewgelman.com/2013/05/10/the-recursion-of-pop-econ-or-of-trolling/</link>
		<comments>http://andrewgelman.com/2013/05/10/the-recursion-of-pop-econ-or-of-trolling/#comments</comments>
		<pubDate>Fri, 10 May 2013 13:13:02 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Economics]]></category>
		<category><![CDATA[Sociology]]></category>
		<category><![CDATA[Zombies]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18121</guid>
		<description><![CDATA[<p>Dave Berri posted the following at the Freakonomics blog: The “best” picture of 2012 was Argo. At least that’s the film that won the Oscar for best picture. According to the Oscars, the decision to give this award to Argo was made by the nearly 6,000 voting members of the Academy of Motion Picture Arts [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/10/the-recursion-of-pop-econ-or-of-trolling/">The recursion of pop-econ</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Dave Berri <a href="http://www.freakonomics.com/2013/02/26/an-economist-on-the-oscars/">posted</a> the following at the Freakonomics blog:</p>
<blockquote><p>The “best” picture of 2012 was Argo. At least that’s the film that won the Oscar for best picture.  According to the Oscars, the decision to give this award to Argo was made by the nearly 6,000 voting members of the Academy of Motion Picture Arts and Sciences. . . . In other words, this choice is made by the “experts.”  There is, though, another group that we could have listened to on Sunday night.  That group would be the people who actually spend money to go to the movies. . . . According to that group, Marvel’s the Avengers was the “best” picture in 2012. With domestic revenues in excess of $600 million, this filmed earned nearly $200 million more than any other picture. And when we look at world-wide revenues, this film brought in more than $1.5 billion. . . . Despite what seems like a clear endorsement by the customers of this industry, the Avengers was ignored by the Oscars.  Perhaps this is just because I am an economist, but this strikes me as odd. Movies are not a product made just for the members the academy.  These ventures are primarily made for the general public.  And yet, when it comes time to decide which picture is “best,” the opinion of the general public seems to be ignored.  Essentially the Oscars are an industry statement to their customers that says: “We don’t think our customers are smart enough to tell us which of our products are good. So we created a ceremony to correct our customers.”</p></blockquote>
<p>He keeps going along those lines for awhile and concludes:</p>
<blockquote><p>One would hope the Academy would at least pay a bit more attention to the people paying the bills.   Not only does it seem wrong (at least to this economist) to argue that movies many people like are simply not that good, focusing on the box office would seem to make good financial sense for the Oscars as well.  A recent Slate article argued that the Oscars’ telecast tends to have higher ratings when more commercially successful films are nominated for best picture.  So in the future, maybe voters for the Oscars will pay a bit more attention to their customers.  These customers may not be thought of as “movie experts.”  But these are the people who pay the bills, and therefore, ultimately it is their opinion that should matter to this industry.</p></blockquote>
<p>What strikes me about this discussion is the mix of descriptive and normative that seems so characteristic of pop-microeconomics.  (I should emphasize here that I&#8217;m not using &#8220;pop&#8221; in any sort of derogatory way.  I&#8217;m speaking of serious economic writing that is intended for a popular audience.)</p>
<p>1.  On one hand, you have the purely descriptive perspective:  economist as person-from-Mars, looking at human society objectively, the way a scientist studies cell cultures in a test tube.  Consumer sovereignty is what it&#8217;s all about, with a slightly offended tone that anyone could think otherwise.  Who are you, smartypants, to think you know better than the average ticket-buyer, etc.  I&#8217;m reminded of the perhaps-apocryphal <a href="http://andrewgelman.com/2012/12/20/who-exactly-are-those-silly-academics-who-arent-as-smart-as-a-vegas-bookie/">story</a> of the &#8220;some academics&#8221; who &#8220;conclude that bookmakers simply aren’t very smart.&#8221;</p>
<p>2.  At the same time, we&#8217;re given a moral lesson.  The Avengers is the best movie because it made more money.  It is &#8220;the people who pay the bills&#8221; whose &#8220;opinion that should matter to this industry.&#8221;</p>
<p>The difficulty, of course, is that lesson 2 gets blurred if it is folded into lesson 1.</p>
<p>Berri&#8217;s argument is that moviemakers should not be paternalistically ignoring the attitudes of their customers in giving awards.  But this argument dissolves if you take one step back and consider moviemakers as independent business operators.  In that case, their business decisions (to do the Oscars however they want) should be given as much respect as that of moviegoers to choose which movies to watch.</p>
<p>As far as I&#8217;m concerned, the Academy can do whatever they want.  What&#8217;s interesting to me here is to see how the economist&#8217;s explicitly non-normative ideology (his implication that the &#8220;best&#8221; picture must be the one with most revenue, and that any other criteria would be disrespectful of moviegoers) so quickly becomes normative (that it&#8217;s &#8220;wrong . . . to argue that movies many people like are simply not that good&#8221;).  To me, it&#8217;s a strange mixture of idealism and cynicism.  The man from Mars has become a scold.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/10/the-recursion-of-pop-econ-or-of-trolling/">The recursion of pop-econ</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/10/the-recursion-of-pop-econ-or-of-trolling/feed/</wfw:commentRss>
		<slash:comments>52</slash:comments>
		</item>
		<item>
		<title>Same old same old</title>
		<link>http://andrewgelman.com/2013/05/09/same-old-same-old/</link>
		<comments>http://andrewgelman.com/2013/05/09/same-old-same-old/#comments</comments>
		<pubDate>Thu, 09 May 2013 21:15:52 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Bayesian Statistics]]></category>
		<category><![CDATA[Decision Theory]]></category>
		<category><![CDATA[Miscellaneous Statistics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18982</guid>
		<description><![CDATA[<p>In an email I sent to a colleague who&#8217;s writing about lasso and Bayesian regression for R users: The one thing you might want to add, to fit with your pragmatic perspective, is to point out that these different methods are optimal under different assumptions about the data. However, these assumptions are never true (even [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/09/same-old-same-old/">Same old same old</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>In an email I sent to a colleague who&#8217;s writing about lasso and Bayesian regression for R users:</p>
<blockquote><p>The one thing you might want to add, to fit with your pragmatic perspective, is to point out that these different methods are optimal under different assumptions about the data.  However, these assumptions are never true (even in the rare cases where you have a believable prior, it won&#8217;t really follow the functional form assumed by <a href="http://www.stat.columbia.edu/~gelman/research/published/priors11.pdf">bayesglm</a>; even in the rare cases where you have a real loss function, it won&#8217;t really follow the mathematical form assumed by <a href="http://andrewgelman.com/2013/03/18/tibshirani-announces-new-research-result-a-significance-test-for-the-lasso/">lasso</a> etc), but these methods can still be useful and be given the interpretation of regularized estimates.</p>
<p>Another thing that someone might naively think is that regularization is fine but &#8220;<a href="http://andrewgelman.com/2013/03/14/everyones-trading-bias-for-variance-at-some-point-its-just-done-at-different-places-in-the-analyses/">unbiased</a>&#8221; is somehow the most honest.  In practice, if you stick to &#8220;unbiased&#8221; methods such as least squares, you&#8217;ll restrict the number of variables you can include in your model.  So in reality you suffer from omitted-variable bias.  So there is not safe home base.  It&#8217;s not like the user can simply do unregularized regression and then think of regularization as a frill. The practitioner who uses unregularized regression has already essentially made a compromise with the devil by restricting the number of predictors in the model to a &#8220;manageable&#8221; level (whatever that means).</p></blockquote>
<p>The post <a href="http://andrewgelman.com/2013/05/09/same-old-same-old/">Same old same old</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/09/same-old-same-old/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A tale of two discussion papers</title>
		<link>http://andrewgelman.com/2013/05/09/a-tale-of-two-discussion-papers/</link>
		<comments>http://andrewgelman.com/2013/05/09/a-tale-of-two-discussion-papers/#comments</comments>
		<pubDate>Thu, 09 May 2013 13:27:04 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Sociology]]></category>
		<category><![CDATA[Statistical computing]]></category>
		<category><![CDATA[Statistical graphics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=16976</guid>
		<description><![CDATA[<p>Over the years I&#8217;ve written a dozen or so journal articles that have appeared with discussions, and I&#8217;ve participated in many published discussions of others&#8217; articles as well. I get a lot out of these article-discussion-rejoinder packages, in all three of my roles as reader, writer, and discussant. Part 1: The story of an unsuccessful [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/09/a-tale-of-two-discussion-papers/">A tale of two discussion papers</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Over the years I&#8217;ve written a dozen or so journal articles that have appeared with discussions, and I&#8217;ve participated in many published discussions of others&#8217; articles as well.  I get a lot out of these article-discussion-rejoinder packages, in all three of my roles as reader, writer, and discussant.</p>
<p><strong>Part 1:  The story of an unsuccessful discussion</strong></p>
<p>The first time I had a discussion article was the result of an unfortunate circumstance.  I had a research idea that resulted in an <a href="http://www.stat.columbia.edu/~gelman/research/published/itsim.pdf">article</a> with Don Rubin on monitoring the mixing of Markov chain simulations. I new the idea was great, but back then we worked pretty slowly so it was awhile before we had a final version to submit to a journal.  (In retrospect I wish I&#8217;d just submitted the <a href="http://andrewgelman.com/2012/05/the-first-version-of-my-inference-from-iterative-simulation-using-parallel-sequences-paper/">draft version</a> as it was.)  In the meantime I presented the paper at a conference.  Our idea was very well received (I had a sheet of paper so people could write their names and addresses to get preprints, and we got either 50 or 150 (I can&#8217;t remember which, I guess it must have been 50) requests), but there was one person who came up later and tried to shoot down our idea.  The shooter-down, Charlie Geyer, has done some <a href="http://andrewgelman.com/2010/07/the_greates_wor/">great work</a> but in this case he was confused, I think in retrospect because we did not have a clear discussion of the different inferential goals that arose in the sorts of calculations he was doing (inference for normalizing constants of distributions) and which I was doing (inference for parameters in fitted models).  In any case, the result was that our new and exciting method was surrounded by an air of controversy.  In some ways that was a good thing:  I became well known in the field right away, perhaps more than I deserved at the time (in the sense that most of my papers up to then and for the next few years were on applied topics; it was awhile before I published other major papers on statistical theory, methods, and computation).  But overall I&#8217;d rather have been moderately known for an excellent piece of research than very well known for being part of a controversy.  I didn&#8217;t seek out controversy; it arose because someone else criticized our work without seeing the big picture, and at the time neither he nor I nor my collaborator had the correct synthesis of my work and his criticism.</p>
<p>(Again, the synthesis was that he was trying to get precise answers for hard problems and was in a position where he needed to have a good understanding of the complex distributions he was simulating from, whereas I was working on a method to apply routinely in relatively easy (but nontrivial!) settings.  For Charlie&#8217;s problems, my method would not suffice because he wouldn&#8217;t be satisfied until he was directly convinced that the Markov chain was exploring all the space.  For my problems, Charlie&#8217;s approach (to run a million simulations and work really hard to understand the computation for a particular model) wasn&#8217;t a practical solution.  His approach to applied statistics was to handcraft big battleships to solve large problems, one at a time.  I wanted to fit lots of small and medium-sized models (along with the occasional <a href="http://www.stat.columbia.edu/~gelman/research/published/bois2.pdf">big one</a>), fast.)</p>
<p>Anyway, this &#8220;different methods for different goals&#8221; conversation never occurred, hence I left that meeting with an unpleasant feeling that our method was controversial, not fully accepted, and not fully understood.  So I got it into my head that our article should be published as a discussion, so that Geyer and others could comment and we could respond.</p>
<p>But we never had that discussion, not in those words.  Neither Charlie nor I nor Don Rubin was aware enough of the sociological context, as it were, so we ended up talking past each other.</p>
<p>In retrospect, that particular discussion did not work so well.</p>
<p>Here&#8217;s another example from about the same time, the Ising model.  Here&#8217;s one chain from the Gibbs sampler.  After 2000 iterations, it looks like it&#8217;s settled down to convergence (here we&#8217;re plotting the log probability density, which is a commonly used summary for this sort of distribution).</p>
<p>But then look at the second plot:  the first 500 iterations.  If we&#8217;d only seen these, we might have been tempted to declare victory too early!</p>
<p><img src="http://i0.wp.com/andrewgelman.com/wp-content/uploads/2012/12/Screen-Shot-2012-12-06-at-8.16.49-PM.png?resize=280%2C481" alt="" title="Screen Shot 2012-12-06 at 8.16.49 PM" class="alignnone size-full wp-image-17079" data-recalc-dims="1" /></p>
<p>At this point, the naive take-home point might be that 500 iterations was not enough but we&#8217;re safe with 2000.  But no!  Even that the last bit of those 2000 looks as stationary and clean as can be, if we start from a different point and run for 2000, we get something different:</p>
<p><img src="http://i2.wp.com/andrewgelman.com/wp-content/uploads/2012/12/Screen-Shot-2012-12-06-at-8.17.07-PM.png?resize=283%2C228" alt="" title="Screen Shot 2012-12-06 at 8.17.07 PM" class="alignnone size-full wp-image-17080" data-recalc-dims="1" /></p>
<p>This one looks stationary too!  But a careful comparison with the graphs above (even clearer when I displayed these on transparency sheets and overlaid them on the projector) reveals that the two &#8220;stationary&#8221; distributions are different.  The chains haven&#8217;t mixed, the process hasn&#8217;t converged.  R-hat reveals this right away (without even having to look at the graphs, but you can look at the graphs if you want).</p>
<p>As I wrote in our <a href="http://digitalassets.lib.berkeley.edu/sdtr/ucb/text/305.pdf">article</a> in Bayesian Statistics 4,</p>
<blockquote><p>This example shows that the Gibbs sampler can stay in a small subset of its space for a long time, without any evidence of this problematic behavior being provided by one simulated series of finite length. The simplest way to run into trouble is with a two-chambered space, in which the probability of switching chambers is very low, but the above graphs are especially disturbing because the probability density in the Ising model has a unimodal (in the sense that this means anything in a discrete distribution) and approximately Gaussian marginal distribution on the gross scale of interest. That is, the example is not pathological; the Gibbs sampler is just very slow. Rather than being a worst-case example, the Ising model is typical of the probability distributions for which iterative simulation methods were designed, and may be typical of many posterior distributions to which the Gibbs sampler is being applied.</p></blockquote>
<p>So that was my perspective:  start from one point and the chain looks fine; start from two points and you see the problem.  But Charlie had a different attitude toward the Ising example.  His take on it was:  the Ising model is known to be difficult, no one but a fool would try to simulate it with 2000 iterations of a Gibbs sampler.  There&#8217;s a huge literature on the Ising model already!</p>
<p>Charlie was interested in methods for solving large, well-understood problems one at a time.  I was interested in methods that would be used for all sorts of problems by statisticians such as myself who, for applied reasons, bite off more in model than we can chew in computation and understanding.  For Charlie with the Ising model, multiple sequences missed the point entirely, as he knew already that 2000 iterations of Gibbs wouldn&#8217;t do it.  For me, though . . . as an applied guy I was just the kind of knucklehead who might apply Gibbs to this sort of problem (in my defense, Geman and Geman made a similar mistake in 1984, I&#8217;ve been told), so it was good to have a practical convergence check.</p>
<p>Again, I think that in our discussion and rejoinder, Don and I presented our method well, in the context of our applied purposes.  But I think it would&#8217;ve worked better as a straight statistics article.  Nothing much useful came out of the discussion because none of us cut through to the key difference in the sorts of problems we were working on.</p>
<p><strong>Part 2:  A successful discussion</strong></p>
<p>In the years since then, I&#8217;ve realized that communication is more than being right (or, should I say, thinking that one is right).  Statistical ideas (and, for that matter, mathematical and scientific ideas in general) are sometimes best understood through their limitations.  It&#8217;s Lakatos&#8217;s old &#8220;proofs and refutations&#8221; story all over again.</p>
<p>Recently I was involved in a discussion that worked out well.  It started a few years ago with <a href="http://andrewgelman.com/2009/04/more_on_data_vi/">a post</a> of mine on the differences between the sorts of data visualizations that go viral on the web (using some examples that were <a href="http://flowingdata.com/2008/12/19/5-best-data-visualization-projects-of-the-year/">celebrated</a> by statistician/designer Nathan Yau), as compared to statistical graphics of the sort that we are trained to make.  It seemed to me that many visualizations that are successful with general audiences feature unique or striking designs and puzzle-like configurations, whereas the most successful statistical graphics have more transparent formats that foreground data comparisons.  Somewhere in between are the visualizations created by lab scientists, who generally try to follow statistical principles but usually (in my view) try too hard to display too much information on a single plot.</p>
<p>My posts, and various follow-ups, were disliked by many in the visualization community.  They didn&#8217;t ever quite disagree with my claim that many successful visualizations involve puzzles, but they didn&#8217;t like what they perceived as my negative tone.</p>
<p>In attempt to engage the fields of statistics and visualization more directly, I wrote an article (with Antony Unwin) on the different goals and different looks of these two sorts of graphics.  Like many of my favorite papers, this one took a lot of effort to get into a journal.  But finally it was accepted in the Journal of Computational and Graphical Statistics, with discussion.</p>
<p>The discussants (<a href="http://www.perceptualedge.com/files/are_infovis_and_statistical_graphics_really_all_that_different.pdf">Stephen Few</a>, <a href="http://kosara.net/publications/Kosara_JCGS_2013.html">Robert Kosara</a>, <a href="http://www.stat.auckland.ac.nz/~paul/MurrellCommentGelmanUnwin.pdf">Paul Murrell</a>, and <a href="http://vita.had.co.nz/papers/stat-graph-hist.html">Hadley Wickham</a>; links to all four discussions are <a href="http://eagereyes.org/blog/2012/responses-gelman-unwin-convenient-posting">here</a> on Kosara&#8217;s blog) politely agreed with us on some points and disagreed with us on others.  And then it was time for us to write our rejoinder.</p>
<p>In composing the rejoinder I finally came upon a good framing of the problem.  Before we&#8217;d spoken of statistical graphs and information visualization as having different goals and looking different.  But that didn&#8217;t work.  No matter how often I said that it could be a good thing that an infovis is puzzle-like, or no matter how often I said that as a statistician I would prefer graphing the data like This but I can understand how graphing it like That could attract more viewers . . . no matter how much I said this sort of thing, it was interpreted as a value judgment (and it didn&#8217;t help when I said that something &#8220;sucked,&#8221; even if I later modified that statement).</p>
<p>Anyway, my new framing, that I really like, is in terms of <em>tradeoffs</em>.  Not &#8220;two cultures,&#8221; not &#8220;different goals, different looks,&#8221; but <a href="http://andrewgelman.com/2012/11/tradeoffs-in-information-graphics/">tradeoffs</a>.  So it&#8217;s not stat <em>versus</em> infographics; instead it&#8217;s any of us trying to construct a graph (or, better still, a grid of graphs) and recognizing that it&#8217;s not generally possible to satisfy all goals at once, so we have to think about what goals are most important in any given situation:</p>
<blockquote><p>In the internet age, we should not have to choose between attractive graphs and informational graphs: it should be possible to display both, via interactive displays. But to follow this suggestion, one must first accept that not every beautiful graph is informative, and not every informative graph is beautiful.</p>
<p>Yes, it can sometimes be possible for a graph to be both beautiful and informative, as in Minard’s famous Napoleon-in-Russia map, or more recently the Baby Name Wizard, which we featured in our article. But such synergy is not always possible, and we believe that an approach to data graphics that focuses on celebrating such wonderful examples can mislead people by obscuring the tradeoffs between the goals of visual appeal to outsiders and statistical communication to experts.</p></blockquote>
<p>So it&#8217;s not Us versus Them, it&#8217;s each of us choosing a different point along the efficient frontier for each problem we care about.</p>
<p>And I think the framing worked well.  At least, it helped us communicate with Robert Kosara, one of our discussants.  <a href="http://eagereyes.org/blog/2012/scholarly-discussion-gelman-unwin">Here&#8217;s</a> what Kosara wrote, after seeing our article, the discussions (including his), and our rejoinder:</p>
<blockquote><p>There are many, many statements in that article [by Gelman and Unwin] that just ask to be debunked . . . I [Kosara] ended up writing a short response that mostly points to the big picture of what InfoVis really is, and that gives some examples of the many things they missed.</p>
<p>While the original article is rather infuriating, the rejoinder is a great example of why this kind of conversation is so valuable. Gelman and Unwin respond very thoughtfully to the comments, seem to have a much more accurate view of information visualization than they used to, and make some good points in response.</p></blockquote>
<p>Great!  A discussion that worked!  This is how it&#8217;s supposed to go:  not a point-scoring debate, not people talking past each other, but an honest and open discussion.</p>
<p><strong>Reflections</strong></p>
<p>Perhaps my extremely, extremely frustrating experience early in my career (detailed in Part 1 above) motivated me to think seriously about the Lakatosian attitude toward understanding and explaining ideas.  If you compare Bayesian Data Analysis to other statistics books of that era, for example, I think we did a pretty good job (although maybe not good enough) of understanding the methods through their limitations.  But even with all my experience and all my efforts, this can be difficult, as revealed by the years it took for us to finally process our ideas on graphics and visualization to the extent that we could communicate with experts in these fields.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/09/a-tale-of-two-discussion-papers/">A tale of two discussion papers</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/09/a-tale-of-two-discussion-papers/feed/</wfw:commentRss>
		<slash:comments>20</slash:comments>
		</item>
		<item>
		<title>Of parsing and chess</title>
		<link>http://andrewgelman.com/2013/05/08/of-parsing-and-chess/</link>
		<comments>http://andrewgelman.com/2013/05/08/of-parsing-and-chess/#comments</comments>
		<pubDate>Wed, 08 May 2013 13:57:34 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Sports]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18958</guid>
		<description><![CDATA[<p>Gary Marcus writes, An algorithm that is good at chess won’t help parsing sentences, and one that parses sentences likely won’t be much help playing chess. That is soooo true. I&#8217;m excellent at parsing sentences but I&#8217;m not so great at chess. And, worse than that, my chess ability seems to be declining from year [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/08/of-parsing-and-chess/">Of parsing and chess</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><a href="http://en.wikipedia.org/wiki/The_Game_of_the_Century_(chess)"><img src="http://i0.wp.com/andrewgelman.com/wp-content/uploads/2013/05/Screen-Shot-2013-05-07-at-9.12.42-PM.png?resize=512%2C219" alt="Screen Shot 2013-05-07 at 9.12.42 PM" class="alignnone size-large wp-image-18959" data-recalc-dims="1" /></a></p>
<p>Gary Marcus <a href="http://www.newyorker.com/online/blogs/elements/2013/05/a-grand-unified-theory-of-everything.html">writes</a>,</p>
<blockquote><p>An algorithm that is good at chess won’t help parsing sentences, and one that parses sentences likely won’t be much help playing chess.</p></blockquote>
<p>That is soooo true.  I&#8217;m excellent at parsing sentences but I&#8217;m not so great at chess.  And, worse than that, my chess ability seems to be declining from year to year.</p>
<p>Which reminds me:  I recently read Frank Brady&#8217;s much lauded <em>Endgame</em>, a biography of Bobby Fischer.  The first few chapters were great, not just the Cinderella story of his steps to the world championship, but also the background on his childhood and the stories of the games and tournaments that he lost along the way.</p>
<p>But after Fischer beats Spassky in 1972, the book just dies.  Brady has chapter after chapter on Fisher&#8217;s life, his paranoia, his girlfriends, his travels.  But, really, after the chess is over, it&#8217;s just sad and kind of boring.  I&#8217;d much rather have had twice as much detail on the first part of the life and then had the post-1972 era compressed into a single chapter.  I mean, sure, I respect that Brady wanted to tell the full life story, and I&#8217;m not telling him how he should&#8217;ve written his book, I&#8217;m just giving my reactions.</p>
<p>Also, I would&#8217;ve liked more information on the games:  what was the amazing set of moves that Fischer did in the so-called Game of the Century, what happened in some of the games he lost, and so on. In an afterword, Brady writes that he decided not to include any games so as to make the book more accessible.  What I wonder is, how many readers are there like me, who enjoy chess, could understand a diagram and some discussion of what these amazing plays were, even if we couldn&#8217;t follow an entire game written on the page or have the patience to play one out on the board.  I wouldn&#8217;t have gotten much out of transcripts of chess games, but a few diagrams and discussions of key moments, that would&#8217;ve made the book a lot more interesting to me.</p>
<p>P.S.  After Kasparov beat Karpov in the final game of their tournament&#8212;the game where both players knew that Kasparov had to <em>win</em>, that a draw wouldn&#8217;t be enough&#8212;I clipped the game out of the newspaper and later played it out with my dad.  <em>That</em> was a game.  To my ignorant eyes, there was no single point where I could spot a mistake by Karpov.  Kasparov just gradually and imperceptibly got to a winning position.  Amazing.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/08/of-parsing-and-chess/">Of parsing and chess</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/08/of-parsing-and-chess/feed/</wfw:commentRss>
		<slash:comments>19</slash:comments>
		</item>
		<item>
		<title>Like Casper the ghost, Niall Ferguson is not only white.  He is also very, very adorable.</title>
		<link>http://andrewgelman.com/2013/05/07/niall-ferguson-updat/</link>
		<comments>http://andrewgelman.com/2013/05/07/niall-ferguson-updat/#comments</comments>
		<pubDate>Wed, 08 May 2013 03:00:35 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Zombies]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18962</guid>
		<description><![CDATA[<p>I don&#8217;t want this to be a regular feature but I wanted to briefly comment on Ferguson&#8217;s open letter regarding the Keynes-was-a-ballet-and-poetry-loving-poof remarks he made the other day at that conference of financial advisors. (I&#8217;m posting this one at night, and a new post on an unrelated topic is coming in the morning, so I&#8217;m [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/07/niall-ferguson-updat/">Like Casper the ghost, Niall Ferguson is not only white.  He is also very, very adorable.</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2013/05/poof.jpg?8b0eec" alt="poof" width="312" height="162" class="alignnone size-full wp-image-18963" /><br />
<span id="more-18962"></span><br />
I don&#8217;t want this to be a regular feature but I wanted to briefly comment on Ferguson&#8217;s <a href="http://www.thecrimson.com/article/2013/5/7/Ferguson-Apology-Keynes/#">open letter</a> regarding the Keynes-was-a-ballet-and-poetry-loving-poof remarks he made the other day at that conference of financial advisors.  (I&#8217;m posting this one at night, and a new post on an unrelated topic is coming in the morning, so I&#8217;m burying it as much as possible.)</p>
<p>Ferguson reiterates that his remarks were &#8220;stupid.&#8221;  The question then arises:  He&#8217;s a smart guy, how did he end up saying such stupid things?  Ferguson has a history of saying high-profile stupid things, and they always seem to be when he&#8217;s trying to make some sort of political point.</p>
<p>I&#8217;m still going with <a href="http://andrewgelman.com/2013/05/04/one-more-thought-on-hoover-historian-niall-fergusons-thing-about-keynes-being-gay-and-marrying-a-ballerina-and-talking-about-poetry/">my theory</a> that Ferguson misjudged his audience; he thought they’d appreciate an anti-Keynes remark, maybe he even thought they were the kind of crowd that would enjoy cracks about gay people who like ballet and poetry. No go.</p>
<p>Again, I&#8217;m not trying to nail the guy to the cross for this.  We all make mistakes; in fact, we all make mistakes, of some sort, repeatedly.  It&#8217;s just interesting to think about what made him say this stuff in the first place.  He&#8217;s <em>not</em> any kind of &#8220;gay-basher&#8221; (in his words), but he still thought this sort of thing would work.  So maybe he was trying a bit too hard to please the crowd.  He should remember that, however entertaining he is as a speaker, however good he looks on TV, his ultimate qualifications come from his historical research.</p>
<p>Perhaps the case interests me so much because Ferguson, like me, is an academic researcher who likes to speak to popular audiences.</p>
<p>P.S.  <a href="http://neweconomicperspectives.org/2013/05/niall-fergusons-latest-gay-bashing-is-the-least-of-his-problems.html">More here</a> from Willam Black, including this amazing bit that Ferguson wrote about Paul Krugman:</p>
<blockquote><p>His inability to debate a question without insulting his opponent suggests some kind of deep insecurity perhaps the result of a childhood trauma.</p></blockquote>
<p>Black makes another good point:</p>
<blockquote><p>Ferguson loves counterfactuals, so let’s try this counterfactual.  What if Ferguson had made the obvious, stupid claim that people who are childless do not carry about future generations?  We know that he despises Keynes and Krugman (in the same talk he archly referred to Krugman as his “arch-enemy”) and that both are childless.  Ferguson could have claimed that Keynes and Krugman were indifferent to the lives of future generations because they were childless.  How would the audience have reacted to such a claim?</p>
<p>The perhaps 25% of the audience who were childless would have stared at him like he was an idiot who had gone out of his way to insult them.  The Americans in the audience would have thought first of themselves if they were childless, then of their relatives and close friends who were childless, then of George Washington, and finally of Jesus. . . .</p></blockquote>
<p>P.P.S.  For those who don&#8217;t get the title of this post, see <a href="http://www.theatlantic.com/technology/archive/2009/08/ferguson-obama-felix-the-cat-and-pluto/23412/">here</a>.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/07/niall-ferguson-updat/">Like Casper the ghost, Niall Ferguson is not only white.  He is also very, very adorable.</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/07/niall-ferguson-updat/feed/</wfw:commentRss>
		<slash:comments>26</slash:comments>
		</item>
		<item>
		<title>Is Felix Salmon wrong on free TV?</title>
		<link>http://andrewgelman.com/2013/05/07/is-felix-salmon-wrong-on-free-tv/</link>
		<comments>http://andrewgelman.com/2013/05/07/is-felix-salmon-wrong-on-free-tv/#comments</comments>
		<pubDate>Tue, 07 May 2013 13:28:11 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Economics]]></category>
		<category><![CDATA[Political Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18911</guid>
		<description><![CDATA[<p>Mark Palko writes: Salmon is dismissive of the claim that there are fifty million over-the-air television viewers: The 50 million number, by the way, should not be considered particularly reliable: it’s Aereo’s guess as to the number of people who ever watch free-to-air TV, even if they mainly watch cable or satellite. (Maybe they have [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/07/is-felix-salmon-wrong-on-free-tv/">Is Felix Salmon wrong on free TV?</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2013/05/rabbit-ears.jpg?8b0eec" alt="rabbit-ears" width="333" height="446" class="alignnone size-full wp-image-18912" /></p>
<p>Mark Palko <a href="http://observationalepidemiology.blogspot.com/2013/04/free-tv-blogging-betting-against-felix.html">writes</a>:</p>
<blockquote><p>Salmon is dismissive of the claim that there are fifty million over-the-air television viewers:</p>
<blockquote><p>The 50 million number, by the way, should not be considered particularly reliable: it’s Aereo’s guess as to the number of people who ever watch free-to-air TV, even if they mainly watch cable or satellite. (Maybe they have a hut somewhere with an old rabbit-ear TV in it.)</p></blockquote>
<p>And he strongly suggests the number is not only smaller but shrinking. By comparison, here&#8217;s a story from the broadcasting news site TV News Check from June of last year (if anyone has more recent numbers please let me know):</p>
<blockquote><p>According to new research by GfK Media, the number of Americans now relying solely on over-the-air (OTA) television reception increased to almost 54 million, up from 46 million just a year ago. The recently completed survey also found that the demographics of broadcast-only households skew towards younger adults, minorities and lower-income families.</p></blockquote>
</blockquote>
<p>As Palko says, Salmon is usually a pretty careful reporter.  And this one should be right up his alley.  Here&#8217;s Palko again:</p>
<blockquote><p>We&#8217;ve talked about how well over-the-air television compares to cable (for some people), how new and apparently successful businesses are springing up around OTA, and how the number of viewers getting their television through antennas appears to have been growing substantially since the introduction of digital. What we haven&#8217;t covered so far is the potential social impact of killing broadcast television.</p>
<p>It is almost axiomatic that, if you have a resource that is used in one way by people at the top of the economic ladder and in another way by people on the bottom and you &#8220;let the market decide&#8221; what to do with the resource, it will go with the people who have the money. . . .</p>
<p>This becomes particularly troubling when we&#8217;re talking about a publicly held resource. . . . What groups rely heavily on broadcast television? What groups would have the most difficulty finding alternatives?</p>
<p>People in the bottom one or two deciles are going to be in trouble. Even the lowest tier of cable would represent a significant monthly expense.  People with limited residential security will be even worse off. People with limited income security will face a difficult choice: sign up for exorbitant no-contract plans or commit to a financial obligation they may not be able to fulfill. People with poor credit histories will have to come up with large deposits every time they move. . . .</p></blockquote>
<p>Palko summarizes:</p>
<blockquote><p>OTA [over-the-air television] is a promising technology supporting an innovative and growing industry, serving important economic and social roles. </p>
<p>The technology is doing fine in the marketplace. It&#8217;s lobbyists who are likely to kill it.</p></blockquote>
<p>I wonder what Salmon&#8217;s take is on this.  Is Palko missing something, or does he just happen to be sharing a perspective that is different from that of NYC-based financial journalists?</p>
<p>P.S.  Let me emphasize that this post is not some sort of trolling of Felix Salmon.  I&#8217;m a big fan of his quantitatively sophisticated reporting, which is why it&#8217;s interesting if he&#8217;s getting something wrong.</p>
<p>P.P.S.  There&#8217;s some dispute about that 54 million number.  Salmon points to <a href="http://broadcastengineering.com/transmitters/over-air-tv-market-continues-shrink">this</a> news article by Michael Grotticelli:</p>
<blockquote><p>Free, over-the-air television viewing of broadcast TV signals are now watched by only 9 percent of the U.S. population — down from 16 percent in 2003, according to Nielsen, the major TV and radio rating service. . . .</p>
<p>The Nielsen numbers are certain to cause a dispute with the NAB, which has insisted the amount of over-the-air viewing is increasing in an era of cord-cutting. Last summer, the NAB produced a survey by Knowledge Networks citing about 18 percent as “broadcast exclusive” households. That total was 54 million Americans — up from 46 million in 2011.</p></blockquote>
<p>So, one claim is that 9% watch any over-the-air TV, the other is that 18% only watch over-the-air TV.  That&#8217;s a big gap.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/07/is-felix-salmon-wrong-on-free-tv/">Is Felix Salmon wrong on free TV?</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/07/is-felix-salmon-wrong-on-free-tv/feed/</wfw:commentRss>
		<slash:comments>24</slash:comments>
		</item>
		<item>
		<title>Against optimism about social science</title>
		<link>http://andrewgelman.com/2013/05/06/against-optimism-about-social-science/</link>
		<comments>http://andrewgelman.com/2013/05/06/against-optimism-about-social-science/#comments</comments>
		<pubDate>Mon, 06 May 2013 13:00:12 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Decision Theory]]></category>
		<category><![CDATA[Economics]]></category>
		<category><![CDATA[Sociology]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18943</guid>
		<description><![CDATA[<p>Social science research has been getting pretty bad press recently, what with the Excel buccaneers who didn&#8217;t know how to handle data with different numbers of observations per country, and the psychologist who published dozens of papers based on fabricated data, and the Evilicious guy who wouldn&#8217;t let people review his data tapes, etc etc. [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/06/against-optimism-about-social-science/">Against optimism about social science</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2013/05/tip_of_the_iceberg.jpg?8b0eec" alt="tip_of_the_iceberg" width="400" height="226" class="alignnone size-full wp-image-18945" /></a></p>
<p>Social science research has been getting pretty bad press recently, what with the Excel buccaneers who didn&#8217;t know how to handle data with different numbers of observations per country, and the psychologist who published dozens of papers based on fabricated data, and the Evilicious guy who wouldn&#8217;t let people review his data tapes, etc etc.  And that&#8217;s not even considering Dr. Anil Potti.</p>
<p>On the other hand, the revelation of all these problems can be taken as evidence that things are getting better.  Psychology researcher Gary Marcus <a href="http://andrewgelman.com/2013/05/05/cleaning-up-science/">writes</a>:</p>
<blockquote><p>There is something positive that has come out of the crisis of replicability—something vitally important for all experimental sciences. For years, it was extremely difficult to publish a direct replication, or a failure to replicate an experiment, in a good journal. . . . Now, happily, the scientific culture has changed. . . . The Reproducibility Project, from the Center for Open Science is now underway . . .</p></blockquote>
<p>And sociologist Fabio Rojas <a href="http://orgtheory.wordpress.com/2013/05/03/yknow-im-kind-of-proud-of-science-right-now-even-social-science/">writes</a>:</p>
<blockquote><p>People may sneer at the social sciences, but they hold up as well.  Recently, a well known study in economics was found to be in error. People may laugh because it was an Excel error, but there’s a deeper point. There was data, it could be obtained, and it could be replicated. Fixing errors and looking for mistakes is the hallmark of science. . . .</p></blockquote>
<p>I agree with Marcus and Rojas that attention to problems of replication is a good thing.  It&#8217;s bad that people are running incompetent analysis or faking data all over the place, but it&#8217;s good that they&#8217;re getting caught.  And, to the extent that scientific practices are improving to help detect error and fraud, and to reduce the incentives for publishing erroneous and fradulent results in the first place, that&#8217;s good too.</p>
<p>But I worry about a sense of complacency. I think we should be careful not to overstate the importance of our first steps.  We may be going in the right direction but we have a lot further to go.  Here are some examples:</p>
<p>1.  Marcus writes of the new culture of publishing replications.  I assume he&#8217;d support the ready publications of corrections, too.  But we&#8217;re not there yet, as this story indicates:</p>
<p>Recently I sent a letter to the editor to a major social science journal pointing out a problem in an article they’d published, they refused to publish my letter, not because of any argument that I was incorrect, but because they judged my letter to not be in the top 10% of submissions to the journal. I’m sure my letter was indeed not in the top 10% of submissions, but the journal’s attitude presents a serious problem, if the bar to publication of a correction is so high. That’s a disincentive for the journal to publish corrections, a disincentive for outsiders such as myself to write corrections, and a disincentive for researchers to be careful in the first place. Just to be clear: I’m not complaining how I was treated here; rather, I’m griping about the system in which a known error can stand uncorrected in a top journal, just because nobody managed to send in a correction that’s in the top 10% of journal submissions.</p>
<p>2.  Rojas writes of the notorious Reinhardt and Rogoff <a href="http://andrewgelman.com/2013/04/16/memo-to-reinhart-and-rogoff-i-think-its-best-to-admit-your-errors-and-go-on-from-there/">study</a> that, &#8220;There was data, it could be obtained, and it could be replicated.&#8221;  Not so fast:</p>
<p>It was <a href="http://andrewgelman.com/2013/05/02/7-ways-to-separate-errors-from-statistics/">over two years</a> before those economists shared the data that allowed people to find the problems in their study. If the system really worked, people wouldn’t have had to struggle for years to try to replicate an unreplicable analysis.</p>
<p>And, remember, the problem with that paper was not just a silly computer error.  Reinhardt and Rogoff also made serious mistakes handling their time-series cross-sectional data.</p>
<p>3.  Marcus writes in a confident tone about progress in methodology:  &#8220;just last week, Uri Simonsohn [and Leif Nelson and Joseph Simmons] released a paper on coping with the famous file-drawer problem, in which failed studies have historically been underreported.&#8221;  I think Uri Simonsohn is <a href="http://andrewgelman.com/2012/12/03/somebody-listened-to-me/">great</a>, but I agree with the recent paper by <a href="http://andrewgelman.com/2013/04/26/a-vast-graveyard-of-undead-theories-publication-bias-and-psychological-sciences-aversion-to-the-null-2/">Christopher Ferguson and Moritz Heene</a> that the so-called file-drawer problem is not a little technical issue that can be easily cleaned up; rather, it&#8217;s fundamental to our current practice of statistically-based science. </p>
<p>And there&#8217;s pushback.  Biostatisticians Leah Jager and Jeffrey Leek wrote a <a href="http://andrewgelman.com/2013/01/24/i-dont-believe-the-paper-empirical-estimates-suggest-most-published-medical-research-is-true-that-is-the-claim-may-very-well-be-true-but-im-not-at-all-convinced-by-the-analysis-being-used/">paper</a>, which I strongly disagree with, called &#8220;Empirical estimates suggest most published medical research is true.&#8221;  I won&#8217;t go into the details here&#8212;my take on their work is that they&#8217;re applying a method that can make sense in the context of a single large study but which won&#8217;t generally work with meta-analysis&#8212;my point is that there remains a constituency for arguments that science is basically OK already.</p>
<p>I respect the view of Marcus, Rojas, Jager, Leek, and others that the current environment of criticism has in some ways gone too far.  All those people do serious, respected research, and those of us who do serious research know how difficult it can be to publish in good journals, how hard we work&#8212;out of necessity&#8212;to consider all possible alternative explanations for any results we find, how carefully we document the steps of our data collection and analysis, and so forth.  But many problems still remain.</p>
<p>Thomas Basbøll <a href="http://secondlanguage.blogspot.dk/2011/08/faith-knowledge-and-storytelling.html">analogizes</a> the difficulties of publishing scientific criticism to problems with the subprime mortgage market before the crash.  He quotes Michael Lewis:</p>
<blockquote><p>To sell a stock or bond short you need to borrow it, and [the bonds they were interested in] were tiny and impossible to find. You could buy them or not buy them but you couldn&#8217;t bet explicitly against them; the market for subprime mortages simply had no place for people in it who took a dim view of them. You might know with certainty that the entire mortgage bond market was doomed, but you could do nothing about it.</p></blockquote>
<p>And now here&#8217;s Basbøll:</p>
<blockquote><p>I had a shock of recognition when I read that. I&#8217;ve been trying to &#8220;bet against&#8221; a number of stories that have been told in the organization studies literature for years now, and the thing I&#8217;m learning is that there&#8217;s no place in the literature for people who take a dim view of them. There isn&#8217;t really a genre (in the area of management studies) of papers that only points out errors in other people&#8217;s work. You have to make a &#8220;contribution&#8221; too. In a sense, you can buy the stories people are telling you or not buy them but you can&#8217;t criticize them. </p>
<p>This got me thinking about the difference between faith and knowledge. Knowledge, it seems to me, is a belief held in a critical environment. Faith, we might say, is a belief held in an &#8220;evangelical&#8221; environment. The mortgage bond market was an evangelical environment in which to hold beliefs about housing prices, default rates, and credit ratings on CDOs. There was no simple way to critique the &#8220;good news&#8221; . . .</p></blockquote>
<p>Eventually, as Lewis reports, people were able to bet against the subprime mortgage market, but it wasn&#8217;t easy.  And the fact that some investors, with great difficulty, were able to do it, doesn&#8217;t mean the financial system is A-OK.</p>
<p>Basbøll&#8217;s analogy may be going too far, but I agree with his general point that the existence of a few cases of exposure should not make us complacent.  Marcus&#8217;s <a href="http://andrewgelman.com/2013/05/05/cleaning-up-science/">suggestions</a> on cleaning up science are good ones, and we have a ways to go before they are generally implemented.<br />
<span id="more-18943"></span><br />
<strong>P.S.</strong>  Coincidentally, Jeff Leek <a href="http://simplystatistics.org/2013/05/06/why-the-current-over-pessimism-about-science-is-the-perfect-confirmation-bias-vehicle-and-we-should-proceed-rationally/">posted</a> something today on the same topic, but with a slightly different perspective (he refers to &#8220;the current over-pessimism about science&#8221;).  Leek argues, reasonably enough, &#8220;that people are using a few high-profile cases to hyperventilate about the real, solvable, and recognized problems in the scientific process&#8221; and he worries that &#8220;the rational reasonable problems we have, with enough hyperbole, will make it look like the scientific process &#8216;sky is falling&#8217;&#8221; and lend support to political attacks on science more generally.  I think Jeff and I should be able to agree to the following:</p>
<p>- Science is hard, we all make mistakes, the system has problems but all human systems have problems, in working to fix these problems we shouldn&#8217;t thrown the research baby out with the bathwater that is the changing rules of scientific communication.</p>
<p>- We&#8217;re not there yet, we still live in a world in which it&#8217;s easier to publish and hype a elaborate flawed claim than to report a simple correction, a world in which data sharing is far from the norm, and where social and statistical biases lead to systematic overreporting of dramatic claims and systematic overestimation of effect sizes.</p>
<p>Leek is making the valid point that the sort of doomsaying that has been needed to draw attention to problems in scientific communication and to motivate improvements, can also be used, in guilt-by-association sense, to disparage good science.  And, even in popular culture, my impression is that things aren&#8217;t as bad as they used to be.  Sure, vaccine deniers and global warming deniers and all the other deniers are out there, but it&#8217;s not like the 70s when people were buying millions of copies of Chariots of the Gods, The Jupiter Effect, and The Bermuda Triangle, right?</p>
<p>The post <a href="http://andrewgelman.com/2013/05/06/against-optimism-about-social-science/">Against optimism about social science</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/06/against-optimism-about-social-science/feed/</wfw:commentRss>
		<slash:comments>28</slash:comments>
		</item>
		<item>
		<title>Cleaning up science</title>
		<link>http://andrewgelman.com/2013/05/05/cleaning-up-science/</link>
		<comments>http://andrewgelman.com/2013/05/05/cleaning-up-science/#comments</comments>
		<pubDate>Sun, 05 May 2013 19:42:24 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Decision Theory]]></category>
		<category><![CDATA[Miscellaneous Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18708</guid>
		<description><![CDATA[<p>David Hogg pointed me to this post by Gary Marcus, reviewing this skeptics&#8217; all-star issue of Perspectives on Psychological Science that features replication culture heroes Jelte Wicherts, Hal Pashler, Arina Bones, E. J. Wagenmakers, Gregory Francis, Hal Pashler, John Ioannidis, and Uri Simonsohn. I agree with pretty much everything Marcus has to say. In addition [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/05/cleaning-up-science/">Cleaning up science</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>David Hogg pointed me to <a href="http://www.newyorker.com/online/blogs/newsdesk/2012/12/cleaning-up-science.html">this post</a> by Gary Marcus, reviewing <a href="http://pps.sagepub.com/content/7/6.toc">this skeptics&#8217; all-star issue</a> of Perspectives on Psychological Science that features replication culture heroes Jelte Wicherts, Hal Pashler, Arina Bones, E. J. Wagenmakers, Gregory Francis, Hal Pashler, John Ioannidis, and Uri Simonsohn.  I agree with pretty much everything Marcus has to say.  In addition to Marcus&#8217;s suggestions, which might be called cultural or psychological, I also have various statistical ideas that might help move the field forward.  Most notably I think we <a href="http://www.stat.columbia.edu/~gelman/research/published/pvalues3.pdf">need</a> to go beyond uniform priors and null-hypothesis testing to a more realistic set of models for effects and variation.  I&#8217;ll discuss more at some other time, but in the meantime I thought I&#8217;d share these links.</p>
<p>P.S.  Marcus <a href="http://www.newyorker.com/online/blogs/elements/2013/05/the-crisis-in-social-psychology-that-isnt.html">updates</a> with a glass-is-half-full take.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/05/cleaning-up-science/">Cleaning up science</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/05/cleaning-up-science/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>The New York Times Book of Mathematics</title>
		<link>http://andrewgelman.com/2013/05/05/the-new-york-times-book-of-mathematics/</link>
		<comments>http://andrewgelman.com/2013/05/05/the-new-york-times-book-of-mathematics/#comments</comments>
		<pubDate>Sun, 05 May 2013 12:07:25 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18932</guid>
		<description><![CDATA[<p>This was an good idea: take a bunch of old (and some recent) news articles on developments in mathematics and related ares from the past hundred years. Fun for the math content and historical/nostalgia value. Relive the four-color theorem, Fermat, fractals, and early computing. I have too much of a technical bent to be the [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/05/the-new-york-times-book-of-mathematics/">The New York Times Book of Mathematics</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>This was an good idea:  take a bunch of old (and some recent) news articles on developments in mathematics and related ares from the past hundred years.  Fun for the math content and historical/nostalgia value.  Relive the four-color theorem, Fermat, fractals, and early computing.</p>
<p>I have too much of a technical bent to be the ideal reader for this sort of book, but it seems like an excellent gift for a non-technical reader who nonetheless enjoys math.  (I assume that such people are out there, just as there are people like me who can&#8217;t read music but still enjoy reading about the subject.)</p>
<p>The book is organized by topic.  My own preference would have been chronological and with more old stuff.  I particularly enjoyed the material from many decades ago, such as the news report on one of the early computers.  This must have been a fun book to compile.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/05/the-new-york-times-book-of-mathematics/">The New York Times Book of Mathematics</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/05/the-new-york-times-book-of-mathematics/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>One more thought on Hoover historian Niall Ferguson’s thing about Keynes being gay and marrying a ballerina and talking about poetry</title>
		<link>http://andrewgelman.com/2013/05/04/one-more-thought-on-hoover-historian-niall-fergusons-thing-about-keynes-being-gay-and-marrying-a-ballerina-and-talking-about-poetry/</link>
		<comments>http://andrewgelman.com/2013/05/04/one-more-thought-on-hoover-historian-niall-fergusons-thing-about-keynes-being-gay-and-marrying-a-ballerina-and-talking-about-poetry/#comments</comments>
		<pubDate>Sun, 05 May 2013 02:02:19 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Decision Theory]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18924</guid>
		<description><![CDATA[<p>We had some interesting comments on our recent reflections on Niall Ferguson&#8217;s ill-chosen remarks in which he attributed Keynes&#8217;s economic views (I don&#8217;t actually know exactly what Keyesianism is, but I think a key part is for the government to run surpluses during economic booms and deficits during recessions) to the Keynes being gay and [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/04/one-more-thought-on-hoover-historian-niall-fergusons-thing-about-keynes-being-gay-and-marrying-a-ballerina-and-talking-about-poetry/">One more thought on Hoover historian Niall Ferguson&#8217;s thing about Keynes being gay and marrying a ballerina and talking about poetry</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2013/05/eddie.jpg?8b0eec" alt="eddie" width="257" height="196" class="alignnone size-full wp-image-18925" /></p>
<p>We had some interesting comments on our recent <a href="http://andrewgelman.com/2013/05/04/jesus-historian-niall-ferguson-and-the-improving-standards-of-public-discourse/">reflections</a> on Niall Ferguson&#8217;s ill-chosen remarks in which he attributed Keynes&#8217;s economic views (I don&#8217;t actually know exactly what Keyesianism is, but I think a key part is for the government to run surpluses during economic booms and deficits during recessions) to the Keynes being gay and marrying a ballerina and talking about poetry.  The general idea, I think, is that people without kids don&#8217;t care so much about the future, and this motivated Keynes&#8217;s party-all-the-time attitude, which might have worked just fine for Eddie Murphy&#8217;s girlfriend in the 1980s and in San Francisco bathhouses of the 1970s but, according to Ferguson, is not the ticket for preserving today&#8217;s American empire.</p>
<p>Some of the more robust defenders of Ferguson may have been disappointed by his followup remarks:  &#8220;I should not have suggested . . . that Keynes was indifferent to the long run because he had no children, nor that he had no children because he was gay. This was doubly stupid. . . . My disagreements with Keynes’s economic philosophy have never had anything to do with his sexual orientation. It is simply false to suggest, as I did, that his approach to economic policy was inspired by any aspect of his personal life.&#8221;  It&#8217;s tough to try to defend a statement that was disowned by the person saying it.</p>
<p>But the question then arises:  What&#8217;s so horrible about what Ferguson said?  After all, it&#8217;s not unreasonable to think that someone&#8217;s personal circumstances will affect their political attitudes and their views on economic policy.  And certainly no one doubts that Keynes&#8217;s background as an upper-class British backgrounds was relevant for understanding his views.</p>
<p>So what was up?<br />
<span id="more-18924"></span><br />
I think the problem was that Ferguson did not seem to be engaging with an open mind but rather seemed to be just trying to throw mud at Keynes as if he (Ferguson) were operating a political campaign rather than engaging in academic inquiry. If Ferguson were to give a talk all about the connections between the personal circumstances and political beliefs of historical economists, I don’t think people would have a problem with it. He could mention Keynes’s sex life, Adam Smith’s pets, and anything else that might be relevant. Ferguson is a historian (or, as I assume he would say, an historian), and he’d be highly qualified to do that sort of thing. But to just throw culture-war words at his audience in an attempt to vaguely disparage Keynes, that’s just stupid. Again, nobody&#8217;s perfect, and Ferguson himself recognized the stupidity of his remarks.  I don&#8217;t think this will, or should, deter historians from connecting the lives and ideas of famous thinkers.  But it doesn&#8217;t look like that&#8217;s what was being done here; rather, Ferguson was doin some insinsuatin as a way to discredit a political opponent, which works better on the campaign trail than when coming from a scholar.</p>
<p>Regarding Ferguson himself, I hold with my <a href="http://themonkeycage.org/2012/09/12/niall-ferguson-crosses-the-john-yoo-line-the-paradox-of-influence/">view</a> from last year that he is the victim of the paradox of influence:</p>
<blockquote><p>No, I think what Ferguson is looking for (as am I, in my scholarly domain) is influence. He wants to make a difference. And one thing about being paid $50K for a lecture is that you can assume that whoever is paying you really wants to hear what you have to say.</p>
<p>The paradox, though, is that Ferguson gets and keeps the big-money audience is by telling them not what he (Ferguson) wants to say&#8212;not by giving them his unique insights and understanding&#8212;but rather by telling his audience what they want to hear. . . .</p>
<p>The paradox is that the anticipated influence becomes valueless if you end up saying whatever it takes to keep it.</p></blockquote>
<p>In this case, I think Ferguson went too far when he threw in some crowd-pleasing anti-Keynes remarks that didn&#8217;t please his crowd so much.  Ferguson took his bad-boy stance a bit too far.</p>
<p>As usual in such cases, I see a gap between what this guy is doing and what he has the potential to do. The Kenyes-is-a-poof remarks would’ve been unremarkable had they been made by a comedian or a politician or talk-show host. A historian can and should do better, and Ferguson himself recognized this by characterizing his remarks as “stupid.”</p>
<p>P.S.  Yes, I know I&#8217;m violating the title of <a href="http://andrewgelman.com/2011/12/03/absolutely-last-niall-ferguson-post-ever/">this post</a> from a couple years ago.  On the plus side, I&#8217;m posting this on a weekend, which is pretty much the time of minimum readership.  So, with luck, nobody will notice what I&#8217;m doing here.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/04/one-more-thought-on-hoover-historian-niall-fergusons-thing-about-keynes-being-gay-and-marrying-a-ballerina-and-talking-about-poetry/">One more thought on Hoover historian Niall Ferguson&#8217;s thing about Keynes being gay and marrying a ballerina and talking about poetry</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/04/one-more-thought-on-hoover-historian-niall-fergusons-thing-about-keynes-being-gay-and-marrying-a-ballerina-and-talking-about-poetry/feed/</wfw:commentRss>
		<slash:comments>37</slash:comments>
		</item>
		<item>
		<title>The Folk Theorem of Statistical Computing</title>
		<link>http://andrewgelman.com/2013/05/04/the-folk-theorem-of-statistical-computing/</link>
		<comments>http://andrewgelman.com/2013/05/04/the-folk-theorem-of-statistical-computing/#comments</comments>
		<pubDate>Sat, 04 May 2013 17:07:40 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Statistical computing]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18804</guid>
		<description><![CDATA[<p>From an email I received the other day: Things are going much better now &#8212; it&#8217;s interesting, it feels like with both of my models, parameters are slow to converge or get &#8220;stuck&#8221; and have trouble mixing when the model is somehow misspecified. See here for a statement of the folk theorem.</p><p>The post <a href="http://andrewgelman.com/2013/05/04/the-folk-theorem-of-statistical-computing/">The Folk Theorem of Statistical Computing</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>From an email I received the other day:</p>
<blockquote><p>Things are going much better now &#8212; it&#8217;s interesting, it feels like with both of my models, parameters are slow to converge or get &#8220;stuck&#8221; and have trouble mixing when the model is somehow misspecified.</p></blockquote>
<p>See <a href="http://andrewgelman.com/2009/05/24/handy_statistic/">here</a> for a statement of the folk theorem.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/04/the-folk-theorem-of-statistical-computing/">The Folk Theorem of Statistical Computing</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/04/the-folk-theorem-of-statistical-computing/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Jesus historian Niall Ferguson and the improving standards of public discourse</title>
		<link>http://andrewgelman.com/2013/05/04/jesus-historian-niall-ferguson-and-the-improving-standards-of-public-discourse/</link>
		<comments>http://andrewgelman.com/2013/05/04/jesus-historian-niall-ferguson-and-the-improving-standards-of-public-discourse/#comments</comments>
		<pubDate>Sat, 04 May 2013 14:05:11 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Decision Theory]]></category>
		<category><![CDATA[Sociology]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18917</guid>
		<description><![CDATA[<p>History professor (or, as the news reports call him, &#8220;Harvard historian&#8221;) Niall Ferguson got in trouble when speaking at a conference of financial advisors. Tom Kostigen reports: Ferguson responded to a question about Keynes&#8217; famous philosophy of self-interest versus the economic philosophy of Edmund Burke, who believed there was a social contract among the living, [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/04/jesus-historian-niall-ferguson-and-the-improving-standards-of-public-discourse/">Jesus historian Niall Ferguson and the improving standards of public discourse</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="http://i2.wp.com/andrewgelman.com/wp-content/uploads/2012/09/great-white-shark-jumping.jpg?resize=300%2C210" alt="" title="great-white-shark-jumping" class="alignnone size-medium wp-image-16364" data-recalc-dims="1" /></p>
<p>History professor (or, as the news reports call him, &#8220;Harvard historian&#8221;) Niall Ferguson got in trouble when speaking at a conference of financial advisors.  Tom Kostigen <a href="http://www.fa-mag.com/news/harvard-professor-gay-bashes-keynes-14173.html">reports</a>:<br />
<span id="more-18917"></span></p>
<blockquote><p>Ferguson responded to a question about Keynes&#8217; famous philosophy of self-interest versus the economic philosophy of Edmund Burke, who believed there was a social contract among the living, as well as the dead. Ferguson asked the audience how many children Keynes had. He explained that Keynes had none because he was a homosexual and was married to a ballerina, with whom he likely talked of &#8220;poetry&#8221; rather than procreated. . . . Ferguson . . . says it&#8217;s only logical that Keynes would take this selfish worldview because he was an &#8220;effete&#8221; member of society. . . .</p>
<p>Throughout his remarks, Ferguson referred to his &#8220;friends&#8221; in high places. They should all be embarrassed and ashamed of such a connection to such small-minded thinking. Ferguson says U.S. laws and institutions have become degenerate.</p></blockquote>
<p><a href="http://www.businessinsider.com/harvard-niall-ferguson-remarks-about-keynes-2013-5">According to</a> Henry Blodget, &#8220;Dan Jamieson at Investment News, also reported the remarks.&#8221;</p>
<p>It will be interesting to see what Ferguson&#8217;s next steps are.  I&#8217;m guessing he&#8217;ll eventually get to the full groveling apology, but other options include affirmation (&#8220;Yes, Keynes was a poof who didn&#8217;t know jack about econ!&#8221;), explanation (&#8220;Statistics show that gay and childless people invest less in the future&#8221;), flat-out denial (&#8220;I never said such a thing&#8221;), a mumbling quasi-denial (&#8220;I don&#8217;t recall saying such a thing . . . must have been misquoted&#8221;), an appeal to academic freedom (&#8220;No, I&#8217;m not embarrassed to speculate in a politically incorrect fashion&#8221;), or, of course, the non-apology apology (&#8220;I&#8217;m sorry that people chose to be offended over my remarks&#8221;).</p>
<p>In short, he can go in two directions:  (1) cut his losses or (2) aim to be a cult hero of political correctness in the Larry Summers mold (although I doubt that Larry Summers himself is going to endorse this particular anti-gay move).</p>
<p>In some ways, this is a higher-profile version of the choice that Ferguson had to make awhile ago, whether to follow his counterparts such as <a href="http://andrewgelman.com/2012/09/20/could-someone-please-lock-this-guy-and-niall-ferguson-in-a-room-together/">Jeffrey Frankel</a> and <a href="http://andrewgelman.com/2012/09/12/niall-ferguson-the-john-yoo-line-and-the-paradox-of-influence/">John Yoo</a> into embarrassing hackery, or stay on the academic straight and narrow (so to speak).  A couple years ago, the choice was to keep doing academic research/writing/learning or to make provocative speeches saying silly things to rich people for big bucks.  Now that Ferguson is upping the ante by antagonizing entirely new groups of people, he&#8217;ll have to decide where to go next.</p>
<p>The good news, though, is I think this whole story is a sign of improving standards of public discourse, at least when it comes to outspoken professors.  Back in the old days, there were respected academics who were Stalinists, Fascists, you name it.  Nowadays when you take an extreme position, you&#8217;re expected to defend it, otherwise you don&#8217;t get taken seriously any more.  (For example, academic policy advocates such as Krugman and Mankiw piss off lots of people, but they defend their controversial statements.)</p>
<p>P.S.  From his webpage:</p>
<blockquote><p>Niall Ferguson, MA, D.Phil., is the Laurence A. Tisch Professor of History at Harvard University. He is a resident faculty member of the Minda de Gunzburg Center for European Studies. He is also a Senior Research Fellow of Jesus College, Oxford University, and a Senior Fellow of the Hoover Institution, Stanford University.</p></blockquote>
<p>But he&#8217;s always referred to as &#8220;Harvard historian&#8221; or &#8220;Harvard professor&#8221; Niall Ferguson.  Doesn&#8217;t seem so fair to Oxford and Stanford.  I propose that future news articles refer to Ferguson as &#8220;Jesus historian&#8221; or &#8220;Hoover historian.&#8221;  These have a good ring to them, no?</p>
<p>P.P.S.  Ferguson chose the first option, <a href="http://www.niallferguson.com/blog/an-unqualified-apology">a complete apology</a>:</p>
<blockquote><p>I should not have suggested – in an off-the-cuff response that was not part of my presentation – that Keynes was indifferent to the long run because he had no children, nor that he had no children because he was gay. This was doubly stupid. First, it is obvious that people who do not have children also care about future generations. Second, I had forgotten that Keynes’s wife Lydia miscarried.</p>
<p>My disagreements with Keynes’s economic philosophy have never had anything to do with his sexual orientation. It is simply false to suggest, as I did, that his approach to economic policy was inspired by any aspect of his personal life. As those who know me and my work are well aware, I detest all prejudice, sexual or otherwise.</p></blockquote>
<p>This retraction seems like a reasonable move to me.  After all, everybody makes mistakes.</p>
<p>As I wrote in comments, I&#8217;m guessing that, in the heat of the moment, Ferguson thought that some Keynes-bashing would be popular with a business crowd.  What happened is, he misjudged his audience. He threw them the wrong flavor of red meat.</p>
<p>P.P.P.S.  More <a href="http://andrewgelman.com/2013/05/04/one-more-thought-on-hoover-historian-niall-fergusons-thing-about-keynes-being-gay-and-marrying-a-ballerina-and-talking-about-poetry/">here</a>.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/04/jesus-historian-niall-ferguson-and-the-improving-standards-of-public-discourse/">Jesus historian Niall Ferguson and the improving standards of public discourse</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/04/jesus-historian-niall-ferguson-and-the-improving-standards-of-public-discourse/feed/</wfw:commentRss>
		<slash:comments>52</slash:comments>
		</item>
		<item>
		<title>NYC Data Skeptics Meetup</title>
		<link>http://andrewgelman.com/2013/05/03/nyc-data-skeptics-meetup/</link>
		<comments>http://andrewgelman.com/2013/05/03/nyc-data-skeptics-meetup/#comments</comments>
		<pubDate>Fri, 03 May 2013 22:48:32 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Statistics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18914</guid>
		<description><![CDATA[<p>Rachel Schutt writes: The hype surrounding Big Data and Data Science is at a fever pitch with promises to solve the world&#8217;s business and social problems, large and small. How accurate or misleading is this message? How is it helping or damaging people, and which people? What opportunities exist for data nerds and entrepreneurs that [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/03/nyc-data-skeptics-meetup/">NYC Data Skeptics Meetup</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Rachel Schutt writes:</p>
<blockquote><p>The hype surrounding Big Data and Data Science is at a fever pitch with promises to solve the world&#8217;s business and social problems, large and small. How accurate or misleading is this message? How is it helping or damaging people, and which people? What opportunities exist for data nerds and entrepreneurs that examine the larger issues with a skeptical view?</p>
<p><a href="http://www.meetup.com/The-NYC-Data-Skeptics-Meetup/">This Meetup</a> focuses on mathematical, ethical, and business aspects of data from a skeptical perspective. Guest speakers will discuss the misuse of and best practices with data, common mistakes people make with data and ways to avoid them, how to deal with intentional gaming and politics surrounding mathematical modeling, and taking into account the feedback loops and wider consequences of modeling. We will take deep dives into models in the fields of Data Science, statistics, financial engineering, and economics.</p>
<p>This is an independent forum and open to anyone sharing an interest in the larger use of data. Technical aspects will be discussed, but attendees do not need to have a technical background.</p></blockquote>
<p>The post <a href="http://andrewgelman.com/2013/05/03/nyc-data-skeptics-meetup/">NYC Data Skeptics Meetup</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/03/nyc-data-skeptics-meetup/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Setting aside the politics, the debate over the new health-care study reveals that we’re moving to a new high standard of statistical journalism</title>
		<link>http://andrewgelman.com/2013/05/03/setting-aside-the-politics-the-debate-over-the-new-health-care-study-reveals-that-were-moving-to-a-new-high-standard-of-statistical-journalism/</link>
		<comments>http://andrewgelman.com/2013/05/03/setting-aside-the-politics-the-debate-over-the-new-health-care-study-reveals-that-were-moving-to-a-new-high-standard-of-statistical-journalism/#comments</comments>
		<pubDate>Fri, 03 May 2013 13:53:13 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Public Health]]></category>
		<category><![CDATA[Teaching]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18896</guid>
		<description><![CDATA[<p>Pointing to this news article by Megan McArdle discussing a recent study of Medicaid recipients, Jonathan Falk writes: Forget the interpretation for a moment, and the political spin, but haven’t we reached an interesting point when a journalist says things like: When you do an RCT with more than 12,000 people in it, and your [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/03/setting-aside-the-politics-the-debate-over-the-new-health-care-study-reveals-that-were-moving-to-a-new-high-standard-of-statistical-journalism/">Setting aside the politics, the debate over the new health-care study reveals that we&#8217;re moving to a new high standard of statistical journalism</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Pointing to this <a href="http://www.thedailybeast.com/articles/2013/05/01/shocker-oregon-health-study-shows-no-significant-health-impacts-from-joining-medicaid.html">news article</a> by Megan McArdle discussing a recent study of Medicaid recipients, Jonathan Falk writes:</p>
<blockquote><p>Forget the interpretation for a moment, and the political spin, but haven’t we reached an interesting point when a journalist says things like:</p>
<blockquote><p>When you do an RCT with more than 12,000 people in it, and your defense of your hypothesis is that maybe the study just didn&#8217;t have enough power, what you&#8217;re actually saying is &#8220;the beneficial effects are probably pretty small&#8221;.</p></blockquote>
<p>and</p>
<blockquote><p>A good Bayesian&#8212;and aren&#8217;t most of us are supposed to be good Bayesians these days?&#8212;should be updating in light of this new information. Given this result, what is the likelihood that Obamacare will have a positive impact on the average health of Americans? Every one of us, for or against, should be revising that probability downwards. I&#8217;m not saying that you have to revise it to zero; I certainly haven&#8217;t. But however high it was yesterday, it should be somewhat lower today.</p></blockquote>
</blockquote>
<p>This is indeed an excellent news article.  Also this sensible understanding of statistical significance and effect sizes:</p>
<blockquote><p>But that doesn&#8217;t mean Medicaid has no effect on health.  It means that Medicaid had no statistically significant effect on three major health markers during a two-year study.  Those are related, but not the same.  And in fact, all three markers moved in the right direction.  They just weren&#8217;t big enough to rule out the possibility that this was just random noise in the underlying data.  I&#8217;d say this suggests that it&#8217;s more likely than not that there is some effect&#8211;but also, more likely than not that this effect is small.</p></blockquote>
<p><span id="more-18896"></span><br />
The only flaw is this bit:</p>
<blockquote><p>There was, on the other hand, a substantial decrease in reported depression.  But this result is kind of weird, because it&#8217;s not coupled with a statistically significant increase in the use of anti-depressants.  So it&#8217;s not clear exactly what effect Medicaid is having.  I&#8217;m not throwing this out: depression&#8217;s a big problem, and this seems to be a big effect.  I&#8217;m just not sure what to make of it.  Does the mere fact of knowing you have Medicaid make you less depressed?</p></blockquote>
<p>McArdle is forgetting that <a href="http://www.stat.columbia.edu/~gelman/research/published/signif4.pdf">the difference between &#8220;significant&#8221; and &#8220;not significant&#8221; is not itself statistically significant</a>.  I have no idea whether the result is actually puzzling.  I just think that she was leaping too quickly from &#8220;A is significant and B is not&#8221; to &#8220;A and B contradict.&#8221;</p>
<p>Also I&#8217;d prefer she&#8217;d talk with some public health experts rather than relying on sources such as, &#8220;as Josh Barro pointed out on Twitter.&#8221;  I have nothing against Josh Barro, I just think it&#8217;s good if a journalist can go out and talk with people rather than just grabbing things off the twitter feed.</p>
<p>But these are minor points.  Overall the article is excellent.</p>
<p>With regard to the larger questions, I agree with McArdle that ultimately the goals are health and economic security, not health insurance or even health care.  She proposes replacing Medicaid with &#8220;free mental health clinics, or cash.&#8221;  The challenge is that we seem to have worked ourselves into an expensive, paperwork-soaked health-care system, and it&#8217;s not clear to me that free mental health clinics or even cash would do the trick.</p>
<p><strong>Other perspectives</strong></p>
<p>I did some searching and found <a href="http://theincidentaleconomist.com/wordpress/additional-thoughts-on-the-new-oregon-medicaid-results/">this post</a> by Aaron Carroll.  I agree with what Carroll wrote, except for the part where he says that he would not say that &#8220;p=0.07 is close to significant.&#8221;  I have no problem with saying p=0.07 is close to significant.  I think p-values are often more of a <a href="http://www.stat.columbia.edu/~gelman/research/published/pvalues3.pdf">hindrance</a> than a help, but if you&#8217;re going to use p=0.05 as a summary of evidence and call it &#8220;significant,&#8221; then, indded, 0.001 is &#8220;very significant,&#8221; 0.07 is &#8220;close to significant,&#8221; and so forth.  McArdle was confused on some of these issues too, most notably by mixing statistical significance with a Bayesian attitude.  I wouldn&#8217;t be so hard on either of these writers, though, as the field of statistics is itself in flux on these points.  Every time I write a new article on the topic, my own thinking changes a bit.</p>
<p>I see some specific disagreements between McArdle and Carroll:</p>
<p>1.  McArdle writes:</p>
<blockquote><p>Katherine Baicker, a lead researcher on the Oregon study, noted back in 2011, &#8220;people who signed up are pretty sick&#8221;.</p></blockquote>
<p>Carroll writes:</p>
<blockquote><p>Most people who get health insurance are healthy. They’re not going to get “healthier”.</p></blockquote>
<p>This seems like a factual (or, at least, a definitional) disagreement.</p>
<p>2.  McArdle:</p>
<blockquote><p>We heard that 150,000 uninsured people had died between 2000 and 2006.  Or maybe more.  With the implication that if we just passed this new law, we&#8217;d save a similar number of lives in the future.  Which is one reason why the reaction to this study from Obamacare&#8217;s supporters has frankly been a bit disappointing.</p></blockquote>
<p>Carroll:</p>
<blockquote><p>This was Medicaid for something like 10,000 people in Oregon. The ACA was supposed to be a Medicaid expansion for 16,000,000 across the country. If 8 people’s lives in the study were saved in some way by the coverage, the total statistic holds.</p></blockquote>
<p>(16,000,000/10,000)*8*7 = 90,000, so that&#8217;s in the ballpark of the claimed 150,000 in seven years.  I&#8217;m guessing that McArdle&#8217;s would reply that there&#8217;s no evidence that 8 people&#8217;s lives were saved in the Oregon study.  Thus, numbers such as 100,000 lives saved are <em>possible</em>, but other things are possible too.</p>
<p><strong>The bottom line</strong></p>
<p>What does this all mean in policy terms?  McArdle describes Obamacare as &#8220;a $1 trillion program to treat mild depression.&#8221;  I&#8217;m not sure where the trillion dollars comes from.  A famous graph <a href="http://andrewgelman.com/2009/12/30/healthcare_spen/">shows</a> U.S. health care spending at $7000 per person per year, that&#8217;s a total of 2.1 trillion dollars a year.  I&#8217;m assuming that the Obama plan would not increase this to 3.1 trillion!  Maybe it is projected to increase annual spending to 2.3 trillion, which would correspond to an additional trillion over a five-year period?  In any case, that sounds pretty expensive.  Given that other countries with better outcomes spend half as much as we do, I&#8217;d hope a new health-care plan would reduce costs, not increase them.  But that&#8217;s politics:  the people who are currently getting these 2.1 trillion dollars don&#8217;t want to give up any of their share!  The other half of McArdle&#8217;s quote (&#8220;mild depression&#8221;) sounds to me like a bit of rhetoric.  If a policy will reduce mild depression, I assume it would have some eventual effect on severe depression too, no?</p>
<p>Beyond this, I can&#8217;t comment.  I&#8217;m like many (I suspect, most) Americans who already have health insurance in that I don&#8217;t actually know what&#8217;s in that famous health-care bill.  I mean, sure, I know there&#8217;s something about every American getting coverage, but I don&#8217;t know anything beyond that.  So I&#8217;m in no position to say anything more on the topic.  I&#8217;ll just link to <a href="http://marginalrevolution.com/marginalrevolution/2013/05/a-few-remarks-on-the-oregon-medicaid-study.html">Tyler Cowen</a>, who, I assume, actually knows what&#8217;s in the law and has further comments on the issue.</p>
<p>Let me conclude where I began, with an appreciation of the high quality of statistical journalism today.   In her news article, McArdle shows the sort of nuanced understanding of statistics and evidence that I don&#8217;t think was out there, twenty years ago.  And she&#8217;s not the only one.  Journalists as varied as Felix Salmon, Nate Silver, and Sharon Begley are all doing the good work, writing about newsworthy topics in a way that acknowledges uncertainty.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/03/setting-aside-the-politics-the-debate-over-the-new-health-care-study-reveals-that-were-moving-to-a-new-high-standard-of-statistical-journalism/">Setting aside the politics, the debate over the new health-care study reveals that we&#8217;re moving to a new high standard of statistical journalism</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/03/setting-aside-the-politics-the-debate-over-the-new-health-care-study-reveals-that-were-moving-to-a-new-high-standard-of-statistical-journalism/feed/</wfw:commentRss>
		<slash:comments>39</slash:comments>
		</item>
		<item>
		<title>Culture clash</title>
		<link>http://andrewgelman.com/2013/05/02/culture-clash/</link>
		<comments>http://andrewgelman.com/2013/05/02/culture-clash/#comments</comments>
		<pubDate>Fri, 03 May 2013 02:26:36 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Political Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18897</guid>
		<description><![CDATA[<p>I had no idea this sort of thing even existed: I&#8217;m reminded of our discussion of Charles Murray&#8217;s recent book on social divisions among Americans. Murray talked about differences between upper and lower class, but I thought he was really talking more about differences between liberals and conservatives among the elite. (More discussion here.) In [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/02/culture-clash/">Culture clash</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.nytimes.com/2013/05/02/us/kentucky-boy-5-kills-sister-2.html"><img src="http://i1.wp.com/themonkeycage.org/wp-content/uploads/2013/05/Screen-Shot-2013-05-02-at-10.13.04-PM.png?resize=467%2C414" alt="Screen Shot 2013-05-02 at 10.13.04 PM" class="alignnone size-full wp-image-29872" data-recalc-dims="1" /></a></p>
<p>I had no idea this sort of thing even existed:</p>
<p><a href="http://web.archive.org/web/20111216002457/http://www.crickett.com/pdf/CrickettNewsletter.pdf"><img src="http://i1.wp.com/themonkeycage.org/wp-content/uploads/2013/05/Screen-Shot-2013-05-02-at-10.14.43-PM.png?resize=512%2C503" alt="Screen Shot 2013-05-02 at 10.14.43 PM" class="alignnone size-large wp-image-29874" data-recalc-dims="1" /></a></p>
<p>I&#8217;m reminded of our <a href="http://www.stat.columbia.edu/~gelman/research/published/murray1.pdf">discussion</a> of Charles Murray&#8217;s recent book on social divisions among Americans.  Murray talked about differences between upper and lower class, but I thought he was really talking more about differences between liberals and conservatives among the elite.  (More discussion <a href="http://andrewgelman.com/2013/04/23/charles-murrays-coming-apart-and-the-measurement-of-social-and-political-divisions/">here</a>.)</p>
<p>In this particular case, Murray&#8217;s story about irresponsible elites seems to fit pretty well.  At the elite level, you have well-connected D.C. gun lobbyists opposing any restrictions on personal weapons.  As Murray might put it, the elites (Phil Spector aside) may be able to handle their guns, but some lower-class Americans cannot&#8212;they do things like give real rifles to 5-year-olds (!).  As Murray writes, it&#8217;s a combination of cultural ignorance and a permissive ideology:  I assume the senators who voted against the recent gun control bill wouldn&#8217;t give live weapons to <em>their</em> kids (or live in neighborhoods in which kids have access to guns at home), but they don&#8217;t feel right about restricting the rights of others to do so.</p>
<p><strong>P.S.</strong>  After reading some comments, I thought it might help to clarify two points.<br />
<span id="more-18897"></span><br />
First, my main point here was, as is noted in the title of the post, this is a culture clash.  In some parts of the country, people are giving real live guns to their 5-year-olds and teaching them the four rules of gun safety.  Here where I live, I had no idea that there was such a thing as a company that sold children&#8217;s guns with cute little cartoon characters.</p>
<p>Second, I don&#8217;t mean to overrate Charles Murray&#8217;s ideas, which are more of a set of interesting speculations than an all-encompassing theory.  (This is fine, speculations are important, I&#8217;m not knocking Murray here.)  For example, consider elites&#8217; reactions to proposed laws restricting cigarette smoking.  If a group of elite Americans support such a law, Murray can argue that this represents their lack of understanding of ordinary American culture:  elites mostly don&#8217;t smoke, and they can&#8217;t understand people who do.  Conversely, if a group of elites oppose an anti-smoking law, Murray can argue that these elites are supporting a permissive, dangerous ideology that they do not themselves believe for their own lives.  This is the tension that I noted in my review of Murray&#8217;s book, where elites are alternately urged to show understanding for the unfamiliar lifestyles of lower-class Americans, and to show moral leadership by criticizing the mistaken lifestyle choices of such people.  Murray&#8217;s reasoning goes in these two opposite directions, so when I say that Murray&#8217;s story fits pretty well in this example, I can only really be referring to part of Murray&#8217;s argument, not all of it.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/02/culture-clash/">Culture clash</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/02/culture-clash/feed/</wfw:commentRss>
		<slash:comments>27</slash:comments>
		</item>
		<item>
		<title>7 ways to separate errors from statistics</title>
		<link>http://andrewgelman.com/2013/05/02/7-ways-to-separate-errors-from-statistics/</link>
		<comments>http://andrewgelman.com/2013/05/02/7-ways-to-separate-errors-from-statistics/#comments</comments>
		<pubDate>Thu, 02 May 2013 13:15:42 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Decision Theory]]></category>
		<category><![CDATA[Economics]]></category>
		<category><![CDATA[Miscellaneous Statistics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18889</guid>
		<description><![CDATA[<p>Betsey Stevenson and Justin Wolfers have been inspired by the recent Reinhardt and Rogoff debacle to list &#8220;six ways to separate lies from statistics&#8221; in economics research: 1. &#8220;Focus on how robust a finding is, meaning that different ways of looking at the evidence point to the same conclusion.&#8221; 2. Don&#8217;t confuse statistical with practical [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/02/7-ways-to-separate-errors-from-statistics/">7 ways to separate errors from statistics</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2013/05/sharing.jpg?8b0eec" alt="sharing" width="275" height="183" class="alignnone size-full wp-image-18892" /></p>
<p>Betsey Stevenson and Justin Wolfers have been inspired by the recent Reinhardt and Rogoff <a href="http://andrewgelman.com/2013/04/16/memo-to-reinhart-and-rogoff-i-think-its-best-to-admit-your-errors-and-go-on-from-there/">debacle</a> to <a href="http://www.bloomberg.com/news/2013-05-01/six-ways-to-separate-lies-from-statistics.html">list</a> &#8220;six ways to separate lies from statistics&#8221; in economics research:</p>
<blockquote><p>1. &#8220;Focus on how robust a finding is, meaning that different ways of looking at the evidence point to the same conclusion.&#8221;</p>
<p>2.  Don&#8217;t confuse statistical with practical significance.</p>
<p>3.  &#8220;Be wary of scholars using high-powered statistical techniques as a bludgeon to silence critics who are not specialists.&#8221;</p>
<p>4.  &#8220;Don’t fall into the trap of thinking about an empirical finding as &#8216;right&#8217; or &#8216;wrong.&#8217; At best, data provide an imperfect guide.&#8221;</p>
<p>5.  &#8220;Don’t mistake correlation for causation.&#8221;</p>
<p>6.  &#8220;Always ask &#8216;so what?&#8217;&#8221;</p></blockquote>
<p>I like all these points, especially #4, which I think doesn&#8217;t get said enough.  As I <a href="http://andrewgelman.com/2013/01/10/that-controversial-claim-that-high-genetic-diversity-or-low-genetic-diversity-is-bad-for-the-economy/">wrote</a> a few months ago, high-profile social science research aims for proof, not for understanding—and that’s a problem.</p>
<p><strong>My addition to the list</strong></p>
<p>If you compare my title above to that of Stevenson and Wolfers, you&#8217;ll find two differences.  First, I changed &#8220;lies&#8221; to &#8220;errors.&#8221;  I have no idea who&#8217;s lying, and I&#8217;m much more comfortable talking about errors.  Second, I think they missed an even better, more general way to find mistakes:</p>
<blockquote><p>7.  Make your data and analysis public.</p></blockquote>
<p>This is the best approach, because now you can have lots of strangers checking your work for free!  This advice is also particularly appropriate for Reinhardt and Rogoff because, according to various reports (see <a href="http://bilbo.economicoutlook.net/blog/?p=23467">here</a> and <a href="http://www.cepr.net/index.php/blogs/beat-the-press/not-following-professional-ethics-matters-also">here</a>), it was years before they made their data available to outsiders.  Nearly three years ago (!), Dean Baker wrote a column entitled, &#8220;It Would Be Helpful if Rogoff and Reinhart Made Their Data Available.&#8221; </p>
<p>Perhaps &#8220;the risk of forced disclosure&#8221; (as Keith O&#8217;Rourke puts it) will motivate researchers to be more careful in the future.</p>
<p><strong>Your additions?</strong></p>
<p>I told Wolfers I was going to link to his list and add my own #7.  He replied that we&#8217;re probably missing #8, 9, and 10.  In the comments, feel free to add <em>your</em> favorite ways to separate errors from statistics.  Phil already gave some <a href="http://andrewgelman.com/2013/04/17/data-problems-coding-errors-what-can-be-done/">here</a>.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/02/7-ways-to-separate-errors-from-statistics/">7 ways to separate errors from statistics</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/02/7-ways-to-separate-errors-from-statistics/feed/</wfw:commentRss>
		<slash:comments>35</slash:comments>
		</item>
		<item>
		<title>A graph at war with its caption.  Also, how to visualize the same numbers without giving the display a misleading causal feel?</title>
		<link>http://andrewgelman.com/2013/05/01/a-graph-at-war-with-its-caption-also-how-to-visualize-the-same-numbers-without-giving-the-display-a-misleading-causal-feel/</link>
		<comments>http://andrewgelman.com/2013/05/01/a-graph-at-war-with-its-caption-also-how-to-visualize-the-same-numbers-without-giving-the-display-a-misleading-causal-feel/#comments</comments>
		<pubDate>Wed, 01 May 2013 13:51:10 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Political Science]]></category>
		<category><![CDATA[Statistical graphics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18799</guid>
		<description><![CDATA[<p>Kaiser Fung discusses the following graph that is captioned, &#8220;A study of 54 nations&#8211;ranked below&#8211;found that those with more progressive tax rates had happier citizens, on average.&#8221; As Kaiser writes, &#8220;from a purely graphical perspective, the chart is well executed . . . they have 54 points, and the chart still doesn&#8217;t look too crammed [...]</p><p>The post <a href="http://andrewgelman.com/2013/05/01/a-graph-at-war-with-its-caption-also-how-to-visualize-the-same-numbers-without-giving-the-display-a-misleading-causal-feel/">A graph at war with its caption.  Also, how to visualize the same numbers without giving the display a misleading causal feel?</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Kaiser Fung <a href="http://junkcharts.typepad.com/junk_charts/2013/04/spin-spin-spin-away.html">discusses</a> the following <a href="http://www.nytimes.com/interactive/2013/04/14/sunday-review/mirth-and-taxes.html?_r=0">graph</a> that is captioned, &#8220;A study of 54 nations&#8211;ranked below&#8211;found that those with more progressive tax rates had happier citizens, on average.&#8221;</p>
<p><a href="http://i1.wp.com/themonkeycage.org/wp-content/uploads/2013/04/6a00d8341e992c53ef017d43093cb8970c.png"><img src="http://i1.wp.com/themonkeycage.org/wp-content/uploads/2013/04/6a00d8341e992c53ef017d43093cb8970c.png?resize=313%2C457" alt="6a00d8341e992c53ef017d43093cb8970c" class="alignnone size-full wp-image-29674" data-recalc-dims="1" /></a></p>
<p>As Kaiser writes, &#8220;from a purely graphical perspective, the chart is well executed . . . they have 54 points, and the chart still doesn&#8217;t look too crammed . . .&#8221;  But he also points out that the graph&#8217;s implicit claims (that tax rates can explain happiness or cause more happiness) are not supported.</p>
<p>Kaiser and I are not being picky-picky-picky here.  Taken literally, the graph title says nothing about causation, but I think the phrasing implies it.  Also, from a purely descriptive perspective, the graph is somewhat at war with its caption.  The caption announces a relationship, but in the graph, the x and y variables have only a very weak correlation.  The caption says that happiness and progressive tax rates go together, but the graph uses the U.S. as a baseline, and when you move from the U.S. point on the graph to the right-hand side (more progressive taxes), you see a lot more points below the line than above the line.  Thus the visual impression of the graph is that more progressive taxes will lead to <em>lower</em> happiness&#8212;the opposite of the message from the caption.</p>
<p>What can be done here?</p>
<p>I don&#8217;t exactly think the graph is &#8220;bad data,&#8221; and, although the graph says little directly about causation, the data have <em>some</em> relevance to our understanding of policy debates over taxes.  If nothing else, we learn that tax progressivity and average happiness some variation among countries.  I think a start would be to reframe and put happiness on the x-axis and the tax system on the y-axis, which would allow us to see that, at any happiness level, there is a range of tax systems. with none of the very happiest countries having flat taxes.</p>
<p>Better still might be to make a line plot with three columns:  First, a list of country names, in decreasing order from richest to poorest (using, for example, per-capita GDP (yes, I know, such data aren&#8217;t <a href="http://andrewgelman.com/2009/03/16/what_is_russias/">perfect</a>!)), then a column showing tax progressivity (if that&#8217;s the measure they want to use), then a column showing average happiness.</p>
<p>The advantage of this pair of dotplots is that you get to see the spread in each of these variables with respect to a natural measure (how rich the country is), and there&#8217;s no implicit causal story getting in the way.</p>
<p>The post <a href="http://andrewgelman.com/2013/05/01/a-graph-at-war-with-its-caption-also-how-to-visualize-the-same-numbers-without-giving-the-display-a-misleading-causal-feel/">A graph at war with its caption.  Also, how to visualize the same numbers without giving the display a misleading causal feel?</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/05/01/a-graph-at-war-with-its-caption-also-how-to-visualize-the-same-numbers-without-giving-the-display-a-misleading-causal-feel/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>“Tragedy of the science-communication commons”</title>
		<link>http://andrewgelman.com/2013/04/30/tragedy-of-the-science-communication-commons/</link>
		<comments>http://andrewgelman.com/2013/04/30/tragedy-of-the-science-communication-commons/#comments</comments>
		<pubDate>Tue, 30 Apr 2013 13:36:53 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Statistics]]></category>
		<category><![CDATA[Sociology]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18716</guid>
		<description><![CDATA[<p>I&#8217;ve earlier written that science is science communication&#8212;that is, the act of communicating scientific ideas and findings to ourselves and others is itself a central part of science. My point was to push against a conventional separation between the act of science and the act of communication, the idea that science is done by scientists [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/30/tragedy-of-the-science-communication-commons/">&#8220;Tragedy of the science-communication commons&#8221;</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>I&#8217;ve earlier written that <a href="http://andrewgelman.com/2013/02/22/science-is-science-communication/">science is science communication</a>&#8212;that is, the act of communicating scientific ideas and findings to ourselves and others is itself a central part of science.  My point was to push against a conventional separation between the act of science and the act of communication, the idea that science is done by scientists and communication is done by communicators.  It&#8217;s a rare bit of science that does not include communication as part of it.  As a scientist and science communicator myself, I&#8217;m particularly sensitive to devaluing of communication.  (For example, Bayesian Data Analysis is full of original research that was done in order to communicate; or, to put it another way, we often think we understand a scientific idea, but once we try to communicate it, we recognize gaps in our understanding that motivate further research.)</p>
<p>I once saw the following on one of those inspirational-sayings-for-every-day desk calendars: “To have ideas is to gather flowers. To think is to weave them into garlands.”  Similarly, writing&#8212;more generally, communication to oneself or others&#8212;forces logic and structure, which are central to science.</p>
<p>Dan Kahan <a href="http://www.culturalcognition.net/blog/2013/2/22/is-a-gelman-trying-to-provoke-me-or-is-that-just-my-narcissi.html">saw</a> what I wrote and responded by flipping it around:  He pointed out that there is a science of science communication.  As scientists, we should move beyond the naive view of communication as the direct imparting of facts and ideas.  We should think more systematically about how communications are produced and how they are understood by their immediate and secondary recipients.</p>
<p>The science of science communication is still in its early stages, and I&#8217;m glad that people such as Kahan are working on it.  <a href="http://www.culturalcognition.net/blog/2013/3/9/tragedy-of-the-science-communication-commons-lecture-summary.html">Here&#8217;s something he wrote recently</a> explicating his theory of cultural cognition:</p>
<blockquote><p>The motivation behind this research has been to understand the science communication problem. The “science communication problem” (as I use this phrase) refers to the failure of valid, compelling, widely available science to quiet public controversy over risk and other policy relevant facts to which it directly speaks. The climate change debate is a conspicuous example, but there are many others, including (historically) the conflict over nuclear power safety, the continuing debate over the risks of HPV vaccine, and the never-ending dispute over the efficacy of gun control. . . . The research I will describe reflects the premise that making sense of these peculiar packages of types of people and sets of factual beliefs is the key to understanding—and solving—the science communication problem. The cultural cognition thesis posits that people’s group commitments are integral to the mental processes through which they apprehend risk. . . .</p></blockquote>
<p>I think of Kahan as part of a loose network of constructive skeptics, along with various people including Thomas Basbøll, John Ioannidis, the guys at Retraction Watch, pissed-off scholars such as Stan Liebowitz, bloggers such as Felix Salmon, and a whole bunch of psychology researchers such as Wicherts, Wagenmakers, Simonsohn, Nosek, etc.  This is not to represent a complete list but rather is intended to give a sense of the different aspect of this movement-without-a-name.  10 or 20 or 30 years ago, I don&#8217;t think such a movement existed.  There were concerns about individual studies or research programs but not such a sense of a statistics-centered crisis in science as a whole.</p>
<p>The post <a href="http://andrewgelman.com/2013/04/30/tragedy-of-the-science-communication-commons/">&#8220;Tragedy of the science-communication commons&#8221;</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/30/tragedy-of-the-science-communication-commons/feed/</wfw:commentRss>
		<slash:comments>44</slash:comments>
		</item>
		<item>
		<title>Giving credit where due</title>
		<link>http://andrewgelman.com/2013/04/29/giving-credit-where-due/</link>
		<comments>http://andrewgelman.com/2013/04/29/giving-credit-where-due/#comments</comments>
		<pubDate>Tue, 30 Apr 2013 00:47:27 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Sports]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18862</guid>
		<description><![CDATA[<p>Gregg Easterbrook may not always be on the ball, but I 100% endorse the last section of his recent column (scroll down to &#8220;Absurd Specificity Watch&#8221;). Earlier in the column, Easterbrook has a plug for Tim Tebow. I&#8217;d forgotten about Tim Tebow.</p><p>The post <a href="http://andrewgelman.com/2013/04/29/giving-credit-where-due/">Giving credit where due</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Gregg Easterbrook may not <a href="http://deadspin.com/5970264/the-27-dumbest-things-gregg-easterbrook-wrote-in-2012">always</a> be on the ball, but I 100% endorse the last section of his recent <a href="http://espn.go.com/espn/playbook/story/_/id/9200935/tuesday-morning-quarterback-returns-mock-nfl-mock-drafts">column</a> (scroll down to &#8220;Absurd Specificity Watch&#8221;).</p>
<p>Earlier in the column, Easterbrook has a plug for Tim Tebow.  I&#8217;d forgotten about Tim Tebow.</p>
<p>The post <a href="http://andrewgelman.com/2013/04/29/giving-credit-where-due/">Giving credit where due</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/29/giving-credit-where-due/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>The Great Race</title>
		<link>http://andrewgelman.com/2013/04/29/the-great-race/</link>
		<comments>http://andrewgelman.com/2013/04/29/the-great-race/#comments</comments>
		<pubDate>Mon, 29 Apr 2013 17:53:13 +0000</pubDate>
		<dc:creator>Phil</dc:creator>
				<category><![CDATA[Sports]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18613</guid>
		<description><![CDATA[<p>This post is by Phil. Last summer my wife and I took a 3.5-month vacation that included a wide range of activities. When I got back, people would ask &#8220;what were the highlights or your trip?&#8221;, and I was somewhat at a loss: we had done so many things that were so different, many of [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/29/the-great-race/">The Great Race</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>This post is by Phil.</p>
<p>Last summer my wife and I took a 3.5-month vacation that included a wide range of activities. When I got back, people would ask &#8220;what were the highlights or your trip?&#8221;, and I was somewhat at a loss: we had done so many things that were so different, many of which seemed really great&#8230;how could I pick? Someone said, wisely, that in six months or a year I&#8217;d be able to answer the question because some memories would be more vivid than others. They were right, and I was recently thinking back on our vacation and putting together a list of highlights &#8212; enjoyable in itself, but also worth doing to help plan future vacations.</p>
<p>One of the things we did was go to four evenings of track and field events at the London Olympics. After we got back, people would ask what we had seen at the Olympics. I would say &#8220;We saw Usain Bolt run the 200m, we saw the women&#8217;s 4x100m relay and the men&#8217;s 4&#215;400, we saw the last events of the decathlon&#8230;lots of great stuff. But my favorite was the men&#8217;s 800m.&#8221;</p>
<p>Trying to figure out why that was one of my favorite events to watch, I looked up some <a href="http://en.wikipedia.org/wiki/Athletics_at_the_2012_Summer_Olympics_–_Men's_800_metres" target="_blank">facts and statistics</a> about the race. Perhaps unexpectedly, I think that some of the things that made it great, as both an athletic contest and a spectacle, are reflected in the stats.</p>
<p><span id="more-18613"></span></p>
<p>The 800m is not one of track&#8217;s marquee events: it&#8217;s much too long for an all-out sprint, so it doesn&#8217;t have the intensity of a 100m or 200m. It has even less of a following than most track events, which is saying something: I&#8217;m going to guess that most readers of this blog cannot name a single U.S. track athlete, and that only a tiny fraction can name a competitor in that 800m Olympics final.</p>
<p>Me, I&#8217;m more of a track fan than most &#8212; my wife and I have gotten interested, especially since discovering that during our fitness workouts we were sometimes sharing the track with <a href="http://alysiamontano.com">Alysia Montano</a> and  <a href="http://en.wikipedia.org/wiki/Magdalena_Lewy-Boulet">Magdalena Lewy-Boulet</a>, and we spectate at track meets occasionally &#8212; but although I recognized a few of the names in the 800m Olympics final I didn&#8217;t really have any context for what I was about to see. Like most people in the stadium, I expect, I was looking forward to the highly-anticipated 200m final later in the evening and hadn&#8217;t even thought about the 800. That 200m final was exciting and electrifying, but to me it was no match for the 800.</p>
<p>When the race started&#8230;it&#8217;s hard to say how this could possibly be, but I could just tell that something special was happening. Maybe everybody in the stadium could tell.  One of the runners &#8212; David Rudisha of Kenya, the record-holder at the time &#8212; went to the front and was pushing hard, and everyone else was going hard too.  Of course.  It&#8217;s the Olympics! Finals! Of course they&#8217;re running hard!  Well, yes, but&#8230; it&#8217;s hard to explain&#8230;it just seemed more intense even than the other finals we had seen, which seems paradoxical because, this being the 800m, the runners weren&#8217;t running as fast.  I&#8217;ve tried to tell if you can see in<a href="http://www.youtube.com/watch?v=YKEOjWEzVGs" target="_blank"> the video</a> what I&#8217;m talking about (the race starts at about 3:15), but I&#8217;m not really sure, you tell me. (One thing that you can see on the video is how easily these guys seem to cruise through the 400m mark in under 50 seconds, incredible).</p>
<p>So, here&#8217;s what happened:</p>
<p>Rudisha, the pre-race favorite, won the event, beating his own previous world record.</p>
<p>The times flashed up on the scoreboard, with indicators to say if the time was a World Record, National Record, Junior Record, Personal Record, or Season&#8217;s Best.  That column on the scoreboard looked like this:WR, NR, PB, PB, PB, NR, SB, PB  In other words, seven out of the eight runners had run their personal best time.</p>
<p>That&#8217;s all I knew at the time, and it&#8217;s only recently that I learned some other facts: All eight athletes went under 1:44, the first time that has ever happened; and the first place time was (obviously) the fastest first place in history, but also, the second place was the fastest second place in history, and third was the fastest third in history, and so on all the way down the line.</p>
<p>&nbsp;</p>
<p>It obviously takes a special effort to set a personal record, and those records usually come when the athlete has something at stake: the chance to finish on the podium at their national championships, or to qualify for the Olympic team, or whatever.  For a guy who is rounding the final curve 20 meters behind the leaders, a guy who is going to finish in 8th place, to push himself to the fastest that he has ever run, that seems to me to be even more remarkable than someone running a personal best in order to set a new world record. I tip my hat.</p>
<p>So  I think it wasn&#8217;t the effort by the top finishers that made the race seem so special at the time &#8212; <del>we had seen other records set</del>[correction, this was the first WR we saw], we would see records set the following day, and they were great and exciting but they didn&#8217;t have the same magic for me. I think what made this special was the effort all up and down the line, every runner trying his hardest right to the end. Maybe the best 800m races to watch are the ones where the last-place finishers run their hardest, not where the first-place finishers run theirs. Just a thought.</p>
<p>This post is by Phil.</p>
<p>The post <a href="http://andrewgelman.com/2013/04/29/the-great-race/">The Great Race</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/29/the-great-race/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>The blogroll</title>
		<link>http://andrewgelman.com/2013/04/29/the-blogroll/</link>
		<comments>http://andrewgelman.com/2013/04/29/the-blogroll/#comments</comments>
		<pubDate>Mon, 29 Apr 2013 13:03:06 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Decision Theory]]></category>
		<category><![CDATA[Economics]]></category>
		<category><![CDATA[Literature]]></category>
		<category><![CDATA[Miscellaneous Science]]></category>
		<category><![CDATA[Miscellaneous Statistics]]></category>
		<category><![CDATA[Political Science]]></category>
		<category><![CDATA[Statistical graphics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18837</guid>
		<description><![CDATA[<p>I encourage you to check out our linked blogs. Here&#8217;s what they&#8217;re all about: Cognitive and Behavioral Science BPS Research Digest: I haven&#8217;t been following this one recently, but it has lots of good links, I should probably check it more often. There are a couple things that bother me, though. The blog is sponsored [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/29/the-blogroll/">The blogroll</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2013/04/links.jpg?8b0eec" alt="Chain Links" width="400" height="300" class="alignnone size-full wp-image-18843" /></p>
<p>I encourage you to check out our <a href="http://andrewgelman.com/blogroll/">linked blogs</a>.  Here&#8217;s what they&#8217;re all about:</p>
<p><strong>Cognitive and Behavioral Science</strong></p>
<p><a href="http://www.bps-research-digest.blogspot.com">BPS Research Digest</a>:  I haven&#8217;t been following this one recently, but it has lots of good links, I should probably check it more often.  There are a couple things that bother me, though.  The blog is sponsored by the British Psychological Society, so this sounds pretty serious.  But then they run things like advertising <a href="http://www.bps-research-digest.blogspot.com/2013/04/5-chances-to-win-cutting-edge.html">promotions</a> sponsored by a textbook company and highlight iffy experimental claims.  For example, in 2010 they ran a wholly uncritical <a href="http://bps-research-digest.blogspot.com/2010/11/dramatic-study-shows-participants-are.html">post</a> on the notorious Daryl Bem study that purported to find ESP.  After being called on it in the comments, the blogger (Christian Jarrett) responded with, &#8220;The stats appear sound. . . . it&#8217;s a great study. Rigorously conducted&#8221; and even defended &#8220;the discussion of quantum physics in the paper.&#8221;  To be fair, though, and as he <a href="http://andrewgelman.com/2013/04/29/the-blogroll/#comment-145487">points out</a> in comments, Jarrett wrote of Bem&#8217;s study:  &#8220;this isn’t proof of psi, far from it. Needs to be replicated. I like how Bem has used standard psychological tasks as a way to explore psi. Makes it easier for other labs to try to replicate.&#8221;  Jarrett writes that he tries to &#8220;strike a balance between promotion and skepticism of new findings.&#8221;  Fair enough.</p>
<p><a href="http://www.decisionsciencenews.com">Decision Science News</a>:  A mix of conference announcements and reports of new research.  <a href="http://www.decisionsciencenews.com/2013/04/16/3-of-doctors-receive-half-the-complaints/">Here&#8217;s</a> a typical example.  I love this stuff; others might find it a bit technical.  Also, this blog runs ads.  I wonder how much the advertisers pay?  I can&#8217;t imagine anyone would pay enough to a niche blogger to make the ads worth it.  I mean, sure, if an advertiser offered me enough money for me to hire a postdoc, I&#8217;d do it, but I can only imagine we&#8217;re talking really small amounts of money.  A topic of discussion for Decision Science News, perhaps?</p>
<p><a href="http://languagelog.ldc.upenn.edu/nll/">Language Log</a>:  Not much needs to be said here.  This one&#8217;s a classic blog with lots of statistical content, remains strong after all these years.</p>
<p><a href="http://blog.sethroberts.net">Seth Roberts</a>:  I disagree with him on climate change denial, Holocaust denial, etc.  Still, he&#8217;s a pioneer of self-experimentation.  I hope that the next generation of psychology or medical research involves an integration of informal experimentation with statistical controls.</p>
<p><a href="http://hardsci.wordpress.com">The Hardest Science</a>:  Mostly revolves around reproducible research.  It&#8217;s where I heard the <a href="http://andrewgelman.com/2012/12/16/its-unscientific-behavior-not-to-admit-an-error/">story</a> of the lamest, grudgingest, non-retraction retraction ever.</p>
<p><strong>Cultural</strong></p>
<p><a href="http://jennydavidson.blogspot.com">Light Reading</a>:  She&#8217;s like me, she likes to write and has a lot of energy.  I&#8217;m still <a href="http://andrewgelman.com/2013/04/06/calling-jenny-davidson/">wondering</a> what she will think of Debutante Hill (I&#8217;ll lend her my copy).</p>
<p><a href="http://www.listsofnote.com">Lists and Letters of Note</a>:  Great stuff but not much new material lately; he says he&#8217;s busy working on a book.</p>
<p><a href="http://lovetheliberry.blogspot.com">Love the Liberry</a>:  Amazingly enough, they keep coming up with good material.</p>
<p><a href="http://paperpools.blogspot.com">Paperpools</a>:  Not much material lately.  As it should be.  We want Helen DeWitt to be writing novels, not blogging!</p>
<p><a href="http://secondlanguage.blogspot.com">Research as a Second Language</a>:  Anti-charismatic self-help advice.  The alternative to those omnipresent shouting, obnoxious internet gurus.</p>
<p><a href="http://www.streetsblog.org">Streetsblog</a>:  Good stuff.  Ideally this would all be in your daily newspaper.  I don&#8217;t read it too often; if I did, I&#8217;d be too angry to think about anything else all day.</p>
<p><strong>Sister Blogs</strong></p>
<p><a href="http://www.themonkeycage.org">The Monkey Cage</a>:  Sometimes I simul-post, other times I&#8217;ll rant there and then link from here.  (<a href="http://themonkeycage.org/2013/04/26/i-dont-know-whether-to-call-it-communism-or-crony-capitalism/">for example</a>)</p>
<p><a href="http://statisticsforum.wordpress.com">The Statistics Forum</a>:  I recently formulated the plan to fill it up with 365 stories.  So far, though, I&#8217;ve only received a few.  So maybe just a story a week?  I&#8217;m not sure what to do with this blog. An official American Statistical Association blog seems like a good thing but I don&#8217;t really know what to do with it.</p>
<p><strong>Social and Political Science</strong></p>
<p><a href="http://chrisblattman.com">Chris Blattman</a>:  International development, politics, economics, and policy.</p>
<p><a href="http://fivethirtyeight.blogs.nytimes.com">Fivethirtyeight</a>:  Nate does a good job.  I like how he can focus on whatever question he&#8217;s answering without getting overwhelmed.  <a href="http://fivethirtyeight.blogs.nytimes.com/2013/04/16/in-state-governments-signs-of-a-healthier-g-o-p/">Here&#8217;s</a> a good recent example.</p>
<p><a href="http://lanekenworthy.net">Lane Kenworthy</a> is a completely serious and reasonable person, just as his name would suggest.</p>
<p><a href="http://marginalrevolution.com">Marginal Revolution</a>:  You&#8217;ve heard of these guys.</p>
<p><a href="http://www.bls.gov/opub/mlr/archive.precis.htm">Monthly Labor Review Precis</a>:  Direct links to research on things that matter.  Good stuff.</p>
<p><a href="http://www.overcomingbias.com">Overcoming Bias</a>:  He recently <a href="http://www.overcomingbias.com/2013/04/in-praise-of-ads.html">wrote</a>, &#8220;most people we know talk as if they hate, revile, and despise ads. They say ads are an evil destructive manipulative force that exists only because big bad firms run the world, and use ads to control us all.&#8221;  I was surprised to hear that most people Robin Hanson knows talk that way, and this gives me a new perspective on why he writes the way he does.  It&#8217;s gotta be frustrating, hanging around people who talk about big bad firms and evil destructive manipulative forces.</p>
<p><a href="http://rajivsethi.blogspot.com">Rajiv Sethi</a>:  He only blogs a couple times a month, but he always has something interesting to say.  (The opposite of this blog, I suppose.)</p>
<p><a href="http://www.babynamewizard.com/blog">The Baby Name Wizard</a>:  The one and only, by the people who, among other things, debunked the <a href="http://andrewgelman.com/2009/11/30/laura_baby_name/">myth</a> that there&#8217;s something special about the word &#8220;orange.&#8221;  But you can just skip directly to the <a href="http://www.babynamewizard.com/voyager#">Name Voyager</a>.</p>
<p><a href="http://blogs.census.gov">U.S. Census Blog</a>:  Not the funnest thing out there to read, but it&#8217;s good that the people at the Census are doing this for us.  When you need good data, the Census is there for you.</p>
<p><strong>Statistics and Machine Learning</strong></p>
<p><a href="http://lingpipe-blog.com">Bob Carpenter</a>:  He wrote Stan.</p>
<p><a href="http://www.dartmouth.edu/~chance/chance_news/news.html">Chance News</a>:  The original statistics blog.</p>
<p><a href="http://xianblog.wordpress.com/category/statistics/">Christian Robert</a>:  People who used to do theoretical statistics, now do computational statistics.  This is a good thing.</p>
<p><a href="http://vserver1.cscs.lsa.umich.edu/~crshalizi/weblog/">Cosma Shalizi</a>:  He has an odd retro style and enough combination of common sense and knowledge of philosophy that I asked him to collaborate on my paper that became <a href="http://www.stat.columbia.edu/~gelman/research/published/philosophy.pdf">this</a>.  His set of interests and frustrations seems to overlap a lot with mine, except that he doesn&#8217;t really ride a bike and I&#8217;m sure there are some big parts of his life that don&#8217;t match to anything in mine.</p>
<p><a href="http://errorstatistics.com">Deborah Mayo</a>:  I learned about her through Shalizi.  Mayo believes in learning through model checking, just like Jaynes (and me).  Her blog features long comment threads and contributions from the likes of Stephen Senn.</p>
<p><a href="http://www.johndcook.com/blog/">John Cook</a>:  Like Tyler Cowen, a guy who does a lot of things but is best known for his blogging.  He throws in some applied math and numerical analysis along with the statistics.</p>
<p><a href="http://junkcharts.typepad.com/numbersruleyourworld/">Kaiser Fung</a>:  Fun to read and utterly sensible.  Among many other things, he offered a good probabilistic summary of the Lance Armstrong story, well before it finally broke.</p>
<p><a href="http://normaldeviate.wordpress.com">Larry Wasserman</a>:  His perspective on statistics is different from mine (for example, he <a href="http://normaldeviate.wordpress.com/2013/03/14/double-misunderstandings-about-p-values/">defines</a> p(a|b) = p(a,b)/p(b), whereas I <a href="http://andrewgelman.com/">define</a> p(a,b)=p(a|b)p(b)), but it&#8217;s good that he can get his views out there.  Research proceeds in many different ways, and if everyone agreed with me (or with any single perspective), the field of statistics would make a lot less progress.</p>
<p><a href="http://messymatters.com">Messy Matters</a>:  This one reads a bit more like a draft of a pop-science book than like a blog.  The trouble is, there are already so many pop-science books about economics and data.  They&#8217;ll have to come up with their own unique twist.</p>
<p><a href="http://nuit-blanche.blogspot.com">Nuit Blanche</a>:  Compressive sensing:  that&#8217;s cool stuff!  I&#8217;m impressed by these CS guys who can effortlessly throw around terabytes of data.</p>
<p><a href="http://observationalepidemiology.blogspot.com">Observational Epidemiology</a>:  These guys are thoughtful and I admire the effort they put into their blogging.  If they&#8217;d started blogging in 2003, they would&#8217;ve been on everyone&#8217;s blogroll.</p>
<p><a href="http://www.statsblogs.com">Stats Blogs</a>:  A convenient compendium, with links back to the originals.</p>
<p><a href="http://www.statsblogs.com">The Numbers Guy</a>:  Carl Bialik is one of the original data journalists.  He, Falix Salmon, and Nate Silver have very similar profiles (as Bill James might say).</p>
<p><strong>Visualization</strong></p>
<p><a href="http://chartsnthings.tumblr.com">Chartsnthings</a>:  This is the ultimate graphics blog.  The New York Times graphics team presents some great data visualizations along with the stories behind them.  I love this sort of insider&#8217;s perspective.</p>
<p><a href="http://eagereyes.org">Eager Eyes</a>:  Graphics research.</p>
<p><a href="http://infosthetics.com">Information Aesthetics</a>:  Seriously pretty.</p>
<p><a href="http://junkcharts.typepad.com">Junk Charts</a>:  The nitty gritty.  What to read if you want to make your own graphs better.</p>
<p>The post <a href="http://andrewgelman.com/2013/04/29/the-blogroll/">The blogroll</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/29/the-blogroll/feed/</wfw:commentRss>
		<slash:comments>22</slash:comments>
		</item>
		<item>
		<title>Plain old everyday Bayesianism!</title>
		<link>http://andrewgelman.com/2013/04/28/plain-old-everyday-bayesianism/</link>
		<comments>http://andrewgelman.com/2013/04/28/plain-old-everyday-bayesianism/#comments</comments>
		<pubDate>Sun, 28 Apr 2013 13:25:23 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Bayesian Statistics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18734</guid>
		<description><![CDATA[<p>Sam Behseta writes: There is a report by Martin Tingley and Peter Huybers in Nature on the unprecedented high temperatures at northern latitudes (Russia, Greenland, etc). What is more interesting is the authors are have used a straightforward hierarchical Bayes model, and for the first time (as far as I can remember) the results are [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/28/plain-old-everyday-bayesianism/">Plain old everyday Bayesianism!</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Sam Behseta writes:</p>
<blockquote><p>There is a <a href="http://www.nature.com/nature/journal/v496/n7444/full/nature11969.html">report</a> by Martin Tingley and Peter Huybers in Nature on the unprecedented high temperatures at northern latitudes (Russia, Greenland, etc). What is more interesting is the authors are have used a straightforward hierarchical Bayes model, and for the first time (as far as I can remember) the results are reported with a probability attached to them (P>0.99), as opposed to the usual p-value<0.01 business. This might be a sign that editors of big time science journals are welcoming Bayesian approaches.</p></blockquote>
<p>I agree.  This is a good sign for statistical communication.  Here are the key sentences from the abstract:</p>
<blockquote><p>Here, using a hierarchical Bayesian analysis of instrumental, tree-ring, ice-core and lake-sediment records, we show that the magnitude and frequency of recent warm temperature extremes at high northern latitudes are unprecedented in the past 600 years. The summers of 2005, 2007, 2010 and 2011 were warmer than those of all prior years back to 1400 (probability P > 0.95), in terms of the spatial average. The summer of 2010 was the warmest in the previous 600 years in western Russia (P > 0.99) and probably the warmest in western Greenland and the Canadian Arctic as well (P > 0.90). These and other recent extremes greatly exceed those expected from a stationary climate, but can be understood as resulting from constant space–time variability about an increased mean temperature.</p></blockquote>
<p>As with classical p-values, these probability statements depend on an assumed model, but I agree with Sam that the expression of direct probabilities is a huge step forward from traditional practice.</p>
<p>The post <a href="http://andrewgelman.com/2013/04/28/plain-old-everyday-bayesianism/">Plain old everyday Bayesianism!</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/28/plain-old-everyday-bayesianism/feed/</wfw:commentRss>
		<slash:comments>50</slash:comments>
		</item>
		<item>
		<title>Time-Sharing Experiments for the Social Sciences</title>
		<link>http://andrewgelman.com/2013/04/27/time-sharing-experiments-for-the-social-sciences/</link>
		<comments>http://andrewgelman.com/2013/04/27/time-sharing-experiments-for-the-social-sciences/#comments</comments>
		<pubDate>Sat, 27 Apr 2013 22:24:24 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Science]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18833</guid>
		<description><![CDATA[<p>James Druckman and Jeremy Freese write: We are pleased to announce that Time-Sharing Experiments for the Social Sciences (TESS) was renewed for another round of funding by NSF starting last Fall. TESS allows researchers to submit proposals for experiments to be conducted on a nationally-representative, probability-based Internet platform, and successful proposals are fielded at no [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/27/time-sharing-experiments-for-the-social-sciences/">Time-Sharing Experiments for the Social Sciences</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>James Druckman and Jeremy Freese write:<br />
<span id="more-18833"></span></p>
<blockquote><p>We are pleased to announce that Time-Sharing Experiments for the Social Sciences (TESS) was renewed for another round of funding by NSF starting last Fall. TESS allows researchers to submit proposals for experiments to be conducted on a nationally-representative, probability-based Internet platform, and successful proposals are fielded at no cost to investigators.  More information about how TESS works and how to submit proposals is available at http://www.tessexperiments.org.</p>
<p>Additionally, we are pleased to announce the development of two new proposal mechanisms. TESS’s Short Studies Program (SSP) is accepting proposals for fielding very brief population-based survey experiments on a general population of at least 2000 adults. SSP recruits participants from within the U.S. using the same Internet-based platform as other TESS studies. More information about SSP and proposal requirements is available at http://www.tessexperiments.org/ssp.html.</p>
<p>TESS’s Special Competition for Young Investigators is accepting proposals from June 15th-September 15th. The competition is meant to enable younger scholars to field large-scale studies and is limited to graduate students and individuals who are no more than 3 years post-Ph.D. More information about the Special Competition and proposal requirements is available at http://www.tessexperiments.org/yic.html.</p>
<p>We are being assisted by a new team of over 65 Associate PIs and peer reviewers across the social sciences. More information about our APIs is available at http://www.tessexperiments.org/associatepi.html.</p></blockquote>
<p>This seems cool to me, both for the obvious reason that it will reduce the intellectual overhead required to run a good social-science experiment, and indirectly because I assume the resulting experiments will be open with shared data.</p>
<p>The post <a href="http://andrewgelman.com/2013/04/27/time-sharing-experiments-for-the-social-sciences/">Time-Sharing Experiments for the Social Sciences</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/27/time-sharing-experiments-for-the-social-sciences/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Continued fractions!!</title>
		<link>http://andrewgelman.com/2013/04/27/continued-fractions-an-assignment-for-william-goldman/</link>
		<comments>http://andrewgelman.com/2013/04/27/continued-fractions-an-assignment-for-william-goldman/#comments</comments>
		<pubDate>Sat, 27 Apr 2013 14:00:29 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Literature]]></category>
		<category><![CDATA[Statistical computing]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=17768</guid>
		<description><![CDATA[<p>Upon reading this note by John Cook on continued fractions, I wrote: If you like continued fractions, I recommend you read the relevant parts of the classic Numerical Methods That Work. The details are probably obsolete but it’s fun reading (at least, if you think that sort of thing is fun to read). I then [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/27/continued-fractions-an-assignment-for-william-goldman/">Continued fractions!!</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Upon reading <a href="http://www.johndcook.com/blog/2013/01/30/continued-fractions-with-sage/">this note</a> by John Cook on continued fractions, I wrote:</p>
<blockquote><p>If you like continued fractions, I recommend you read the relevant parts of the classic Numerical Methods That Work. The details are probably obsolete but it’s fun reading (at least, if you think that sort of thing is fun to read).</p></blockquote>
<p>I then looked up Acton in Wikipedia and was surprised to see he’s still alive. And he wrote a second book (published at the age of 77!). I wonder if it’s any good. It’s sobering to read Numerical Methods That Work: it’s so wonderful and so readable, yet in this modern era there’s really not much reason to read it. Perhaps William Goldman (hey, I checked: he’s still alive too!) or some equivalent could prepare a 50-page “good parts” version that could be still be useful as a basic textbook.</p>
<p>The post <a href="http://andrewgelman.com/2013/04/27/continued-fractions-an-assignment-for-william-goldman/">Continued fractions!!</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/27/continued-fractions-an-assignment-for-william-goldman/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>“A Vast Graveyard of Undead Theories:  Publication Bias and Psychological Science’s Aversion to the Null”</title>
		<link>http://andrewgelman.com/2013/04/26/a-vast-graveyard-of-undead-theories-publication-bias-and-psychological-sciences-aversion-to-the-null-2/</link>
		<comments>http://andrewgelman.com/2013/04/26/a-vast-graveyard-of-undead-theories-publication-bias-and-psychological-sciences-aversion-to-the-null-2/#comments</comments>
		<pubDate>Fri, 26 Apr 2013 13:09:31 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Zombies]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18670</guid>
		<description><![CDATA[<p>Erin Jonaitis points us to this article by Christopher Ferguson and Moritz Heene, who write: Publication bias remains a controversial issue in psychological science. . . . that the field often constructs arguments to block the publication and interpretation of null results and that null results may be further extinguished through questionable researcher practices. Given [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/26/a-vast-graveyard-of-undead-theories-publication-bias-and-psychological-sciences-aversion-to-the-null-2/">&#8220;A Vast Graveyard of Undead Theories:  Publication Bias and Psychological Science’s Aversion to the Null&#8221;</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Erin Jonaitis points us to <a href="http://pps.sagepub.com/content/7/6/555.full">this article</a> by Christopher Ferguson and Moritz Heene, who write:</p>
<blockquote><p>Publication bias remains a controversial issue in psychological science. . . . that the field often constructs arguments to block the publication and interpretation of null results and that null results may be further extinguished through questionable researcher practices. Given that science is dependent on the process of falsification, we argue that these problems reduce psychological science’s capability to have a proper mechanism for theory falsification, thus resulting in the promulgation of numerous “undead” theories that are ideologically popular but have little basis in fact.</p></blockquote>
<p>They mention the infamous Daryl Bem article.  It is pretty much only because Bem&#8217;s claims are (presumably) false that they got published in a major research journal.  Had the claims been true&#8212;that is, had Bem run identical experiments, analyzed his data more carefully and objectively, and reported that the results were consistent with the null hypothesis&#8212;then the result would be entirely unpublishable.  After all, you can&#8217;t publish an article in a top journal demonstrating that a study is consistent with there being no ESP.  Everybody knows that ESP, to the extent it exists, has such small effects as to be essentially undetectable in any direct study.  So here you have the extreme case of a field in which errors are the <em>only</em> thing that gets published.</p>
<p>It&#8217;s science as <em>Slate</em> magazine is reputed to be: if it&#8217;s true, it&#8217;s obvious so no need to publish.  If it&#8217;s counterintuitive, go for it.  (Just to be clear, I&#8217;m not saying the actual Slate magazine is like that; this is just its reputation.)</p>
<p>This is indeed disturbing and I applaud yet another publication on the topic.  The authors go beyond previous research by Gregory Francis and Uri Simonsohn by focusing specifically on difficulties with meta-analyses that unsuccessfully try to overcome problems of publication bias.</p>
<p>There&#8217;s something called the fail-safe number (FSN) of Rosenthal (1979) and Rosenthal and Rubin (1978), &#8220;an early and still widely used attempt to estimate the number of unpublished studies, averaging null results, that are required to bring the meta-analytic mean Z value of effect sizes down to an insignificant level,&#8221; but,</p>
<blockquote><p>The FSN treats the file drawer of unpublished studies as unbiased by assuming that their average Z value is zero. This wrong assumption appears mostly not to be recognized by researchers who use the FSN to demonstrate the stability of their results. . . . Without making this computational error, the FSN turns out to be a gross overestimate of the number of unpublished studies required to bring the mean Z value of published studies to an insignificant level. The FSN thus gives the meta-analytic researcher a false sense of security.</p></blockquote>
<p>The false sense of security persists:</p>
<blockquote><p>Although this fundamental flaw had been spotted early, the number of applications of the FSN has grown exponentially since its publication. Ironically, getting critiques of the FSN published was far from an easy task . . .</p></blockquote>
<p><strong>Problems with meta-analysis</strong></p>
<p>Ferguson and Heene continue:</p>
<blockquote><p>Meta-analyses should be more objective arbiters of review for a field than are narrative reviews, but we argue that this is not the case in practice. . . . The selection and interpretation of effect sizes from individual studies requires decisions that may be susceptible to researcher biases.</p>
<p>It is thus not surprising that we have seldom seen a meta-analysis resolve a controversial debate in a field. Typically, the antagonists simply decry the meta-analysis as fundamentally flawed or produce a competing meta-analysis of their own . . . meta-analyses may be used in such debates to essentially confound the process of replication and falsification.</p></blockquote>
<p>Thus:</p>
<blockquote><p>The average effect size may be largely meaningless and spurious due to the avoidance of null findings in the published literature. This aversion to the null is arguably one of the most pernicious and unscientific aspects of modern social science.</p></blockquote>
<p>Let me interject here that, although I am in general agreement with Ferguson and Heene on these issues, I have a bit of &#8220;aversion to the null&#8221; myself.  I think it&#8217;s important to separate the <em>statistical</em> from the <em>scientific</em> null hypothesis.</p>
<p>- The <em>statistical null hypothesis</em> is typically that a particular comparison is exactly zero in the population.</p>
<p>- The <em>scientific null hypothesis</em> is typically that a certain effect is nonexistent or, more generally, that the effect depends so much on situation as to be unreplicable in general.</p>
<p>I might well believe in the scientific null but not in the statistical null.</p>
<p><strong>Virtually unkillable</strong></p>
<p>Ferguson and Heene continue:</p>
<blockquote><p>The aversion to the null and the persistence of publication bias and denial of the same, renders a situation in which psychological theories are virtually unkillable. Instead of rigid adherence to an objective process of replication and falsification, debates within psychology too easily degenerate into ideological snowball fights, the end result of which is to allow poor quality theories to survive indefinitely. Proponents of a theory may, in effect, reverse the burden of proof, insisting that their theory is true unless skeptics can prove it false (a fruitless invitation, as any falsifying data would certainly be rejected as flawed were it even able to pass through the null-aversive peer review process described above).</p></blockquote>
<p>Indeed.  We see this reversal of the burden of proof all the time.  For example, after a data alignment error was uncovered in their research, Neil Anderson and Deniz Ones <a href="http://andrewgelman.com/2012/12/16/its-unscientific-behavior-not-to-admit-an-error/">notoriously wrote</a>:  &#8220;When any call is made for the retraction of two peer-reviewed and published articles, the onus of proof is on the claimant and the duty of scientific care and caution is manifestly high. . . . Goldberg et al. do not and cannot provide irrefutable proof of the alleged clerical errors.. . . . We continue to stand by the findings and conclusions reported in our previous publications&#8221;  Ugh!  This bothered me so much when I saw it, it made me want to barf.  At the time, I wrote that it’s unscientific behavior not to admit error.  Unfortunately, for reasons discussed by Ferguson and Heene, much of the scientific enterprise seems to be set up to avoid admission of error.  These are serious issues, and it&#8217;s interesting to me that we as a field haven&#8217;t been thinking much about them until recently.</p>
<p>The post <a href="http://andrewgelman.com/2013/04/26/a-vast-graveyard-of-undead-theories-publication-bias-and-psychological-sciences-aversion-to-the-null-2/">&#8220;A Vast Graveyard of Undead Theories:  Publication Bias and Psychological Science’s Aversion to the Null&#8221;</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/26/a-vast-graveyard-of-undead-theories-publication-bias-and-psychological-sciences-aversion-to-the-null-2/feed/</wfw:commentRss>
		<slash:comments>47</slash:comments>
		</item>
		<item>
		<title>Fascinating graphs from facebook data</title>
		<link>http://andrewgelman.com/2013/04/25/fascinating-graphs-from-facebook/</link>
		<comments>http://andrewgelman.com/2013/04/25/fascinating-graphs-from-facebook/#comments</comments>
		<pubDate>Thu, 25 Apr 2013 13:29:41 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Sociology]]></category>
		<category><![CDATA[Statistical graphics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18809</guid>
		<description><![CDATA[<p>Yair points us to this page full of wonderful graphs from the Stephen Wolfram blog. Here are a few: And some words: People talk less about video games as they get older, and more about politics and the weather. Men typically talk more about sports and technology than women&#8212;and, somewhat surprisingly to me, they also [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/25/fascinating-graphs-from-facebook/">Fascinating graphs from facebook data</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Yair points us to <a href="http://blog.stephenwolfram.com/2013/04/data-science-of-the-facebook-world/">this</a> page full of wonderful graphs from the Stephen Wolfram blog.  Here are a few:</p>
<p><img src="/wp-content/uploads/2013/04/wolfram1.png?8b0eec" alt="wolfram1" width="512" height="252" class="alignnone size-full wp-image-18810" /></p>
<p><img src="/wp-content/uploads/2013/04/wolfram2.png?8b0eec" alt="wolfram2" width="499" height="229" class="alignnone size-full wp-image-18811" /></p>
<p><img src="/wp-content/uploads/2013/04/wolfram3.png?8b0eec" alt="wolfram3" width="505" height="922" class="alignnone size-full wp-image-18812" /></p>
<p>And some words:</p>
<blockquote><p>People talk less about video games as they get older, and more about politics and the weather. Men typically talk more about sports and technology than women&#8212;and, somewhat surprisingly to me, they also talk more about movies, television and music. Women talk more about pets+animals, family+friends, relationships—and, at least after they reach child-bearing years, health. . . . Some of this is rather depressingly stereotypical. And most of it isn’t terribly surprising to anyone who’s known a reasonable diversity of people of different ages. But what to me is remarkable is how we can see everything laid out in such quantitative detail in the pictures above&#8212;kind of a signature of people’s thinking as they go through life. </p>
<p>Of course, the pictures above are all based on aggregate data, carefully anonymized. But if we start looking at individuals, we’ll see all sorts of other interesting things. . . .</p></blockquote>
<p>Good stuff, and I like the flexible, open attitude.  And great graphs.  That&#8217;s why I&#8217;m posting this, in order to spread the word, to inspire others to do this sort of statistical exploration.  Follow the <a href="http://blog.stephenwolfram.com/2013/04/data-science-of-the-facebook-world/">link</a> for lots more.</p>
<p><strong>By the way . . .</strong></p>
<p>I wonder who did the analysis, who made the graphs, and who wrote the text.  No authors are listed.  It&#8217;s posted on the Stephen Wolfram Blog, but Wolfram is known for contracting out his research.  It&#8217;s certainly <em>possible</em> that he did all the statistical analysis, computing, graphics, and writing himself, I just have no idea.  It&#8217;s funny:  in academia, allocation of credit and attribution of authorship is huge.  In industry, not so much.  As an academic, I&#8217;d like to give credit to whoever made these pretty graphs, but perhaps from Wolfram&#8217;s perspective, whoever made the graphs is just doing a job, just like whoever sweeps the floors in the lab or whoever cleans the erasers in the classroom.  In any case, I give Wolfram credit, no joke.  Even if he didn&#8217;t do any of the work on this, it takes skill to hire the right people to do the job.</p>
<p>The post <a href="http://andrewgelman.com/2013/04/25/fascinating-graphs-from-facebook/">Fascinating graphs from facebook data</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/25/fascinating-graphs-from-facebook/feed/</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
		<item>
		<title>It’s binless!  A program for computing normalizing functions</title>
		<link>http://andrewgelman.com/2013/04/25/its-binless-a-program-for-computing-normalizing-functions/</link>
		<comments>http://andrewgelman.com/2013/04/25/its-binless-a-program-for-computing-normalizing-functions/#comments</comments>
		<pubDate>Thu, 25 Apr 2013 13:22:40 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Statistical computing]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18676</guid>
		<description><![CDATA[<p>Zhiqiang Tan writes: I have created an R package to implement the full likelihood method in Kong et al. (2003). The method can be seen as a binless extension of so-called Weighted Histogram Analysis Method (UWHAM) widely used in physics and chemistry. The method has also been introduced to the physics literature and called the [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/25/its-binless-a-program-for-computing-normalizing-functions/">It&#8217;s binless!  A program for computing normalizing functions</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2013/04/binless-300x199.jpg?8b0eec" alt="binless" width="300" height="199" class="alignnone size-medium wp-image-18677" /></p>
<p>Zhiqiang Tan writes:</p>
<blockquote><p>I have created <a href="http://cran.r-project.org/web/packages/UWHAM/index.html">an R package</a> to implement the full likelihood method in Kong et al. (2003). The method can be seen as a binless extension of so-called Weighted Histogram Analysis Method (UWHAM) widely used in physics and chemistry. The method has also been introduced to the physics literature and called the Multivariate Bennet Acceptance Ratio (MBAR) method. But a key point of my implementation is to compute the free energy estimates by minimizing a convex function, instead of solving nonlinear equations by the self-consistency or the Newton-Raphson algorithm.</p></blockquote>
<p>The post <a href="http://andrewgelman.com/2013/04/25/its-binless-a-program-for-computing-normalizing-functions/">It&#8217;s binless!  A program for computing normalizing functions</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/25/its-binless-a-program-for-computing-normalizing-functions/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Samurai sword-wielding Mormon bishop pharmaceutical statistician stops mugger</title>
		<link>http://andrewgelman.com/2013/04/24/samurai-sword-wielding-mormon-bishop-pharmaceutical-statistician-stops-mugger/</link>
		<comments>http://andrewgelman.com/2013/04/24/samurai-sword-wielding-mormon-bishop-pharmaceutical-statistician-stops-mugger/#comments</comments>
		<pubDate>Wed, 24 Apr 2013 20:12:58 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Miscellaneous Statistics]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18801</guid>
		<description><![CDATA[<p>Brett Keller points us to this feel-good story of the day: A Samurai sword-wielding Mormon bishop helped a neighbor woman escape a Tuesday morning attack by a man who had been stalking her. Kent Hendrix woke up Tuesday to his teenage son pounding on his bedroom door and telling him somebody was being mugged in [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/24/samurai-sword-wielding-mormon-bishop-pharmaceutical-statistician-stops-mugger/">Samurai sword-wielding Mormon bishop pharmaceutical statistician stops mugger</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2013/04/042413_edge_sword_640.jpg?8b0eec" alt="042413_edge_sword_640" width="320" height="180" class="alignnone size-full wp-image-18802" /></p>
<p>Brett Keller points us to <a href="http://www.foxnews.com/us/2013/04/23/samurai-sword-wielding-mormon-bishop-comes-to-aid-woman-being-attacked/">this</a> feel-good story of the day:</p>
<blockquote><p>A Samurai sword-wielding Mormon bishop helped a neighbor woman escape a Tuesday morning attack by a man who had been stalking her.</p>
<p>Kent Hendrix woke up Tuesday to his teenage son pounding on his bedroom door and telling him somebody was being mugged in front of their house. The 47-year-old father of six rushed out the door and grabbed the weapon closest to him &#8212; a 29-inch high carbon steel Samurai sword. . . .</p>
<p>Hendrix, a pharmaceutical statistician, was one of several neighbors who came to the woman&#8217;s aid after she began yelling for help . . .</p></blockquote>
<p>Too bad the whole &#8220;statistician&#8221; thing got buried in the middle of the article.  Fair enough, though:  I don&#8217;t know what it takes to become a Mormon bishop, but I assume it&#8217;s more effort than what it takes to learn statistics.</p>
<p>The post <a href="http://andrewgelman.com/2013/04/24/samurai-sword-wielding-mormon-bishop-pharmaceutical-statistician-stops-mugger/">Samurai sword-wielding Mormon bishop pharmaceutical statistician stops mugger</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/24/samurai-sword-wielding-mormon-bishop-pharmaceutical-statistician-stops-mugger/feed/</wfw:commentRss>
		<slash:comments>21</slash:comments>
		</item>
		<item>
		<title>My talk midtown this Friday noon (and at Columbia Monday afternoon)</title>
		<link>http://andrewgelman.com/2013/04/24/my-talk-midtown-this-friday-noon/</link>
		<comments>http://andrewgelman.com/2013/04/24/my-talk-midtown-this-friday-noon/#comments</comments>
		<pubDate>Wed, 24 Apr 2013 19:16:18 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Causal Inference]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18788</guid>
		<description><![CDATA[<p>At the City University of New York Graduate Center, 365 Fifth Avenue (between 34th and 35th street), room 6002. The topic: causality and statistical learning. Announcement is here (scroll down). It says that if you would like to attend any event, please respond by emailing datamining@gc.cuny.edu I&#8217;m also giving a shorter talk on the same [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/24/my-talk-midtown-this-friday-noon/">My talk midtown this Friday noon (and at Columbia Monday afternoon)</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>At the City University of New York Graduate Center, 365 Fifth Avenue (between 34th and 35th street), room 6002.  The topic:  <a href="http://www.stat.columbia.edu/~gelman/presentations/causaltalk3_handout.pdf">causality and statistical learning</a>.  Announcement is <a href="http://datamining.ws.gc.cuny.edu/2013/03/07/draft/">here</a> (scroll down).  It says that if you would like to attend any event, please respond by emailing datamining@gc.cuny.edu</p>
<p>I&#8217;m also giving a shorter talk on the same topic in the Sustainable Development Seminar <a href="https://docs.google.com/document/d/1z7p3l8cSY5RJKh55rD1CaeHbVdrji_rEzw2OijW4Qjk/edit">Series</a> 4pm Monday 29 Apr in room 407 International Affairs Bldg (at 118th St. and Amsterdam Ave.).</p>
<p>It&#8217;s just a coincidence that I&#8217;m giving the same talk twice.  I was asked at different times to speak for these groups.  When someone asks me to speak, I let them pick from recent talks on the <a href="http://www.stat.columbia.edu/~gelman/presentations/">list</a>.</p>
<p>The post <a href="http://andrewgelman.com/2013/04/24/my-talk-midtown-this-friday-noon/">My talk midtown this Friday noon (and at Columbia Monday afternoon)</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/24/my-talk-midtown-this-friday-noon/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>The Tweets-Votes Curve</title>
		<link>http://andrewgelman.com/2013/04/24/the-tweets-votes-curve/</link>
		<comments>http://andrewgelman.com/2013/04/24/the-tweets-votes-curve/#comments</comments>
		<pubDate>Wed, 24 Apr 2013 13:15:37 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Political Science]]></category>
		<category><![CDATA[Sociology]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18791</guid>
		<description><![CDATA[<p>Fabio Rojas points me to this excellently-titled working paper by Joseph DiGrazia, Karissa McKelvey, Johan Bollen, and himself: Is social media a valid indicator of political behavior? We answer this ques- tion using a random sample of 537,231,508 tweets from August 1 to November 1, 2010 and data from 406 competitive U.S. congressional elections provided [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/24/the-tweets-votes-curve/">The Tweets-Votes Curve</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Fabio Rojas points me to <a href="http://orgtheory.wordpress.com/2013/04/23/more-tweets-more-votes-social-media-as-a-quantitative-indicator-of-political-behavior/">this excellently-titled working paper</a> by Joseph DiGrazia, Karissa McKelvey, Johan Bollen, and himself:</p>
<blockquote><p>Is social media a valid indicator of political behavior? We answer this ques- tion using a random sample of 537,231,508 tweets from August 1 to November 1, 2010 and data from 406 competitive U.S. congressional elections provided by the Federal Election Commission. Our results show that the percentage of Republican-candidate name mentions correlates with the Republican vote margin in the subsequent election. This finding persists even when controlling for incumbency, district partisanship, media coverage of the race, time, and demographic variables such as the district’s racial and gender composi- tion. With over 500 million active users in 2012, Twitter now represents a new frontier for the study of human behavior. This research provides a framework for incorporating this emerging medium into the computational social science toolkit.</p></blockquote>
<p>One charming thing about this paper&#8212;and I know this is going to sound patronizing but I don&#8217;t mean it to be&#8212;is that the authors (or, at least, whatever subset of the authors who did the statistical work) are amateurs.  They analyze the outcome in terms of total votes rather than vote proportion, even while coding the predictor as a proportion.  They present regression coefficients to 7 significant figures.  They report that they have data from two different election cycles but present only one in the paper (but they do have the other in their blog post).</p>
<p>But that&#8217;s all ok.  They pulled out some interesting data.  And, as I often say, <em>the most important aspect of a statistical analysis is not what you do with the data, it&#8217;s what data you use</em>.</p>
<p><strong>Tweets and votes</strong></p>
<p>As to the result itself, I&#8217;m not quite sure what to do with it.  Here&#8217;s the key graph:</p>
<p><img src="http://i2.wp.com/themonkeycage.org/wp-content/uploads/2013/04/tweet1.png?resize=512%2C353" alt="tweet1" class="alignnone size-large wp-image-29635" data-recalc-dims="1" /></p>
<p>More tweets, more votes, indeed.<br />
<del datetime="2013-04-24T15:30:05+00:00">Of course most congressional elections are predictable.  But the elections that are between 40-60 and 60-40, maybe not so much.  So let&#8217;s look at the data there . . . Not such a strong pattern (and for the 2012 data in the 40-60% range it looks even worse; any correlation is swamped by the noise).  That&#8217;s fine, and it&#8217;s not unexpected, it&#8217;s not a criticism of the paper but it indicates that the real gain in this analysis is not for predicting votes.</del></p>
<p>I&#8217;m not so convinced that tweets will be so useful in predicting votes&#8212;most congressional elections are predictable, but perhaps the prediction tool could be more relevant in low-information or multicandidate elections where prediction is not so easy.</p>
<p>Instead, it might make sense to flip it around and predict twitter mentions given candidate popularity.  That is, rotate the graph 90 degrees, and see how much variation there is in tweet shares for elections of different degrees of closeness.  Also, while you&#8217;re at it, re-express vote share as vote proportion.  And scale the size of each dot to the total number of tweets for the two candidates in the election.</p>
<p>Move away from trying to predict votes and move toward trying to understand tweets.  DiGrazia et al. write, &#8220;the models show that social media matters . . .&#8221;  No, not at all.  They find a correlation between candidate popularity and social media mentions.  No-name and fringe candidates get fewer mentions (on average) than competitive and dominant candidates. That&#8217;s fine, you can go with that.</p>
<p>Again, I fear the above sounds patronizing, but I don&#8217;t mean it to be.  You gotta start somewhere, and you&#8217;re nowhwere without the data.  As someone who was (originally) an outsider to the field of political science, I do think that researchers coming from other fields can offer valuable perspectives.</p>
<p><strong>Sharing the data</strong></p>
<p>What I want to know is, is this dataset publicly available?  What would really make this go viral is if DiGrazia et al. post the data online.  Then everyone will take a hack at it, and each of those people will cite them.</p>
<p>There&#8217;s been a lot of talk about reproducible research lately.  In this case, they have a perfect incentive to make they data public:  it will help them out, it will help out the research project, and it will be a great inspiration for others to follow in their footsteps.  Releasing data as a publicity intensifier:  that&#8217;s my new idea.</p>
<p>P.S.  In the first version of this post I included a graph showing votes given tweet shares between 40% and 60%.  I intended this to illustrate the difficulty of predicting close elections, but my graph really missed the point, because the x-axis represented close elections in tweet shares, not in votes.  So I crossed that part out.  If nothing else, I&#8217;ve demonstrated the difficulty of thinking about this sort of analysis!</p>
<p>The post <a href="http://andrewgelman.com/2013/04/24/the-tweets-votes-curve/">The Tweets-Votes Curve</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/24/the-tweets-votes-curve/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Foundation for Open Access Statistics</title>
		<link>http://andrewgelman.com/2013/04/23/foundation-for-open-access-statistics/</link>
		<comments>http://andrewgelman.com/2013/04/23/foundation-for-open-access-statistics/#comments</comments>
		<pubDate>Tue, 23 Apr 2013 21:23:28 +0000</pubDate>
		<dc:creator>Bob Carpenter</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18745</guid>
		<description><![CDATA[<p>Now here&#8217;s a foundation I (Bob) can get behind: Foundation for Open Access Statistics (FOAS) Their mission is to &#8220;promote free software, open access publishing, and reproducible research in statistics.&#8221; To me, that&#8217;s like supporting motherhood and apple pie! FOAS spun out of and is partially designed to support the Journal of Statistical Software (aka [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/23/foundation-for-open-access-statistics/">Foundation for Open Access Statistics</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<div style="float:right; margin: 2em"><img src="http://i2.wp.com/citypragmatist.files.wordpress.com/2010/08/vintage-pic-apple-pie.jpg?w=150" data-recalc-dims="1"/></div>
<p>Now here&#8217;s a foundation I (Bob) can get behind: </p>
<blockquote><p>
<a href="http://www.foastat.org">Foundation for Open Access Statistics</a> (FOAS)
</p></blockquote>
<p>Their <a href="http://www.foastat.org/philosophy.html">mission</a> is to &#8220;promote free software, open access publishing, and reproducible research in statistics.&#8221;  To me, that&#8217;s like supporting <a href="http://en.wiktionary.org/wiki/motherhood_and_apple_pie">motherhood and apple pie</a>!</p>
<p>FOAS spun out of and is partially designed to support the <a href="http://www.jstatsoft.org"><i>Journal of Statistical Software</i></a> (aka <i>JSS</i>, aka <i>JStatSoft</i>). I adore <i>JSS</i> because it (a) is open access, (b) publishes systems papers on statistical software, (c) has fast reviewing turnaround times, and (d) is free for authors and readers.  One of the next items on my to-do list is to write up the <a href="http://mc-stan.org/">Stan</a> modeling language and submit it to <i>JSS</i>.</p>
<p>As a not-for-profit with no visible source of income, they are quite sensibly <a href="http://www.foastat.org/donate.html">asking for donations</a> (don&#8217;t complain &#8212; it beats $3K author fees or not being able to read papers).  </p>
<p>The post <a href="http://andrewgelman.com/2013/04/23/foundation-for-open-access-statistics/">Foundation for Open Access Statistics</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/23/foundation-for-open-access-statistics/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Charles Murray’s “Coming Apart” and the measurement of social and political divisions</title>
		<link>http://andrewgelman.com/2013/04/23/charles-murrays-coming-apart-and-the-measurement-of-social-and-political-divisions/</link>
		<comments>http://andrewgelman.com/2013/04/23/charles-murrays-coming-apart-and-the-measurement-of-social-and-political-divisions/#comments</comments>
		<pubDate>Tue, 23 Apr 2013 13:27:00 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Political Science]]></category>
		<category><![CDATA[Sociology]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18079</guid>
		<description><![CDATA[<p>Following up on our blog discussions a year ago, I published a review of Charles Murray&#8217;s recent book, &#8220;Coming Apart,&#8221; for the journal Statistics, Politics, and Policy. I invited Murray to publish a response, and he did so. Here&#8217;s the abstract to my review: This article examines some claims made in a recent popular book [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/23/charles-murrays-coming-apart-and-the-measurement-of-social-and-political-divisions/">Charles Murray’s &#8220;Coming Apart&#8221; and the measurement of social and political divisions</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2013/04/heather-has-two-mommies.jpg?8b0eec" alt="heather-has-two-mommies" width="458" height="600" class="alignnone size-full wp-image-18664" /></p>
<p>Following up on our <a href="http://andrewgelman.com/2012/02/15/some-reactions-to-charles-murrays-thoughts-on-income-and-politics/">blog discussions</a> a year ago, I published a review of Charles Murray&#8217;s recent book, &#8220;Coming Apart,&#8221; for the journal Statistics, Politics, and Policy.  I invited Murray to publish a response, and he did so.</p>
<p>Here&#8217;s the abstract to <a href="http://www.stat.columbia.edu/~gelman/research/published/murray1.pdf">my review</a>:</p>
<blockquote><p>This article examines some claims made in a recent popular book of political sociology, with the intent not being to debunk any claims but rather to connect some important social and policy positions to statistical data on income, social class, and political attitudes. The thesis of Charles Murray’s book is that America’s upper and lower classes have become increasingly separate, with elites living more disciplined, orderly lives (characterized by marriage, work, and stable families) while being largely unaware of the lifestyles of the majority of Americans. I argue that some of Murray’s conclusions are sensitive to particular choices of whom to label as elite or upper-class. From my analysis of survey data, I see the big culture war occurring within the upper class, whereas Murray focuses on differences in attitudes and lifestyles comparing rich to poor. Coming Apart is a lively contribution to current debates and complements more statistical analyses of political and social polarization.</p></blockquote>
<p>Murray&#8217;s reply had no abstract, but <a href="/wp-content/uploads/2013/02/murray2.pdf?8b0eec">here&#8217;s the text</a> of the whole thing.  I think the discussion was productive.</p>
<p>Before leaving the conversation, I just want to point to a bit in Murray&#8217;s article that I thought inadvertently illustrated Murray&#8217;s points about implicit assumptions and cultural divides.  Near the end of his reply to my review, Murray writes:</p>
<blockquote><p>When a society’s elite is confident that its own values are the ones that all of society ought to adopt, those values get communicated. They’re in the air&#8212;in the way journalists cover stories, editors write editorials, television networks choose the new season’s series, and screenwriters create plots. They are reflected in the way that members of the elite talk with their children, with their professional colleagues, and whenever those topics relating to their values come up in a public setting. In all of those settings, today’s new upper class tends to be obsessively nonjudgmental.</p>
<p>If you doubt it, try bringing up the issue of single women having babies at your next dinner party, and see how many of your companions are willing to say, even in a private gathering of friends, that it is morally wrong for a woman to bring a baby into the world knowing that it will not have a father, and morally wrong for a man to impregnate a woman knowing he will not be a father to the child. Fifty years ago, no one at the same kind of dinner party would have said that it was not morally wrong.</p>
<p>It is statistically highly likely that all of the biological children of the people at a dinner table of today’s upper-class adults have been born within wedlock. If there are childless never-married women at the table, it is likely that they have deliberately foregone having a baby, even though they might want one, because they have decided it is unfair to the child not to have a father. Put another way, it is likely that all of the people at the table have made moral evaluations and behaved accordingly. “Preach what you practice” simply means to stop being nonjudgmental in public about moral principles that you hold in private.</p></blockquote>
<p>I have no idea what Murray&#8217;s personal view is on the matter, but from the above passage I take it that he thinks it is OK for people to think that &#8220;it is morally wrong for a woman to bring a baby into the world knowing that it will not have a father,&#8221; and that it is &#8220;obsessively nonjudgmental&#8221; to <em>not</em> think it is morally wrong for a woman etc.</p>
<p>I think there&#8217;s something generational going on here.  I actually know several women who&#8217;ve knowingly brought fatherless babies into the world&#8212;and they and their kids seem to be doing just fine.  Yes, I realize there are people who find this sort of thing objectionable, but I&#8217;d actually think it was kind of rude for them to say something like that over dinner&#8212;at least, it would be rude if one of our friends with children were around.</p>
<p>So, yes, I agree with Murray that there have been big changes over the past few decades&#8212;but these changes go deeper than he might realize.  In the last two sentences of the quotation above, Murray implies that members of the upper class hold in private the principle that it is morally wrong for a woman to bring a baby into the world knowing that it will not have a father.  That might be true among the upper class of Dallas or other conservative parts of the country, but where I live, not so much.</p>
<p>At this point, Murray could of course respond that I&#8217;m making his point for him, that I&#8217;m illustrating how distant I am, in my Manhattan lifestyle, from average Americans.  And I could respond with some poll data showing liberal attitudes on social issues among Americans under 40.  But really this particular discussion would not be needed, because my point is simpler than that.  Murray thinks it&#8217;s obsessionally nonjudgmental to think it&#8217;s OK for a woman to have a baby with no father, and I think it&#8217;s an odd sort of retro throwback to believe that such a woman&#8217;s behavior is morally wrong.  In this case it can be difficult to communicate in general, although perhaps it would be possible in special cases.  (Murray might, for example, agree that my friends might be OK having brought babies into the world knowing they will not have a father.  Perhaps he could consider it morally wrong in general but accept my friends as special cases, perhaps arguing that this sort of think can work in Morningside Heights but not in many other places in the country.)</p>
<p>P.S.  There&#8217;s a bunch of discussion below but I think some of it misses my main point, which is the following.  Murray wrote, &#8220;try bringing up the issue of single women having babies at your next dinner party, and see how many of your companions are willing to say, even in a private gathering of friends, that it is morally wrong for a woman to bring a baby into the world knowing that it will not have a father.&#8221;  I agree that my friends wouldn&#8217;t be &#8220;willing to say&#8221; this, but for the simple reason that I think most of my friends <em>don&#8217;t</em> think it is wrong for a woman to do that.  Times have <a href="http://www.usatoday.com/story/news/politics/2012/12/05/poll-from-gay-marriage-to-adoption-attitudes-changing-fast/1748873/">changed</a>.  Then again, lots of Americans would agree with Murray on the moral condemnation.  Attitudes vary by geography, by generation, by many factors.</p>
<p>The post <a href="http://andrewgelman.com/2013/04/23/charles-murrays-coming-apart-and-the-measurement-of-social-and-political-divisions/">Charles Murray’s &#8220;Coming Apart&#8221; and the measurement of social and political divisions</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/23/charles-murrays-coming-apart-and-the-measurement-of-social-and-political-divisions/feed/</wfw:commentRss>
		<slash:comments>115</slash:comments>
		</item>
		<item>
		<title>Goal:  Rules for Turing chess</title>
		<link>http://andrewgelman.com/2013/04/22/new-ideas-on-turing-chess/</link>
		<comments>http://andrewgelman.com/2013/04/22/new-ideas-on-turing-chess/#comments</comments>
		<pubDate>Mon, 22 Apr 2013 14:25:37 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Sports]]></category>

		<guid isPermaLink="false">http://andrewgelman.com/?p=18706</guid>
		<description><![CDATA[<p>Daniel Murell has more thoughts on Turing chess (last discussed here): When I played with my brother, we had it that if you managed to lap someone while running around the house, then you got an additional move. This means that if you had the option to take the king on your additional move, you [...]</p><p>The post <a href="http://andrewgelman.com/2013/04/22/new-ideas-on-turing-chess/">Goal:  Rules for Turing chess</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img src="/wp-content/uploads/2013/04/turing.jpg?8b0eec" alt="turing" width="146" height="286" class="alignnone size-full wp-image-18722" /></p>
<p>Daniel Murell has more thoughts on <a href="http://andrewgelman.com/2010/01/a_problem_with/">Turing chess</a> (last discussed <a href="http://andrewgelman.com/2012/08/28/turing-chess-run-update/">here</a>):</p>
<blockquote><p>When I played with my brother, we had it that if you managed to lap someone while running around the house, then you got an additional move. This means that if you had the option to take the king on your additional move, you could, and doing so won you the game. He was fitter at the time so he slipped in two additional moves over the course of the game. I still won :) I am much better at him at chess though, so I&#8217;m sure he would have beaten me had we been more even.</p>
<p>W.r.t. dsquared&#8217;s <a href="http://andrewgelman.com/2010/01/24/a_problem_with/">comment</a> and your response, I&#8217;m not overly concerned about the first move, because you can enforce that white must reach a halfway point or that some time interval elapse before black makes his first move.</p>
<p>This version though does have one significant weakness that is evident to me. If you wait a little for your opponent to return to make his second move in a row against you, you get your breath back. He couldn&#8217;t plan for this tactic since it was your decision to wait. So he&#8217;s probably not played his first move to check your king. If he does play is second move and start running and you are in the position to take his king in two moves you can easily do it since you have had a short rest. His only defense would be to see exactly what you are up to when he comes back for his second move, and wait for you to move before he does so he can respond and so that you don&#8217;t get two moves in a row. If you also don&#8217;t respond, then this give time for him to catch his breath back too :) and so he will now be harder to lap when he finally does move. If you can out sprint him naturally though then he still has a problem and you would both wait indefinitely if both playing optimally. Can you see any solution to this problem? I&#8217;m still trying to decide on a rule set that works.</p></blockquote>
<p>I agree that it would be great to have a standard set of rules for this game, a set of rules that are non-arbitrary (that is, no clock, no tuning parameters other than the length of the circuit to be run) for which the game is playable and is a reasonable balance between chess and running skills.</p>
<p>P.S.  A google search turns up <a href="http://www.chess.com/blog/qixel/alan-turing-and-run-around-chess">this idea</a>:</p>
<blockquote><p>The rules are the same as standard chess but the chess clock is set up some distance away, say 10 meters, on a separate table.</p></blockquote>
<p>That&#8217;s ok, I guess, but the use of the chess clock makes it less beautiful.  I want a pure version where the running is its own clock.</p>
<p>The post <a href="http://andrewgelman.com/2013/04/22/new-ideas-on-turing-chess/">Goal:  Rules for Turing chess</a> appeared first on <a href="http://andrewgelman.com">Statistical Modeling, Causal Inference, and Social Science</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://andrewgelman.com/2013/04/22/new-ideas-on-turing-chess/feed/</wfw:commentRss>
		<slash:comments>22</slash:comments>
		</item>
	</channel>
</rss><!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk: enhanced
Object Caching 5419/5898 objects using xcache

 Served from: andrewgelman.com @ 2013-05-22 09:51:57 by W3 Total Cache -->
