<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>[citation needed]</title>
	
	<link>http://www.talyarkoni.org/blog</link>
	<description>...and a clue would also be helpful.</description>
	<lastBuildDate>Mon, 26 Jul 2010 08:27:20 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/citationNeeded" /><feedburner:info uri="citationneeded" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><creativeCommons:license>http://creativecommons.org/licenses/by-nc-sa/3.0/</creativeCommons:license><item>
		<title>babygate blues: a neuromarketing tale</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/5jALIywgEjk/</link>
		<comments>http://www.talyarkoni.org/blog/2010/07/26/babygate-blues-a-neuromarketing-tale/#comments</comments>
		<pubDate>Mon, 26 Jul 2010 08:27:20 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[fiction]]></category>
		<category><![CDATA[general silliness]]></category>
		<category><![CDATA[Cory Doctorow]]></category>
		<category><![CDATA[Dampers]]></category>
		<category><![CDATA[funding]]></category>
		<category><![CDATA[neuromarketing]]></category>
		<category><![CDATA[politics]]></category>
		<category><![CDATA[science fiction]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=655</guid>
		<description><![CDATA[Cory Doctorow has a new short story (&#8220;Ghosts in my Head&#8220;) about the undesirable consequences of neuromarketing run amok up on the Subterranean Press website.  I liked the story, but thought the premise was pretty unrealistic (and, yes, I do know it&#8217;s called science fiction for a reason&#8211;I&#8217;m just sayin&#8217;). So as a counterpoint, here&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p><em><a href="http://en.wikipedia.org/wiki/Cory_Doctorow">Cory Doctorow</a> has a new short story (&#8220;<a href="http://subterraneanpress.com/index.php/magazine/summer-2010/fiction-ghosts-in-my-head-by-cory-doctorow/">Ghosts in my Head</a>&#8220;) about the undesirable consequences of neuromarketing run amok up on the Subterranean Press website.  I liked the story, but thought the premise was pretty unrealistic (and, yes, I do know it&#8217;s called science <strong>fiction </strong>for a reason&#8211;I&#8217;m just sayin&#8217;). So as a counterpoint, here&#8217;s an alternative neuromarketing future that I personally find much more plausible</em>.</p>
<p>Deborah Stojko didn&#8217;t care much for Pockter and Gramble&#8217;s corporate headquarters. The building smelled of disinfectant and organization; the halogen corridors all blended together into one giant dimly-lit maze. Stojko had been visiting P&amp;G regularly for several years now; it was never a pleasant experience, but it couldn&#8217;t be avoided. Communicating with major stakeholders was a large part of her job as director of the International Consortium for Neuromarketing Research. And P&amp;G was by far the largest stakeholder, contributing over 70% of the money that supported the consortium&#8217;s work.</p>
<p>For several years now, ICNR had been pumping out first-class scientific research on the neural mechanisms of economic decision-making. The Richelieu effect, Preinforcement Learning, the neurometric satisficing theorem&#8230; ICNR was behind any number of recent discoveries; its members were continually in the news. And all of it was made possible only through the generosity of the marketing and R&amp;D wings of P&amp;G.</p>
<p>The generosity, or the naivete? Stojko asked herself as she reached her destination and knocked softly on an office door. Somehow, the executives at Pockter and Gramble had managed to convince themselves that the survival of P&amp;G rested on their ability to mine the deep secrets of the brain. For years now, they&#8217;d been throwing sums of money at cognitive neuroscientists that would make European royalty blush. That streak of good fortune, Stojko suspected, was now about to end. Recent events had rendered P&amp;G&#8217;s massive investment in ICNR something of a political liability; she had the feeling this was the last time she&#8217;d be making the trip to P&amp;G headquarters.</p>
<p>And not a moment too soon, she thought, as the door opened in front of her.</p>
<p style="text-align: center;">
*    *    *</p>
<p>&#8220;How long has Pockter and Gramble been funding you, Deborah,&#8221; Bob Ramsey, Chief Executive Officer, asked, once Stojko was seated and they&#8217;d gotten the standard pleasantries out of the way.</p>
<p>Stojko did the arithmetic in her head. The International Neuromarketing Consortium had formed in 2013, following a massive infusion of P&amp;G cash, so&#8230;</p>
<p>&#8220;Six years,&#8221; she said.</p>
<p>&#8220;Right. And do you know how much money Pockter and Gramble has given your consortium in those six years?&#8221;</p>
<p>&#8220;I&#8217;d put it somewhere between 251.8 and 251.9 million dollars.&#8221;</p>
<p>&#8220;Very clever. A quarter of a billion dollars. We&#8217;ve given you a quarter. Of a billion. Dollars.&#8221;</p>
<p>&#8220;Well, to be fair, that amount is spread out over 8 sites and 30 other investigators,&#8221; Stojko pointed out. &#8220;It&#8217;s not like you wrote <em>me</em> a check for 250 million. <em>My</em> institution only got about forty-five million.&#8221;</p>
<p>Ramsey didn&#8217;t say anything, but his expression bespoke a thinly-veiled irritation. He picked up a remote control on the desk and pushed a button. Behind Stojko, the wall turned translucent as the embedded display lit up.</p>
<p>&#8220;No doubt you&#8217;ll recognize this clip,&#8221; Ramsey said.</p>
<p>Stojko swiveled around to watch the giant screen. The camera faded in on a bright and comfortable-looking living room somewhere in America. Almost immediately, six or seven babies in diapers filed into the room and began dancing synchronously in a circle. After a few seconds of dancing, the babies started babbling an Eastern-sounding melody in a totally incomprehensible&#8211;and, Stojko suspected, nonexistent&#8211;language. And a few seconds after that, they started banging spoons on the tabletop in perfect unison, all the while still dancing and singing in tongues. The whole thing lasted exactly thirty seconds, and occupied a very narrow emotional niche between really adorable and utterly creepy.</p>
<p>Stojko did recognize the clip, of course; it was an ad for Dampers, a P&amp;G-owned diaper brand. The consortium had selected the ad from over two dozen candidates that P&amp;G had asked them to test. For reasons that remained unclear to Stojko&#8211;and to pretty much everyone else&#8211;singing, dancing, spoon-banging babies lit the brain up like a christmas tree.</p>
<p>Stojko had had her reservations about declaring a &#8216;winner&#8217;; she&#8217;d written several long emails to the P&amp;G marketing brain trust explaining that, brain activation notwithstanding, there really wasn&#8217;t any evidence yet that this particular ad was going to help sell more diapers, and many more studies were needed before the consortium could confidently interpret its own results. But marketing wasn&#8217;t into the whole waiting thing, and the ad was on the air within three months of the consortium&#8217;s initial report.</p>
<p>As it turned out, it didn&#8217;t do so well.</p>
<p>&#8220;That ad <em>bombed</em>,&#8221; Ramsey said, wagging his finger in the general direction of the screen, &#8220;According to you people, it was supposed to push all of the brain&#8217;s buttons <em>at once</em>. You spent three million dollars of our money just on that one testing program. Two dozen ads to choose from, and the one you pick completely tanked. It was an epic failure. At this very moment, people in living rooms all over America are laughing at Pockter and Gramble because of that ad.&#8221;</p>
<p>&#8220;I&#8217;m sure it&#8217;s not <em>that </em>bad&#8221; said Stojko, smirking almost imperceptibly. She was well aware of the PR disaster P&amp;G had on its hands, of course. But she couldn&#8217;t deny the warm feeling of schadenfreude that accompanied the knowledge that P&amp;G was now paying many times over for disregarding just about every recommendation the consortium had made in its 480-page report. She was pretty sure the suits had never made it past the fifth or sixth page.</p>
<p>&#8220;It <em>is</em> that bad,&#8221; Ramsey shot back. &#8220;We blew half of our network budget for the year on this ad. Our initial focus groups were already pretty positive, and then we received your report saying things like&#8211;and I quote&#8211;&#8221;of all the ads tested, number seventeen elicited the largest response in brain areas associated with reward.&#8221; So we figured it was a sure thing, and started airing the ad in all the major markets. And then, out of nowhere, we get this massive backlash. Thousands of angry emails from people complaining that the ad was trite and we were shamefully &#8220;exploiting babies&#8221;. People saying they would never buy Dampers diapers again; that the CEO&#8211;that&#8217;s me, mind you&#8211;should resign; that someone should &#8220;just torch Pockter and Gramble headquarters&#8221;. And those were just the <em>serious</em> complaints. There were also the people who apparently thought the whole thing was just a big joke that gave them an opening to do their own thing. We had forty YouTube videos <em>a day</em> uploaded by people spoofing the ad. There was one clip of six guys in giraffe suits singing and doing our baby dance. Sixteen million hits.&#8221;</p>
<p>&#8220;All publicity is good publicity, right?&#8221;</p>
<p>&#8220;No. Not even close.&#8221;</p>
<p>Stojko chuckled just loudly enough for Ramsey to hear.</p>
<p>&#8220;Is this funny to you?&#8221; Ramsey asked. &#8220;We give you a quarter of a billion dollars for commercials designed to push the brain&#8217;s reward buttons, and we get grown men in giraffe suits?&#8221;</p>
<p>&#8220;Well, let me put it this way, Bob. If your goal was really to make commercials that light up the brain&#8217;s reward circuitry, you wouldn&#8217;t have needed to do any serious research in the first place; you could have just run 30-second clips of semi-nude women making out with each other, or couples giggling and cuddling in bed. That&#8217;d cover most of the bases. You&#8217;d have all the reward-related activation you could want. But how many deodorant sticks do you think commercials like that would sell?&#8221;</p>
<p>Ramsey stared at Stojko blankly.</p>
<p>&#8220;Porn, flashing lights, pictures of hundred-dollar bills, a basket of shiny fresh fruit&#8230; lots of things activate the brain&#8217;s reward centers,&#8221; Stojko continued. &#8220;What makes you think a commercial that tangentially elicits reward-related activation is going to make people buy any more of a product?&#8221;</p>
<p>&#8220;Well, can&#8217;t you tell that?&#8221;</p>
<p>&#8220;Can we?&#8221; asked Stojko rhetorically. &#8220;I don&#8217;t know. Can <em>you</em> tell that? You guys probably have labs full of people trying to figure out whether the fact that people tell you they like a commercial means they&#8217;re going to buy more of the product featured in that commercial. And what&#8217;s the answer?&#8221;</p>
<p>&#8220;I don&#8217;t know that myself,&#8221; Ramsey replied abruptly. &#8220;It&#8217;s not my job to know that. I can have marketing come up here and tell you the answer if you like.&#8221;</p>
<p>Stojko shook her head.</p>
<p>&#8220;Doesn&#8217;t matter. I mean, it can only go one of two ways. If marketing <em>doesn&#8217;t</em> know what makes a commercial good or bad, you can&#8217;t really expect <em>us</em> to tell you what it is about the brain that makes people buy things. We don&#8217;t track how well your products sell after different ads go into circulation; how the hell would we know which commercials have the largest impact on sales? I can tell you which commercials activate the nucleus accumbens more than others, but so what? How am I supposed to know if nucleus accumbens activation is a good predictor of actual purchases without actually knowing anything about real-world purchases?&#8221;</p>
<p>Ramsey had nothing to say to that; he stared down at his shoes.</p>
<p>&#8220;So clearly, that&#8217;s not going to help us,&#8221; Stojko continued. &#8220;But suppose instead we pretend that the people in your marketing department are smart cookies, and they <em>do</em> know what it is about commercials that makes people buy your products. Well, in that case, what the hell would you need <em>us</em>? If you&#8217;ve figured out that people are more likely to buy your anti-dandruff shampoo after watching ads they rate &#8216;extremely interesting&#8217;, what is peering into the brain going to tell you?&#8221;</p>
<p>&#8220;Well, I guess you could use brain imaging to figure out what it is that people find extremely interesting, right?&#8221;</p>
<p>&#8220;Sure, Bob, we could do that. And you know how we&#8217;d do that? By asking people which commercials they found interesting, and then correlating their verbal responses with what their brains were doing while they watched those commercials. And you know what that means? It means we can never do any better than your people can do with your focus groups and spreadsheets. Because basically, we&#8217;re stuck trying to predict the same variables that you guys are using to predict people&#8217;s buying behavior. We&#8217;re just one step further removed.&#8221;</p>
<p>Ramsey listened quietly, but anger visibly colored his face as Stojko spoke.</p>
<p>&#8220;This is the kind of thing that might have been good to bring up, oh, say, five years ago,&#8221; he said.</p>
<p>&#8220;Oh, believe me, we did bring it up,&#8221; Stojko smiled bitterly. &#8220;Or at least, we tried to.&#8221;</p>
<p>She tapped a few keys on her holoboard.</p>
<p>&#8220;Here&#8217;s an email dated June 18th, 2014: &#8220;Dear Mr. Chauahan&#8211;I believe that&#8217;s your VP of marketing, right?&#8211;senior members of the consortium continue to express their frustration at Pockter and Gramble&#8217;s failure to provide us with the sales data we requested. As we indicated in our letter dated April 21st, it is not possible for us to properly evaluate the efficacy of our program without the use of real-world performance metrics. We understand your concerns about sharing private data with outside contractors; however&#8230;&#8221;</p>
<p>Stojko shot Ramsey a pointed look.</p>
<p>&#8220;I&#8217;ll spare you the rest; it goes on like that for three pages. See, we&#8217;ve been asking for the data we need for six years now&#8211;pretty much since we started. And every time we ask, you throw more money at us and tell us to go back to work, that you&#8217;re not going to share your numbers with us because they&#8217;re confidential and we shouldn&#8217;t need that information anyway.&#8221;</p>
<p>She tapped a few more keys.</p>
<p>&#8220;Here&#8217;s another similar one. September 30th: Dear Mr. Chauahan, the consortium is at a loss to understand&#8230;&#8221;</p>
<p>&#8220;Enough!&#8221; yelled Ramsey, slamming his fist down on the desk. &#8220;I get the point! We&#8217;ve spent a quarter of a billion buying you new toys to play with, and all the while you&#8217;ve been playing us for idiots. Well, you know what&#8211;enjoy your toys while they last, because we&#8217;re going to have Legal look at our options for recovering that money first thing Monday morning. Those fancy new scanners of yours are going <em>away</em>.&#8221;</p>
<p>He wheeled his chair away from Stojko and sat there fuming. Stojko took it as a sign the meeting was over; she shrugged and got up to leave.</p>
<p>The falling out was unfortunate, she thought as she walked down the long sterile corridor towards the elevator. But it had been a long time coming, and after the whole Babygate episode (as the scientists at ICNR had started calling it), no one at ICNR would be surprised to hear that P&amp;G was pulling the plug.</p>
<p>Nor would most of them mind terribly much. Stojko had always planned for a six or seven-year run, and had stopped hiring people on short-term contracts a couple of years ago. There would be no massive lay-offs, no collective plunge into obscurity for the many researchers invested in the project. The data was already collected, and she and her colleagues would be kept busy analyzing and publishing the results for years to come.</p>
<p>As for Ramsey&#8217;s legal threats, Stojko wasn&#8217;t the least bit worried. Universities had lawyers too, and there wasn&#8217;t a judge in the country who&#8217;d award P&amp;G a single nickel for breach of contract; not after reading the long series of emails from the consortium that already explained in excruciating detail exactly why P&amp;G was never going to recoup its financial investment unless it fundamentally changed the way it did things. Which, of course, hadn&#8217;t happened&#8211;and probably never would.</p>
<p>Stojko left Pockter and Gramble headquarters with a clear conscience. At the end of the day, she thought as she walked to her car, all you could do was represent yourself honestly to the other party and let the chips fall where they may. And that was what she&#8217;d done. She&#8217;d told P&amp;G all along exactly how the consortium was going to spend the money they received; the service agreements she signed were very clearly delineated in legalese that several lawyers on the institutional payroll had contributed to and pored over. Stojko and her colleagues had worked hard to ensure that no one at P&amp;G was laboring under false pretenses about the likely outcome of ICNR&#8217;s work. As she&#8217;d once put it to a mid-level P&amp;G executive over dinner, neuromarketing research was great for science, and (in her estimation) utterly useless for advertising. But if the suits were willing to pay for it, she was willing to do the research. That, after all, was her job; it was what she&#8217;d be doing with her time anyway, ICNR or no ICNR.</p>
<p>No, she thought, turning the key in the ignition. She&#8217;d been right to take the industry money; ICNR had conducted itself impeccably over the past six years. If someone insisted on filling your cup up with change even after you very carefully explained to them that you were only going to buy beer with it, who could blame you for paying a visit to the bar once panhandling hours were over?</p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/5jALIywgEjk" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/07/26/babygate-blues-a-neuromarketing-tale/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/07/26/babygate-blues-a-neuromarketing-tale/</feedburner:origLink></item>
		<item>
		<title>elsewhere on the net, vacation edition</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/4eHL0bljQhg/</link>
		<comments>http://www.talyarkoni.org/blog/2010/07/13/elsewhere-on-the-net-vacation-edition/#comments</comments>
		<pubDate>Wed, 14 Jul 2010 04:50:15 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[fun with data]]></category>
		<category><![CDATA[general silliness]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[authorship]]></category>
		<category><![CDATA[bisexuality]]></category>
		<category><![CDATA[bloggers]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[dehydrator]]></category>
		<category><![CDATA[ed yong]]></category>
		<category><![CDATA[john zogby]]></category>
		<category><![CDATA[magnetic vision]]></category>
		<category><![CDATA[neuroethics]]></category>
		<category><![CDATA[neurolaw]]></category>
		<category><![CDATA[neuroplasticity]]></category>
		<category><![CDATA[okcupid]]></category>
		<category><![CDATA[plagiarism]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=651</guid>
		<description><![CDATA[I&#8217;m hanging out in Boston for a few days, so blogging will probably be sporadic or nonexistent. Which is to say, you probably won&#8217;t notice any difference. The last post on the Dunning-Kruger effect somehow managed to rack up 10,000 hits in 48 hours; but that was last week. Today I looked at my stats [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m hanging out in Boston for a few days, so blogging will probably be sporadic or nonexistent. Which is to say, you probably won&#8217;t notice any difference.</p>
<p>The last post on the Dunning-Kruger effect somehow managed to rack up 10,000 hits in 48 hours; but that was last week. Today I looked at my stats again, and the blog is back to a more normal 300 hits, so I feel like it&#8217;s safe to blog again. Here are some neat (and totally unrelated) links from the past week:</p>
<ul>
<li>OKCupid has <a href="http://blog.okcupid.com/index.php/2010/07/07/the-biggest-lies-in-online-dating/">another one of those nifty posts</a> showing off all the cool things they can learn from their gigantic userbase (who else gets to say things like &#8220;this analysis includes 1.51 million users&#8217; data&#8221;???). Apparently, tall people (claim to) have more sex, attractive photos are more likely to be out of date, and most people who claim to be bisexual aren&#8217;t really bisexual.</li>
<li>After a few months off, my department-mate Chris Chatham is posting furiously again over at <a href="http://scienceblogs.com/developingintelligence/">Developing Intelligence</a>, with a series of excellent posts reviewing recent work on cognitive control and the perils of fMRI research. I&#8217;m not really sure what Chris spent his blogging break doing, but given the frequency with which he&#8217;s been posting lately, my suspicion is that he spent it secretly writing blog posts.</li>
<li>Mark Liberman <a href="http://languagelog.ldc.upenn.edu/nll/?p=2441">points out a fundamental inconsistency</a> in the way we view attributions of authorship: we get appropriately angry at academics who pass someone else&#8217;s work off as their own, but think it&#8217;s just fine for politicians to pay speechwriters to write for them. It&#8217;s an interesting question, and leads to an intimately related, and even more important question&#8211;namely, will anyone get mad at me if I pay someone else to write a blog post for me about someone else&#8217;s blog post discussing people getting angry at people paying or not paying other people to write material for other people that they do or don&#8217;t own the copyright on?</li>
<li>I like oohing and aahing over large datasets, and the <a href="http://www.guardian.co.uk/news/datablog">Guardian&#8217;s Data Blog</a> provides a nice interface to some of the most ooh- and aah-able datasets out there. [via <a href="http://www.r-chart.com/2010/07/world-government-data-store-api-r-and.html">R-Chart</a>]</li>
<li>Ed Yong has a characteristically excellent write-up about recent work on <a href="http://blogs.discovermagazine.com/notrocketscience/2010/07/08/robins-can-literally-see-magnetic-fields-but-only-if-their-vision-is-sharp/">the magnetic vision of birds</a>. Yong also does link dump posts better than anyone else, so you should probably stop reading this one right now and <a href="http://blogs.discovermagazine.com/notrocketscience/2010/07/10/ive-got-your-missing-links-right-here/">read his instead</a>.</li>
<li>You&#8217;ve probably heard about this already, but some time last week, the brain trust at ScienceBlogs made the <a href="http://www.guardian.co.uk/science/blog/2010/jul/07/scienceblogs-blogging-pepsi">amazingly clever decision</a> to throw away their integrity by selling PepsiCo its very own &#8220;science&#8221; blog. Predictably, a lot of the bloggers weren&#8217;t happy with the decision, and many have now moved onto greener pastures; Carl Zimmer&#8217;s <a href="http://blogs.discovermagazine.com/loom/2010/07/07/oh-pepsi-what-hath-thou-wrought/">keeping score</a>. Personally, I don&#8217;t have anything intelligent to add to everything that&#8217;s already been said; I&#8217;m literally dumbfounded.</li>
<li>Andrew Gelman <a href="http://www.stat.columbia.edu/~cook/movabletype/archives/2010/07/a_note_to_john.html">takes apart</a> an obnoxious letter from pollster John Zogby to Nate Silver of <a href="http://fivethirtyeight.com">fivethirtyeight.com</a>. I guess now we know that Zogby didn&#8217;t get where he is by not being an ass to other people.</li>
<li>Vaughan Bell of Mind Hacks points out that <a href="http://www.mindhacks.com/blog/2010/07/neuroplasticity_is_n.html">neuroplasticity isn&#8217;t a new concept</a>, and was discussed seriously in the literature as far back as the 1800s. Apparently our collective views about the malleability of mind are not, themselves, very plastic.</li>
<li>NPR ran a <a href="http://www.wbur.org/npr/127888976">three-part story</a> by Barbara Bradley Hagerty on the emerging and somewhat uneasy relationship between neuroscience and the law. The articles are pretty good, but much better, in my opinion, was the <a href="http://www.npr.org/templates/story/story.php?storyId=128339306 ">Talk of the Nation episode</a> that featured Hagerty as a guest alongside Joshua Greene, Kent Kiehl, and Stephen Morse&#8211;people who&#8217;ve all contributed in various ways to the emerging discipline of NeuroLaw. It&#8217;s a really interesting set of interviews and discussions. For what it&#8217;s worth, I think I agree with just about everything Greene has to say about these issues&#8211;except that he says things much more eloquently than I think them.</li>
<li>Okay, this one&#8217;s totally frivolous, but does anyone want to buy me one of <a href="http://www.kk.org/cooltools/archives/004590.php">these things</a>? I don&#8217;t even like dried food; I just think it would be fun to stick random things in there and watch them come out pale, dried husks of their former selves. Is it morbid to enjoy watching the life slowly being sucked out of apples and mushrooms?</li>
</ul>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/4eHL0bljQhg" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/07/13/elsewhere-on-the-net-vacation-edition/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/07/13/elsewhere-on-the-net-vacation-edition/</feedburner:origLink></item>
		<item>
		<title>what the Dunning-Kruger effect is and isn’t</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/iJN9zNO7Awo/</link>
		<comments>http://www.talyarkoni.org/blog/2010/07/07/what-the-dunning-kruger-effect-is-and-isnt/#comments</comments>
		<pubDate>Wed, 07 Jul 2010 06:46:13 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[psychology]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[confirmation bias]]></category>
		<category><![CDATA[decision making]]></category>
		<category><![CDATA[dunning-kruger effect]]></category>
		<category><![CDATA[metacognition]]></category>
		<category><![CDATA[regression to the mean]]></category>
		<category><![CDATA[self-enhancement]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=641</guid>
		<description><![CDATA[If you regularly read cognitive science or psychology blogs (or even just the lowly New York Times!), you&#8217;ve probably heard of something called the Dunning-Kruger effect. The Dunning-Kruger effect refers to the seemingly pervasive tendency of poor performers to overestimate their abilities relative to other people&#8211;and, to a lesser extent, for high performers to underestimate [...]]]></description>
			<content:encoded><![CDATA[<p>If you regularly read cognitive science or psychology blogs (or even just the lowly <a href="http://opinionator.blogs.nytimes.com/2010/06/20/the-anosognosics-dilemma-1/">New York Times</a>!), you&#8217;ve probably heard of something called the <a href="http://en.wikipedia.org/wiki/Dunning-kruger_effect">Dunning-Kruger effect</a>. The Dunning-Kruger effect refers to the seemingly pervasive tendency of poor performers to overestimate their abilities relative to other people&#8211;and, to a lesser extent, for high performers to underestimate their abilities. The explanation for this, according to Kruger and Dunning, who first reported the effect in an <a href="http://www.jerwood-no.org.uk/pdf/Dunning%20Kruger.pdf">extremely influential 1999 article</a> in the Journal of Personality and Social Psychology, is that incompetent people by lack the skills they&#8217;d need in order to be able to distinguish good performers from bad performers:</p>
<blockquote><p>&#8230;people who lack the knowledge or wisdom to perform well are often unaware of this fact. We attribute this lack of awareness to a deficit in metacognitive skill. That is, the same incompetence that leads them to make wrong choices also deprives them of the savvy necessary to recognize competence, be it their own or anyone else&#8217;s.</p></blockquote>
<p>For reasons I&#8217;m not really clear on, the Dunning-Kruger effect seems to be experiencing something of a renaissance over the past few months; it&#8217;s <em>everywhere</em> in the blogosphere and media. For instance, here are just a few alleged Dunning-Krugerisms from the past few weeks:</p>
<blockquote><p>&#8230;<a href="http://www.arkansasbusiness.com/article.aspx?aid=122918.54928.135046">So what does this mean in business?</a> Well, it&#8217;s all over the place. Even the title of Dunning and Kruger&#8217;s paper, the part about inflated self-assessments, reminds me of a truism that was pointed out by a supervisor early in my career: The best employees will invariably be the hardest on themselves in self-evaluations, while the lowest performers can be counted on to think they are doing excellent work&#8230;</p>
<p>&#8230;<a href="http://youarenotsosmart.com/2010/05/11/the-dunning-kruger-effect/">Heidi Montag and Spencer Pratt</a> are great examples of the Dunning-Kruger effect. A whole industry of assholes are making a living off of encouraging two attractive yet untalented people they are actually genius auteurs. The bubble around them is so thick, they may never escape it. At this point, all of America (at least those who know who they are), is in on the joke – yet the two people in the center of this tragedy are completely unaware&#8230;</p>
<p>&#8230;<a href="http://feedproxy.google.com/~r/scienceblogs/pharyngula/~3/he9-XnPeozU/the_sorry_state_of_the_public.php">Not so fast there</a> — the Dunning-Kruger effect comes into play here. People in the United States do not have a high level of understanding of evolution, and this survey did not measure actual competence. I&#8217;ve found that the people most likely to declare that they have a thorough knowledge of evolution are the creationists…but that a brief conversation is always sufficient to discover that all they&#8217;ve really got is a confused welter of misinformation&#8230;</p></blockquote>
<p>As you can see, the findings reported by Kruger and Dunning are often interpreted to suggest that the less competent people are, the more competent they <em>think </em>they are. People who perform worst at a task tend to think they&#8217;re god&#8217;s gift to said task, and the people who can actually do said task often display excessive modesty. I suspect we find this sort of explanation compelling because it appeals to our implicit <a href="http://en.wikipedia.org/wiki/Just-world_phenomenon">just-world theories</a>: we&#8217;d like to believe that people who obnoxiously proclaim their excellence at X, Y, and Z must really not be so very good at X, Y, and Z at all, and must be (over)compensating for some actual deficiency; it&#8217;s much less pleasant to imagine that people who go around shoving their (alleged) superiority in our faces might really be better than us at what they do.</p>
<p>Unfortunately, Kruger and Dunning never actually provided any support for this type of just-world view; their studies categorically <em>didn&#8217;t</em> show that incompetent people are more confident or arrogant than competent people. What they <em>did</em> show is this:</p>
<p><a href="http://www.talyarkoni.org/blog/wp-content/uploads/2010/07/dunning_kruger.png"><img class="alignnone size-full wp-image-642" src="http://www.talyarkoni.org/blog/wp-content/uploads/2010/07/dunning_kruger.png" alt="" width="434" height="395" /></a></p>
<p>This is one of the key figures from Kruger and Dunning&#8217;s 1999 paper (and the basic effect has been replicated many times since). The critical point to note is that there&#8217;s a clear <em>positive</em> correlation between actual performance (gray line) and perceived performance (black line): the people in the top quartile for actual performance think they perform better than the people in the second quartile, who in turn think they perform better than the people in the third quartile, and so on. So the bias is definitively <em>not</em> that incompetent people think they&#8217;re better than competent people. Rather, it&#8217;s that <em>incompetent people think they&#8217;re much better than they actually are</em>. But they typically still don&#8217;t think they&#8217;re quite as good as people who, you know, actually <em>are</em> good. (It&#8217;s important to note that Dunning and Kruger never claimed to show that the unskilled think they&#8217;re better than the skilled; that&#8217;s just the way the finding is often interpreted by others.)</p>
<p>That said, it&#8217;s clear that there <em>is</em> a very large discrepancy between the way incompetent people actually perform and the way they perceive their own performance level, whereas the discrepancy is much smaller for highly competent individuals. So the big question is why. Kruger and Dunning&#8217;s explanation, as I mentioned above, is that incompetent people lack the skills they&#8217;d need in order to know they&#8217;re incompetent. For example, if you&#8217;re not very good at learning languages, it might be hard for you to tell that you&#8217;re not very good, because the very skills that you&#8217;d need in order to distinguish someone who&#8217;s good from someone who&#8217;s not are the ones you lack. If you can&#8217;t hear <a href="http://en.wikipedia.org/wiki/Speech_perception">the distinction between two different phonemes</a>, how could you ever know who has native-like pronunciation ability and who doesn&#8217;t? If you don&#8217;t understand very many words in another language, how can you evaluate the size of your own vocabulary in relation to other people&#8217;s?</p>
<p>This appeal to people&#8217;s meta-cognitive abilities (i.e., their knowledge about their knowledge) has some intuitive plausibility, and Kruger, Dunning and their colleagues have provided quite a bit of evidence for it over the past decade. That said, it&#8217;s by no means the only explanation around; over the past few years, a fairly sizeable literature criticizing or extending Kruger and Dunning&#8217;s work has developed. I&#8217;ll mention just three plausible (and mutually compatible) alternative accounts people have proposed (but <a href="http://opim.wharton.upenn.edu/risk/library/J2007JPers+SocPsy_DAM,DAS_Error,Bias.pdf">there</a> are <a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1114332">others</a>!)</p>
<p><strong><em>1. </em><em>Regression toward the mean</em>. </strong>Probably the most common criticism of the Dunning-Kruger effect is that it simply reflects <a href="http://en.wikipedia.org/wiki/Regression_to_the_mean">regression to the mean</a>&#8211;that is, it&#8217;s a statistical artifact. Regression to the mean refers to the fact that any time you select a group of individuals based on some criterion, and then measure the standing of those individuals on some other dimension, performance levels will tend to shift (or regress) toward the mean level. It&#8217;s a notoriously underappreciated problem, and probably explains many, many phenomena that people have tried to interpret substantively. For instance, in placebo-controlled clinical trials of <a href="http://en.wikipedia.org/wiki/SSRIs">SSRIs</a>, depressed people tend to get better in both the drug and placebo conditions. Some of this is undoubtedly due to the <a href="http://en.wikipedia.org/wiki/Placebo_effect">placebo effect</a>, but much of it is probably also due to what&#8217;s often referred to as &#8220;natural history&#8221;. Depression, like most things, tends to be cyclical: people get better or worse better over time, often for no apparent rhyme or reason. But since people tend to seek help (and sign up for drug trials) primarily when they&#8217;re doing particularly badly, it follows that most people would get better to some extent even without any treatment. That&#8217;s regression to the mean (the Wikipedia entry has other nice examples&#8211;for example, the famous <a href="http://en.wikipedia.org/wiki/Sports_Illustrated_Cover_Jinx">Sports Illustrated Cover Jinx</a>).</p>
<p>In the context of the Dunning-Kruger effect, the argument is that incompetent people simply regress toward the mean when you ask them to evaluate their own performance. Since perceived performance is influenced not only by actual performance, but also by many other factors (e.g., one&#8217;s personality, meta-cognitive ability, measurement error, etc.), it follows that, on average, people with extreme levels of actual performance won&#8217;t be quite as extreme in terms of their perception of their performance. So, much of the Dunning-Kruger effect arguably doesn&#8217;t need to be explained at all, and in fact, it would be quite surprising if you <em>didn&#8217;t</em> see a pattern of results that looks at least somewhat like the figure above.</p>
<p><strong><em>2. </em><em>Regression to the mean plus better-than-average.</em></strong> Having said that, it&#8217;s clear that regression to the mean can&#8217;t explain everything about the Dunning-Kruger effect. One problem is that it doesn&#8217;t explain why the effect is greater at the low end than at the high end. That is, incompetent people tend to overestimate their performance to a much greater extent than competent people underestimate their performance. This asymmetry can&#8217;t be explained solely by regression to the mean. It can, however, be explained by a combination of RTM and a &#8220;better-than-average&#8221; (or self-enhancement) heuristic which says that, in general, most people have a tendency to view themselves excessively positively. This two-pronged explanation was proposed by <a href="http://www.ncbi.nlm.nih.gov/pubmed/11831408">Krueger and Mueller in a 2002 study</a> (note that Krueger and Kruger are different people!), who argued that poor performers suffer from a double whammy: not only do their perceptions of their own performance regress toward the mean, but those perceptions are also further inflated by the self-enhancement bias. In contrast, for high performers, these two effects largely balance each other out: regression to the mean causes high performers to underestimate their performance, but to some extent that underestimation is offset by the self-enhancement bias. As a result, it looks as though high performers make more accurate judgments than low performers, when in reality the high performers are just lucky to be where they are in the distribution.</p>
<p><strong><em>3. </em><em>The instrumental role of task difficulty.</em></strong> Consistent with the notion that the Dunning-Kruger effect is at least partly a statistical artifact, some studies have shown that the asymmetry reported by Kruger and Dunning (i.e., the smaller discrepancy for high performers than for low performers) actually goes away, and even reverses, when the ability tests given to participants are very difficult. For instance, <a href="http://deepblue.lib.umich.edu/bitstream/2027.42/39168/1/956.pdf">Burson and colleagues (2006)</a>, writing in JPSP, showed that when University of Chicago undergraduates were asked moderately difficult trivia questions about their university, the subjects who performed best were just as poorly calibrated as the people who performed worst, in the sense that their estimates of how well they did relative to other people were wildly inaccurate. Here&#8217;s what that looks like:</p>
<p><a href="http://www.talyarkoni.org/blog/wp-content/uploads/2010/07/burson_figure.png"><img class="alignnone size-full wp-image-643" title="burson_figure" src="http://www.talyarkoni.org/blog/wp-content/uploads/2010/07/burson_figure.png" alt="" width="440" height="375" /></a></p>
<p>Notice that this finding wasn&#8217;t anomalous with respect to the Kruger and Dunning findings; when participants were given easier trivia (the diamond-studded line), Burson et al observed the standard pattern, with poor performers seemingly showing worse calibration. Simply knocking about 10% off the accuracy rate on the trivia questions was enough to induce a large shift in the relative mismatch between perceptions of ability and actual ability. Burson et al then went on to replicate this pattern in two additional studies involving a number of different judgments and tasks, so this result isn&#8217;t specific to trivia questions. In fact, in the later studies, Burson et al showed that when the task was <em>really</em> difficult, poor performers were actually considerably <em>better</em> calibrated than high performers.</p>
<p>Looking at the figure above, it&#8217;s not hard to see why this would be. Since the slope of the line tends to be pretty constant in these types of experiments, any change in mean performance levels (i.e., a shift in intercept on the y-axis) will necessarily result in a larger difference between actual and perceived performance at the high end. Conversely, if you raise the line, you maximize the difference between actual and perceived performance at the lower end.</p>
<p>To get an intuitive sense of what&#8217;s happening here, just think of it this way: if you&#8217;re performing a very difficult task, you&#8217;re probably going to find the experience subjectively demanding even if you&#8217;re at the high end relative to other people. Since people&#8217;s judgments about their own relative standing depends to a substantial extent on their subjective perception of their <em>own</em> performance (i.e., you use your sense of how easy a task was as a proxy of how good you must be at it), high performers are going to end up systematically underestimating how well they did. When a task is difficult, most people assume they must have done relatively poorly compared to other people. Conversely, when a task is relatively easy (and the tasks Dunning and Kruger studied were on the easier side), most people assume they must be pretty good compared to others. As a result, it&#8217;s going to look like the people who perform well are well-calibrated when the task is easy and poorly-calibrated when the task is difficult; less competent people are going to show exactly the opposite pattern. And note that this doesn&#8217;t require us to assume <em>any</em> relationship between actual performance and perceived performance. You would expect to get the Dunning-Kruger effect for easy tasks even if there was exactly zero correlation between how good people actually are at something and how good they think they are.</p>
<p>Here&#8217;s how Burson et al summarized their findings:</p>
<blockquote><p>Our studies replicate, eliminate, or reverse the association between task performance and judgment accuracy reported by Kruger and Dunning (1999) as a function of task difficulty. On easy tasks, where there is a positive bias, the best performers are also the most accurate in estimating their standing, but on difficult tasks, where there is a negative bias, the worst performers are the most accurate. This pattern is consistent with a combination of noisy estimates and overall bias, with no need to invoke differences in metacognitive abilities. In this  regard, our findings support Krueger and Mueller’s (2002) reinterpretation of Kruger and Dunning’s (1999) findings. An association between task-related skills and metacognitive insight may indeed exist, and later we offer some suggestions for ways to test for it. However, our analyses indicate that the primary drivers of errors in judging relative standing are general inaccuracy and overall biases tied to task difficulty. Thus, it is important to know more about those sources of error in order to better understand and ameliorate them.</p></blockquote>
<p>What should we conclude from these (and other) studies? I think the jury&#8217;s still out to some extent, but at minimum, I think it&#8217;s clear that much of the Dunning-Kruger effect reflects either statistical artifact (regression to the mean), or much more general cognitive biases (the tendency to self-enhance and/or to use one&#8217;s subjective experience as a guide to one&#8217;s standing in relation to others). This doesn&#8217;t mean that the meta-cognitive explanation preferred by Dunning, Kruger and colleagues can&#8217;t hold in some situations; it very well may be that in <em>some</em> cases, and to <em>some</em> extent, people&#8217;s lack of skill is really what prevents them from accurately determining their standing in relation to others. But I think our default position should be to prefer the alternative explanations I&#8217;ve discussed above, because they&#8217;re (a) simpler, (b) more general (they explain lots of other phenomena), and (c) necessary (frankly, it&#8217;d be amazing if regression to the mean <em>didn&#8217;t</em> explain at least part of the effect!).</p>
<p>We should also try to be aware of another very powerful cognitive bias whenever we use the Dunning-Kruger effect to explain the people or situations around us&#8211;namely, <a href="http://en.wikipedia.org/wiki/Confirmation_bias">confirmation bias</a>. If you believe that incompetent people don&#8217;t know enough to know they&#8217;re incompetent, it&#8217;s not hard to find anecdotal evidence for that; after all, we all know people who are both arrogant and not very good at what they do. But if you stop to look for it, it&#8217;s probably also not hard to find disconfirming evidence. After all, there are clearly plenty of people who <em>are</em> good at what they do, but not nearly as good as they think they are (i.e., they&#8217;re above average, and still totally miscalibrated in the positive direction). Just like there are plenty of people who are lousy at what they do and recognize their limitations (e.g., I don&#8217;t need to be a great runner in order to be able to tell that I&#8217;m <em>not</em> a great runner&#8211;I&#8217;m perfectly well aware that I have terrible endurance, precisely because I can&#8217;t finish runs that most other runners find trivial!). But <a href="http://www.talyarkoni.org/blog/2010/05/14/in-defense-of-three-of-my-favorite-sayings/">the plural of anecdote is not data</a>, and the data appear to be equivocal. Next time you&#8217;re inclined to chalk your obnoxious co-worker&#8217;s delusions of grandeur down to the Dunning-Kruger effect, consider the possibility that your co-worker&#8217;s simply a jerk&#8211;no meta-cognitive incompetence necessary.</p>
<p><span style="float: left; padding: 5px;"><a href="http://www.researchblogging.org"><img style="border: 0;" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" alt="ResearchBlogging.org" /></a></span><span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Journal+of+personality+and+social+psychology&amp;rft_id=info%3Apmid%2F10626367&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Unskilled+and+unaware+of+it%3A+how+difficulties+in+recognizing+one%27s+own+incompetence+lead+to+inflated+self-assessments.&amp;rft.issn=0022-3514&amp;rft.date=1999&amp;rft.volume=77&amp;rft.issue=6&amp;rft.spage=1121&amp;rft.epage=34&amp;rft.artnum=&amp;rft.au=Kruger+J&amp;rft.au=Dunning+D&amp;rfe_dat=bpr3.included=1;bpr3.tags=Psychology%2CSocial+Psychology%2C+Judgment+and+Decision+Making">Kruger J, &amp; Dunning D (1999). Unskilled and unaware of it: how difficulties in recognizing one&#8217;s own incompetence lead to inflated self-assessments. <span style="font-style: italic;">Journal of personality and social psychology, 77</span> (6), 1121-34 PMID: <a rev="review" href="http://www.ncbi.nlm.nih.gov/pubmed/10626367">10626367</a><a title="Search Columbia Collections for this item" href="http://rd8hp6du2b.search.serialssolutions.com/?rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Journal%20of%20personality%20and%20social%20psychology&amp;rft_id=info%3Apmid%2F10626367&amp;rft.atitle=Unskilled%20and%20unaware%20of%20it%3A%20how%20difficulties%20in%20recognizing%20one's%20own%20incompetence%20lead%20to%20inflated%20self-assessments.&amp;rft.issn=0022-3514&amp;rft.date=1999&amp;rft.volume=77&amp;rft.issue=6&amp;rft.spage=1121&amp;rft.epage=34&amp;rft.artnum=&amp;rft.au=Kruger%20J&amp;rft.au=Dunning%20D&amp;rfe_dat=bpr3.included&amp;url_ver=Z39.88-2004&amp;rfr_id=info:sid/libx"><img src="chrome://libx/skin/g_100x25_DB__ot__4407_2.gif" border="0" alt="" /></a></span><br />
<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Journal+of+personality+and+social+psychology&amp;rft_id=info%3Apmid%2F11831408&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Unskilled%2C+unaware%2C+or+both%3F+The+better-than-average+heuristic+and+statistical+regression+predict+errors+in+estimates+of+own+performance.&amp;rft.issn=0022-3514&amp;rft.date=2002&amp;rft.volume=82&amp;rft.issue=2&amp;rft.spage=180&amp;rft.epage=8&amp;rft.artnum=&amp;rft.au=Krueger+J&amp;rft.au=Mueller+RA&amp;rfe_dat=bpr3.included=1;bpr3.tags=Psychology%2CSocial+Psychology%2C+Judgment+and+Decision+Making%2C+Statistics">Krueger J, &amp; Mueller RA (2002). Unskilled, unaware, or both? The better-than-average heuristic and statistical regression predict errors in estimates of own performance. <span style="font-style: italic;">Journal of personality and social psychology, 82</span> (2), 180-8 PMID: <a rev="review" href="http://www.ncbi.nlm.nih.gov/pubmed/11831408">11831408</a><a title="Search Columbia Collections for this item" href="http://rd8hp6du2b.search.serialssolutions.com/?rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Journal%20of%20personality%20and%20social%20psychology&amp;rft_id=info%3Apmid%2F11831408&amp;rft.atitle=Unskilled%2C%20unaware%2C%20or%20both%3F%20The%20better-than-average%20heuristic%20and%20statistical%20regression%20predict%20errors%20in%20estimates%20of%20own%20performance.&amp;rft.issn=0022-3514&amp;rft.date=2002&amp;rft.volume=82&amp;rft.issue=2&amp;rft.spage=180&amp;rft.epage=8&amp;rft.artnum=&amp;rft.au=Krueger%20J&amp;rft.au=Mueller%20RA&amp;rfe_dat=bpr3.included&amp;url_ver=Z39.88-2004&amp;rfr_id=info:sid/libx"><img src="chrome://libx/skin/g_100x25_DB__ot__4407_2.gif" border="0" alt="" /></a></span><br />
<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Journal+of+personality+and+social+psychology&amp;rft_id=info%3Apmid%2F16448310&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Skilled+or+unskilled%2C+but+still+unaware+of+it%3A+how+perceptions+of+difficulty+drive+miscalibration+in+relative+comparisons.&amp;rft.issn=0022-3514&amp;rft.date=2006&amp;rft.volume=90&amp;rft.issue=1&amp;rft.spage=60&amp;rft.epage=77&amp;rft.artnum=&amp;rft.au=Burson+KA&amp;rft.au=Larrick+RP&amp;rft.au=Klayman+J&amp;rfe_dat=bpr3.included=1;bpr3.tags=Psychology%2CSocial+Psychology%2C+Judgment+and+Decision+Making">Burson KA, Larrick RP, &amp; Klayman J (2006). Skilled or unskilled, but still unaware of it: how perceptions of difficulty drive miscalibration in relative comparisons. <span style="font-style: italic;">Journal of personality and social psychology, 90</span> (1), 60-77 PMID: <a rev="review" href="http://www.ncbi.nlm.nih.gov/pubmed/16448310">16448310</a><a title="Search Columbia Collections for this item" href="http://rd8hp6du2b.search.serialssolutions.com/?rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Journal%20of%20personality%20and%20social%20psychology&amp;rft_id=info%3Apmid%2F16448310&amp;rft.atitle=Skilled%20or%20unskilled%2C%20but%20still%20unaware%20of%20it%3A%20how%20perceptions%20of%20difficulty%20drive%20miscalibration%20in%20relative%20comparisons.&amp;rft.issn=0022-3514&amp;rft.date=2006&amp;rft.volume=90&amp;rft.issue=1&amp;rft.spage=60&amp;rft.epage=77&amp;rft.artnum=&amp;rft.au=Burson%20KA&amp;rft.au=Larrick%20RP&amp;rft.au=Klayman%20J&amp;rfe_dat=bpr3.included&amp;url_ver=Z39.88-2004&amp;rfr_id=info:sid/libx"><img src="chrome://libx/skin/g_100x25_DB__ot__4407_2.gif" border="0" alt="" /></a></span></p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/iJN9zNO7Awo" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/07/07/what-the-dunning-kruger-effect-is-and-isnt/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/07/07/what-the-dunning-kruger-effect-is-and-isnt/</feedburner:origLink></item>
		<item>
		<title>this year, i backed new zealand to go all the way</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/XpT43Wvc6LQ/</link>
		<comments>http://www.talyarkoni.org/blog/2010/07/06/this-year-i-backed-new-zealand-to-go-all-the-way/#comments</comments>
		<pubDate>Wed, 07 Jul 2010 04:49:21 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[general silliness]]></category>
		<category><![CDATA[chance]]></category>
		<category><![CDATA[football]]></category>
		<category><![CDATA[oranje]]></category>
		<category><![CDATA[reinforcement learning]]></category>
		<category><![CDATA[soccer]]></category>
		<category><![CDATA[sports]]></category>
		<category><![CDATA[world cup]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=638</guid>
		<description><![CDATA[Jerry Coyne ponders whether the best football/soccer team generally wins the World Cup. The answer is clearly no: any sporting event where games are settled on the basis of rare events (e.g., only one or two goals per match), and teams only play each other once to determine a winner, is going to be at [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://whyevolutionistrue.wordpress.com">Jerry Coyne</a> ponders <a href="http://whyevolutionistrue.wordpress.com/2010/07/04/does-the-best-team-win-the-world-cup/">whether the best football/soccer team generally wins the World Cup</a>. The answer is clearly no: any sporting event where games are settled on the basis of rare events (e.g., only one or two goals per match), and teams only play each other once to determine a winner, is going to be at the mercy of Lady Luck a good deal of the time. If we really wanted the best team to come out on top reliably, we&#8217;d probably need teams to play multiple games at every stage of the Cup, which isn&#8217;t very practical. Coyne discusses an (old) paper demonstrating that the occurrence of goals during World Cup matches is well fit by a poisson distribution, allowing one to calculate the probability of various unjust outcomes taking place (which turn out to be surprisingly high).</p>
<p>The curious thing, I think, is that it&#8217;s not really clear that sporting fans really <em>do</em> want the best team to come out on top. We don&#8217;t want outcomes to be determined by a coin toss, of course; it would kind of suck if, say, New Zealand had as much chance of lifting the cup as Brazil did. But it would also be pretty boring if it were a foregone conclusion that Brazil was going to win it all every time around. We want events to make sense, but we don&#8217;t want them to be <em>too</em> predictable. I suppose you could tell an interesting <a href="http://en.wikipedia.org/wiki/Reinforcement_learning">prediction error</a> story about this kind of thing&#8211;e.g., that maximally engaging stimuli may be ones that<em> seem</em> to occur systematically yet defy easy explanation&#8211;but it&#8217;s probably more fun to sit around and curse at the television set as the Netherlands make short work of the <a href="http://en.wikipedia.org/wiki/Brazil_soccer_team">Samba Kings</a> (I don&#8217;t know if anyone actually uses that nickname; I just picked it off Wikipedia to make it look like I know what I&#8217;m talking about). Go Oranje!</p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/XpT43Wvc6LQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/07/06/this-year-i-backed-new-zealand-to-go-all-the-way/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/07/06/this-year-i-backed-new-zealand-to-go-all-the-way/</feedburner:origLink></item>
		<item>
		<title>speaking the language of personality</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/G_oNHdlUJ3E/</link>
		<comments>http://www.talyarkoni.org/blog/2010/07/01/speaking-the-language-of-personality/#comments</comments>
		<pubDate>Fri, 02 Jul 2010 05:42:15 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[personality]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[bloggers]]></category>
		<category><![CDATA[EAR]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[narcissism]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=635</guid>
		<description><![CDATA[Nick Holtzman and I have a very short piece in the latest ARP (Association for Research in Personality) newsletter. We touch on some recent work at the interface of personality and language, including Nick&#8217;s work on narcissism, Mathias Mehl and Simine Vazire&#8216;s work with the Electronically-Activated Recorder (EAR), and a recent paper of mine that [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://nickholtzman.com">Nick Holtzman</a> and I have <a href="http://www.personality-arp.org/newsletter05/article_language.html">a very short piece</a> in the latest ARP (Association for Research in Personality) <a href="http://www.personality-arp.org/newsletter05/index.html">newsletter</a>. We touch on some recent work at the interface of personality and language, including Nick&#8217;s <a href="http://nickholtzman.com/Holtzman%20Vazire%20Mehl%20in%20press%20Sounds%20like%20a%20narcissist%20Behavioral%20Manifestions%20of%20narcissism%20in%20everyday%20life.pdf">work on narcissism</a>, <a href="http://dingo.sbs.arizona.edu/~mehl/">Mathias Mehl</a> and <a href="http://www.simine.com">Simine Vazire</a>&#8216;s work with the <a href="http://dingo.sbs.arizona.edu/~mehl/EAR.htm">Electronically-Activated Recorder</a> (EAR), and a recent paper of mine that explored <a href="http://talyarkoni.org/papers/Yarkoni_blogging_JRP_in_press.pdf">the relation between personality and word use in a large sample of bloggers</a> (who knew bloggers had personalities!?). Then we speculate wildly about where this line of research might go in future. I find I quite enjoy speculating wildly; it&#8217;s much more pleasant than having to write factually correct, well-researched sentences.</p>
<p>At any rate, facts or no facts, I think it&#8217;s a nice (and short) read. And I think I can say that without much bias, since I mostly just lounged around while Nick did all the hard work.</p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/G_oNHdlUJ3E" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/07/01/speaking-the-language-of-personality/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/07/01/speaking-the-language-of-personality/</feedburner:origLink></item>
		<item>
		<title>will trade two Methods sections for twenty-two subjects worth of data</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/M3J5VoUOrdc/</link>
		<comments>http://www.talyarkoni.org/blog/2010/07/01/630/#comments</comments>
		<pubDate>Thu, 01 Jul 2010 06:08:43 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[academics]]></category>
		<category><![CDATA[opinion]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[benchwork]]></category>
		<category><![CDATA[care bears]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[division of labor]]></category>
		<category><![CDATA[preferences]]></category>
		<category><![CDATA[programming]]></category>
		<category><![CDATA[writing]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=630</guid>
		<description><![CDATA[The excellent and ever-candid Candid Engineer in Academia has an interesting post discussing the love-hate relationship many scientists who work in wet labs have with benchwork. She compares two very different perspectives: She [a current student] then went on to say that, despite wanting to go to grad school, she is pretty sure she doesn&#8217;t [...]]]></description>
			<content:encoded><![CDATA[<p>The excellent and ever-candid <a href="http://candidengineer.blogspot.com">Candid Engineer in Academia</a> has an interesting post discussing <a href="http://candidengineer.blogspot.com/2010/06/loving-bench.html">the love-hate relationship many scientists who work in wet labs have with benchwork</a>. She compares two very different perspectives:</p>
<blockquote><p>She [a current student] then went on to say that, despite wanting to go to grad school, she is pretty sure she doesn&#8217;t want to continue in academia beyond the Ph.D. because she just loves doing the science so much and she can&#8217;t imagine ever not being at the bench.</p>
<p>Being young and into the benchwork, I remember once asking my grad advisor if he missed doing experiments. His response: &#8220;Hell no.&#8221; I didn&#8217;t understand it at the time, but now I do. So I wonder if my student will always feel the way she does now- possessing of that unbridled passion for the pipet, that unquenchable thirst for the cell culture hood.</p></blockquote>
<p>Wet labs are pretty much nonexistent in psychology&#8211;I&#8217;ve never had to put on gloves or goggles to do anything that I&#8217;d consider an &#8220;experiment&#8221;, and I&#8217;ve certainly never run the risk of  <a href="http://science-professor.blogspot.com/2010/03/my-grandmother-was-right.html">spilling dangerous chemicals all over myself</a>&#8211;so I have no opinion at all about benchwork. Maybe I&#8217;d love it, maybe I&#8217;d hate it; I couldn&#8217;t tell you. But Candid Engineer&#8217;s post did get me thinking about opinions surrounding the psychological equivalent of benchwork&#8211;namely, collecting data form human subjects. My sense is that there&#8217;s somewhat more consensus among psychologists, in that most of us don&#8217;t seem to like data collection very much. But there are plenty of exceptions, and there certainly are strong feelings on both sides.</p>
<p>More generally, I&#8217;m perpetually amazed at the wide range of opinions people can hold about the various elements of scientific research, even when the people doing the different-opinion-holding all work in very similar domains. For instance, my favorite aspect of the research I do, hands down, is data analysis. I&#8217;d be ecstatic if I could analyze data all day and never have to worry about actually communicating the results to anyone (though I enjoy doing that too). After that, there are activities like writing and software development, which I spend a lot of time doing, and occasionally enjoy, but also frequently find very frustrating. And then, at the other end, there are aspects of research that I find have little redeeming value save for their instrumental value in supporting other, more pleasant, activities&#8211;nasty, evil activities like writing <a href="http://en.wikipedia.org/wiki/Institutional_review_board">IRB</a> proposals and, yes, collecting data.</p>
<p>To me, collecting data is something you do because you&#8217;re fundamentally interested in some deep (or maybe not so deep) question about how the mind works, and the only way to get an answer is to actually interrogate people while they do stuff in a controlled environment. It isn&#8217;t something I do for <em>fun</em>. Yet I know people who genuinely seem to love collecting data&#8211;or, for that matter, writing Methods sections or designing new experiments&#8211;even as they loathe perfectly pleasant activities like, say, sitting down to analyze the data they&#8217;ve collected, or writing a few lines of code that could save them hours&#8217; worth of manual data entry. On a personal level, I find this almost incomprehensible: how could <em>anyone</em> possibly enjoy collecting data more than actually crunching the numbers and learning new things? But I know these people exist, because I&#8217;ve talked to them. And I recognize that, from their perspective, <em>I&#8217;m</em> the guy with the strange views. They&#8217;re sitting there thinking: what kind of joker actually <em>likes</em> to turn his data inside out several dozen times? What&#8217;s wrong with just running a simple t-test and writing up the results as fast as possible, so you can get back to the pleasure of designing and running new experiments?</p>
<p>This of course leads us directly to the <a href="http://scienceblogs.com/drugmonkey/2008/06/academic_science_not_a_care_be.php">care bears fucking tea party</a> moment where I tell you how wonderful it is that we all have these different likes and dislikes. I&#8217;m not being sarcastic; it really<em> is</em> great. Ultimately, it works to everyone&#8217;s advantage that we enjoy different things, because it means we get to collaborate on projects and take advantage of complementary strengths and interests, instead of all having to fight over who gets to write the same part of the Methods section. It&#8217;s good that there are some people who love benchwork and some people who hate it, and it&#8217;s good that there are people who&#8217;re happy to write software that other people who hate writing software can use. We don&#8217;t all have to pretend we understand each other; it&#8217;s enough just to nod and smile and say &#8220;but <em>of course</em> you can write the Methods for that paper; I really don&#8217;t mind. And yes, I <em>guess</em> I can run some additional analyses for you, really, it&#8217;s not too much trouble at all.&#8221;</p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/M3J5VoUOrdc" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/07/01/630/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/07/01/630/</feedburner:origLink></item>
		<item>
		<title>fourteen questions about selection bias, circularity, nonindependence, etc.</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/3w5BNBUzYHM/</link>
		<comments>http://www.talyarkoni.org/blog/2010/06/27/fourteen-questions-about-selection-bias-circularity-nonindependence-etc/#comments</comments>
		<pubDate>Mon, 28 Jun 2010 05:19:49 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[fmri]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[circular analysis]]></category>
		<category><![CDATA[nonindependence]]></category>
		<category><![CDATA[selection bias]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=623</guid>
		<description><![CDATA[A new paper published online this week in the Journal of Cerebral Blood Flow &#38; Metabolism this week discusses the infamous problem of circular analysis in fMRI research. The paper is aptly titled &#8220;Everything you never wanted to know about circular analysis, but were afraid to ask,&#8221; and is authored by several well-known biostatisticians and [...]]]></description>
			<content:encoded><![CDATA[<p>A new paper published online this week in the Journal of Cerebral Blood Flow &amp; Metabolism this week discusses <a href="http://www.nature.com/jcbfm/journal/vaop/ncurrent/full/jcbfm201086a.html">the infamous problem of circular analysis in fMRI research</a>. The paper is aptly titled &#8220;Everything you never wanted to know about circular analysis, but were afraid to ask,&#8221; and is authored by several well-known biostatisticians and cognitive neuroscientists&#8211;to wit, <a href="http://www.mrc-cbu.cam.ac.uk/people/nikolaus.kriegeskorte/">Niko Kriegeskorte</a>, <a href="http://www.stat.columbia.edu/~martin/">Martin Lindquist</a>, <a href="http://www.sph.umich.edu/~nichols/">Tom Nichols</a>, <a href="http://www.psy.utexas.edu/psy/faculty/Poldrack/poldrack.html">Russ Poldrack</a>, and <a href="http://www.edvul.com/">Ed Vul</a>. The paper has an interesting format, and one that I really like: it&#8217;s set up as a series of fourteen questions related to circular analysis, and each author answers each question in 100 words or less.</p>
<p>I won&#8217;t bother going over the gist of the paper, because the Neuroskeptic <a href="http://neuroskeptic.blogspot.com/2010/06/a-team-sets-fmri-to-rights.html">already  beat me to the punch</a> in an excellent post a couple of days ago  (actually, that&#8217;s how I found out about the paper); instead,  I&#8217;ll just give my own answers to the same set of questions raised in the paper. And since blog posts don&#8217;t have the same length constraints as <a href="http://www.nature.com">NPG journals</a>, I&#8217;m going to be characteristically long-winded and ignore the 100 word limit&#8230;</p>
<p><em>(1) Is circular analysis a problem in systems and cognitive neuroscience?</em></p>
<p>Yes, it&#8217;s a huge problem. That said, I think the term &#8216;circular&#8217; is somewhat misleading here, because it has the connotation than an analysis is completely vacuous. Truly circular analyses&#8211;i.e., those where an initial analysis is performed, and the researchers then conduct a &#8220;follow-up&#8221; analysis that literally adds <em>no</em> new information&#8211;are relatively rare in fMRI research. Much more common are cases where there&#8217;s some dependency between two different analyses, but the second one still adds <em>some</em> novel information.</p>
<p><em>(2) How widespread are slight distortions and serious errors caused by circularity in the neuroscience literature?</em></p>
<p>I think Nichols sums it up nicely here:</p>
<blockquote><p>TN: False positives due to circularity are minimal; biased estimates of effect size are common. False positives due to brushing off the multiple testing problem (e.g., ‘P&lt;0.001 uncorrected’ and crossing your fingers) remain pervasive.</p></blockquote>
<p>The only thing I&#8217;d add to this is that the bias in effect size estimates is not only common, but, in most cases, is probably <a href="http://www.talyarkoni.org/blog/2009/11/21/ioannidis-on-effect-size-inflation-with-guest-appearance-by-bozo-the-clown/">very large</a>.</p>
<p><em>(3) Are circular estimates useful measures of effect size?</em></p>
<p>Yes and no. They&#8217;re less useful than unbiased measures of effect size. But given that the vast majority of effects reported in whole-brain fMRI analyses (and, more generally, <a href="http://www.dcscience.net/ioannidis-associations-2008.pdf">analyses in most fields</a>) are likely to be inflated to some extent, the only way to ensure we don&#8217;t rely on circular estimates of effect size would be to disregard effect size estimates entirely, which doesn&#8217;t seem prudent.</p>
<p><em>(4) Should circular estimates of effect size be presented in papers and, if so, how?</em></p>
<p>Yes, because the only principled alternatives are to either (a) <em>never</em> report effect sizes (which seems much too drastic), or (b) report the results of every single test performed, irrespective of the result (i.e., to never give selection bias an opportunity to rear its head). Neither of these is reasonable. We should generally report effect sizes for all key effects, but they should be accompanied by appropriate confidence intervals. As Lindquist notes:</p>
<blockquote><p>In general, it may be useful to present any effect size estimate as confidence intervals, so that readers can see for themselves how much uncertainty is related to the point estimate.</p></blockquote>
<p>A key point I&#8217;d add is that the width of the reported CIs should match the threshold used to identify results in the first place. In other words, if you conduct a whole brain analysis at p &lt; .001, you should report all resulting effects with 99.9% CIs, and not 95% CIs. I think this simple step would go a considerable ways towards conveying the true uncertainty surrounding most point estimates in fMRI studies.</p>
<p><em>(5) Are effect size estimates important/useful for neuroscience research, and why?</em></p>
<p>I think my view here is closest to Ed Vul&#8217;s:</p>
<blockquote><p>Yes, very much so. Null-hypothesis testing is insufficient for most goals of neuroscience because it can only indicate that a brain region is involved to some nonzero degree in some task contrast. This is likely to be true of most combinations of task contrasts and brain regions when measured with sufficient power.</p></blockquote>
<p>I&#8217;d go further than Ed does though, and say that in a sense, effect size estimates are the <em>only </em>things that matter. As Ed notes, there are few if any cases where it&#8217;s plausible to suppose that the effect of some manipulation on brain activation is really <em>zero</em>. The brain is a very dense causal system&#8211;almost any change in one variable is going to have downstream effects on many, and perhaps most, others. So the real question we care about is almost never &#8220;is there or isn&#8217;t there an effect,&#8221; it&#8217;s whether there&#8217;s an effect that&#8217;s big enough to actually care about. (This problem isn&#8217;t specific to fMRI research, of course; it&#8217;s been a persistent source of criticism of null hypothesis significance testing for many decades.)</p>
<p>People sometimes try to deflect this concern by saying that they&#8217;re not trying to make any claims about how <em>big</em> an effect is, but only about whether or not one can reject the null&#8211;i.e., whether <em>any</em> kind of effect is present or not. I&#8217;ve never found this argument convincing, because whether or not you own up to it, you&#8217;re <em>always</em> making an effect size claim whenever you conduct a hypothesis test. Testing against a null of zero is equivalent to saying that you care about any effect that isn&#8217;t exactly zero, which is simply false. No one in fMRI research cares about <em>r</em> or <em>d</em> values of 0.0001, yet we routinely conduct tests whose results could be consistent with those types of effect sizes.</p>
<p>Since we&#8217;re always making implicit claims about effect sizes when we conduct hypothesis tests, we may as well make them explicit so that they can be evaluated properly. If you only care about correlations greater than 0.1, there&#8217;s no sense in hiding that fact; why not explicitly test against a null range of -0.1 to 0.1, instead of a meaningless null of zero?</p>
<p><em>(6) What is the best way to accurately estimate effect sizes from imaging data?</em></p>
<p>Use large samples, conduct multivariate analyses, report results comprehensively, use meta-analysis&#8230; I don&#8217;t think there&#8217;s any single way to ensure accurate effect size estimates, but plenty of things help. Maybe the most general recommendation is to ensure adequate power (see below), which will naturally minimize effect size inflation.</p>
<p><em>(7) What makes data sets independent? Are different sets of subjects required?</em></p>
<p>Most of the authors think (as I do too) that different sets of subjects are indeed required in order to ensure independence. Here&#8217;s Nichols:</p>
<blockquote><p>Only data sets collected on distinct individuals can be assured to be independent. Splitting an individual&#8217;s data (e.g., using run 1 and run 2 to create two data sets) does not yield independence at the group level, as each subject&#8217;s true random effect will correlate the data sets.</p></blockquote>
<p>Put differently, splitting data within subjects only eliminates measurement error, and not sampling error. You could in theory measure activation perfectly reliably (in which case the two halves of subjects&#8217; data would be perfectly correlated) and still have grossly inflated effects, simply because the multivariate distribution of scores in your sample doesn&#8217;t accurately reflect the distribution in the population. So, as Nichols points out, you always need new subjects if you want to be absolutely certain your analyses are independent. But since this generally isn&#8217;t feasible, I&#8217;d argue we should worry less about whether or not our data sets are completely independent, and more about reporting results in a way that makes the presence of any bias as clear as possible.</p>
<p><em>(8) What information can one glean from data selected for a certain effect?</em></p>
<p>I think this is kind of a moot question, since virtually all data are susceptible to <em>some </em>form of selection bias (scientists generally don&#8217;t write papers detailing all the analyses they conducted that didn&#8217;t pan out!). As I note above, I think it&#8217;s a bad idea to disregard effect sizes entirely; they&#8217;re actually what we should be focusing most of our attention on. Better to report confidence intervals that accurately reflect the selection procedure and make the uncertainty around the point estimate clear.</p>
<p><em>(9) Are visualizations of nonindependent data helpful to illustrate the claims of a paper?</em></p>
<p>Not in cases where there&#8217;s an extremely strong dependency between the selection criteria and the effect size estimate. In cases of weak to moderate dependency, visualization is fine so long as confidence bands are plotted alongside the best fit. Again, the key is to always be explicit about the limitations of the analysis and provide some indication of the uncertainty involved.</p>
<p><em>(10) Should data exploration be discouraged in favor of valid confirmatory analyses?</em></p>
<p>No. I agree with Poldrack&#8217;s sentiment here:</p>
<blockquote><p>Our understanding of brain function remains incredibly crude, and limiting research to the current set of models and methods would virtually guarantee scientific failure. Exploration of new approaches is thus critical, but the findings must be confirmed using new samples and convergent methods.</p></blockquote>
<p><em>(11) Is a confirmatory analysis safer than an exploratory analysis in terms of drawing neuroscientific conclusions?</em></p>
<p>In principle, sure, but in practice, it&#8217;s virtually impossible to determine which reported analyses really started out their lives as confirmatory analyses and which started life out as exploratory analyses and then mysteriously evolved into &#8220;a priori&#8221; predictions once the paper was written. I&#8217;m not saying there&#8217;s anything wrong with this&#8211;everyone reports results strategically to some extent&#8211;just that I don&#8217;t know that the distinction between confirmatory and exploratory analyses is all that meaningful in practice. Also, as the previous point makes clear, safety isn&#8217;t the only criterion we care about; we also want to discover new and unexpected findings, which requires exploration.</p>
<p><em>(12) What makes a whole-brain mapping analysis valid? What constitutes sufficient adjustment for multiple testing?</em></p>
<p>From a hypothesis testing standpoint, you need to ensure adequate control of the family-wise error (FWE) rate or false discovery rate (FDR). But as I suggested above, I think this only ensures validity in a limited sense; it doesn&#8217;t ensure that the results are actually going to be worth <em>caring about</em>. If you want to feel confident that any effects that survive are meaningfully large, you need to do the extra work up front and define what constitutes a meaningful effect size (and then test against that).</p>
<p><em>(13) How much power should a brain-mapping analysis have to be useful?</em></p>
<p>As much as possible! Concretely, the conventional target of 80% seems like a good place to start. But as I&#8217;ve argued before (e.g., <a href="http://talyarkoni.org/papers/Yarkoni_PPS_commentary.pdf">here</a>), that would require more than doubling conventional sample sizes in most cases. The reality is that fMRI studies are expensive, so we&#8217;re probably stuck with underpowered analyses for the foreseeable future. So we need to find other ways to compensate for that (e.g., relying more heavily on meta-analytic effect size estimates).</p>
<p><em>(14) In which circumstances are nonindependent selective analyses acceptable for scientific publication?</em></p>
<p>It depends on exactly what&#8217;s problematic about the analysis. Analyses that are truly circular and provide no new information should never be reported, but those constitute only a small fraction of all analyses. More commonly, the nonindependence simply amounts to selection bias: researchers tend to report only those results that achieve statistical significance, thereby inflating apparent effect sizes. I think the solution to this is to still report all key effect sizes, but to ensure they&#8217;re accompanied by confidence intervals and appropriate qualifiers.</p>
<p><span style="float: left; padding: 5px;"><a href="http://www.researchblogging.org"><img style="border:0;" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" alt="ResearchBlogging.org" /></a></span><span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Journal+of+cerebral+blood+flow+and+metabolism+%3A+official+journal+of+the+International+Society+of+Cerebral+Blood+Flow+and+Metabolism&amp;rft_id=info%3Apmid%2F20571517&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Everything+you+never+wanted+to+know+about+circular+analysis%2C+but+were+afraid+to+ask.&amp;rft.issn=0271-678X&amp;rft.date=2010&amp;rft.volume=&amp;rft.issue=&amp;rft.spage=&amp;rft.epage=&amp;rft.artnum=&amp;rft.au=Kriegeskorte+N&amp;rft.au=Lindquist+MA&amp;rft.au=Nichols+TE&amp;rft.au=Poldrack+RA&amp;rft.au=Vul+E&amp;rfe_dat=bpr3.included=1;bpr3.tags=Neuroscience%2CCognitive+Neuroscience%2C+Probability+and+Statistics%2C+Statistics%2C+fMRI">Kriegeskorte N, Lindquist MA, Nichols TE, Poldrack RA, &amp; Vul E (2010). Everything you never wanted to know about circular analysis, but were afraid to ask. <span style="font-style: italic;">Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism</span> PMID: <a rev="review" href="http://www.ncbi.nlm.nih.gov/pubmed/20571517">20571517</a></span></p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/3w5BNBUzYHM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/06/27/fourteen-questions-about-selection-bias-circularity-nonindependence-etc/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/06/27/fourteen-questions-about-selection-bias-circularity-nonindependence-etc/</feedburner:origLink></item>
		<item>
		<title>estimating bias in text with Ruby</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/EL2T23okVdM/</link>
		<comments>http://www.talyarkoni.org/blog/2010/06/25/estimating-bias-in-text-with-ruby/#comments</comments>
		<pubDate>Fri, 25 Jun 2010 06:29:33 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[methods]]></category>
		<category><![CDATA[software]]></category>
		<category><![CDATA[tools]]></category>
		<category><![CDATA[BEAGLE]]></category>
		<category><![CDATA[bias]]></category>
		<category><![CDATA[CASS]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[Ruby]]></category>
		<category><![CDATA[semantics]]></category>
		<category><![CDATA[vector space models]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=620</guid>
		<description><![CDATA[Over the past couple of months, I&#8217;ve been working on and off on a collaboration with my good friend Nick Holtzman and some other folks that focuses on ways to automatically extract bias from text using a vector space model. The paper is still in progress, so I won&#8217;t give much away here, except to [...]]]></description>
			<content:encoded><![CDATA[<p>Over the past couple of months, I&#8217;ve been working on and off on a collaboration with my good friend <a href="http://nickholtzman.com">Nick Holtzman</a> and <a href="http://psychweb.wustl.edu/node/444">some</a> <a href="http://psych.indiana.edu/faculty/pages/mjones.asp">other</a> <a href="http://www.psych.wustl.edu/coglab/balota.htm">folks</a> that focuses on ways to automatically extract bias from text using a <a href="http://en.wikipedia.org/wiki/Vector_space_model">vector space model</a>. The paper is still in progress, so I won&#8217;t give much away here, except to say that Nick&#8217;s figured out what I think is a pretty clever way to show that, yes, Fox likes Republicans more than Democrats, and MSNBC likes Democrats more than Republicans. It&#8217;s not meant to be a surprising result, but simply a nice validation of the underlying method, which can be flexibly applied to all sorts of interesting questions.</p>
<p>The model we&#8217;re using is a simplified variant of <a href="http://www.indiana.edu/~clcl/BEAGLE/Jones_Mewhort_PR.pdf">Jones and Mewhort&#8217;s (2007) BEAGLE</a> model. Essentially, similarity between words is quantified by looking at the degree to which words have similar co-occurrence patterns with other words. This basic idea is actually common to pretty much all vector space models, so in that sense, there&#8217;s not much new here (there&#8217;s plenty that&#8217;s new in Jones and Mewhort (2007), but we&#8217;re mostly leaving those features out for the sake of simplicity and computational speed). The novel aspect is the contrast coding of similarity terms in order to produce bias estimates. But you&#8217;ll have to wait for the paper to read more about that.</p>
<p>In the meantime, one thing we&#8217;ve tried to do is develop software that can be used to easily implement the kind of analyses we describe in the paper. With plenty of input from Nick and Mike Jones, I&#8217;ve written a set of tools in <a href="http://www.ruby-lang.org/en/">Ruby</a> that&#8217;s now freely available for download <a href="http://casstools.org">here</a>. The tools are actually bundled as a Ruby <a href="http://rubygems.org/">gem</a>, so installation should be a snap on most platforms. We&#8217;re still working on documentation, so there&#8217;s no full-blown manual yet, but the <a href="http://casstools.org/doc/">quick-start guide</a> should be sufficient to get many users up and running. And for people who share my love of Ruby and are interested in using the tools programmatically, there&#8217;s a fairly well-commented <a href="http://casstools.org/doc/">RDoc</a>.</p>
<p>The code should really be considered an <a href="http://en.wikipedia.org/wiki/Alpha_release">alpha release</a> at the moment; I&#8217;m sure there are plenty of bugs (if you find any, <a href="mailto:tyarkoni@gmail.com">email</a> me!), and the feature set is currently pretty limited. Hopefully it&#8217;ll grow over time. I also plan to throw the code up on <a href="https://github.com/">GitHub</a> at some point in the near future so that anyone who&#8217;s interested can help out with the development. In the meantime, if you&#8217;re interested in semantic space models and want to play around with a crude (but relatively fast) implementation of one, there&#8217;s a (very) small chance you might find these tools useful.</p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/EL2T23okVdM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/06/25/estimating-bias-in-text-with-ruby/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/06/25/estimating-bias-in-text-with-ruby/</feedburner:origLink></item>
		<item>
		<title>not really a pyramid scheme; maybe a giant cesspool of little white lies?</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/eWND-amQroM/</link>
		<comments>http://www.talyarkoni.org/blog/2010/06/23/not-really-a-pyramid-scheme-maybe-a-giant-cesspool-of-little-white-lies/#comments</comments>
		<pubDate>Wed, 23 Jun 2010 06:36:27 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[academic life]]></category>
		<category><![CDATA[ethics]]></category>
		<category><![CDATA[training]]></category>
		<category><![CDATA[academia]]></category>
		<category><![CDATA[game theory]]></category>
		<category><![CDATA[jobs]]></category>
		<category><![CDATA[phd programs]]></category>
		<category><![CDATA[pyramid schemes]]></category>
		<category><![CDATA[tenure track]]></category>
		<category><![CDATA[Walt Whitman]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=614</guid>
		<description><![CDATA[There&#8217;s a long tradition in the academic blogosphere (and the offlinesphere too, I presume) of complaining that academia is a pyramid scheme. In a strict sense, I guess you could liken academia to a pyramid scheme, inasmuch as there are fewer open positions at each ascending level, and supply generally exceeds demand. But as The [...]]]></description>
			<content:encoded><![CDATA[<p>There&#8217;s a long tradition in the academic blogosphere (and the offlinesphere too, I presume) of complaining that academia is a pyramid scheme. In a strict sense, I guess you <em>could</em> liken academia to a pyramid scheme, inasmuch as there are fewer open positions at each ascending level, and supply generally exceeds demand. But as The Prodigal Academic points out in a post today, <a href="http://theprodigalacademic.blogspot.com/2010/06/academia-and-pyramid-schemes.html">this phenomenon is hardly exclusive to academia</a>:</p>
<blockquote><p>I guess I don&#8217;t really see much difference between academic job hunting, and job hunting in general. Starting out with undergrad admissions, there are many more qualified people for desirable positions than available slots. Who gets those slots is a matter of hard work (to get qualified) and luck (to be one of the qualified people who is &#8220;chosen&#8221;). So how is the TT any different from grad school admissions (in ANY prestige program), law firm partnership, company CEO, professional artist/athlete/performer, attending physician, investment banking, etc? The pool of qualified applicants is many times larger than the number of slots, and there are desirable perks to success (money/prestige/fame/security/intellectual freedom) making the supply of those willing to try for the goal pretty much infinite.</p>
<p>Maybe I have rose colored glasses on because I have always been lucky enough to find a position in research, but there are no guarantees in life. When I was interviewing in industry, I saw many really interesting jobs available to science PhD holders that were not in research. If I hadn&#8217;t gone to National Lab, I would have been happy to take on one of those instead. Sure, my life would be different, but it wouldn&#8217;t make my PhD a waste of time or a failed opportunity.</p></blockquote>
<p>For the most part, I agree with this sentiment. I love doing research, and can&#8217;t imagine ever voluntarily leaving academia. But If I do end up having to leave&#8211;meaning, if I can&#8217;t find a faculty position when I go on the job market in the next year or two&#8211;I don&#8217;t think it&#8217;ll be the end of the world. I see job ads in industry all the time that looks really interesting, and on some level, I think I&#8217;d find almost any job that involves creative analysis of very large datasets (which there are plenty of these days!) pretty gratifying. And no matter what happens, I don&#8217;t think I&#8217;d ever view the time I&#8217;ve spent on my PhD and postdoc training as a waste of time, for the simple reason that I&#8217;ve really enjoyed most of it (there are, of course, the nasty bits, like writing the <em>N</em>th chapter of a dissertation&#8211;but those are transient, fortunately). So in that sense, I think all the talk about academia being a pyramid scheme is kind of silly.</p>
<p>That said, there is one sticking point to the standard pyramid scheme argument I do agree with, which is that, when you&#8217;re starting out as a graduate student, no one really goes out of their way to tell you what the odds of getting a tenure-track faculty position actually are (and they&#8217;re not good). The problem being that most of the professors that prospective graduate students have interacted with, either as undergraduates, or in the context of applying to grad school, are precisely those lucky souls who&#8217;ve managed to secure faculty positions. So the difficulty of obtaining the same type of position isn&#8217;t always very salient to them.</p>
<p>I&#8217;m not saying faculty members <em>lie</em> outright to prospective graduate students, of course; I don&#8217;t doubt that if you asked most faculty point blank &#8220;what proportion of students in your department have managed to find tenure-track positions,&#8221; they&#8217;d give you an honest answer. But when you&#8217;re 22 or 23 years old (and yes, I recognize some graduate students are much older, but this is the mode) and you&#8217;re thinking of a career in research, it doesn&#8217;t always occur to you to ask that question. And naturally, departments that are trying to recruit your services are unlikely to begin their pitch by saying, &#8220;in the past 10 years, only about 12% of our graduates have gone on to tenure-track faculty positions&#8221;. So in that sense, I don&#8217;t think new graduate students are always aware of just how difficult it is to obtain an independent research position, statistically speaking. That&#8217;s not a problem for the (many) graduate students who don&#8217;t really have any intention of going into academia anyway, but I do think a large part of the disillusionment graduate students often experience is about the realization that you can bust your ass for five or six years working sixty hours a week, and still have no guarantee of finding a research job when you&#8217;re done. And that could be avoided to some extent by making a concerted effort to inform students up front of the odds they face if they&#8217;re planning on going down that path. So long as that information is made readily available, I don&#8217;t really see a problem.</p>
<p>Having said that, I&#8217;m now going to blatantly contradict myself (so what if I do? <a href="http://www.daypoems.net/plainpoems/1900.html">I am large! I contain multitudes!</a>). You could, I think, reasonably argue that this type of deception <em>isn&#8217;t</em> really a problem, and that it&#8217;s actually <em>necessary</em>. For one thing, the white lies cut both ways. It isn&#8217;t just faculty who conveniently forget to mention that relatively few students will successfully obtain tenure-track positions; many graduate students nod and smile when asked if they&#8217;re planning a career in research, despite having no intention of continuing down that path past the PhD. I&#8217;ve occasionally heard faculty members complain that they need to do a better job filtering out those applicants who <em>really truly</em> are interested in a career in research, because they&#8217;re losing a lot of students to industry at the tail end. But I think this kind of magical mind-reading filter is a pipe dream, for precisely the reasons outlined above: if faculty aren&#8217;t willing to begin their recruitment speeches by saying &#8220;most of you probably won&#8217;t get research positions even if you want them,&#8221; they shouldn&#8217;t really complain when most students don&#8217;t come right out and say &#8220;actually, I just want a PhD because I think it&#8217;ll be something interesting to do for a few years and then I&#8217;ll be able to find a decent job with better hours later&#8221;.</p>
<p>The reality is that the whole enterprise may actually <em>require</em> subtle misdirection about people&#8217;s intentions. If every student applying to grad school knew exactly what the odds of getting a research position were, I imagine many fewer people who were serious about research would bother applying; you&#8217;d then get predominantly people who don&#8217;t really want to do research anyway. And if you <em>could</em> magically weed out the students who don&#8217;t want to do research, then (a) there probably wouldn&#8217;t be enough highly qualified students left to keep research programs afloat, and/or (b) there would be even more candidates applying for research positions, making things even harder for those students who <em>do</em> want careers in research. There&#8217;s probably no magical allocation of resources that optimizes everyone&#8217;s needs simultaneously; it could be that we&#8217;re more or less at a stable equilibrium point built on little white lies.</p>
<p><a href="http://en.wiktionary.org/wiki/TLDR">tl;dr</a> : I don&#8217;t think academia is really a pyramid scheme; more like a giant cesspool of little white lies and subtle misinformation that indirectly serves most people&#8217;s interests. So, basically, it&#8217;s kind of like most other domains of life that involve interactions between many groups of people.</p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/eWND-amQroM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/06/23/not-really-a-pyramid-scheme-maybe-a-giant-cesspool-of-little-white-lies/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/06/23/not-really-a-pyramid-scheme-maybe-a-giant-cesspool-of-little-white-lies/</feedburner:origLink></item>
		<item>
		<title>and the runner up is…</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/Z_4tj5D6BSQ/</link>
		<comments>http://www.talyarkoni.org/blog/2010/06/22/and-the-runner-up-is/#comments</comments>
		<pubDate>Tue, 22 Jun 2010 06:27:14 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[general silliness]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[publishing]]></category>
		<category><![CDATA[impact factor]]></category>
		<category><![CDATA[journals]]></category>
		<category><![CDATA[rankings]]></category>
		<category><![CDATA[thomson-reuters]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=609</guid>
		<description><![CDATA[This one&#8217;s a bit of a head-scratcher. Thomson-Reuters just released its 2009 Journal Citation Report&#8211;essentially a comprehensive ranking of scientific journals by their impact factor (IF). The odd part, as reported by Bob Grant in The Scientist, is that the journal with the second-highest IF is Acta Crystallographica &#8211; Section A&#8211;ahead of heavyweights like the [...]]]></description>
			<content:encoded><![CDATA[<p>This one&#8217;s a bit of a head-scratcher. Thomson-Reuters just released its 2009 Journal Citation Report&#8211;essentially a comprehensive ranking of scientific journals by their impact factor (IF). The odd part, <a href="http://www.the-scientist.com/blog/display/57500/">as reported by Bob Grant in The S</a><a href="http://www.the-scientist.com/blog/display/57500/">cientist</a>, is that the journal with the second-highest IF is <em>Acta Crystallographica &#8211; Section A</em>&#8211;ahead of heavyweights like the <em>New England Journal of Medicine</em>. For perspective, the same journal had an IF of 2.051 in 2008. The reason for the jump?</p>
<blockquote><p>A single article published in a 2008 issue of the journal seems to be  responsible for the meteoric rise in the <em>Acta Crystallographica &#8211;  Section A</em>&#8216;s impact factor. <a href="http://www3.interscience.wiley.com/journal/119398457/abstract">&#8220;A  short history of SHELX,&#8221;</a> by University of Göttingen crystallographer  <a href="http://shelx.uni-ac.gwdg.de/%3Csub%3Egsheldr/">George  Sheldrick,</a> which reviewed the development of the computer system  SHELX, has been cited more than 6,600 times, according to ISI. This  paper includes a sentence that essentially instructs readers to cite the  paper they&#8217;re reading &#8212; &#8220;This paper could serve as a general  literature citation when one or more of the open-source SHELX programs  (and the Bruker AXS version SHELXTL) are employed in the course of a  crystal-structure determination.&#8221; (Note: This may be a good way to boost  your citations.)</p></blockquote>
<p>Setting aside the good career advice (and yes, I&#8217;ve made a mental note to include the phrase &#8220;this paper could serve as a general literature citation&#8230;&#8221; in my next paper), it&#8217;s perplexing that Thomson-Reuters didn&#8217;t downweight <em>Acta Crystallographica</em>&#8216;s IF considerably given the obvious outlier. There&#8217;s no question they would have noticed that the second-ranked journal was only there in virtue of one article, so I&#8217;m curious what the thought process was. Perhaps the deliberation went something like this:</p>
<blockquote><p>Thomson-Reuters statistician A: We need to take it out! We can&#8217;t have a journal with an impact factor of 2 last year beat out the NEJM!</p>
<p>Thomson-Reuters statistician B: But if we take it out, it&#8217;ll look like we tampered with the IF!</p>
<p>TRS-A: But we already tamper with the IF! No one knows how we come up with these numbers! <a href="http://jgp.rupress.org/content/131/1/3.full">Sometimes we can&#8217;t even replicate our own results ourselves</a>! And anyway, it&#8217;s really not a big deal if we just leave the article in; scientists know better than to think Acta Crystallographica is the second most influential science journal on the planet. They&#8217;ll figure it out.</p>
<p>TRS-B: But that&#8217;s like asking them to just disregard our numbers! If you&#8217;re supposed to ignore the impact factor in cases where it contradicts your perception of journal quality, what&#8217;s the point of having an impact factor at all?</p>
<p>TRS-A: Beats me.</p></blockquote>
<div style="overflow: hidden; color: #000000; background-color: transparent; text-align: left; text-decoration: none; border: medium none;">So okay, I&#8217;m sure it didn&#8217;t go down quite like that. But it&#8217;s still pretty weird.</div>
<div style="overflow: hidden; color: #000000; background-color: transparent; text-align: left; text-decoration: none; border: medium none;"></div>
<div style="overflow: hidden; color: #000000; background-color: transparent; text-align: left; text-decoration: none; border: medium none;">And now, having bitched about how arbitrary the IF is, I&#8217;m going to go off and spend the next 15 minutes perusing the psychology and neuroscience journal rankings&#8230;</div>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/Z_4tj5D6BSQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/06/22/and-the-runner-up-is/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/06/22/and-the-runner-up-is/</feedburner:origLink></item>
	</channel>
</rss>
