<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/rss2full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>[citation needed]</title>
	
	<link>http://www.talyarkoni.org/blog</link>
	<description>...and a clue would also be helpful.</description>
	<lastBuildDate>Fri, 03 Sep 2010 07:59:45 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/rss+xml" href="http://feeds.feedburner.com/citationNeeded" /><feedburner:info uri="citationneeded" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><creativeCommons:license>http://creativecommons.org/licenses/by-nc-sa/3.0/</creativeCommons:license><item>
		<title>does functional specialization exist in the language system?</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/nHo4pOlq2XA/</link>
		<comments>http://www.talyarkoni.org/blog/2010/09/03/does-functional-specialization-exist-in-the-language-system/#comments</comments>
		<pubDate>Fri, 03 Sep 2010 07:59:45 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[fmri]]></category>
		<category><![CDATA[methods]]></category>
		<category><![CDATA[ffa]]></category>
		<category><![CDATA[functional specialization]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[science on the web]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=668</guid>
		<description><![CDATA[One of the central questions in cognitive neuroscience&#8211;according to some people, at least&#8211;is how selective different chunks of cortex are for specific cognitive functions. The paradigmatic examples of functional selectivity are pretty much all located in sensory cortical regions or adjacent association cortices. For instance, the fusiform face area (FFA), is so named because it [...]]]></description>
			<content:encoded><![CDATA[<p>One of the central questions in cognitive neuroscience&#8211;according to some people, at least&#8211;is how selective different chunks of cortex are for specific cognitive functions. The paradigmatic examples of functional selectivity are pretty much all located in sensory cortical regions or adjacent association cortices. For instance, the <a href="http://en.wikipedia.org/wiki/Fusiform_face_area">fusiform face area</a> (FFA), is so named because it (allegedly) responds selectively to faces but not to other stimuli. Other regions with varying selectivity profiles are similarly named: the visual word form area (VWFA), parahippocampal place area (PPA), extrastriate body area (EBA), and so on.</p>
<p>In a recent review paper, <a href="http://web.mit.edu/evelina9/www/Downloads/Journal%20pubs/Fedorenko%20&amp;%20Kanwisher%202009.pdf">Fedorenko and Kanwisher (2009)</a> sought to apply insights from the study of functionally selective visual regions to the study of language. They posed the following question with respect to the neuroimaging of language in the title of their paper: <em>Why hasn&#8217;t a clearer picture emerged?</em> And they gave the following answer: <em>it&#8217;s because brains differ from one another, stupid</em>.</p>
<p>Admittedly, I&#8217;m paraphrasing; they don&#8217;t use <em>exactly</em> those words. But the basic point they make is that it&#8217;s difficult to identify functionally selective regions when you&#8217;re averaging over a bunch of very different brains. And the solution they propose&#8211;again, imported from the study of visual areas&#8211;is to identify potentially selective language regions-of-interest (ROIs) on a subject-specific basis rather than relying on group-level analyses.</p>
<p>The Fedorenko and Kanwisher paper apparently didn&#8217;t please Greg Hickok of <a href="http://www.talkingbrains.org/">Talking Brains</a>, who&#8217;s done a lot of very elegant work on the neurobiology of language.  A summary of <a href="http://www.talkingbrains.org/2010/08/neuroimaging-of-language-why-hasnt.html">Hickok&#8217;s take:</a></p>
<blockquote><p>What I found a bit on the irritating side though was the extremely dim  and distressingly myopic view of progress in the field of the neural  basis of language.</p></blockquote>
<p>He objects to Fedorenko and Kanwisher on several grounds, and the post is well worth reading. But since I&#8217;m very <span style="text-decoration: line-through;">lazy</span> tired, I&#8217;ll just summarize his points as follows:</p>
<ul>
<li>There&#8217;s more functional specialization in the language system than F&amp;K give the field credit for</li>
<li>The use of subject-specific analyses in the domain of language isn&#8217;t new, and many researchers (including Hickok) have used procedures similar to those F&amp;K recommend in the past</li>
<li>Functional selectivity is not necessarily a criterion we should care about all that much anyway</li>
</ul>
<p>As you might expect, F&amp;K disagree with Hickok on these points, and Hickok was kind enough to post <a href="http://www.talkingbrains.org/2010/08/response-from-fedorenko-kanwisher.html">their response</a>. He then responded to their response in <a href="https://www.blogger.com/comment.g?blogID=9048879464910781933&amp;postID=7464140007627040086&amp;isPopup=true">the comments</a> (which are also worth reading), which in turn spawned a back-and-forth with F&amp;K, a cameo by <a href="http://flowbrain.blogspot.com">Brad Buchsbaum</a> (who posted his own excellent thoughts on the matter <a href="http://flowbrain.blogspot.com/2010/09/more-work-needed.html">here</a>), and eventually, an intervention by a team of professional arbitrators. Okay, I made that last bit up; it was a very civil disagreement, and is exactly what scientific debates on the internet <em>should</em> look like, in my opinion.</p>
<p>Anyway, rather than revisit the entire thread, which you can read for yourself, I&#8217;ll just summarize my thoughts:</p>
<ul>
<li>On the whole, I think my view lines up pretty closely with Hickok&#8217;s and Buchsbaum&#8217;s. Although I&#8217;m very far from an expert on the neurobiology of language (is there a word in English for someone&#8217;s who&#8217;s the diametric opposite of an expert&#8211;i.e., someone who consistently and confidently asserts exactly the wrong thing? Cause that&#8217;s what I am), I agree with Hickok&#8217;s argument that the temporal poles show a response profile that looks suspiciously like sentence- or narrative-specific processing (I have a paper on <a href="http://talyarkoni.org/papers/yarkoni_speer_zacks.pdf">the neural mechanisms of narrative comprehension</a> that supports that claim to some extent), and think F&amp;K&#8217;s review of the literature is probably not as balanced as it could have been.</li>
<li>More generally, I agree with Hickok that demonstrating functional specialization isn&#8217;t necessarily that important to the study of language (or most other domains). This seems to be a major point of contention for F&amp;K, but I don&#8217;t think they make a very strong case for their view. They suggest that they &#8220;are not sure what other goals (besides understanding a region&#8217;s  computations) could drive studies aimed at understanding how  functionally specialized a region is,&#8221; which I think is reasonable, but affirms the consequent. Hickok isn&#8217;t saying there&#8217;s no reason to search for functional specialization in the F&amp;K sense; as I read him, he&#8217;s simply saying that you can study the nature of neural computation in lots of interesting ways that don&#8217;t require you to demonstrate functional specialization to the degree F&amp;K seem to require. Seems hard to disagree with that.</li>
<li>Buchsbaum points out that it&#8217;s questionable whether there are<em> any</em> brain regions that meet the criteria F&amp;K set out for functional specialization&#8211;namely that &#8220;A brain region R is  specialized for cognitive function x if this region (i) is engaged in  tasks that rely on cognitive function x, and (ii) is not engaged in  tasks that do not rely on cognitive function x.Buchsbaum and Hickok both point out that the two examples F&amp;K give of putatively specialized regions (the FFA and the temporo-parietal junction, which some people believe is selectively involved in theory of mind) are hardly uncontroversial. Plenty of people have argued that the FFA <em>isn&#8217;t </em>really selective to faces, and even more people have argued that the TPJ isn&#8217;t selective to theory of mind. As far as I can tell, F&amp;K don&#8217;t really address this issue in the comments. They do refer to a recent paper of Kanwisher&#8217;s that discusses the evidence for functional specificity in the FFA, but I&#8217;m not sure the argument made in that paper is itself uncontroversial, and in any case, Kanwisher does concede that there&#8217;s good evidence for at least some representation of non-preferred stimuli (i.e., non-faces in the FFA). In any case, the central question here is whether or not F&amp;K really unequivocally believe that FFA and TPJ aren&#8217;t engaged by any tasks that don&#8217;t involve face or theory of mind processing. If not, then it&#8217;s unfair to demand or expect the same of regions implicated in language.</li>
<li>Although I think there&#8217;s a good deal to be said for subject-specific analyses, I&#8217;m not as sanguine as F&amp;K that a subject-specific approach offers a remedy to the problems that they perceive afflict the study of the neural mechanisms of language. While there&#8217;s no denying that group analyses suffer from a number of limitations, subject-specific analyses have their own weaknesses, which F&amp;K don&#8217;t really mention in their paper. One is that such analyses typically require the assumption that two clusters located in slightly different places for different subjects must be carrying out the same cognitive operations if they respond similarly to a localizer task. That&#8217;s a very strong assumption for which there&#8217;s very little evidence (at least in the language domain)&#8211;especially because the localizer task F&amp;K promote in <a href="http://web.mit.edu/evelina9/www/Downloads/Journal%20pubs/Fedorenko%20et%20al%202010%20JN%20with%20suppl%20figs.pdf">this paper</a> involves a rather strong manipulation that may confound several different aspects of language processing.<br />
Another problem is that it&#8217;s not at all obvious how you determine which regions are the &#8220;same&#8221; (in their <a href="http://web.mit.edu/evelina9/www/Downloads/Journal%20pubs/Fedorenko%20et%20al%202010%20JN%20with%20suppl%20figs.pdf">2010 paper</a>, F&amp;K argue for an algorithmic parcellation approach, but the fact that you get sensible-looking results is no guarantee that your parcellation actually reflects meaningful functional divisions in individual subjects). And yet another is that serious statistical problems can arise in cases where one or more subjects fail to show activation in a putative region (which is generally the norm rather than the exception). Say you have 25 subjects in your sample, and 7 don&#8217;t show activation anywhere in a region that can broadly be called Broca&#8217;s area. What do you do? You can&#8217;t just throw those subjects out of the analysis, because that would grossly and misleadingly inflate your effect sizes. Conversely, you can&#8217;t just identify any old region that <em>does</em> activate and lump it in with the regions identified in all the other subjects. This is a very serious problem, but it&#8217;s one that group analyses, for all their weaknesses, don&#8217;t have to contend with.</li>
</ul>
<p>Disagreements aside, I think it&#8217;s really great to see serious scientific discussion taking place in this type of forum. In principle, this is the kind of debate that should be resolved (or not) in the peer-reviewed literature; in practice, peer review is slow, writing full-blown articles takes time, and journal space is limited. So I think blogs have a really important role to play in scientific communication, and frankly, I envy Hickok and Poeppel for the excellent discussion they consistently manage to stimulate over at Talking Brains!</p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/nHo4pOlq2XA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/09/03/does-functional-specialization-exist-in-the-language-system/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/09/03/does-functional-specialization-exist-in-the-language-system/</feedburner:origLink></item>
		<item>
		<title>the Bactrian camel and prefrontal cortex: evidence from somatosensory function</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/3C50stHmqqA/</link>
		<comments>http://www.talyarkoni.org/blog/2010/08/30/the-bactrian-camel-and-prefrontal-cortex-evidence-from-somatosensory-function/#comments</comments>
		<pubDate>Mon, 30 Aug 2010 07:05:23 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[general silliness]]></category>
		<category><![CDATA[neuroscience]]></category>
		<category><![CDATA[cognitive neuroscience]]></category>
		<category><![CDATA[fun]]></category>
		<category><![CDATA[generative software]]></category>
		<category><![CDATA[toys]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=664</guid>
		<description><![CDATA[I&#8217;ve been swamped with work lately, and don&#8217;t expect to see the light at the end of the tunnel for a few more weeks, so there won&#8217;t be any serious blogging here for the foreseeable future. But on a completely frivolous note, someone reminded me the other day of a cognitive neuroscience paper title generator [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been swamped with work lately, and don&#8217;t expect to see the light at the end of the tunnel for a few more weeks, so there won&#8217;t be any serious blogging here for the foreseeable future. But on a completely frivolous note, someone reminded me the other day of a <a href="http://talyarkoni.org/cngen">cognitive neuroscience paper title generator</a> I wrote a few years ago and had forgotten about. So I brushed it off and added a small amount of new content, and now it&#8217;s alive again <a href="http://talyarkoni.org/cngen">here</a>. I think it&#8217;s good for a few moments of entertainment, and occasionally produces a rare gem&#8211;like the one in the title of this post, or my all-time favorite, <em>Neural correlates of nicotine withdrawal in infants</em>.</p>
<p>Feel free to post any other winners in the comments&#8230;</p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/3C50stHmqqA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/08/30/the-bactrian-camel-and-prefrontal-cortex-evidence-from-somatosensory-function/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/08/30/the-bactrian-camel-and-prefrontal-cortex-evidence-from-somatosensory-function/</feedburner:origLink></item>
		<item>
		<title>trouble with biomarkers and press releases</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/RPAev64JuGA/</link>
		<comments>http://www.talyarkoni.org/blog/2010/08/15/trouble-with-biomarkers-and-press-releases/#comments</comments>
		<pubDate>Mon, 16 Aug 2010 03:25:15 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[neuroscience]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[science journalism]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[asd]]></category>
		<category><![CDATA[autism]]></category>
		<category><![CDATA[base rates]]></category>
		<category><![CDATA[biomarkers]]></category>
		<category><![CDATA[classifiers]]></category>
		<category><![CDATA[mri]]></category>
		<category><![CDATA[science reporting]]></category>
		<category><![CDATA[sensitivity]]></category>
		<category><![CDATA[specificity]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=660</guid>
		<description><![CDATA[The latest issue of the Journal of Neuroscience contains an interesting article by Ecker et al in which the authors attempted to classify people with autism spectrum disorder (ASD) and health controls based on their brain anatomy, and report achieving &#8220;a sensitivity and specificity of up to 90% and 80%, respectively.&#8221; Before unpacking what that [...]]]></description>
			<content:encoded><![CDATA[<p>The latest issue of the Journal of Neuroscience contains an interesting article by Ecker et al in which the authors <a href="http://www.jneurosci.org/cgi/content/full/30/32/10612">attempted to classify people with autism spectrum disorder (ASD)</a> and health controls based on their brain anatomy, and report achieving &#8220;a sensitivity and specificity of up to 90% and 80%, respectively.&#8221; Before unpacking what that means, and why you probably shouldn&#8217;t get too excited (about the clinical implications, at any rate; the science is pretty cool), here&#8217;s a snippet from the decidedly optimistic <a href="http://www.mrc.ac.uk/Newspublications/News/MRC007083">press release</a> that accompanied the study:</p>
<p>&#8220;Scientists funded by the Medical Research Council (MRC) have developed a pioneering new method of diagnosing autism in adults. For the first time, a quick brain scan that takes just 15 minutes can identify adults with autism with over 90% accuracy. The method could lead to the screening for autism spectrum disorders in children in the future.&#8221;</p>
<p>If you think this sounds too good to be true, that&#8217;s because it is. <a href="http://www.guardian.co.uk/science/blog/2010/aug/12/autism-brain-scan-statistics">Carl Heneghan</a> explains why in an excellent article in the Guardian:</p>
<blockquote><p>How the brain scans results are portrayed is one of the simplest mistakes in interpreting diagnostic test accuracy to make. What has happened is, the sensitivity has been taken to be the positive predictive value, which is what you want to know: if I have a positive test do I have the disease? Not, if I have the disease, do I have a positive test? It would help if the results included a measure called the likelihood ratio (LR), which is the likelihood that a given test result would be expected in a patient with the target disorder compared to the likelihood that the same result would be expected in a patient without that disorder. In this case the LR is 4.5. We&#8217;ve put up an article if you want to know more on how to calculate the LR.</p>
<p>In the general population the prevalence of autism is 1 in 100; the actual chances of having the disease are 4.5 times more likely given a positive test. This gives a positive predictive value of 4.5%; about 5 in every 100 with a positive test would have autism.</p>
<p>For those still feeling confused and not convinced, let&#8217;s think of 10,000 children. Of these 100 (1%) will have autism, 90 of these 100 would have a positive test, 10 are missed as they have a negative test: there&#8217;s the 90% reported accuracy by the media.</p>
<p>But what about the 9,900 who don&#8217;t have the disease? 7,920 of these will test negative (the specificity3 in the Ecker paper is 80%). But, the real worry though, is the numbers without the disease who test positive. This will be substantial: 1,980 of the 9,900 without the disease. This is what happens at very low prevalences, the numbers falsely misdiagnosed rockets. Alarmingly, of the 2,070 with a positive test, only 90 will have the disease, which is roughly 4.5%.</p></blockquote>
<p>In other words, if you screened everyone in the population for autism, and assume the best about the classifier reported in the JNeuro article (e.g., that the sample of 20 ASD participants they used is perfectly representative of the broader ASD population, which seems unlikely), only about 1 in 20 people who receive a positive diagnosis would actually deserve one.</p>
<p>Ecker et al object to this characterization, and <a href="http://www.guardian.co.uk/science/blog/2010/aug/12/autism-brain-scan-statistics?showallcomments=true#CommentKey:6a367062-3655-4002-bebb-fbdc5b43cdfc">reply to Heneghan in the comments</a> (through the MRC PR office):</p>
<blockquote><p>Our test was never designed to screen the entire population of the UK. This is simply not practical in terms of costs and effort, and besides totally  unjustified- why would we screen everybody in the UK for autism if there is no evidence whatsoever that an individual is affected?. The same case applies to other diagnostic tests. Not every single individual in the UK is tested for HIV. Clearly this would be too costly and unnecessary. However, in the group of individuals that are test for the virus, we can be very confident that if the test is positive that means a patient is infected. The same goes for our approach.</p></blockquote>
<p>Essentially, the argument is that, since people would presumably be sent for an MRI scan because they were already under consideration for an ASD diagnosis, and not at random, the false positive rate would in fact be much lower than 95%, and closer to the 20% reported in the article.</p>
<p>One response to this reply&#8211;which is in fact Heneghan&#8217;s <a href="http://www.guardian.co.uk/science/blog/2010/aug/12/autism-brain-scan-statistics?showallcomments=true#CommentKey:3b7a457c-d161-4bb9-a1c8-9d55dd63605c">response in the comments</a>&#8211;is to point out that the pre-test probability of ASD would need to be pretty high already in order for the classifier to add much. For instance, even if fully 30% of people who were sent for a scan actually had ASD, the posterior probability of ASD given a positive result would still be only 66% (Heneghan&#8217;s numbers, which I haven&#8217;t checked). Heneghan nicely contrasts these results with the standard for HIV testing, which &#8220;reports sensitivity of 99.7% and specificity of 98.5% for enzyme immunoassay.&#8221; Clearly, we have a long way to go before doctors can order MRI-based tests for ASD and feel reasonably confident that a positive result is sufficient grounds for an ASD diagnosis.</p>
<p>Setting Heneghan&#8217;s concerns about base rates aside, there&#8217;s a more general issue that he doesn&#8217;t touch on. It&#8217;s one that&#8217;s not specific to this particular study, and applies to nearly all studies that attempt to develop &#8220;biomarkers&#8221; for existing disorders. The problem is that the sensitivity and specificity values that people report for their new diagnostic procedure in these types of studies generally aren&#8217;t the <em>true</em> parameters of the procedure. Rather, they&#8217;re the sensitivity and specificity <em>under the assumption that the diagnostic procedures used to classify patients and controls in the first place are themselves correct</em>. In other words, in order to believe the results, you have to assume that the researchers correctly classified the subjects into patient and control groups using other procedures. In cases where the gold standard test used to make the initial classification is known to have near 100% sensitivity and specificity (e.g., for the aforementioned HIV tests), one can reasonably ignore this concern. But when we&#8217;re talking about mental health disorders, where diagnoses are fuzzy and borderline cases abound, it&#8217;s very likely that the &#8220;gold standard&#8221; isn&#8217;t really all that great to begin with.</p>
<p>Concretely,  studies that attempt to develop biomarkers for mental health disorders face two substantial problems. One is that it&#8217;s extremely unlikely that the clinical diagnoses are ever perfect; after all, if they <em>were</em> perfect, there&#8217;d be little point in trying to develop other diagnostic procedures! In this particular case, the authors selected subjects into the ASD group based on standard clinical instruments and structured interviews. I don&#8217;t know that there are many clinicians who&#8217;d claim with a straight face that the current diagnostic criteria for ASD (and there are multiple sets to choose from!) are perfect. From my limited knowledge, the criteria for ASD seem to be even more controversial than those for most other mental health disorders (which is saying something, if you&#8217;ve been following the ongoing <a href="http://online.wsj.com/article/SB10001424052748703525704575061851569968656.html?mod=WSJ_Opinion_LEFTThirdBucket">DSM-V saga</a>). So really, the accuracy of the classifier in the present study, even if you put the best face on it and ignore the base rate issue Heneghan brings up, is undoubtedly south of the 90% sensitivity / 80% specificity the authors report. How much south, we just don&#8217;t know, because we don&#8217;t really have any independent, objective way to determine who &#8220;really&#8221; should get an ASD diagnosis and who shouldn&#8217;t (assuming you think it makes sense to make that kind of dichotomous distinction at all). But 90% accuracy is probably a pipe dream, if for no other reason than it&#8217;s hard to imagine that level of consensus about autism spectrum diagnoses.</p>
<p>The second problem is that, because the researchers are using the MRI-based classifier to predict the clinician-based diagnosis, it simply isn&#8217;t possible for the former to exceed the accuracy of the latter. That bears repeating, because it&#8217;s important: <em>no matter how good the MRI-based classifier is, it can only be as good as the procedures used to make the original diagnosis, and no better</em>. It cannot, by definition, make diagnoses that are any more accurate than the clinicians who screened the participants in the authors&#8217; ASD sample. So when you see the press release say this:</p>
<blockquote><p>For the first time, a quick brain scan that takes just 15 minutes can identify adults with autism with over 90% accuracy.</p></blockquote>
<p>You should really read it as this:</p>
<blockquote><p>The method relies on structural (MRI) brain scans and has an accuracy rate approaching that of conventional clinical diagnosis.</p></blockquote>
<p>That&#8217;s not quite as exciting, obviously, but it&#8217;s more accurate.</p>
<p>To be fair, there&#8217;s something of a catch-22 here, in that the authors didn&#8217;t really have a choice about whether or not to diagnose the ASD group using conventional criteria. If they hadn&#8217;t, reviewers and other researchers would have complained that we can&#8217;t tell if the ASD group is really an ASD group, because they authors used non-standard criteria. Under the circumstances, they did the only thing they could do. But that doesn&#8217;t change the fact that it&#8217;s misleading to intimate, as the press release does, that the new procedure might be any better than the old ones. It can&#8217;t be, by definition.</p>
<p>Ultimately, if we want to develop brain-based diagnostic tools that are <em>more</em> accurate than conventional clinical diagnoses, we&#8217;re going to need to show that these tools are capable of predicting meaningful outcomes that clinician diagnoses can&#8217;t. This isn&#8217;t an impossible task, but it&#8217;s a very difficult one. One approach you could take, for instance, would be to compare the ability of clinician diagnosis and MRI-based diagnosis to predict functional outcomes among subjects at a later point in time. If you could show that MRI-based classification of subjects at an early age was a <em>stronger</em> predictor of receiving an ASD diagnosis later in life than conventional criteria, that would make a really strong case for using the former approach in the real world. Short of that type of demonstration though, the only reason I can imagine wanting to use a procedure that was developed by trying to duplicate the results of an existing procedure is in the event that the new procedure is substantially cheaper or more efficient than the old one. Meaning, it would be reasonable enough to say &#8220;well, look, we don&#8217;t do <em>quite</em> as well with this approach as we do with a full clinical evaluation, but at least this new approach costs much less.&#8221; Unfortunately, that&#8217;s not really true in this case, since the price of even a short MRI scan is generally going to outweigh that of a comprehensive evaluation by a psychiatrist or psychotherapist. And while it could theoretically be much faster to get an MRI scan than an appointment with a mental health professional, I suspect that that&#8217;s not generally going to be true in practice either.</p>
<p>Having said all that, I hasten to note that all this is really a critique of the MRC press release and <a href="http://news.google.com/news/more?pz=1&amp;cf=all&amp;cf=all&amp;ncl=dYR0ovuKzmfoyaMhIc2o0BGbxKd8M">subsequently lousy science reporting</a>, and not of the science itself. I actually think the science itself is very cool (but the Neuroskeptic just wrote <a href="http://neuroskeptic.blogspot.com/2010/08/is-your-brain-autistic.html">a great rundown of the methods and results</a>, so there&#8217;s not much point in me describing them here). People have been doing really interesting work with pattern-based classifiers for several years now in the neuroimaging literature, but relatively few studies have applied this kind of technique to try and discriminate between different groups of individuals in a clinical setting. While I&#8217;m not really optimistic that the technique the authors introduce in this paper is going to change the way diagnosis happens any time soon (or at least, I&#8217;d argue that it <em>shouldn&#8217;t</em>), there&#8217;s no question that the general approach will be an important piece of future efforts to improve clinical diagnoses by integrating biological data with existing approaches. But that&#8217;s not going to happen overnight, and in the meantime, I think it&#8217;s pretty irresponsible of the MRC to be issuing press releases claiming that its researchers can diagnose autism in adults with 90% accuracy.</p>
<p><span style="float: left; padding: 5px;"><a href="http://www.researchblogging.org"><img style="border: 0;" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" alt="ResearchBlogging.org" /></a></span><span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=The+Journal+of+neuroscience+%3A+the+official+journal+of+the+Society+for+Neuroscience&amp;rft_id=info%3Apmid%2F20702694&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Describing+the+brain+in+autism+in+five+dimensions--magnetic+resonance+imaging-assisted+diagnosis+of+autism+spectrum+disorder+using+a+multiparameter+classification+approach.&amp;rft.issn=0270-6474&amp;rft.date=2010&amp;rft.volume=30&amp;rft.issue=32&amp;rft.spage=10612&amp;rft.epage=23&amp;rft.artnum=&amp;rft.au=Ecker+C&amp;rft.au=Marquand+A&amp;rft.au=Mour%C3%A3o-Miranda+J&amp;rft.au=Johnston+P&amp;rft.au=Daly+EM&amp;rft.au=Brammer+MJ&amp;rft.au=Maltezos+S&amp;rft.au=Murphy+CM&amp;rft.au=Robertson+D&amp;rft.au=Williams+SC&amp;rft.au=Murphy+DG&amp;rfe_dat=bpr3.included=1;bpr3.tags=Clinical+Research%2CPsychology%2CResearch+%2F+Scholarship%2CNeuroscience%2CCognitive+Neuroscience%2C+Abnormal+Psychology%2C+Policy%2C+Psychiatry">Ecker C, Marquand A, Mourão-Miranda J, Johnston P, Daly EM, Brammer MJ, Maltezos S, Murphy CM, Robertson D, Williams SC, &amp; Murphy DG (2010). Describing the brain in autism in five dimensions&#8211;magnetic resonance imaging-assisted diagnosis of autism spectrum disorder using a multiparameter classification approach. <span style="font-style: italic;">The Journal of neuroscience : the official journal of the Society for Neuroscience, 30</span> (32), 10612-23 PMID: <a rev="review" href="http://www.ncbi.nlm.nih.gov/pubmed/20702694">20702694</a></span></p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/RPAev64JuGA" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/08/15/trouble-with-biomarkers-and-press-releases/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/08/15/trouble-with-biomarkers-and-press-releases/</feedburner:origLink></item>
		<item>
		<title>babygate blues: a neuromarketing tale</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/5jALIywgEjk/</link>
		<comments>http://www.talyarkoni.org/blog/2010/07/26/babygate-blues-a-neuromarketing-tale/#comments</comments>
		<pubDate>Mon, 26 Jul 2010 08:27:20 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[fiction]]></category>
		<category><![CDATA[general silliness]]></category>
		<category><![CDATA[Cory Doctorow]]></category>
		<category><![CDATA[Dampers]]></category>
		<category><![CDATA[funding]]></category>
		<category><![CDATA[neuromarketing]]></category>
		<category><![CDATA[politics]]></category>
		<category><![CDATA[science fiction]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=655</guid>
		<description><![CDATA[Cory Doctorow has a new short story (&#8220;Ghosts in my Head&#8220;) about the undesirable consequences of neuromarketing run amok up on the Subterranean Press website.  I liked the story, but thought the premise was pretty unrealistic (and, yes, I do know it&#8217;s called science fiction for a reason&#8211;I&#8217;m just sayin&#8217;). So as a counterpoint, here&#8217;s [...]]]></description>
			<content:encoded><![CDATA[<p><em><a href="http://en.wikipedia.org/wiki/Cory_Doctorow">Cory Doctorow</a> has a new short story (&#8220;<a href="http://subterraneanpress.com/index.php/magazine/summer-2010/fiction-ghosts-in-my-head-by-cory-doctorow/">Ghosts in my Head</a>&#8220;) about the undesirable consequences of neuromarketing run amok up on the Subterranean Press website.  I liked the story, but thought the premise was pretty unrealistic (and, yes, I do know it&#8217;s called science <strong>fiction </strong>for a reason&#8211;I&#8217;m just sayin&#8217;). So as a counterpoint, here&#8217;s an alternative neuromarketing future that I personally find much more plausible</em>.</p>
<p>Deborah Stojko didn&#8217;t care much for Pockter and Gramble&#8217;s corporate headquarters. The building smelled of disinfectant and organization; the halogen corridors all blended together into one giant dimly-lit maze. Stojko had been visiting P&amp;G regularly for several years now; it was never a pleasant experience, but it couldn&#8217;t be avoided. Communicating with major stakeholders was a large part of her job as director of the International Consortium for Neuromarketing Research. And P&amp;G was by far the largest stakeholder, contributing over 70% of the money that supported the consortium&#8217;s work.</p>
<p>For several years now, ICNR had been pumping out first-class scientific research on the neural mechanisms of economic decision-making. The Richelieu effect, Preinforcement Learning, the neurometric satisficing theorem&#8230; ICNR was behind any number of recent discoveries; its members were continually in the news. And all of it was made possible only through the generosity of the marketing and R&amp;D wings of P&amp;G.</p>
<p>The generosity, or the naivete? Stojko asked herself as she reached her destination and knocked softly on an office door. Somehow, the executives at Pockter and Gramble had managed to convince themselves that the survival of P&amp;G rested on their ability to mine the deep secrets of the brain. For years now, they&#8217;d been throwing sums of money at cognitive neuroscientists that would make European royalty blush. That streak of good fortune, Stojko suspected, was now about to end. Recent events had rendered P&amp;G&#8217;s massive investment in ICNR something of a political liability; she had the feeling this was the last time she&#8217;d be making the trip to P&amp;G headquarters.</p>
<p>And not a moment too soon, she thought, as the door opened in front of her.</p>
<p style="text-align: center;">
*    *    *</p>
<p>&#8220;How long has Pockter and Gramble been funding you, Deborah,&#8221; Bob Ramsey, Chief Executive Officer, asked, once Stojko was seated and they&#8217;d gotten the standard pleasantries out of the way.</p>
<p>Stojko did the arithmetic in her head. The International Neuromarketing Consortium had formed in 2013, following a massive infusion of P&amp;G cash, so&#8230;</p>
<p>&#8220;Six years,&#8221; she said.</p>
<p>&#8220;Right. And do you know how much money Pockter and Gramble has given your consortium in those six years?&#8221;</p>
<p>&#8220;I&#8217;d put it somewhere between 251.8 and 251.9 million dollars.&#8221;</p>
<p>&#8220;Very clever. A quarter of a billion dollars. We&#8217;ve given you a quarter. Of a billion. Dollars.&#8221;</p>
<p>&#8220;Well, to be fair, that amount is spread out over 8 sites and 30 other investigators,&#8221; Stojko pointed out. &#8220;It&#8217;s not like you wrote <em>me</em> a check for 250 million. <em>My</em> institution only got about forty-five million.&#8221;</p>
<p>Ramsey didn&#8217;t say anything, but his expression bespoke a thinly-veiled irritation. He picked up a remote control on the desk and pushed a button. Behind Stojko, the wall turned translucent as the embedded display lit up.</p>
<p>&#8220;No doubt you&#8217;ll recognize this clip,&#8221; Ramsey said.</p>
<p>Stojko swiveled around to watch the giant screen. The camera faded in on a bright and comfortable-looking living room somewhere in America. Almost immediately, six or seven babies in diapers filed into the room and began dancing synchronously in a circle. After a few seconds of dancing, the babies started babbling an Eastern-sounding melody in a totally incomprehensible&#8211;and, Stojko suspected, nonexistent&#8211;language. And a few seconds after that, they started banging spoons on the tabletop in perfect unison, all the while still dancing and singing in tongues. The whole thing lasted exactly thirty seconds, and occupied a very narrow emotional niche between really adorable and utterly creepy.</p>
<p>Stojko did recognize the clip, of course; it was an ad for Dampers, a P&amp;G-owned diaper brand. The consortium had selected the ad from over two dozen candidates that P&amp;G had asked them to test. For reasons that remained unclear to Stojko&#8211;and to pretty much everyone else&#8211;singing, dancing, spoon-banging babies lit the brain up like a christmas tree.</p>
<p>Stojko had had her reservations about declaring a &#8216;winner&#8217;; she&#8217;d written several long emails to the P&amp;G marketing brain trust explaining that, brain activation notwithstanding, there really wasn&#8217;t any evidence yet that this particular ad was going to help sell more diapers, and many more studies were needed before the consortium could confidently interpret its own results. But marketing wasn&#8217;t into the whole waiting thing, and the ad was on the air within three months of the consortium&#8217;s initial report.</p>
<p>As it turned out, it didn&#8217;t do so well.</p>
<p>&#8220;That ad <em>bombed</em>,&#8221; Ramsey said, wagging his finger in the general direction of the screen, &#8220;According to you people, it was supposed to push all of the brain&#8217;s buttons <em>at once</em>. You spent three million dollars of our money just on that one testing program. Two dozen ads to choose from, and the one you pick completely tanked. It was an epic failure. At this very moment, people in living rooms all over America are laughing at Pockter and Gramble because of that ad.&#8221;</p>
<p>&#8220;I&#8217;m sure it&#8217;s not <em>that </em>bad&#8221; said Stojko, smirking almost imperceptibly. She was well aware of the PR disaster P&amp;G had on its hands, of course. But she couldn&#8217;t deny the warm feeling of schadenfreude that accompanied the knowledge that P&amp;G was now paying many times over for disregarding just about every recommendation the consortium had made in its 480-page report. She was pretty sure the suits had never made it past the fifth or sixth page.</p>
<p>&#8220;It <em>is</em> that bad,&#8221; Ramsey shot back. &#8220;We blew half of our network budget for the year on this ad. Our initial focus groups were already pretty positive, and then we received your report saying things like&#8211;and I quote&#8211;&#8221;of all the ads tested, number seventeen elicited the largest response in brain areas associated with reward.&#8221; So we figured it was a sure thing, and started airing the ad in all the major markets. And then, out of nowhere, we get this massive backlash. Thousands of angry emails from people complaining that the ad was trite and we were shamefully &#8220;exploiting babies&#8221;. People saying they would never buy Dampers diapers again; that the CEO&#8211;that&#8217;s me, mind you&#8211;should resign; that someone should &#8220;just torch Pockter and Gramble headquarters&#8221;. And those were just the <em>serious</em> complaints. There were also the people who apparently thought the whole thing was just a big joke that gave them an opening to do their own thing. We had forty YouTube videos <em>a day</em> uploaded by people spoofing the ad. There was one clip of six guys in giraffe suits singing and doing our baby dance. Sixteen million hits.&#8221;</p>
<p>&#8220;All publicity is good publicity, right?&#8221;</p>
<p>&#8220;No. Not even close.&#8221;</p>
<p>Stojko chuckled just loudly enough for Ramsey to hear.</p>
<p>&#8220;Is this funny to you?&#8221; Ramsey asked. &#8220;We give you a quarter of a billion dollars for commercials designed to push the brain&#8217;s reward buttons, and we get grown men in giraffe suits?&#8221;</p>
<p>&#8220;Well, let me put it this way, Bob. If your goal was really to make commercials that light up the brain&#8217;s reward circuitry, you wouldn&#8217;t have needed to do any serious research in the first place; you could have just run 30-second clips of semi-nude women making out with each other, or couples giggling and cuddling in bed. That&#8217;d cover most of the bases. You&#8217;d have all the reward-related activation you could want. But how many deodorant sticks do you think commercials like that would sell?&#8221;</p>
<p>Ramsey stared at Stojko blankly.</p>
<p>&#8220;Porn, flashing lights, pictures of hundred-dollar bills, a basket of shiny fresh fruit&#8230; lots of things activate the brain&#8217;s reward centers,&#8221; Stojko continued. &#8220;What makes you think a commercial that tangentially elicits reward-related activation is going to make people buy any more of a product?&#8221;</p>
<p>&#8220;Well, can&#8217;t you tell that?&#8221;</p>
<p>&#8220;Can we?&#8221; asked Stojko rhetorically. &#8220;I don&#8217;t know. Can <em>you</em> tell that? You guys probably have labs full of people trying to figure out whether the fact that people tell you they like a commercial means they&#8217;re going to buy more of the product featured in that commercial. And what&#8217;s the answer?&#8221;</p>
<p>&#8220;I don&#8217;t know that myself,&#8221; Ramsey replied abruptly. &#8220;It&#8217;s not my job to know that. I can have marketing come up here and tell you the answer if you like.&#8221;</p>
<p>Stojko shook her head.</p>
<p>&#8220;Doesn&#8217;t matter. I mean, it can only go one of two ways. If marketing <em>doesn&#8217;t</em> know what makes a commercial good or bad, you can&#8217;t really expect <em>us</em> to tell you what it is about the brain that makes people buy things. We don&#8217;t track how well your products sell after different ads go into circulation; how the hell would we know which commercials have the largest impact on sales? I can tell you which commercials activate the nucleus accumbens more than others, but so what? How am I supposed to know if nucleus accumbens activation is a good predictor of actual purchases without actually knowing anything about real-world purchases?&#8221;</p>
<p>Ramsey had nothing to say to that; he stared down at his shoes.</p>
<p>&#8220;So clearly, that&#8217;s not going to help us,&#8221; Stojko continued. &#8220;But suppose instead we pretend that the people in your marketing department are smart cookies, and they <em>do</em> know what it is about commercials that makes people buy your products. Well, in that case, what the hell would you need <em>us</em>? If you&#8217;ve figured out that people are more likely to buy your anti-dandruff shampoo after watching ads they rate &#8216;extremely interesting&#8217;, what is peering into the brain going to tell you?&#8221;</p>
<p>&#8220;Well, I guess you could use brain imaging to figure out what it is that people find extremely interesting, right?&#8221;</p>
<p>&#8220;Sure, Bob, we could do that. And you know how we&#8217;d do that? By asking people which commercials they found interesting, and then correlating their verbal responses with what their brains were doing while they watched those commercials. And you know what that means? It means we can never do any better than your people can do with your focus groups and spreadsheets. Because basically, we&#8217;re stuck trying to predict the same variables that you guys are using to predict people&#8217;s buying behavior. We&#8217;re just one step further removed.&#8221;</p>
<p>Ramsey listened quietly, but anger visibly colored his face as Stojko spoke.</p>
<p>&#8220;This is the kind of thing that might have been good to bring up, oh, say, five years ago,&#8221; he said.</p>
<p>&#8220;Oh, believe me, we did bring it up,&#8221; Stojko smiled bitterly. &#8220;Or at least, we tried to.&#8221;</p>
<p>She tapped a few keys on her holoboard.</p>
<p>&#8220;Here&#8217;s an email dated June 18th, 2014: &#8220;Dear Mr. Chauahan&#8211;I believe that&#8217;s your VP of marketing, right?&#8211;senior members of the consortium continue to express their frustration at Pockter and Gramble&#8217;s failure to provide us with the sales data we requested. As we indicated in our letter dated April 21st, it is not possible for us to properly evaluate the efficacy of our program without the use of real-world performance metrics. We understand your concerns about sharing private data with outside contractors; however&#8230;&#8221;</p>
<p>Stojko shot Ramsey a pointed look.</p>
<p>&#8220;I&#8217;ll spare you the rest; it goes on like that for three pages. See, we&#8217;ve been asking for the data we need for six years now&#8211;pretty much since we started. And every time we ask, you throw more money at us and tell us to go back to work, that you&#8217;re not going to share your numbers with us because they&#8217;re confidential and we shouldn&#8217;t need that information anyway.&#8221;</p>
<p>She tapped a few more keys.</p>
<p>&#8220;Here&#8217;s another similar one. September 30th: Dear Mr. Chauahan, the consortium is at a loss to understand&#8230;&#8221;</p>
<p>&#8220;Enough!&#8221; yelled Ramsey, slamming his fist down on the desk. &#8220;I get the point! We&#8217;ve spent a quarter of a billion buying you new toys to play with, and all the while you&#8217;ve been playing us for idiots. Well, you know what&#8211;enjoy your toys while they last, because we&#8217;re going to have Legal look at our options for recovering that money first thing Monday morning. Those fancy new scanners of yours are going <em>away</em>.&#8221;</p>
<p>He wheeled his chair away from Stojko and sat there fuming. Stojko took it as a sign the meeting was over; she shrugged and got up to leave.</p>
<p>The falling out was unfortunate, she thought as she walked down the long sterile corridor towards the elevator. But it had been a long time coming, and after the whole Babygate episode (as the scientists at ICNR had started calling it), no one at ICNR would be surprised to hear that P&amp;G was pulling the plug.</p>
<p>Nor would most of them mind terribly much. Stojko had always planned for a six or seven-year run, and had stopped hiring people on short-term contracts a couple of years ago. There would be no massive lay-offs, no collective plunge into obscurity for the many researchers invested in the project. The data was already collected, and she and her colleagues would be kept busy analyzing and publishing the results for years to come.</p>
<p>As for Ramsey&#8217;s legal threats, Stojko wasn&#8217;t the least bit worried. Universities had lawyers too, and there wasn&#8217;t a judge in the country who&#8217;d award P&amp;G a single nickel for breach of contract; not after reading the long series of emails from the consortium that already explained in excruciating detail exactly why P&amp;G was never going to recoup its financial investment unless it fundamentally changed the way it did things. Which, of course, hadn&#8217;t happened&#8211;and probably never would.</p>
<p>Stojko left Pockter and Gramble headquarters with a clear conscience. At the end of the day, she thought as she walked to her car, all you could do was represent yourself honestly to the other party and let the chips fall where they may. And that was what she&#8217;d done. She&#8217;d told P&amp;G all along exactly how the consortium was going to spend the money they received; the service agreements she signed were very clearly delineated in legalese that several lawyers on the institutional payroll had contributed to and pored over. Stojko and her colleagues had worked hard to ensure that no one at P&amp;G was laboring under false pretenses about the likely outcome of ICNR&#8217;s work. As she&#8217;d once put it to a mid-level P&amp;G executive over dinner, neuromarketing research was great for science, and (in her estimation) utterly useless for advertising. But if the suits were willing to pay for it, she was willing to do the research. That, after all, was her job; it was what she&#8217;d be doing with her time anyway, ICNR or no ICNR.</p>
<p>No, she thought, turning the key in the ignition. She&#8217;d been right to take the industry money; ICNR had conducted itself impeccably over the past six years. If someone insisted on filling your cup up with change even after you very carefully explained to them that you were only going to buy beer with it, who could blame you for paying a visit to the bar once panhandling hours were over?</p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/5jALIywgEjk" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/07/26/babygate-blues-a-neuromarketing-tale/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/07/26/babygate-blues-a-neuromarketing-tale/</feedburner:origLink></item>
		<item>
		<title>elsewhere on the net, vacation edition</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/4eHL0bljQhg/</link>
		<comments>http://www.talyarkoni.org/blog/2010/07/13/elsewhere-on-the-net-vacation-edition/#comments</comments>
		<pubDate>Wed, 14 Jul 2010 04:50:15 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[fun with data]]></category>
		<category><![CDATA[general silliness]]></category>
		<category><![CDATA[news]]></category>
		<category><![CDATA[authorship]]></category>
		<category><![CDATA[bisexuality]]></category>
		<category><![CDATA[bloggers]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[dehydrator]]></category>
		<category><![CDATA[ed yong]]></category>
		<category><![CDATA[john zogby]]></category>
		<category><![CDATA[magnetic vision]]></category>
		<category><![CDATA[neuroethics]]></category>
		<category><![CDATA[neurolaw]]></category>
		<category><![CDATA[neuroplasticity]]></category>
		<category><![CDATA[okcupid]]></category>
		<category><![CDATA[plagiarism]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=651</guid>
		<description><![CDATA[I&#8217;m hanging out in Boston for a few days, so blogging will probably be sporadic or nonexistent. Which is to say, you probably won&#8217;t notice any difference. The last post on the Dunning-Kruger effect somehow managed to rack up 10,000 hits in 48 hours; but that was last week. Today I looked at my stats [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m hanging out in Boston for a few days, so blogging will probably be sporadic or nonexistent. Which is to say, you probably won&#8217;t notice any difference.</p>
<p>The last post on the Dunning-Kruger effect somehow managed to rack up 10,000 hits in 48 hours; but that was last week. Today I looked at my stats again, and the blog is back to a more normal 300 hits, so I feel like it&#8217;s safe to blog again. Here are some neat (and totally unrelated) links from the past week:</p>
<ul>
<li>OKCupid has <a href="http://blog.okcupid.com/index.php/2010/07/07/the-biggest-lies-in-online-dating/">another one of those nifty posts</a> showing off all the cool things they can learn from their gigantic userbase (who else gets to say things like &#8220;this analysis includes 1.51 million users&#8217; data&#8221;???). Apparently, tall people (claim to) have more sex, attractive photos are more likely to be out of date, and most people who claim to be bisexual aren&#8217;t really bisexual.</li>
<li>After a few months off, my department-mate Chris Chatham is posting furiously again over at <a href="http://scienceblogs.com/developingintelligence/">Developing Intelligence</a>, with a series of excellent posts reviewing recent work on cognitive control and the perils of fMRI research. I&#8217;m not really sure what Chris spent his blogging break doing, but given the frequency with which he&#8217;s been posting lately, my suspicion is that he spent it secretly writing blog posts.</li>
<li>Mark Liberman <a href="http://languagelog.ldc.upenn.edu/nll/?p=2441">points out a fundamental inconsistency</a> in the way we view attributions of authorship: we get appropriately angry at academics who pass someone else&#8217;s work off as their own, but think it&#8217;s just fine for politicians to pay speechwriters to write for them. It&#8217;s an interesting question, and leads to an intimately related, and even more important question&#8211;namely, will anyone get mad at me if I pay someone else to write a blog post for me about someone else&#8217;s blog post discussing people getting angry at people paying or not paying other people to write material for other people that they do or don&#8217;t own the copyright on?</li>
<li>I like oohing and aahing over large datasets, and the <a href="http://www.guardian.co.uk/news/datablog">Guardian&#8217;s Data Blog</a> provides a nice interface to some of the most ooh- and aah-able datasets out there. [via <a href="http://www.r-chart.com/2010/07/world-government-data-store-api-r-and.html">R-Chart</a>]</li>
<li>Ed Yong has a characteristically excellent write-up about recent work on <a href="http://blogs.discovermagazine.com/notrocketscience/2010/07/08/robins-can-literally-see-magnetic-fields-but-only-if-their-vision-is-sharp/">the magnetic vision of birds</a>. Yong also does link dump posts better than anyone else, so you should probably stop reading this one right now and <a href="http://blogs.discovermagazine.com/notrocketscience/2010/07/10/ive-got-your-missing-links-right-here/">read his instead</a>.</li>
<li>You&#8217;ve probably heard about this already, but some time last week, the brain trust at ScienceBlogs made the <a href="http://www.guardian.co.uk/science/blog/2010/jul/07/scienceblogs-blogging-pepsi">amazingly clever decision</a> to throw away their integrity by selling PepsiCo its very own &#8220;science&#8221; blog. Predictably, a lot of the bloggers weren&#8217;t happy with the decision, and many have now moved onto greener pastures; Carl Zimmer&#8217;s <a href="http://blogs.discovermagazine.com/loom/2010/07/07/oh-pepsi-what-hath-thou-wrought/">keeping score</a>. Personally, I don&#8217;t have anything intelligent to add to everything that&#8217;s already been said; I&#8217;m literally dumbfounded.</li>
<li>Andrew Gelman <a href="http://www.stat.columbia.edu/~cook/movabletype/archives/2010/07/a_note_to_john.html">takes apart</a> an obnoxious letter from pollster John Zogby to Nate Silver of <a href="http://fivethirtyeight.com">fivethirtyeight.com</a>. I guess now we know that Zogby didn&#8217;t get where he is by not being an ass to other people.</li>
<li>Vaughan Bell of Mind Hacks points out that <a href="http://www.mindhacks.com/blog/2010/07/neuroplasticity_is_n.html">neuroplasticity isn&#8217;t a new concept</a>, and was discussed seriously in the literature as far back as the 1800s. Apparently our collective views about the malleability of mind are not, themselves, very plastic.</li>
<li>NPR ran a <a href="http://www.wbur.org/npr/127888976">three-part story</a> by Barbara Bradley Hagerty on the emerging and somewhat uneasy relationship between neuroscience and the law. The articles are pretty good, but much better, in my opinion, was the <a href="http://www.npr.org/templates/story/story.php?storyId=128339306 ">Talk of the Nation episode</a> that featured Hagerty as a guest alongside Joshua Greene, Kent Kiehl, and Stephen Morse&#8211;people who&#8217;ve all contributed in various ways to the emerging discipline of NeuroLaw. It&#8217;s a really interesting set of interviews and discussions. For what it&#8217;s worth, I think I agree with just about everything Greene has to say about these issues&#8211;except that he says things much more eloquently than I think them.</li>
<li>Okay, this one&#8217;s totally frivolous, but does anyone want to buy me one of <a href="http://www.kk.org/cooltools/archives/004590.php">these things</a>? I don&#8217;t even like dried food; I just think it would be fun to stick random things in there and watch them come out pale, dried husks of their former selves. Is it morbid to enjoy watching the life slowly being sucked out of apples and mushrooms?</li>
</ul>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/4eHL0bljQhg" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/07/13/elsewhere-on-the-net-vacation-edition/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/07/13/elsewhere-on-the-net-vacation-edition/</feedburner:origLink></item>
		<item>
		<title>what the Dunning-Kruger effect is and isn’t</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/iJN9zNO7Awo/</link>
		<comments>http://www.talyarkoni.org/blog/2010/07/07/what-the-dunning-kruger-effect-is-and-isnt/#comments</comments>
		<pubDate>Wed, 07 Jul 2010 06:46:13 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[psychology]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[confirmation bias]]></category>
		<category><![CDATA[decision making]]></category>
		<category><![CDATA[dunning-kruger effect]]></category>
		<category><![CDATA[metacognition]]></category>
		<category><![CDATA[regression to the mean]]></category>
		<category><![CDATA[self-enhancement]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=641</guid>
		<description><![CDATA[If you regularly read cognitive science or psychology blogs (or even just the lowly New York Times!), you&#8217;ve probably heard of something called the Dunning-Kruger effect. The Dunning-Kruger effect refers to the seemingly pervasive tendency of poor performers to overestimate their abilities relative to other people&#8211;and, to a lesser extent, for high performers to underestimate [...]]]></description>
			<content:encoded><![CDATA[<p>If you regularly read cognitive science or psychology blogs (or even just the lowly <a href="http://opinionator.blogs.nytimes.com/2010/06/20/the-anosognosics-dilemma-1/">New York Times</a>!), you&#8217;ve probably heard of something called the <a href="http://en.wikipedia.org/wiki/Dunning-kruger_effect">Dunning-Kruger effect</a>. The Dunning-Kruger effect refers to the seemingly pervasive tendency of poor performers to overestimate their abilities relative to other people&#8211;and, to a lesser extent, for high performers to underestimate their abilities. The explanation for this, according to Kruger and Dunning, who first reported the effect in an <a href="http://www.jerwood-no.org.uk/pdf/Dunning%20Kruger.pdf">extremely influential 1999 article</a> in the Journal of Personality and Social Psychology, is that incompetent people by lack the skills they&#8217;d need in order to be able to distinguish good performers from bad performers:</p>
<blockquote><p>&#8230;people who lack the knowledge or wisdom to perform well are often unaware of this fact. We attribute this lack of awareness to a deficit in metacognitive skill. That is, the same incompetence that leads them to make wrong choices also deprives them of the savvy necessary to recognize competence, be it their own or anyone else&#8217;s.</p></blockquote>
<p>For reasons I&#8217;m not really clear on, the Dunning-Kruger effect seems to be experiencing something of a renaissance over the past few months; it&#8217;s <em>everywhere</em> in the blogosphere and media. For instance, here are just a few alleged Dunning-Krugerisms from the past few weeks:</p>
<blockquote><p>&#8230;<a href="http://www.arkansasbusiness.com/article.aspx?aid=122918.54928.135046">So what does this mean in business?</a> Well, it&#8217;s all over the place. Even the title of Dunning and Kruger&#8217;s paper, the part about inflated self-assessments, reminds me of a truism that was pointed out by a supervisor early in my career: The best employees will invariably be the hardest on themselves in self-evaluations, while the lowest performers can be counted on to think they are doing excellent work&#8230;</p>
<p>&#8230;<a href="http://youarenotsosmart.com/2010/05/11/the-dunning-kruger-effect/">Heidi Montag and Spencer Pratt</a> are great examples of the Dunning-Kruger effect. A whole industry of assholes are making a living off of encouraging two attractive yet untalented people they are actually genius auteurs. The bubble around them is so thick, they may never escape it. At this point, all of America (at least those who know who they are), is in on the joke – yet the two people in the center of this tragedy are completely unaware&#8230;</p>
<p>&#8230;<a href="http://feedproxy.google.com/~r/scienceblogs/pharyngula/~3/he9-XnPeozU/the_sorry_state_of_the_public.php">Not so fast there</a> — the Dunning-Kruger effect comes into play here. People in the United States do not have a high level of understanding of evolution, and this survey did not measure actual competence. I&#8217;ve found that the people most likely to declare that they have a thorough knowledge of evolution are the creationists…but that a brief conversation is always sufficient to discover that all they&#8217;ve really got is a confused welter of misinformation&#8230;</p></blockquote>
<p>As you can see, the findings reported by Kruger and Dunning are often interpreted to suggest that the less competent people are, the more competent they <em>think </em>they are. People who perform worst at a task tend to think they&#8217;re god&#8217;s gift to said task, and the people who can actually do said task often display excessive modesty. I suspect we find this sort of explanation compelling because it appeals to our implicit <a href="http://en.wikipedia.org/wiki/Just-world_phenomenon">just-world theories</a>: we&#8217;d like to believe that people who obnoxiously proclaim their excellence at X, Y, and Z must really not be so very good at X, Y, and Z at all, and must be (over)compensating for some actual deficiency; it&#8217;s much less pleasant to imagine that people who go around shoving their (alleged) superiority in our faces might really be better than us at what they do.</p>
<p>Unfortunately, Kruger and Dunning never actually provided any support for this type of just-world view; their studies categorically <em>didn&#8217;t</em> show that incompetent people are more confident or arrogant than competent people. What they <em>did</em> show is this:</p>
<p><a href="http://www.talyarkoni.org/blog/wp-content/uploads/2010/07/dunning_kruger.png"><img class="alignnone size-full wp-image-642" src="http://www.talyarkoni.org/blog/wp-content/uploads/2010/07/dunning_kruger.png" alt="" width="434" height="395" /></a></p>
<p>This is one of the key figures from Kruger and Dunning&#8217;s 1999 paper (and the basic effect has been replicated many times since). The critical point to note is that there&#8217;s a clear <em>positive</em> correlation between actual performance (gray line) and perceived performance (black line): the people in the top quartile for actual performance think they perform better than the people in the second quartile, who in turn think they perform better than the people in the third quartile, and so on. So the bias is definitively <em>not</em> that incompetent people think they&#8217;re better than competent people. Rather, it&#8217;s that <em>incompetent people think they&#8217;re much better than they actually are</em>. But they typically still don&#8217;t think they&#8217;re quite as good as people who, you know, actually <em>are</em> good. (It&#8217;s important to note that Dunning and Kruger never claimed to show that the unskilled think they&#8217;re better than the skilled; that&#8217;s just the way the finding is often interpreted by others.)</p>
<p>That said, it&#8217;s clear that there <em>is</em> a very large discrepancy between the way incompetent people actually perform and the way they perceive their own performance level, whereas the discrepancy is much smaller for highly competent individuals. So the big question is why. Kruger and Dunning&#8217;s explanation, as I mentioned above, is that incompetent people lack the skills they&#8217;d need in order to know they&#8217;re incompetent. For example, if you&#8217;re not very good at learning languages, it might be hard for you to tell that you&#8217;re not very good, because the very skills that you&#8217;d need in order to distinguish someone who&#8217;s good from someone who&#8217;s not are the ones you lack. If you can&#8217;t hear <a href="http://en.wikipedia.org/wiki/Speech_perception">the distinction between two different phonemes</a>, how could you ever know who has native-like pronunciation ability and who doesn&#8217;t? If you don&#8217;t understand very many words in another language, how can you evaluate the size of your own vocabulary in relation to other people&#8217;s?</p>
<p>This appeal to people&#8217;s meta-cognitive abilities (i.e., their knowledge about their knowledge) has some intuitive plausibility, and Kruger, Dunning and their colleagues have provided quite a bit of evidence for it over the past decade. That said, it&#8217;s by no means the only explanation around; over the past few years, a fairly sizeable literature criticizing or extending Kruger and Dunning&#8217;s work has developed. I&#8217;ll mention just three plausible (and mutually compatible) alternative accounts people have proposed (but <a href="http://opim.wharton.upenn.edu/risk/library/J2007JPers+SocPsy_DAM,DAS_Error,Bias.pdf">there</a> are <a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1114332">others</a>!)</p>
<p><strong><em>1. </em><em>Regression toward the mean</em>. </strong>Probably the most common criticism of the Dunning-Kruger effect is that it simply reflects <a href="http://en.wikipedia.org/wiki/Regression_to_the_mean">regression to the mean</a>&#8211;that is, it&#8217;s a statistical artifact. Regression to the mean refers to the fact that any time you select a group of individuals based on some criterion, and then measure the standing of those individuals on some other dimension, performance levels will tend to shift (or regress) toward the mean level. It&#8217;s a notoriously underappreciated problem, and probably explains many, many phenomena that people have tried to interpret substantively. For instance, in placebo-controlled clinical trials of <a href="http://en.wikipedia.org/wiki/SSRIs">SSRIs</a>, depressed people tend to get better in both the drug and placebo conditions. Some of this is undoubtedly due to the <a href="http://en.wikipedia.org/wiki/Placebo_effect">placebo effect</a>, but much of it is probably also due to what&#8217;s often referred to as &#8220;natural history&#8221;. Depression, like most things, tends to be cyclical: people get better or worse better over time, often for no apparent rhyme or reason. But since people tend to seek help (and sign up for drug trials) primarily when they&#8217;re doing particularly badly, it follows that most people would get better to some extent even without any treatment. That&#8217;s regression to the mean (the Wikipedia entry has other nice examples&#8211;for example, the famous <a href="http://en.wikipedia.org/wiki/Sports_Illustrated_Cover_Jinx">Sports Illustrated Cover Jinx</a>).</p>
<p>In the context of the Dunning-Kruger effect, the argument is that incompetent people simply regress toward the mean when you ask them to evaluate their own performance. Since perceived performance is influenced not only by actual performance, but also by many other factors (e.g., one&#8217;s personality, meta-cognitive ability, measurement error, etc.), it follows that, on average, people with extreme levels of actual performance won&#8217;t be quite as extreme in terms of their perception of their performance. So, much of the Dunning-Kruger effect arguably doesn&#8217;t need to be explained at all, and in fact, it would be quite surprising if you <em>didn&#8217;t</em> see a pattern of results that looks at least somewhat like the figure above.</p>
<p><strong><em>2. </em><em>Regression to the mean plus better-than-average.</em></strong> Having said that, it&#8217;s clear that regression to the mean can&#8217;t explain everything about the Dunning-Kruger effect. One problem is that it doesn&#8217;t explain why the effect is greater at the low end than at the high end. That is, incompetent people tend to overestimate their performance to a much greater extent than competent people underestimate their performance. This asymmetry can&#8217;t be explained solely by regression to the mean. It can, however, be explained by a combination of RTM and a &#8220;better-than-average&#8221; (or self-enhancement) heuristic which says that, in general, most people have a tendency to view themselves excessively positively. This two-pronged explanation was proposed by <a href="http://www.ncbi.nlm.nih.gov/pubmed/11831408">Krueger and Mueller in a 2002 study</a> (note that Krueger and Kruger are different people!), who argued that poor performers suffer from a double whammy: not only do their perceptions of their own performance regress toward the mean, but those perceptions are also further inflated by the self-enhancement bias. In contrast, for high performers, these two effects largely balance each other out: regression to the mean causes high performers to underestimate their performance, but to some extent that underestimation is offset by the self-enhancement bias. As a result, it looks as though high performers make more accurate judgments than low performers, when in reality the high performers are just lucky to be where they are in the distribution.</p>
<p><strong><em>3. </em><em>The instrumental role of task difficulty.</em></strong> Consistent with the notion that the Dunning-Kruger effect is at least partly a statistical artifact, some studies have shown that the asymmetry reported by Kruger and Dunning (i.e., the smaller discrepancy for high performers than for low performers) actually goes away, and even reverses, when the ability tests given to participants are very difficult. For instance, <a href="http://deepblue.lib.umich.edu/bitstream/2027.42/39168/1/956.pdf">Burson and colleagues (2006)</a>, writing in JPSP, showed that when University of Chicago undergraduates were asked moderately difficult trivia questions about their university, the subjects who performed best were just as poorly calibrated as the people who performed worst, in the sense that their estimates of how well they did relative to other people were wildly inaccurate. Here&#8217;s what that looks like:</p>
<p><a href="http://www.talyarkoni.org/blog/wp-content/uploads/2010/07/burson_figure.png"><img class="alignnone size-full wp-image-643" title="burson_figure" src="http://www.talyarkoni.org/blog/wp-content/uploads/2010/07/burson_figure.png" alt="" width="440" height="375" /></a></p>
<p>Notice that this finding wasn&#8217;t anomalous with respect to the Kruger and Dunning findings; when participants were given easier trivia (the diamond-studded line), Burson et al observed the standard pattern, with poor performers seemingly showing worse calibration. Simply knocking about 10% off the accuracy rate on the trivia questions was enough to induce a large shift in the relative mismatch between perceptions of ability and actual ability. Burson et al then went on to replicate this pattern in two additional studies involving a number of different judgments and tasks, so this result isn&#8217;t specific to trivia questions. In fact, in the later studies, Burson et al showed that when the task was <em>really</em> difficult, poor performers were actually considerably <em>better</em> calibrated than high performers.</p>
<p>Looking at the figure above, it&#8217;s not hard to see why this would be. Since the slope of the line tends to be pretty constant in these types of experiments, any change in mean performance levels (i.e., a shift in intercept on the y-axis) will necessarily result in a larger difference between actual and perceived performance at the high end. Conversely, if you raise the line, you maximize the difference between actual and perceived performance at the lower end.</p>
<p>To get an intuitive sense of what&#8217;s happening here, just think of it this way: if you&#8217;re performing a very difficult task, you&#8217;re probably going to find the experience subjectively demanding even if you&#8217;re at the high end relative to other people. Since people&#8217;s judgments about their own relative standing depends to a substantial extent on their subjective perception of their <em>own</em> performance (i.e., you use your sense of how easy a task was as a proxy of how good you must be at it), high performers are going to end up systematically underestimating how well they did. When a task is difficult, most people assume they must have done relatively poorly compared to other people. Conversely, when a task is relatively easy (and the tasks Dunning and Kruger studied were on the easier side), most people assume they must be pretty good compared to others. As a result, it&#8217;s going to look like the people who perform well are well-calibrated when the task is easy and poorly-calibrated when the task is difficult; less competent people are going to show exactly the opposite pattern. And note that this doesn&#8217;t require us to assume <em>any</em> relationship between actual performance and perceived performance. You would expect to get the Dunning-Kruger effect for easy tasks even if there was exactly zero correlation between how good people actually are at something and how good they think they are.</p>
<p>Here&#8217;s how Burson et al summarized their findings:</p>
<blockquote><p>Our studies replicate, eliminate, or reverse the association between task performance and judgment accuracy reported by Kruger and Dunning (1999) as a function of task difficulty. On easy tasks, where there is a positive bias, the best performers are also the most accurate in estimating their standing, but on difficult tasks, where there is a negative bias, the worst performers are the most accurate. This pattern is consistent with a combination of noisy estimates and overall bias, with no need to invoke differences in metacognitive abilities. In this  regard, our findings support Krueger and Mueller’s (2002) reinterpretation of Kruger and Dunning’s (1999) findings. An association between task-related skills and metacognitive insight may indeed exist, and later we offer some suggestions for ways to test for it. However, our analyses indicate that the primary drivers of errors in judging relative standing are general inaccuracy and overall biases tied to task difficulty. Thus, it is important to know more about those sources of error in order to better understand and ameliorate them.</p></blockquote>
<p>What should we conclude from these (and other) studies? I think the jury&#8217;s still out to some extent, but at minimum, I think it&#8217;s clear that much of the Dunning-Kruger effect reflects either statistical artifact (regression to the mean), or much more general cognitive biases (the tendency to self-enhance and/or to use one&#8217;s subjective experience as a guide to one&#8217;s standing in relation to others). This doesn&#8217;t mean that the meta-cognitive explanation preferred by Dunning, Kruger and colleagues can&#8217;t hold in some situations; it very well may be that in <em>some</em> cases, and to <em>some</em> extent, people&#8217;s lack of skill is really what prevents them from accurately determining their standing in relation to others. But I think our default position should be to prefer the alternative explanations I&#8217;ve discussed above, because they&#8217;re (a) simpler, (b) more general (they explain lots of other phenomena), and (c) necessary (frankly, it&#8217;d be amazing if regression to the mean <em>didn&#8217;t</em> explain at least part of the effect!).</p>
<p>We should also try to be aware of another very powerful cognitive bias whenever we use the Dunning-Kruger effect to explain the people or situations around us&#8211;namely, <a href="http://en.wikipedia.org/wiki/Confirmation_bias">confirmation bias</a>. If you believe that incompetent people don&#8217;t know enough to know they&#8217;re incompetent, it&#8217;s not hard to find anecdotal evidence for that; after all, we all know people who are both arrogant and not very good at what they do. But if you stop to look for it, it&#8217;s probably also not hard to find disconfirming evidence. After all, there are clearly plenty of people who <em>are</em> good at what they do, but not nearly as good as they think they are (i.e., they&#8217;re above average, and still totally miscalibrated in the positive direction). Just like there are plenty of people who are lousy at what they do and recognize their limitations (e.g., I don&#8217;t need to be a great runner in order to be able to tell that I&#8217;m <em>not</em> a great runner&#8211;I&#8217;m perfectly well aware that I have terrible endurance, precisely because I can&#8217;t finish runs that most other runners find trivial!). But <a href="http://www.talyarkoni.org/blog/2010/05/14/in-defense-of-three-of-my-favorite-sayings/">the plural of anecdote is not data</a>, and the data appear to be equivocal. Next time you&#8217;re inclined to chalk your obnoxious co-worker&#8217;s delusions of grandeur down to the Dunning-Kruger effect, consider the possibility that your co-worker&#8217;s simply a jerk&#8211;no meta-cognitive incompetence necessary.</p>
<p><span style="float: left; padding: 5px;"><a href="http://www.researchblogging.org"><img style="border: 0;" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" alt="ResearchBlogging.org" /></a></span><span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Journal+of+personality+and+social+psychology&amp;rft_id=info%3Apmid%2F10626367&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Unskilled+and+unaware+of+it%3A+how+difficulties+in+recognizing+one%27s+own+incompetence+lead+to+inflated+self-assessments.&amp;rft.issn=0022-3514&amp;rft.date=1999&amp;rft.volume=77&amp;rft.issue=6&amp;rft.spage=1121&amp;rft.epage=34&amp;rft.artnum=&amp;rft.au=Kruger+J&amp;rft.au=Dunning+D&amp;rfe_dat=bpr3.included=1;bpr3.tags=Psychology%2CSocial+Psychology%2C+Judgment+and+Decision+Making">Kruger J, &amp; Dunning D (1999). Unskilled and unaware of it: how difficulties in recognizing one&#8217;s own incompetence lead to inflated self-assessments. <span style="font-style: italic;">Journal of personality and social psychology, 77</span> (6), 1121-34 PMID: <a rev="review" href="http://www.ncbi.nlm.nih.gov/pubmed/10626367">10626367</a><a title="Search Columbia Collections for this item" href="http://rd8hp6du2b.search.serialssolutions.com/?rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Journal%20of%20personality%20and%20social%20psychology&amp;rft_id=info%3Apmid%2F10626367&amp;rft.atitle=Unskilled%20and%20unaware%20of%20it%3A%20how%20difficulties%20in%20recognizing%20one's%20own%20incompetence%20lead%20to%20inflated%20self-assessments.&amp;rft.issn=0022-3514&amp;rft.date=1999&amp;rft.volume=77&amp;rft.issue=6&amp;rft.spage=1121&amp;rft.epage=34&amp;rft.artnum=&amp;rft.au=Kruger%20J&amp;rft.au=Dunning%20D&amp;rfe_dat=bpr3.included&amp;url_ver=Z39.88-2004&amp;rfr_id=info:sid/libx"><img src="chrome://libx/skin/g_100x25_DB__ot__4407_2.gif" border="0" alt="" /></a></span><br />
<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Journal+of+personality+and+social+psychology&amp;rft_id=info%3Apmid%2F11831408&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Unskilled%2C+unaware%2C+or+both%3F+The+better-than-average+heuristic+and+statistical+regression+predict+errors+in+estimates+of+own+performance.&amp;rft.issn=0022-3514&amp;rft.date=2002&amp;rft.volume=82&amp;rft.issue=2&amp;rft.spage=180&amp;rft.epage=8&amp;rft.artnum=&amp;rft.au=Krueger+J&amp;rft.au=Mueller+RA&amp;rfe_dat=bpr3.included=1;bpr3.tags=Psychology%2CSocial+Psychology%2C+Judgment+and+Decision+Making%2C+Statistics">Krueger J, &amp; Mueller RA (2002). Unskilled, unaware, or both? The better-than-average heuristic and statistical regression predict errors in estimates of own performance. <span style="font-style: italic;">Journal of personality and social psychology, 82</span> (2), 180-8 PMID: <a rev="review" href="http://www.ncbi.nlm.nih.gov/pubmed/11831408">11831408</a><a title="Search Columbia Collections for this item" href="http://rd8hp6du2b.search.serialssolutions.com/?rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Journal%20of%20personality%20and%20social%20psychology&amp;rft_id=info%3Apmid%2F11831408&amp;rft.atitle=Unskilled%2C%20unaware%2C%20or%20both%3F%20The%20better-than-average%20heuristic%20and%20statistical%20regression%20predict%20errors%20in%20estimates%20of%20own%20performance.&amp;rft.issn=0022-3514&amp;rft.date=2002&amp;rft.volume=82&amp;rft.issue=2&amp;rft.spage=180&amp;rft.epage=8&amp;rft.artnum=&amp;rft.au=Krueger%20J&amp;rft.au=Mueller%20RA&amp;rfe_dat=bpr3.included&amp;url_ver=Z39.88-2004&amp;rfr_id=info:sid/libx"><img src="chrome://libx/skin/g_100x25_DB__ot__4407_2.gif" border="0" alt="" /></a></span><br />
<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Journal+of+personality+and+social+psychology&amp;rft_id=info%3Apmid%2F16448310&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Skilled+or+unskilled%2C+but+still+unaware+of+it%3A+how+perceptions+of+difficulty+drive+miscalibration+in+relative+comparisons.&amp;rft.issn=0022-3514&amp;rft.date=2006&amp;rft.volume=90&amp;rft.issue=1&amp;rft.spage=60&amp;rft.epage=77&amp;rft.artnum=&amp;rft.au=Burson+KA&amp;rft.au=Larrick+RP&amp;rft.au=Klayman+J&amp;rfe_dat=bpr3.included=1;bpr3.tags=Psychology%2CSocial+Psychology%2C+Judgment+and+Decision+Making">Burson KA, Larrick RP, &amp; Klayman J (2006). Skilled or unskilled, but still unaware of it: how perceptions of difficulty drive miscalibration in relative comparisons. <span style="font-style: italic;">Journal of personality and social psychology, 90</span> (1), 60-77 PMID: <a rev="review" href="http://www.ncbi.nlm.nih.gov/pubmed/16448310">16448310</a><a title="Search Columbia Collections for this item" href="http://rd8hp6du2b.search.serialssolutions.com/?rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Journal%20of%20personality%20and%20social%20psychology&amp;rft_id=info%3Apmid%2F16448310&amp;rft.atitle=Skilled%20or%20unskilled%2C%20but%20still%20unaware%20of%20it%3A%20how%20perceptions%20of%20difficulty%20drive%20miscalibration%20in%20relative%20comparisons.&amp;rft.issn=0022-3514&amp;rft.date=2006&amp;rft.volume=90&amp;rft.issue=1&amp;rft.spage=60&amp;rft.epage=77&amp;rft.artnum=&amp;rft.au=Burson%20KA&amp;rft.au=Larrick%20RP&amp;rft.au=Klayman%20J&amp;rfe_dat=bpr3.included&amp;url_ver=Z39.88-2004&amp;rfr_id=info:sid/libx"><img src="chrome://libx/skin/g_100x25_DB__ot__4407_2.gif" border="0" alt="" /></a></span></p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/iJN9zNO7Awo" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/07/07/what-the-dunning-kruger-effect-is-and-isnt/feed/</wfw:commentRss>
		<slash:comments>18</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/07/07/what-the-dunning-kruger-effect-is-and-isnt/</feedburner:origLink></item>
		<item>
		<title>this year, i backed new zealand to go all the way</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/XpT43Wvc6LQ/</link>
		<comments>http://www.talyarkoni.org/blog/2010/07/06/this-year-i-backed-new-zealand-to-go-all-the-way/#comments</comments>
		<pubDate>Wed, 07 Jul 2010 04:49:21 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[general silliness]]></category>
		<category><![CDATA[chance]]></category>
		<category><![CDATA[football]]></category>
		<category><![CDATA[oranje]]></category>
		<category><![CDATA[reinforcement learning]]></category>
		<category><![CDATA[soccer]]></category>
		<category><![CDATA[sports]]></category>
		<category><![CDATA[world cup]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=638</guid>
		<description><![CDATA[Jerry Coyne ponders whether the best football/soccer team generally wins the World Cup. The answer is clearly no: any sporting event where games are settled on the basis of rare events (e.g., only one or two goals per match), and teams only play each other once to determine a winner, is going to be at [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://whyevolutionistrue.wordpress.com">Jerry Coyne</a> ponders <a href="http://whyevolutionistrue.wordpress.com/2010/07/04/does-the-best-team-win-the-world-cup/">whether the best football/soccer team generally wins the World Cup</a>. The answer is clearly no: any sporting event where games are settled on the basis of rare events (e.g., only one or two goals per match), and teams only play each other once to determine a winner, is going to be at the mercy of Lady Luck a good deal of the time. If we really wanted the best team to come out on top reliably, we&#8217;d probably need teams to play multiple games at every stage of the Cup, which isn&#8217;t very practical. Coyne discusses an (old) paper demonstrating that the occurrence of goals during World Cup matches is well fit by a poisson distribution, allowing one to calculate the probability of various unjust outcomes taking place (which turn out to be surprisingly high).</p>
<p>The curious thing, I think, is that it&#8217;s not really clear that sporting fans really <em>do</em> want the best team to come out on top. We don&#8217;t want outcomes to be determined by a coin toss, of course; it would kind of suck if, say, New Zealand had as much chance of lifting the cup as Brazil did. But it would also be pretty boring if it were a foregone conclusion that Brazil was going to win it all every time around. We want events to make sense, but we don&#8217;t want them to be <em>too</em> predictable. I suppose you could tell an interesting <a href="http://en.wikipedia.org/wiki/Reinforcement_learning">prediction error</a> story about this kind of thing&#8211;e.g., that maximally engaging stimuli may be ones that<em> seem</em> to occur systematically yet defy easy explanation&#8211;but it&#8217;s probably more fun to sit around and curse at the television set as the Netherlands make short work of the <a href="http://en.wikipedia.org/wiki/Brazil_soccer_team">Samba Kings</a> (I don&#8217;t know if anyone actually uses that nickname; I just picked it off Wikipedia to make it look like I know what I&#8217;m talking about). Go Oranje!</p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/XpT43Wvc6LQ" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/07/06/this-year-i-backed-new-zealand-to-go-all-the-way/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/07/06/this-year-i-backed-new-zealand-to-go-all-the-way/</feedburner:origLink></item>
		<item>
		<title>speaking the language of personality</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/G_oNHdlUJ3E/</link>
		<comments>http://www.talyarkoni.org/blog/2010/07/01/speaking-the-language-of-personality/#comments</comments>
		<pubDate>Fri, 02 Jul 2010 05:42:15 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[personality]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[bloggers]]></category>
		<category><![CDATA[EAR]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[narcissism]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=635</guid>
		<description><![CDATA[Nick Holtzman and I have a very short piece in the latest ARP (Association for Research in Personality) newsletter. We touch on some recent work at the interface of personality and language, including Nick&#8217;s work on narcissism, Mathias Mehl and Simine Vazire&#8216;s work with the Electronically-Activated Recorder (EAR), and a recent paper of mine that [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://nickholtzman.com">Nick Holtzman</a> and I have <a href="http://www.personality-arp.org/newsletter05/article_language.html">a very short piece</a> in the latest ARP (Association for Research in Personality) <a href="http://www.personality-arp.org/newsletter05/index.html">newsletter</a>. We touch on some recent work at the interface of personality and language, including Nick&#8217;s <a href="http://nickholtzman.com/Holtzman%20Vazire%20Mehl%20in%20press%20Sounds%20like%20a%20narcissist%20Behavioral%20Manifestions%20of%20narcissism%20in%20everyday%20life.pdf">work on narcissism</a>, <a href="http://dingo.sbs.arizona.edu/~mehl/">Mathias Mehl</a> and <a href="http://www.simine.com">Simine Vazire</a>&#8216;s work with the <a href="http://dingo.sbs.arizona.edu/~mehl/EAR.htm">Electronically-Activated Recorder</a> (EAR), and a recent paper of mine that explored <a href="http://talyarkoni.org/papers/Yarkoni_blogging_JRP_in_press.pdf">the relation between personality and word use in a large sample of bloggers</a> (who knew bloggers had personalities!?). Then we speculate wildly about where this line of research might go in future. I find I quite enjoy speculating wildly; it&#8217;s much more pleasant than having to write factually correct, well-researched sentences.</p>
<p>At any rate, facts or no facts, I think it&#8217;s a nice (and short) read. And I think I can say that without much bias, since I mostly just lounged around while Nick did all the hard work.</p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/G_oNHdlUJ3E" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/07/01/speaking-the-language-of-personality/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/07/01/speaking-the-language-of-personality/</feedburner:origLink></item>
		<item>
		<title>will trade two Methods sections for twenty-two subjects worth of data</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/M3J5VoUOrdc/</link>
		<comments>http://www.talyarkoni.org/blog/2010/07/01/630/#comments</comments>
		<pubDate>Thu, 01 Jul 2010 06:08:43 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[academics]]></category>
		<category><![CDATA[opinion]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[benchwork]]></category>
		<category><![CDATA[care bears]]></category>
		<category><![CDATA[data analysis]]></category>
		<category><![CDATA[division of labor]]></category>
		<category><![CDATA[preferences]]></category>
		<category><![CDATA[programming]]></category>
		<category><![CDATA[writing]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=630</guid>
		<description><![CDATA[The excellent and ever-candid Candid Engineer in Academia has an interesting post discussing the love-hate relationship many scientists who work in wet labs have with benchwork. She compares two very different perspectives: She [a current student] then went on to say that, despite wanting to go to grad school, she is pretty sure she doesn&#8217;t [...]]]></description>
			<content:encoded><![CDATA[<p>The excellent and ever-candid <a href="http://candidengineer.blogspot.com">Candid Engineer in Academia</a> has an interesting post discussing <a href="http://candidengineer.blogspot.com/2010/06/loving-bench.html">the love-hate relationship many scientists who work in wet labs have with benchwork</a>. She compares two very different perspectives:</p>
<blockquote><p>She [a current student] then went on to say that, despite wanting to go to grad school, she is pretty sure she doesn&#8217;t want to continue in academia beyond the Ph.D. because she just loves doing the science so much and she can&#8217;t imagine ever not being at the bench.</p>
<p>Being young and into the benchwork, I remember once asking my grad advisor if he missed doing experiments. His response: &#8220;Hell no.&#8221; I didn&#8217;t understand it at the time, but now I do. So I wonder if my student will always feel the way she does now- possessing of that unbridled passion for the pipet, that unquenchable thirst for the cell culture hood.</p></blockquote>
<p>Wet labs are pretty much nonexistent in psychology&#8211;I&#8217;ve never had to put on gloves or goggles to do anything that I&#8217;d consider an &#8220;experiment&#8221;, and I&#8217;ve certainly never run the risk of  <a href="http://science-professor.blogspot.com/2010/03/my-grandmother-was-right.html">spilling dangerous chemicals all over myself</a>&#8211;so I have no opinion at all about benchwork. Maybe I&#8217;d love it, maybe I&#8217;d hate it; I couldn&#8217;t tell you. But Candid Engineer&#8217;s post did get me thinking about opinions surrounding the psychological equivalent of benchwork&#8211;namely, collecting data form human subjects. My sense is that there&#8217;s somewhat more consensus among psychologists, in that most of us don&#8217;t seem to like data collection very much. But there are plenty of exceptions, and there certainly are strong feelings on both sides.</p>
<p>More generally, I&#8217;m perpetually amazed at the wide range of opinions people can hold about the various elements of scientific research, even when the people doing the different-opinion-holding all work in very similar domains. For instance, my favorite aspect of the research I do, hands down, is data analysis. I&#8217;d be ecstatic if I could analyze data all day and never have to worry about actually communicating the results to anyone (though I enjoy doing that too). After that, there are activities like writing and software development, which I spend a lot of time doing, and occasionally enjoy, but also frequently find very frustrating. And then, at the other end, there are aspects of research that I find have little redeeming value save for their instrumental value in supporting other, more pleasant, activities&#8211;nasty, evil activities like writing <a href="http://en.wikipedia.org/wiki/Institutional_review_board">IRB</a> proposals and, yes, collecting data.</p>
<p>To me, collecting data is something you do because you&#8217;re fundamentally interested in some deep (or maybe not so deep) question about how the mind works, and the only way to get an answer is to actually interrogate people while they do stuff in a controlled environment. It isn&#8217;t something I do for <em>fun</em>. Yet I know people who genuinely seem to love collecting data&#8211;or, for that matter, writing Methods sections or designing new experiments&#8211;even as they loathe perfectly pleasant activities like, say, sitting down to analyze the data they&#8217;ve collected, or writing a few lines of code that could save them hours&#8217; worth of manual data entry. On a personal level, I find this almost incomprehensible: how could <em>anyone</em> possibly enjoy collecting data more than actually crunching the numbers and learning new things? But I know these people exist, because I&#8217;ve talked to them. And I recognize that, from their perspective, <em>I&#8217;m</em> the guy with the strange views. They&#8217;re sitting there thinking: what kind of joker actually <em>likes</em> to turn his data inside out several dozen times? What&#8217;s wrong with just running a simple t-test and writing up the results as fast as possible, so you can get back to the pleasure of designing and running new experiments?</p>
<p>This of course leads us directly to the <a href="http://scienceblogs.com/drugmonkey/2008/06/academic_science_not_a_care_be.php">care bears fucking tea party</a> moment where I tell you how wonderful it is that we all have these different likes and dislikes. I&#8217;m not being sarcastic; it really<em> is</em> great. Ultimately, it works to everyone&#8217;s advantage that we enjoy different things, because it means we get to collaborate on projects and take advantage of complementary strengths and interests, instead of all having to fight over who gets to write the same part of the Methods section. It&#8217;s good that there are some people who love benchwork and some people who hate it, and it&#8217;s good that there are people who&#8217;re happy to write software that other people who hate writing software can use. We don&#8217;t all have to pretend we understand each other; it&#8217;s enough just to nod and smile and say &#8220;but <em>of course</em> you can write the Methods for that paper; I really don&#8217;t mind. And yes, I <em>guess</em> I can run some additional analyses for you, really, it&#8217;s not too much trouble at all.&#8221;</p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/M3J5VoUOrdc" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/07/01/630/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/07/01/630/</feedburner:origLink></item>
		<item>
		<title>fourteen questions about selection bias, circularity, nonindependence, etc.</title>
		<link>http://feedproxy.google.com/~r/citationNeeded/~3/3w5BNBUzYHM/</link>
		<comments>http://www.talyarkoni.org/blog/2010/06/27/fourteen-questions-about-selection-bias-circularity-nonindependence-etc/#comments</comments>
		<pubDate>Mon, 28 Jun 2010 05:19:49 +0000</pubDate>
		<dc:creator>Tal Yarkoni</dc:creator>
				<category><![CDATA[fmri]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[circular analysis]]></category>
		<category><![CDATA[nonindependence]]></category>
		<category><![CDATA[selection bias]]></category>

		<guid isPermaLink="false">http://www.talyarkoni.org/blog/?p=623</guid>
		<description><![CDATA[A new paper published online this week in the Journal of Cerebral Blood Flow &#38; Metabolism this week discusses the infamous problem of circular analysis in fMRI research. The paper is aptly titled &#8220;Everything you never wanted to know about circular analysis, but were afraid to ask,&#8221; and is authored by several well-known biostatisticians and [...]]]></description>
			<content:encoded><![CDATA[<p>A new paper published online this week in the Journal of Cerebral Blood Flow &amp; Metabolism this week discusses <a href="http://www.nature.com/jcbfm/journal/vaop/ncurrent/full/jcbfm201086a.html">the infamous problem of circular analysis in fMRI research</a>. The paper is aptly titled &#8220;Everything you never wanted to know about circular analysis, but were afraid to ask,&#8221; and is authored by several well-known biostatisticians and cognitive neuroscientists&#8211;to wit, <a href="http://www.mrc-cbu.cam.ac.uk/people/nikolaus.kriegeskorte/">Niko Kriegeskorte</a>, <a href="http://www.stat.columbia.edu/~martin/">Martin Lindquist</a>, <a href="http://www.sph.umich.edu/~nichols/">Tom Nichols</a>, <a href="http://www.psy.utexas.edu/psy/faculty/Poldrack/poldrack.html">Russ Poldrack</a>, and <a href="http://www.edvul.com/">Ed Vul</a>. The paper has an interesting format, and one that I really like: it&#8217;s set up as a series of fourteen questions related to circular analysis, and each author answers each question in 100 words or less.</p>
<p>I won&#8217;t bother going over the gist of the paper, because the Neuroskeptic <a href="http://neuroskeptic.blogspot.com/2010/06/a-team-sets-fmri-to-rights.html">already  beat me to the punch</a> in an excellent post a couple of days ago  (actually, that&#8217;s how I found out about the paper); instead,  I&#8217;ll just give my own answers to the same set of questions raised in the paper. And since blog posts don&#8217;t have the same length constraints as <a href="http://www.nature.com">NPG journals</a>, I&#8217;m going to be characteristically long-winded and ignore the 100 word limit&#8230;</p>
<p><em>(1) Is circular analysis a problem in systems and cognitive neuroscience?</em></p>
<p>Yes, it&#8217;s a huge problem. That said, I think the term &#8216;circular&#8217; is somewhat misleading here, because it has the connotation than an analysis is completely vacuous. Truly circular analyses&#8211;i.e., those where an initial analysis is performed, and the researchers then conduct a &#8220;follow-up&#8221; analysis that literally adds <em>no</em> new information&#8211;are relatively rare in fMRI research. Much more common are cases where there&#8217;s some dependency between two different analyses, but the second one still adds <em>some</em> novel information.</p>
<p><em>(2) How widespread are slight distortions and serious errors caused by circularity in the neuroscience literature?</em></p>
<p>I think Nichols sums it up nicely here:</p>
<blockquote><p>TN: False positives due to circularity are minimal; biased estimates of effect size are common. False positives due to brushing off the multiple testing problem (e.g., ‘P&lt;0.001 uncorrected’ and crossing your fingers) remain pervasive.</p></blockquote>
<p>The only thing I&#8217;d add to this is that the bias in effect size estimates is not only common, but, in most cases, is probably <a href="http://www.talyarkoni.org/blog/2009/11/21/ioannidis-on-effect-size-inflation-with-guest-appearance-by-bozo-the-clown/">very large</a>.</p>
<p><em>(3) Are circular estimates useful measures of effect size?</em></p>
<p>Yes and no. They&#8217;re less useful than unbiased measures of effect size. But given that the vast majority of effects reported in whole-brain fMRI analyses (and, more generally, <a href="http://www.dcscience.net/ioannidis-associations-2008.pdf">analyses in most fields</a>) are likely to be inflated to some extent, the only way to ensure we don&#8217;t rely on circular estimates of effect size would be to disregard effect size estimates entirely, which doesn&#8217;t seem prudent.</p>
<p><em>(4) Should circular estimates of effect size be presented in papers and, if so, how?</em></p>
<p>Yes, because the only principled alternatives are to either (a) <em>never</em> report effect sizes (which seems much too drastic), or (b) report the results of every single test performed, irrespective of the result (i.e., to never give selection bias an opportunity to rear its head). Neither of these is reasonable. We should generally report effect sizes for all key effects, but they should be accompanied by appropriate confidence intervals. As Lindquist notes:</p>
<blockquote><p>In general, it may be useful to present any effect size estimate as confidence intervals, so that readers can see for themselves how much uncertainty is related to the point estimate.</p></blockquote>
<p>A key point I&#8217;d add is that the width of the reported CIs should match the threshold used to identify results in the first place. In other words, if you conduct a whole brain analysis at p &lt; .001, you should report all resulting effects with 99.9% CIs, and not 95% CIs. I think this simple step would go a considerable ways towards conveying the true uncertainty surrounding most point estimates in fMRI studies.</p>
<p><em>(5) Are effect size estimates important/useful for neuroscience research, and why?</em></p>
<p>I think my view here is closest to Ed Vul&#8217;s:</p>
<blockquote><p>Yes, very much so. Null-hypothesis testing is insufficient for most goals of neuroscience because it can only indicate that a brain region is involved to some nonzero degree in some task contrast. This is likely to be true of most combinations of task contrasts and brain regions when measured with sufficient power.</p></blockquote>
<p>I&#8217;d go further than Ed does though, and say that in a sense, effect size estimates are the <em>only </em>things that matter. As Ed notes, there are few if any cases where it&#8217;s plausible to suppose that the effect of some manipulation on brain activation is really <em>zero</em>. The brain is a very dense causal system&#8211;almost any change in one variable is going to have downstream effects on many, and perhaps most, others. So the real question we care about is almost never &#8220;is there or isn&#8217;t there an effect,&#8221; it&#8217;s whether there&#8217;s an effect that&#8217;s big enough to actually care about. (This problem isn&#8217;t specific to fMRI research, of course; it&#8217;s been a persistent source of criticism of null hypothesis significance testing for many decades.)</p>
<p>People sometimes try to deflect this concern by saying that they&#8217;re not trying to make any claims about how <em>big</em> an effect is, but only about whether or not one can reject the null&#8211;i.e., whether <em>any</em> kind of effect is present or not. I&#8217;ve never found this argument convincing, because whether or not you own up to it, you&#8217;re <em>always</em> making an effect size claim whenever you conduct a hypothesis test. Testing against a null of zero is equivalent to saying that you care about any effect that isn&#8217;t exactly zero, which is simply false. No one in fMRI research cares about <em>r</em> or <em>d</em> values of 0.0001, yet we routinely conduct tests whose results could be consistent with those types of effect sizes.</p>
<p>Since we&#8217;re always making implicit claims about effect sizes when we conduct hypothesis tests, we may as well make them explicit so that they can be evaluated properly. If you only care about correlations greater than 0.1, there&#8217;s no sense in hiding that fact; why not explicitly test against a null range of -0.1 to 0.1, instead of a meaningless null of zero?</p>
<p><em>(6) What is the best way to accurately estimate effect sizes from imaging data?</em></p>
<p>Use large samples, conduct multivariate analyses, report results comprehensively, use meta-analysis&#8230; I don&#8217;t think there&#8217;s any single way to ensure accurate effect size estimates, but plenty of things help. Maybe the most general recommendation is to ensure adequate power (see below), which will naturally minimize effect size inflation.</p>
<p><em>(7) What makes data sets independent? Are different sets of subjects required?</em></p>
<p>Most of the authors think (as I do too) that different sets of subjects are indeed required in order to ensure independence. Here&#8217;s Nichols:</p>
<blockquote><p>Only data sets collected on distinct individuals can be assured to be independent. Splitting an individual&#8217;s data (e.g., using run 1 and run 2 to create two data sets) does not yield independence at the group level, as each subject&#8217;s true random effect will correlate the data sets.</p></blockquote>
<p>Put differently, splitting data within subjects only eliminates measurement error, and not sampling error. You could in theory measure activation perfectly reliably (in which case the two halves of subjects&#8217; data would be perfectly correlated) and still have grossly inflated effects, simply because the multivariate distribution of scores in your sample doesn&#8217;t accurately reflect the distribution in the population. So, as Nichols points out, you always need new subjects if you want to be absolutely certain your analyses are independent. But since this generally isn&#8217;t feasible, I&#8217;d argue we should worry less about whether or not our data sets are completely independent, and more about reporting results in a way that makes the presence of any bias as clear as possible.</p>
<p><em>(8) What information can one glean from data selected for a certain effect?</em></p>
<p>I think this is kind of a moot question, since virtually all data are susceptible to <em>some </em>form of selection bias (scientists generally don&#8217;t write papers detailing all the analyses they conducted that didn&#8217;t pan out!). As I note above, I think it&#8217;s a bad idea to disregard effect sizes entirely; they&#8217;re actually what we should be focusing most of our attention on. Better to report confidence intervals that accurately reflect the selection procedure and make the uncertainty around the point estimate clear.</p>
<p><em>(9) Are visualizations of nonindependent data helpful to illustrate the claims of a paper?</em></p>
<p>Not in cases where there&#8217;s an extremely strong dependency between the selection criteria and the effect size estimate. In cases of weak to moderate dependency, visualization is fine so long as confidence bands are plotted alongside the best fit. Again, the key is to always be explicit about the limitations of the analysis and provide some indication of the uncertainty involved.</p>
<p><em>(10) Should data exploration be discouraged in favor of valid confirmatory analyses?</em></p>
<p>No. I agree with Poldrack&#8217;s sentiment here:</p>
<blockquote><p>Our understanding of brain function remains incredibly crude, and limiting research to the current set of models and methods would virtually guarantee scientific failure. Exploration of new approaches is thus critical, but the findings must be confirmed using new samples and convergent methods.</p></blockquote>
<p><em>(11) Is a confirmatory analysis safer than an exploratory analysis in terms of drawing neuroscientific conclusions?</em></p>
<p>In principle, sure, but in practice, it&#8217;s virtually impossible to determine which reported analyses really started out their lives as confirmatory analyses and which started life out as exploratory analyses and then mysteriously evolved into &#8220;a priori&#8221; predictions once the paper was written. I&#8217;m not saying there&#8217;s anything wrong with this&#8211;everyone reports results strategically to some extent&#8211;just that I don&#8217;t know that the distinction between confirmatory and exploratory analyses is all that meaningful in practice. Also, as the previous point makes clear, safety isn&#8217;t the only criterion we care about; we also want to discover new and unexpected findings, which requires exploration.</p>
<p><em>(12) What makes a whole-brain mapping analysis valid? What constitutes sufficient adjustment for multiple testing?</em></p>
<p>From a hypothesis testing standpoint, you need to ensure adequate control of the family-wise error (FWE) rate or false discovery rate (FDR). But as I suggested above, I think this only ensures validity in a limited sense; it doesn&#8217;t ensure that the results are actually going to be worth <em>caring about</em>. If you want to feel confident that any effects that survive are meaningfully large, you need to do the extra work up front and define what constitutes a meaningful effect size (and then test against that).</p>
<p><em>(13) How much power should a brain-mapping analysis have to be useful?</em></p>
<p>As much as possible! Concretely, the conventional target of 80% seems like a good place to start. But as I&#8217;ve argued before (e.g., <a href="http://talyarkoni.org/papers/Yarkoni_PPS_commentary.pdf">here</a>), that would require more than doubling conventional sample sizes in most cases. The reality is that fMRI studies are expensive, so we&#8217;re probably stuck with underpowered analyses for the foreseeable future. So we need to find other ways to compensate for that (e.g., relying more heavily on meta-analytic effect size estimates).</p>
<p><em>(14) In which circumstances are nonindependent selective analyses acceptable for scientific publication?</em></p>
<p>It depends on exactly what&#8217;s problematic about the analysis. Analyses that are truly circular and provide no new information should never be reported, but those constitute only a small fraction of all analyses. More commonly, the nonindependence simply amounts to selection bias: researchers tend to report only those results that achieve statistical significance, thereby inflating apparent effect sizes. I think the solution to this is to still report all key effect sizes, but to ensure they&#8217;re accompanied by confidence intervals and appropriate qualifiers.</p>
<p><span style="float: left; padding: 5px;"><a href="http://www.researchblogging.org"><img style="border:0;" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" alt="ResearchBlogging.org" /></a></span><span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Journal+of+cerebral+blood+flow+and+metabolism+%3A+official+journal+of+the+International+Society+of+Cerebral+Blood+Flow+and+Metabolism&amp;rft_id=info%3Apmid%2F20571517&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Everything+you+never+wanted+to+know+about+circular+analysis%2C+but+were+afraid+to+ask.&amp;rft.issn=0271-678X&amp;rft.date=2010&amp;rft.volume=&amp;rft.issue=&amp;rft.spage=&amp;rft.epage=&amp;rft.artnum=&amp;rft.au=Kriegeskorte+N&amp;rft.au=Lindquist+MA&amp;rft.au=Nichols+TE&amp;rft.au=Poldrack+RA&amp;rft.au=Vul+E&amp;rfe_dat=bpr3.included=1;bpr3.tags=Neuroscience%2CCognitive+Neuroscience%2C+Probability+and+Statistics%2C+Statistics%2C+fMRI">Kriegeskorte N, Lindquist MA, Nichols TE, Poldrack RA, &amp; Vul E (2010). Everything you never wanted to know about circular analysis, but were afraid to ask. <span style="font-style: italic;">Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism</span> PMID: <a rev="review" href="http://www.ncbi.nlm.nih.gov/pubmed/20571517">20571517</a></span></p>
<img src="http://feeds.feedburner.com/~r/citationNeeded/~4/3w5BNBUzYHM" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://www.talyarkoni.org/blog/2010/06/27/fourteen-questions-about-selection-bias-circularity-nonindependence-etc/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		<feedburner:origLink>http://www.talyarkoni.org/blog/2010/06/27/fourteen-questions-about-selection-bias-circularity-nonindependence-etc/</feedburner:origLink></item>
	</channel>
</rss>
