<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/atom10full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><feed xmlns="http://www.w3.org/2005/Atom" xmlns:openSearch="http://a9.com/-/spec/opensearch/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr="http://purl.org/syndication/thread/1.0" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" gd:etag="W/&quot;D0YGQ30yeyp7ImA9WhVbFUg.&quot;"><id>tag:blogger.com,1999:blog-27862247</id><updated>2012-06-01T13:52:02.393+01:00</updated><category term="repeated measure ANOVA" /><category term="compound symmetry" /><category term="Bayesian statistics" /><category term="Loftus and Masson" /><category term="loglinear models" /><category term="p curves" /><category term="sample chapter" /><category term="MLwin" /><category term="serious stats" /><category term="robust statistics" /><category term="moderator effects" /><category term="contrasts" /><category term="data sets" /><category term="memory" /><category term="displaying means" /><category term="puzzle" /><category term="philosophy" /><category term="dynamite plots" /><category term="PsyPAG" /><category term="Greenhouse-Geisser" /><category term="multilevel models" /><category term="blog" /><category term="sphericity" /><category term="effect size" /><category term="likelihood" /><category term="statistical significance" /><category term="psychology" /><category term="text book" /><category term="Goldstein and Healy" /><category term="nonparametric statistics" /><category term="interaction" /><category term="simple main effects" /><category term="comparing correlations" /><category term="exploratory data analysis" /><category term="Bayes factors" /><category term="ANOVA" /><category term="BPS" /><category term="meta-analysis" /><category term="statistics" /><category term="moderated multiple regression" /><category term="confidence intervals" /><category term="correlation" /><category term="replication" /><category term="NTU" /><category term="science" /><category term="R" /><category term="multiple imputation" /><title>Psychological Statistics</title><subtitle type="html" /><link rel="http://schemas.google.com/g/2005#feed" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/posts/default" /><link rel="alternate" type="text/html" href="http://psychologicalstatistics.blogspot.com/" /><link rel="next" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default?start-index=26&amp;max-results=25&amp;redirect=false&amp;v=2" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><generator version="7.00" uri="http://www.blogger.com">Blogger</generator><openSearch:totalResults>30</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/atom+xml" href="http://feeds.feedburner.com/PsychologicalStatistics" /><feedburner:info uri="psychologicalstatistics" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><entry gd:etag="W/&quot;Dk8CRnwyeip7ImA9WhVbEUg.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-8611486187331986229</id><published>2012-05-27T21:28:00.000+01:00</published><updated>2012-05-27T22:41:07.292+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-05-27T22:41:07.292+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="displaying means" /><category scheme="http://www.blogger.com/atom/ns#" term="exploratory data analysis" /><category scheme="http://www.blogger.com/atom/ns#" term="dynamite plots" /><category scheme="http://www.blogger.com/atom/ns#" term="ANOVA" /><category scheme="http://www.blogger.com/atom/ns#" term="interaction" /><category scheme="http://www.blogger.com/atom/ns#" term="confidence intervals" /><category scheme="http://www.blogger.com/atom/ns#" term="serious stats" /><category scheme="http://www.blogger.com/atom/ns#" term="text book" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>The aesthetics of error bars</title><content type="html">This blog and &lt;a href="http://seriousstats.wordpress.com/"&gt;my other main blog (the companion blog for my book)&lt;/a&gt; are now syndicated via &lt;a href="http://www.r-bloggers.com/"&gt;R-bloggers&lt;/a&gt; (posts tagged R only) and &lt;a href="http://statsblogs.com/"&gt;statsblogs.com&lt;/a&gt;. The latter is a relatively new blog aggregator but looks to have some interesting content. R-bloggers it quite well established and I was already an occasional reader.&lt;br /&gt;
&lt;br /&gt;
Looking at some recent content I noticed an &lt;a href="http://emdbolker.wikidot.com/blog:dynamite"&gt;interesting piece by Ben Bolker&lt;/a&gt; (author, among other things, of the excellent &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;bbmle&lt;/span&gt; package in R) on dynamite plots. Until a few years ago (possibly in the early research for my book) I had not heard the term 'dynamite plot' or the negative press the attract in some research fields. In my own discipline (psychology) and in experimental psychology in particular bar plots with error bars (looking like sticks of dynamite stacked in a row) are rather popular. In fact, I was taught to use them in preference to dot plots when plotting interactions in ANOVA (their main application in experimental psychology). The main arguments against dot plots are that it is easy to manipulate them to make effects look large large by adjusting the scale (and sometimes software does this automatically). The advantage of switching to a bar plot is that these are supposed to be zero-referenced and thus effects are likely to more appropriately scaled.&lt;br /&gt;
&lt;br /&gt;
Here is an example of a dynamite plot adapted from chapter 3 of &lt;a href="http://www.blogger.com/%3Cscript%20type=%22text/javascript%22%20src=%22http://wms.assoc-amazon.co.uk/20070822/GB/js/link-enhancer-common.js?tag=psychologic05-21%22%3E%20%3C/script%3E%20%3Cnoscript%3E%20%20%20%20%20%3Cimg%20src=%22http://wms.assoc-amazon.co.uk/20070822/GB/img/noscript.gif?tag=psychologic05-21%22%20alt=%22%22%20/%3E%20%3C/noscript%3E"&gt;my book&lt;/a&gt;:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-5gkFgYdKyWE/T8KFMQAaMlI/AAAAAAAAAFs/n7F5SxcjV0c/s1600/dynamite+plot.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="http://4.bp.blogspot.com/-5gkFgYdKyWE/T8KFMQAaMlI/AAAAAAAAAFs/n7F5SxcjV0c/s320/dynamite+plot.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;div style="text-align: left;"&gt;
(Colleagues of mine will note that the quantity displayed by the error bars is not labeled. This should always be clear from plot or figure caption - here they are 95% CIs).&lt;/div&gt;
&lt;br /&gt;
Some of the material on dynamite plots on the web is somewhat one-sided (e.g., see &lt;a href="http://pablomarin-garcia.blogspot.co.nz/2010/02/why-dynamite-plots-are-bad.html"&gt;here&lt;/a&gt;, &lt;a href="http://biostat.mc.vanderbilt.edu/wiki/pub/Main/TatsukiKoyama/Poster3.pdf"&gt;here&lt;/a&gt;, &lt;a href="http://sharpstatistics.co.uk/stats/comparing-data-sets/"&gt;here&lt;/a&gt; or the comments &lt;a href="http://flowingdata.com/2008/02/15/how-to-read-and-use-a-box-and-whisker-plot/"&gt;here&lt;/a&gt;).&amp;nbsp;Ben Bolker bravely presents a more balanced picture. He also gets to the heart of the issue by noting that most criticisms of dynamite plots suggest box plots or plots of raw data as alternatives. This doesn't seem appropriate if your goal is inference rather than description. As Bolker notes, if you've decided to something like ANOVA you are already implicitly assuming approximate normality of the errors and so forth. Thus if the main purpose of the plot is inferential or to display key patterns among the data, box plots or raw data plots are not so useful. (Don't get me wrong I think think that plotting raw data is a good idea - but exploratory work and model checking are different from inference). So for a plot of means with error bars, the choice of dot plot or bar plot is one of aesthetics. These days my preference is for dot plots (which are more versatile and have a better information to ink ratio), but I think a well constructed dynamite plot can be appropriate in some situations. I would usually save these for a situation in which the pattern was quite simple (e.g., a 2 by 2 interaction), there was a meaningful zero or other reference point and when my audience are familiar with this style of plot and may prefer them.&lt;br /&gt;
&lt;br /&gt;
A further aesthetic point here is how to plot the error bars themselves. I am persuaded by &lt;a href="http://andrewgelman.com/2011/07/any-good-articles-on-the-use-of-error-bars/"&gt;Andrew Gelman's argument&lt;/a&gt;&amp;nbsp;that the crossbars on conventional error bar plots are ugly and counterproductive. They draw your attention to the extremes of the error bar - when values closer to the statistic being estimated are more plausible. Here is the earlier dyamite plot redrawn as a conventional error bar plot and in cleaner Gelman-approved style:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-stcvR9hl5qg/T8KF9TYH3-I/AAAAAAAAAF0/Bhv5cQ46hQY/s1600/error+bar+plots.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="196" src="http://4.bp.blogspot.com/-stcvR9hl5qg/T8KF9TYH3-I/AAAAAAAAAF0/Bhv5cQ46hQY/s400/error+bar+plots.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
I find the version on the right to be much prettier. Furthermore it makes it easier to adapt them into two-tiered error bar plots. I like to use two-tier plots to convey 95% CIs for individual means (outer tier) and &lt;a href="http://psychologicalstatistics.blogspot.co.uk/2012/03/graphing-between-subject-confidence.html"&gt;inferential (difference-adjusted) 95% CIs&lt;/a&gt; (inner tier). The inner tier approximates to a 95% CI for the difference - so that the means can be considered different by conventional criteria if the inner tier error bars don't overlap:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-cmVbr3eqU6s/T8KI0xrrYSI/AAAAAAAAAGA/Lt47DTS4W2g/s1600/twotiered.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="200" src="http://3.bp.blogspot.com/-cmVbr3eqU6s/T8KI0xrrYSI/AAAAAAAAAGA/Lt47DTS4W2g/s400/twotiered.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
&lt;br /&gt;
In my paper on &lt;a href="http://psychologicalstatistics.blogspot.co.uk/2011/10/calculating-and-graphing-within-subject.html"&gt;within-subject CIs&lt;/a&gt;&amp;nbsp;I used the style on the left. However, with hindsight I wish I'd included the style on the right. Varying the width of the bars avoids the ugly crossbars but may make detecting a 'statistically significant' difference trickier. I think that aesthetics win here because graphical methods aim to support informal inference - they are not supposed to be there for fine-grain, formal inference (which can be supported by formal hypothesis tests of various kinds - not just null hypothesis significance tests).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-8611486187331986229?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/xlFN8kmfTICUoG7PxqH0WjG2kJw/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/xlFN8kmfTICUoG7PxqH0WjG2kJw/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/xlFN8kmfTICUoG7PxqH0WjG2kJw/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/xlFN8kmfTICUoG7PxqH0WjG2kJw/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/GAMfRPAd9r0" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/8611486187331986229/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2012/05/aesthetics-of-error-bars.html#comment-form" title="5 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/8611486187331986229?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/8611486187331986229?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/GAMfRPAd9r0/aesthetics-of-error-bars.html" title="The aesthetics of error bars" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/-5gkFgYdKyWE/T8KFMQAaMlI/AAAAAAAAAFs/n7F5SxcjV0c/s72-c/dynamite+plot.png" height="72" width="72" /><thr:total>5</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2012/05/aesthetics-of-error-bars.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C08HSH4_fip7ImA9WhVXE00.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-7702174590325559825</id><published>2012-03-23T20:52:00.000Z</published><updated>2012-04-13T08:23:59.046+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-04-13T08:23:59.046+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="blog" /><category scheme="http://www.blogger.com/atom/ns#" term="meta-analysis" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="sample chapter" /><category scheme="http://www.blogger.com/atom/ns#" term="replication" /><category scheme="http://www.blogger.com/atom/ns#" term="serious stats" /><category scheme="http://www.blogger.com/atom/ns#" term="text book" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><category scheme="http://www.blogger.com/atom/ns#" term="contrasts" /><category scheme="http://www.blogger.com/atom/ns#" term="effect size" /><category scheme="http://www.blogger.com/atom/ns#" term="data sets" /><category scheme="http://www.blogger.com/atom/ns#" term="loglinear models" /><category scheme="http://www.blogger.com/atom/ns#" term="science" /><category scheme="http://www.blogger.com/atom/ns#" term="multiple imputation" /><title>Serious stats - free statistics resources</title><content type="html">The companion web site for Serious Stats is now live:&lt;br /&gt;
&lt;br /&gt;
&lt;a href="http://www.palgrave.com/psychology/baguley/"&gt;http://www.palgrave.com/psychology/baguley/&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
The web site includes:&lt;br /&gt;
&lt;br /&gt;
- &lt;a href="http://www.palgrave.com/psychology/baguley/about/"&gt;a free sample chapter (Chapter 15: Contrasts)&lt;/a&gt;&lt;br /&gt;
- &lt;a href="http://www.palgrave.com/psychology/baguley/students/datasets.html"&gt;data sets&lt;/a&gt;&lt;br /&gt;
- &lt;a href="http://www.palgrave.com/psychology/baguley/students/rcode.html"&gt;R scripts&lt;/a&gt;&lt;br /&gt;
- &lt;a href="http://www.palgrave.com/psychology/baguley/students/supplements.html"&gt;5 online supplements&lt;/a&gt; (for meta-analysis, multiple imputation, replication probabilities, pseudo-R squared and loglinear models)&lt;br /&gt;
&lt;br /&gt;
Also don't forget the &lt;a href="http://seriousstats.wordpress.com/"&gt;Serious stats blog&lt;/a&gt; to accompany the book.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-7702174590325559825?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/zkBmdZxxkniGRbICcwb4BT0_-9M/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/zkBmdZxxkniGRbICcwb4BT0_-9M/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/zkBmdZxxkniGRbICcwb4BT0_-9M/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/zkBmdZxxkniGRbICcwb4BT0_-9M/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/RkQUX5ndQig" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/7702174590325559825/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2012/03/serious-stats-free-statics-resources.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7702174590325559825?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7702174590325559825?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/RkQUX5ndQig/serious-stats-free-statics-resources.html" title="Serious stats - free statistics resources" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2012/03/serious-stats-free-statics-resources.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CEUDQHg8fSp7ImA9WhVREUs.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-8291477012342848370</id><published>2012-03-19T11:58:00.002Z</published><updated>2012-03-19T13:04:31.675Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-03-19T13:04:31.675Z</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="Loftus and Masson" /><category scheme="http://www.blogger.com/atom/ns#" term="displaying means" /><category scheme="http://www.blogger.com/atom/ns#" term="Goldstein and Healy" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="ANOVA" /><category scheme="http://www.blogger.com/atom/ns#" term="statistical significance" /><category scheme="http://www.blogger.com/atom/ns#" term="confidence intervals" /><category scheme="http://www.blogger.com/atom/ns#" term="repeated measure ANOVA" /><category scheme="http://www.blogger.com/atom/ns#" term="serious stats" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>Graphing between-subject confidence intervals for ANOVA</title><content type="html">&lt;br /&gt;
This is a quick follow up to &lt;a href="http://psychologicalstatistics.blogspot.co.uk/2011/10/calculating-and-graphing-within-subject.html"&gt;my earlier post that discussed how to graph CIs for&amp;nbsp;within-subjects (repeated measures)&amp;nbsp;ANOVA designs&lt;/a&gt;. My forthcoming book &lt;a href="http://www.amazon.co.uk/Serious-Stats-advanced-statistics-behavioral/dp/0230577180"&gt;Serious stats&lt;/a&gt; describes how to do this for between-subjects designs (a much simpler problem). The blog that accompanies the book now has &lt;a href="http://seriousstats.wordpress.com/2012/03/18/cis-for-anova/"&gt;a post summarizing the main options and explaining how to plot difference-adjusted CIs&lt;/a&gt; (95% CIs constructed so that non-overlapping intervals correspond to a statistically significant difference between means at &lt;i&gt;p&lt;/i&gt; &amp;lt; .05). In addition, the post includes R functions to calculate and plot difference-adjusted CIs (though the calculations are not difficult to reproduce by hand).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-8291477012342848370?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/xxmDERrTcPf58OdeYeBxJgcXYX0/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/xxmDERrTcPf58OdeYeBxJgcXYX0/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/xxmDERrTcPf58OdeYeBxJgcXYX0/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/xxmDERrTcPf58OdeYeBxJgcXYX0/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/ngG0vjhA01I" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/8291477012342848370/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2012/03/graphing-between-subject-confidence.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/8291477012342848370?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/8291477012342848370?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/ngG0vjhA01I/graphing-between-subject-confidence.html" title="Graphing between-subject confidence intervals for ANOVA" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2012/03/graphing-between-subject-confidence.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CUIAQ3k7fSp7ImA9WhVREUw.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-7898274551397259397</id><published>2012-03-15T23:35:00.000Z</published><updated>2012-03-18T23:32:22.705Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-03-18T23:32:22.705Z</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="p curves" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="statistical significance" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>p curves revisited</title><content type="html">&lt;br /&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;I finally found some time to take a closer look at &lt;i&gt;p&lt;/i&gt; curves. I haven't had a chance to follow-up my simulations (and probably won't for a few weeks if not months), but I have had time to think through the ideas the &lt;i&gt;p&lt;/i&gt; curve approach raises based on some of the comments I've received and a brief exchange with Uri Simonsohn (who has answered a few of my questions).&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;First, I got a couple of things at least partly wrong.&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;i) how &lt;i&gt;p&lt;/i&gt; curves work&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;ii) the potential for correlated &lt;i&gt;p&lt;/i&gt; values&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;i&gt;How p curves work&lt;/i&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;I made the (I think) reasonable assumption that &lt;i&gt;p&lt;/i&gt; curve analysis involved focusing on a bump just under the &lt;i&gt;p&lt;/i&gt; = .05 threshold. Other work (Wicherts et al., 2011) has shown that there is indeed some distortion around this value. My crude simulation suggested that p curves could maybe be used to detect this kind of bump - but that the method was noisy and required large N.&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;All good so far except my assumption was completely wrong. This isn't what Simonsohn and colleagues are proposing at all. They are focusing on the whole of the distribution between &lt;i&gt;p&lt;/i&gt; = 0 and &lt;i&gt;p&lt;/i&gt; = .05. This is a very different kind of analysis because it uses all the available &lt;i&gt;p&lt;/i&gt; value information about '&lt;i&gt;p&lt;/i&gt; hacking' (if you accept the highly plausible premise that &lt;i&gt;p&lt;/i&gt; hacking is concentrated on statistically significant &lt;i&gt;p&lt;/i&gt; values).&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;Null effects will therefore produce a flat &lt;i&gt;p&lt;/i&gt; curve (because the distribution of &lt;i&gt;p&lt;/i&gt; under the null is uniform). Simonsohn argues that non-null effects should produce downward sloping &lt;i&gt;p&lt;/i&gt; curves. He and his colleagues have simulated &lt;i&gt;p&lt;/i&gt; curves under various ranges of effect size to confirm this - and there is also an analytic proof for the normal case (Hung et al., 1997).*&amp;nbsp; I also (inadvertently) confirmed this in my &lt;a href="http://psychologicalstatistics.blogspot.com/2012/02/simulating-p-curves-and-detecting-dodgy.html"&gt;original simulations&lt;/a&gt; - which show the downward sloping trend (but note that I include &lt;i&gt;p&lt;/i&gt; values up to &lt;i&gt;p&lt;/i&gt; = .10 in my plots).&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;However, mixing in &lt;i&gt;p&lt;/i&gt; hacked studies to a flat curve will produce an upward sloping curve - the feature that Simonshohn and his colleagues are focusing on. I haven't simulated this directly - but it seems sensible because &lt;i&gt;p&lt;/i&gt; hacking is (in essence) a flavour of optional stopping (adding data or iterating analyses until you squeeze a statistically significant effect out). Certainly, an upward sloping curve would be a signal of something wierd going on.&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;This approach uses more information than my mistaken '&lt;i&gt;p&lt;/i&gt; bump' approach and so should be much more stable.&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;* It is far from unreasonable to treat the distribution of effects as approximately normal - as is common in meta-analysis (and see also Gillett, 1994), but I don't think the pattern depends strongly on this assumption.&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;i&gt;Correlated p values&lt;/i&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;It is well known that &lt;i&gt;p&lt;/i&gt; values are inherently extremely noisy 'statistics' - they jump around all over the place for identical replications. Geoff Cumming and colleagues have published some good work on this (e.g., Cumming &amp;amp; Fidler, 2009). Thus the same effect in different studies or different effects of similar sizes will in general not tend to have correlated &lt;i&gt;p&lt;/i&gt; values. However, the noise that causes this jumping around will be crystalized if you use the same data to re-calculate the &lt;i&gt;p&lt;/i&gt; value. This could cause correlated &lt;i&gt;p&lt;/i&gt; values where data is re-used or where variables are very highly correlated. For example, this could happen if you add a covariate that is a modest predictor of Y and uncorrelated with and report &lt;i&gt;p&lt;/i&gt; values with and without the covariate. It could also happen if you report essentially the same analysis twice with a very similar variable (e.g., X correlated with children's age or X correlated with years of schooling).&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;There are two main solutions here: a) just filter out p values that re-use data or use highly-correlated data, or b) model the correlations in some way by accounting for within-study clustering - as you might in a multilevel model and some forms of meta-analysis (itself a form of multilevel model).&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;In summary, I think the &lt;i&gt;p&lt;/i&gt; curve approach looks very interesting, and&amp;nbsp; I'd certainly like to see more work on it (and hope to see the full version published some time soon).&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;i&gt;References&lt;/i&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Times; margin: 0.0px 0.0px 12.0px 32.0px; text-indent: -32.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;Cumming, G., &amp;amp; Fidler, F. (2009). Confidence Intervals. &lt;i&gt;Zeitschrift für Psychologie / Journal of Psychology&lt;/i&gt;, &lt;i&gt;217&lt;/i&gt;(1), 15-26.&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Times; margin: 0.0px 0.0px 12.0px 32.0px; text-indent: -32.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;Gillett, R. (1994). Post hoc power analysis. &lt;i&gt;Journal of Applied Psychology&lt;/i&gt;, &lt;i&gt;79&lt;/i&gt;(5), 783-785. doi:10.1037//0021-9010.79.5.783&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Times; margin: 0.0px 0.0px 12.0px 32.0px; text-indent: -32.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;Hung, H. M., O’Neill, R. T., Bauer, P., &amp;amp; Köhne, K. (1997). The behavior of the P-value when the alternative hypothesis is true. &lt;i&gt;Biometrics&lt;/i&gt;, &lt;i&gt;53&lt;/i&gt;(1), 11-22.&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Times; margin: 0.0px 0.0px 12.0px 32.0px; text-indent: -32.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;Wicherts, J. M., Bakker, M., &amp;amp; Molenaar, D. (2011). Willingness to share research data is related to the strength of the evidence and the quality of reporting of statistical results. &lt;i&gt;PloS one&lt;/i&gt;, &lt;i&gt;6&lt;/i&gt;(11), e26828. doi:10.1371/journal.pone.0026828&lt;/span&gt;&lt;/div&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;
&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-7898274551397259397?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/Ixcq3Uloi3dUjv6edaYWK0A9ldw/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/Ixcq3Uloi3dUjv6edaYWK0A9ldw/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/Ixcq3Uloi3dUjv6edaYWK0A9ldw/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/Ixcq3Uloi3dUjv6edaYWK0A9ldw/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/LdfojyhSPKY" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/7898274551397259397/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2012/03/p-curves-revisited.html#comment-form" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7898274551397259397?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7898274551397259397?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/LdfojyhSPKY/p-curves-revisited.html" title="p curves revisited" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>2</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2012/03/p-curves-revisited.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CUMMQ3wzeCp7ImA9WhVREUw.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-8018539215618229135</id><published>2012-03-14T20:23:00.000Z</published><updated>2012-03-18T23:31:22.280Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-03-18T23:31:22.280Z</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="p curves" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="statistical significance" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>R code for p curves</title><content type="html">&lt;div style="overflow: auto;"&gt;
&lt;div class="geshifilter"&gt;
&lt;pre class="r geshifilter-R"&gt;&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif;"&gt;I have finally got around to posting the R code for my &lt;a href="http://psychologicalstatistics.blogspot.com/2012/02/simulating-p-curves-and-detecting-dodgy.html"&gt;&lt;i&gt;p&lt;/i&gt; curve simulation&lt;/a&gt;. Those familiar with R will realize how crude it is (I've been caught up with other urgent stuff and had no time to explore further).&lt;/span&gt;&lt;/pre&gt;
&lt;pre class="r geshifilter-R"&gt;&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif;"&gt;You are welcome to play with (and improve!) the code. Changing delta will alter the (at present) fixed effect size. It would be more realistic to vary this (and the sample sizes). A good starting point for the effect size distribution (in the population) might be a normal distribution with say a mean of zero and a variance of 1 (see Gillett, 1994).&lt;/span&gt;&lt;/pre&gt;
&lt;pre class="r geshifilter-R" style="font-family: monospace;"&gt;&lt;/pre&gt;
&lt;pre class="r geshifilter-R" style="font-family: monospace;"&gt;&lt;/pre&gt;
&lt;pre class="r geshifilter-R" style="font-family: monospace;"&gt;delta &amp;lt;- &lt;span style="color: #cc66cc;"&gt;0.5&lt;/span&gt;
m1 &amp;lt;- &lt;span style="color: #cc66cc;"&gt;10&lt;/span&gt;
&lt;a href="http://inside-r.org/r-doc/stats/sd"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;sd&lt;/span&gt;&lt;/a&gt; &amp;lt;- &lt;span style="color: #cc66cc;"&gt;2&lt;/span&gt;
m2 &amp;lt;- m1 + &lt;a href="http://inside-r.org/r-doc/stats/sd"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;sd&lt;/span&gt;&lt;/a&gt;*delta
n1 &amp;lt;- n2 &amp;lt;-&lt;span style="color: #cc66cc;"&gt;25&lt;/span&gt;
&amp;nbsp;
n.sims &amp;lt;- &lt;span style="color: #cc66cc;"&gt;500&lt;/span&gt;
p.data &amp;lt;- &lt;a href="http://inside-r.org/r-doc/base/replicate"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;replicate&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;(&lt;/span&gt;n.sims&lt;span style="color: #339933;"&gt;,&lt;/span&gt; &lt;a href="http://inside-r.org/r-doc/stats/t.test"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;t.test&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;(&lt;/span&gt;&lt;a href="http://inside-r.org/r-doc/stats/rnorm"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;rnorm&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;(&lt;/span&gt;n1&lt;span style="color: #339933;"&gt;,&lt;/span&gt; m1&lt;span style="color: #339933;"&gt;,&lt;/span&gt;&lt;a href="http://inside-r.org/r-doc/stats/sd"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;sd&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;)&lt;/span&gt;&lt;span style="color: #339933;"&gt;,&lt;/span&gt; &lt;a href="http://inside-r.org/r-doc/stats/rnorm"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;rnorm&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;(&lt;/span&gt;n2&lt;span style="color: #339933;"&gt;,&lt;/span&gt; m2&lt;span style="color: #339933;"&gt;,&lt;/span&gt;&lt;a href="http://inside-r.org/r-doc/stats/sd"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;sd&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;)&lt;/span&gt;&lt;span style="color: #009900;"&gt;)&lt;/span&gt;$p.val&lt;span style="color: #339933;"&gt;,&lt;/span&gt; simplify=T&lt;span style="color: #009900;"&gt;)&lt;/span&gt;
&amp;nbsp;
&lt;a href="http://inside-r.org/r-doc/graphics/par"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;par&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;(&lt;/span&gt;mfrow=&lt;a href="http://inside-r.org/r-doc/base/c"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;c&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;(&lt;/span&gt;&lt;span style="color: #cc66cc;"&gt;5&lt;/span&gt;&lt;span style="color: #339933;"&gt;,&lt;/span&gt;&lt;span style="color: #cc66cc;"&gt;3&lt;/span&gt;&lt;span style="color: #009900;"&gt;)&lt;/span&gt;&lt;span style="color: #009900;"&gt;)&lt;/span&gt;
&lt;span style="color: black; font-weight: bold;"&gt;for&lt;/span&gt; &lt;span style="color: #009900;"&gt;(&lt;/span&gt;i &lt;span style="color: black; font-weight: bold;"&gt;in&lt;/span&gt; &lt;span style="color: #cc66cc;"&gt;1&lt;/span&gt;:&lt;span style="color: #cc66cc;"&gt;15&lt;/span&gt;&lt;span style="color: #009900;"&gt;)&lt;/span&gt; &lt;span style="color: #009900;"&gt;{&lt;/span&gt;
 p.data &amp;lt;- &lt;a href="http://inside-r.org/r-doc/base/replicate"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;replicate&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;(&lt;/span&gt;n.sims&lt;span style="color: #339933;"&gt;,&lt;/span&gt; &lt;a href="http://inside-r.org/r-doc/stats/t.test"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;t.test&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;(&lt;/span&gt;&lt;a href="http://inside-r.org/r-doc/stats/rnorm"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;rnorm&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;(&lt;/span&gt;n1&lt;span style="color: #339933;"&gt;,&lt;/span&gt; m1&lt;span style="color: #339933;"&gt;,&lt;/span&gt;&lt;a href="http://inside-r.org/r-doc/stats/sd"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;sd&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;)&lt;/span&gt;&lt;span style="color: #339933;"&gt;,&lt;/span&gt; &lt;a href="http://inside-r.org/r-doc/stats/rnorm"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;rnorm&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;(&lt;/span&gt;n2&lt;span style="color: #339933;"&gt;,&lt;/span&gt; m2&lt;span style="color: #339933;"&gt;,&lt;/span&gt;&lt;a href="http://inside-r.org/r-doc/stats/sd"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;sd&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;)&lt;/span&gt;&lt;span style="color: #009900;"&gt;)&lt;/span&gt;$p.val&lt;span style="color: #339933;"&gt;,&lt;/span&gt; simplify=T&lt;span style="color: #009900;"&gt;)&lt;/span&gt;
 &lt;a href="http://inside-r.org/r-doc/graphics/hist"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;hist&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;(&lt;/span&gt;p.data&lt;span style="color: #339933;"&gt;,&lt;/span&gt; xlim=&lt;a href="http://inside-r.org/r-doc/base/c"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;c&lt;/span&gt;&lt;/a&gt;&lt;span style="color: #009900;"&gt;(&lt;/span&gt;&lt;span style="color: #cc66cc;"&gt;0&lt;/span&gt;&lt;span style="color: #339933;"&gt;,&lt;/span&gt;&lt;span style="color: #cc66cc;"&gt;0.1&lt;/span&gt;&lt;span style="color: #009900;"&gt;)&lt;/span&gt;&lt;span style="color: #339933;"&gt;,&lt;/span&gt; breaks = &lt;span style="color: #cc66cc;"&gt;99&lt;/span&gt;&lt;span style="color: #339933;"&gt;,&lt;/span&gt; &lt;a href="http://inside-r.org/r-doc/base/col"&gt;&lt;span style="color: #003399; font-weight: bold;"&gt;col&lt;/span&gt;&lt;/a&gt; = &lt;span style="color: blue;"&gt;'gray'&lt;/span&gt;&lt;span style="color: #009900;"&gt;)&lt;/span&gt;
 &lt;span style="color: #009900;"&gt;}&lt;/span&gt;&lt;/pre&gt;
&lt;pre class="r geshifilter-R" style="font-family: monospace;"&gt;&lt;span style="color: #009900;"&gt;
&lt;/span&gt;&lt;/pre&gt;
&lt;pre class="r geshifilter-R" style="font-family: monospace;"&gt;&lt;span style="color: #009900;"&gt;
&lt;/span&gt;&lt;/pre&gt;
&lt;/div&gt;
&lt;/div&gt;
&lt;a href="http://www.inside-r.org/pretty-r" title="Created by Pretty R at inside-R.org"&gt;R code html script courtesy of Pretty R at inside-R.org&lt;/a&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif;"&gt;References&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;div style="font: 12.0px Times; margin: 0.0px 0.0px 12.0px 32.0px; text-indent: -32.0px;"&gt;
&lt;span class="Apple-style-span" style="font-family: 'Trebuchet MS', sans-serif; font-size: small;"&gt;Gillett, R. (1994). Post Hoc Power Analysis. &lt;i&gt;Journal of Applied Psychology&lt;/i&gt;, &lt;i&gt;79&lt;/i&gt;, 783-785.&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-8018539215618229135?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/IpY5UTZ7QXdEggAFk73RTNfBlvI/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/IpY5UTZ7QXdEggAFk73RTNfBlvI/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/IpY5UTZ7QXdEggAFk73RTNfBlvI/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/IpY5UTZ7QXdEggAFk73RTNfBlvI/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/SCmHuuefXGQ" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/8018539215618229135/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2012/03/r-code-for-p-curves.html#comment-form" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/8018539215618229135?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/8018539215618229135?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/SCmHuuefXGQ/r-code-for-p-curves.html" title="R code for p curves" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>1</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2012/03/r-code-for-p-curves.html</feedburner:origLink></entry><entry gd:etag="W/&quot;AkYDSXs4fSp7ImA9WhVSEkk.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-1452054299113177074</id><published>2012-02-16T21:08:00.000Z</published><updated>2012-03-08T23:09:38.535Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-03-08T23:09:38.535Z</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="p curves" /><category scheme="http://www.blogger.com/atom/ns#" term="statistical significance" /><category scheme="http://www.blogger.com/atom/ns#" term="science" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>Simulating p curves and detecting dodgy stats</title><content type="html">&lt;a href="http://psych-your-mind.blogspot.com/2012/02/friday-fun-one-researchers-p-curve.html"&gt;Psych your mind&lt;/a&gt;&amp;nbsp;has an interesting blog post on using &lt;i&gt;p&lt;/i&gt; curves to detect dodgy stats in a a volume of published work (e.g., for a researcher or journal). The idea apparently comes from Uri Simonsohn (one of the authors of &lt;a href="http://people.psych.cornell.edu/~jec7/pcd%20pubs/simmonsetal11.pdf"&gt;a recent paper on dodgy stats&lt;/a&gt;). The author (Michael W. Kraus) bravely plotted and published his own &lt;i&gt;p&lt;/i&gt; curve - which looks reasonably 'healthy'. However, he makes an interesting point - which is that we don't know how useful these curves are in practice - which depends among other things on the variability inherent in the profile of &lt;i&gt;p&lt;/i&gt; values.&lt;br /&gt;
&lt;br /&gt;
I quickly threw together a simulation to address this in R. It is pretty limited (as I don't have much time right now), but potentially interesting. It simulates independent&amp;nbsp;&lt;i&gt;t&lt;/i&gt; test &lt;i&gt;p&lt;/i&gt; values where the samples are drawn from independent, normal distributions with equal variances but different means (and &lt;i&gt;n&lt;/i&gt; = 25 per group). The population standardized effect size is fixed at &lt;i&gt;d&lt;/i&gt; = 0.5 (as psychology research generally reports median effect sizes around this value). Fixing the parameters is unrealistic, but is perhaps OK for a quick simulation.&lt;br /&gt;
&lt;br /&gt;
I ran this several times and plotted &lt;i&gt;p&lt;/i&gt; curves (really just histograms with bins collecting &lt;i&gt;p&lt;/i&gt; values at relevant intervals). First I plotted for an early career researcher with just a few publications reporting 50 &lt;i&gt;p&lt;/i&gt; values. I then repeated for more experienced researchers with &lt;i&gt;n&lt;/i&gt; = 100 or &lt;i&gt;n&lt;/i&gt; = 500 published &lt;i&gt;p&lt;/i&gt; values.&lt;br /&gt;
&lt;br /&gt;
Here are the 15 random plots for 50 &lt;i&gt;p&lt;/i&gt; values:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-nQW3rpxcJtI/Tz1s-5gJ7OI/AAAAAAAAAFI/GqBvXuYg1mo/s1600/simulated+p+curves+n+=+50.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="640" src="http://4.bp.blogspot.com/-nQW3rpxcJtI/Tz1s-5gJ7OI/AAAAAAAAAFI/GqBvXuYg1mo/s640/simulated+p+curves+n+=+50.png" width="516" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
At least one of the plots has a suspicious spike between &lt;i&gt;p&lt;/i&gt;&amp;nbsp;= .04 and .05 (exactly where dodgy practices would tend to push the &lt;i&gt;p&lt;/i&gt; values).&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
What about 100 &lt;i&gt;p&lt;/i&gt; values?&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-JEDvDlF0PAk/Tz1ts_gooWI/AAAAAAAAAFQ/ovBW4OfVrQM/s1600/simulated+p+curves+n+=+100.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="640" src="http://4.bp.blogspot.com/-JEDvDlF0PAk/Tz1ts_gooWI/AAAAAAAAAFQ/ovBW4OfVrQM/s640/simulated+p+curves+n+=+100.png" width="517" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
Here the plots are still variable (but closer to the theoretical ideal plotted on Kraus' blog).&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
You can see this pattern even more clearly with 500 p values:&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://1.bp.blogspot.com/-HTxkr_0W-UI/Tz1uXhc4ZeI/AAAAAAAAAFg/m7KM3bZfoDU/s1600/simulated+p+curves+n+=+500.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="640" src="http://1.bp.blogspot.com/-HTxkr_0W-UI/Tz1uXhc4ZeI/AAAAAAAAAFg/m7KM3bZfoDU/s640/simulated+p+curves+n+=+500.png" width="516" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
Some quick conclusions ... The method is too unreliable for use with early career researchers. You need a few hundred &lt;i&gt;p &lt;/i&gt;values to be pretty confidence of a nice flat pattern between &lt;i&gt;p&lt;/i&gt; = .01 and &lt;i&gt;p&lt;/i&gt; = .06. Varying the effect size and other parameters might well inject further noise (as would adding in null effects which have a uniform distribution of &lt;i&gt;p&lt;/i&gt; values and are thus probably rather noisy).&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
I'm also skeptical that this is useful for detecting fraud (as presumably deliberate fraud will tend to go for 'impressive' &lt;i&gt;p&lt;/i&gt; values such as &lt;i&gt;p&lt;/i&gt; &amp;lt; .0001). Also (going forward) fraudsters will be able to generate results to circumvent tools such as &lt;i&gt;p&lt;/i&gt; curves (if they are known to be in use).&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: left;"&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div style="text-align: center;"&gt;
&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-1452054299113177074?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/gY3o_GUgMqiybUT-DxvHZIIf_jE/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/gY3o_GUgMqiybUT-DxvHZIIf_jE/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/gY3o_GUgMqiybUT-DxvHZIIf_jE/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/gY3o_GUgMqiybUT-DxvHZIIf_jE/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/c0cvfRGkVpo" height="1" width="1"/&gt;</content><link rel="related" href="http://psych-your-mind.blogspot.com/2012/02/friday-fun-one-researchers-p-curve.html" title="Simulating p curves and detecting dodgy stats" /><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/1452054299113177074/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2012/02/simulating-p-curves-and-detecting-dodgy.html#comment-form" title="14 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/1452054299113177074?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/1452054299113177074?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/c0cvfRGkVpo/simulating-p-curves-and-detecting-dodgy.html" title="Simulating p curves and detecting dodgy stats" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/-nQW3rpxcJtI/Tz1s-5gJ7OI/AAAAAAAAAFI/GqBvXuYg1mo/s72-c/simulated+p+curves+n+=+50.png" height="72" width="72" /><thr:total>14</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2012/02/simulating-p-curves-and-detecting-dodgy.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CU4BQH07fip7ImA9WhRaEks.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-5330311759010315757</id><published>2012-02-14T23:54:00.003Z</published><updated>2012-02-14T23:59:11.306Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-02-14T23:59:11.306Z</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="robust statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="nonparametric statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="confidence intervals" /><title>On nonparametric statistics ...</title><content type="html">&lt;span class="Apple-style-span" style="font-family: Georgia, serif;"&gt;I'm not a big fan of the term "nonparametric statistics", or at least how it is used in psychology and related fields (e.g., education and health research). This is one reason why I don't make a big deal of the parametric/non-parametric distinction in my&amp;nbsp;&lt;/span&gt;&lt;span class="Apple-style-span" style="color: #333333; line-height: 23px;"&gt;&lt;span class="Apple-style-span" style="font-family: Georgia, serif;"&gt;&amp;nbsp;&lt;a href="http://www.amazon.co.uk/gp/product/0230577180/ref=as_li_tf_tl?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0230577180" style="border-bottom-width: 0px; border-color: initial; border-left-width: 0px; border-right-width: 0px; border-style: initial; border-top-width: 0px; color: #df0000; font-style: inherit; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px; outline-color: initial; outline-style: initial; outline-width: 0px; padding-bottom: 0px; padding-left: 0px; padding-right: 0px; padding-top: 0px; vertical-align: baseline;"&gt;Serious stats&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&amp;nbsp;book and probably partly why a &lt;a href="http://www.psychologicalscience.org/index.php/publications/observer/2012/february-11-2012-observer-publications/why-nonparametric-statistics-should-be-part-of-your-toolkit.html"&gt;recent article in APS Observer&lt;/a&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt; annoyed me so much.&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;Why did it annoy me so much? It says, in essence, what many standard psychology text books say (and also makes several good points - which I shall largely ignore for rhetorical purposes). I like in particular the points about statistical tests used to check statistical assumptions lacking power and making statistical assumptions themselves (which are typically unchecked and they are not robust to). Indeed, that's another thing that annoys me and I have blogged/bored people about in the past (e.g., see &lt;a href="http://psychologicalstatistics.blogspot.com/2006/05/what-is-all-this-stuff-about.html"&gt;here&lt;/a&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;). I also think the article might have been chopped about a bit by editors (as it contains at least one important reference not cited and some of my gripes are also contained in references cited by the article I'm annoyed with). So, on balance, it is (by psychology mainstream standards) not at all a bad piece. However, by buying into the standard nonparametric stats presentation it perpetuates a few myths or errors and may inadvertently gives some pretty poor advice mixed in with the good.&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;Here is a quick list of my criticisms:&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt; &lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;/span&gt; &lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;(1) The definition of nonparametric statistics is deeply confused. The author starts by writing: "Nonparametric statistical analyses are used to investigate research questions in which the dependent variable is ranked or categorical rather than quantified in a true numeric sense". This seems to suggest that non-parametric procedures are defined by having a discrete or bounded DV. Later it adds that "Traditional parametric statistics require a number of assumptions about the characteristics (i.e., parameters) of the data." This is an appeal to the idea that parametric statistics assume a particular probability distribution (the parameters of which are estimated by the data). This seems like a better definition to me, although like many psychologists, the author appears to assume that the probabilty distribution assumed is always a normal distribution. Mixing the two aspects of the definition is confusing. It is easy to find statistical procedures that are parametric in the second sense, but involve ranked or categorical DVs. I would argue that a chi-square test of independence or a sign test is parametric by the second definition. Not only is the definition problematic, but it could lead to poor analytic decisions. For instance, dichotomous outcomes are often best analyzed using parametric techniques such as logistic regression (a generalized linear model with a logistic link function and a binomial random component) rather than the methods surveyed in the article.&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;(2) The article also reads as if nonparametric tests are the same thing as rank randomization tests. Rank&amp;nbsp;randomization&amp;nbsp;tests are examples of nonparametric tests in the sense they are distribution free (making no assumption about a particular probability distribution for the data). However, there are many other nonparametric methods that don't involve ranks. Rank&amp;nbsp;randomization&amp;nbsp;tests are useful but limited in scope. In addition, if the raw data are in the form of ranks then a rank transformation is pretty pointless (you might as well jut run a regular test).&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;The main limitation of the rank approaches is that the rank transformation is irreversible; it destroys the link between the analysis and the raw scores. This is a serious problem if you care about the raw scores - perhaps because you want to test whether effects are non-additive or because you want to get an interval estimate or a effect sizes on the original scale. There are ways to help you do this with rank transformation procedures, but they are generally pretty fiddly. There are good reasons why people don't tend to use rank transformations in multiple regression, use them to test interaction effects or use them to construct confidence intervals.&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;Third, rank randomization (and other nonparametric) procedures &lt;i&gt;are&lt;/i&gt; bound by pesky assumptions! They tend to make weaker assumptions than parametric procedures, but all statistical procedures make assumptions. The assertion that the "main assumptions of nonparametric tests are that the dependent variable should be continuous and have independent random sampling, which means that nonparametric statistics do not require assumptions of homogeneity of variance and normality" is misleading. The precise assumptions vary from test to test and with the hypothesis being tested. As a rule, rank randomization and related rank transformation tests assume that samples with similar shapes of distribution are being compared. They can therefore be undermined by heterogeneity of variance or varying degrees of skew. If the distributions have very dissimilar shapes then the tests can sometimes behave very strangely (e.g., it is possible to get outcomes where A &amp;gt; B, B &amp;gt; C and yet C &amp;gt; A). Continuity is also not necessarily a requirement of the DV or of the underlying construct being measured (though it may be for some hypotheses).&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;(4) Many generalizations are made that simply don't hold up: "When the data violates the assumptions of a parametric test, nonparametric tests are again the more powerful analytic technique (Siegel, 1957)." This doesn't follow. Often this is true, but not always. There has been a lot of research on this topic since 1957 and parametric tests are not always inferior. This is particularly true if you don't restrict parametric techniques to &lt;i&gt;t&lt;/i&gt; tests, linear regression and ANOVA. Some nonparametric techniques are known to have very poor power. I also got irritated by the critique of log and similar transformations that states: "while these transformations can make variables more normally distributed, they can also diminish or alter experimental effects, which can reduce power." To the extent this is true, it is also true of the rank transformation (more so in some cases). Furthermore, the real problem is with arbitrary transformations. Log transformations - where appropriate - tend to aid interpretation of effects (e.g., by quantifying them as proportionate rather than additive effects). I also dislike the implication that "experimental effects" exist in a pristine form prior to transformation. This is simply not the case - how to quantify and scale measurement of an effect is a tricky business (e.g., memory researchers can use percentage correct, hits minus false alarms, &lt;i&gt;A&lt;/i&gt; prime &lt;i&gt;d&lt;/i&gt; prime etc.) and many effects come "pre-transformed" (e.g., measurement of loudness in decibels). Transformations (including the rank transformation) are useful tools that can increase power and aid interpretation if used carefully and appropriately.&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;(5) The use of parametric statistics to analyze Likert-style rating scales may be one of the "seven deadly sins of statistical analysis", but it is rarely a big problem in practice. Least squares methods are most messed up by heavy tailed distributions, severe skew or outliers. If anything, Likert-style rating scales tend not to have these problems (or to manifest them relatively mildly). Furthermore, where there are problems with Likert-style measures, rank randomization or transformation tests s are probably not the solution. A number of parametric procedures for ordinal outcomes exist - notably ordered logistic regression (though least squares methods such as &lt;i&gt;t&lt;/i&gt; tests, ANOVA or regression should work well when their assumptions are not badly violated).&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;(6) The variety of nonparametric tests referred to is slightly artificial. As a general rule there are advantages to sticking to standard 'parametric' tests such as the Welch-Satterthwaite &lt;i&gt;t&lt;/i&gt; test or one-way ANOVA rather than using named rank-transformation tests such as the Mann-Whitney U test. In some cases there may be advantages with specialized rank randomization tests where sample sizes are small (e.g., because software such as R implements exact versions). However, there are a few cases where the rank&amp;nbsp;randomization&amp;nbsp;tests are not robust (e.g., Mann-Whitney U test is not robust to heterogeneity of variance) or lack power (e.g., the Friedman test, Page's L test and most multiple comparison procedures available for ranks). Rank transforming the data and then running a &lt;i&gt;t&lt;/i&gt; test with Welch-Satterthwaite correction is superior to running the Mann-Whitney directly (Zimmerman &amp;amp; Zumbo, 1993a). For more on the low power of the Friedman test and better alternatives see &lt;a href="http://seriousstats.wordpress.com/2012/02/14/friedman/"&gt;here&lt;/a&gt;.&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;Rather than think in terms of nonparametric statistics, it is better to focus on checking assumptions (using graphical methods and simple descriptives) and checking our models against more robust procedures. If more robust methods show different results - the next step is to find out why (and definitely not report just the outcome you prefer). This should lead you to a superior model (using robust methods or perhaps a more appropriate parametric model). The consideration of robust methods is particularly important. This includes some rank transformation tests, but also includes robust regression, bootstrapping and other tools (e.g., see Wilcox &amp;amp; Keselman, 2003; 2004; Baguley, 2012).&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt; &lt;/span&gt; &lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;i&gt;References&lt;/i&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;                 &lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;div class="ReferenceStyle"&gt;&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="line-height: 23px;"&gt;Baguley, T. (2012, in press).&amp;nbsp;&lt;a data-mce-href="http://www.amazon.co.uk/gp/product/0230577180/ref=as_li_tf_tl?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0230577180" href="http://www.amazon.co.uk/gp/product/0230577180/ref=as_li_tf_tl?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0230577180" style="line-height: 1.7;"&gt;Serious stats: A guide to advanced statistics for the behavioral sciences&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="line-height: 23px;"&gt;. Basingstoke: Palgrave.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;Wilcox, R. R., &amp;amp; Keselman, H. J. (2003). Modern robust data analysis methods: Measures of central tendency. &lt;i&gt;Psychological Methods, 8&lt;/i&gt;&lt;span style="font-style: normal;"&gt;, 254-274.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="ReferenceStyle"&gt;&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span style="font-style: normal;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="ReferenceStyle"&gt;&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span style="font-style: normal;"&gt;                 &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="ReferenceStyle"&gt;&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;Wilcox, R. R.,&amp;amp;&amp;nbsp; Keselman, H. J. (2004). Robust regression methods: Achieving small standard errors when there is heteroscedasticity. &lt;i&gt;Understanding Statistics, 3&lt;/i&gt;, 349- 364.&lt;/span&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt; &lt;br /&gt;
&lt;div class="ReferenceStyle"&gt;&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;span lang="EN-US"&gt;Zimmerman, D. W., &amp;amp; Zumbo, B. D. (1993b). Rank transformations and the power of the Student &lt;i&gt;t&lt;/i&gt; test and Welch &lt;i&gt;t&lt;/i&gt;' test for non-normal populations with unequal variances. &lt;i&gt;Canadian Journal of Experimental Psychology, 47&lt;/i&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;, 523-539.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;b&gt;&lt;/b&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-5330311759010315757?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/Jn-_WoBSQYNr0gjilkUUXRILKWg/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/Jn-_WoBSQYNr0gjilkUUXRILKWg/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/Jn-_WoBSQYNr0gjilkUUXRILKWg/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/Jn-_WoBSQYNr0gjilkUUXRILKWg/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/FPNppjGSAMI" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/5330311759010315757/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2012/02/on-nonparametric-statistics.html#comment-form" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/5330311759010315757?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/5330311759010315757?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/FPNppjGSAMI/on-nonparametric-statistics.html" title="On nonparametric statistics ..." /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>1</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2012/02/on-nonparametric-statistics.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CEEGSX4-eSp7ImA9WhRbFkk.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-8574724576909358550</id><published>2012-02-05T17:42:00.002Z</published><updated>2012-02-07T19:23:48.051Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-02-07T19:23:48.051Z</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="blog" /><category scheme="http://www.blogger.com/atom/ns#" term="correlation" /><category scheme="http://www.blogger.com/atom/ns#" term="confidence intervals" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><category scheme="http://www.blogger.com/atom/ns#" term="comparing correlations" /><title>Comparing correlations update</title><content type="html">I have just published R code for &lt;a href="http://seriousstats.wordpress.com/2012/02/05/comparing-correlations/"&gt;calculating CIs for differences between correlations&lt;/a&gt; on the &lt;a href="http://www.amazon.co.uk/gp/product/0230577180/ref=as_li_tf_tl?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0230577180"&gt;Serious stats book&lt;/a&gt;&amp;nbsp;blog. This covers independent correlations (taken from chapter 6 of the book) and dependent correlations (new R code written as a supplement to chapter 6).&lt;br /&gt;
&lt;br /&gt;
UPDATE on the update ...&lt;br /&gt;
&lt;br /&gt;
I have also added an Excel spreadsheet that should match the R output (though the latter is probably more accurate and reliable).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-8574724576909358550?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/OAEOxFRpzjkKZCeg-ZtJ3CoYRZE/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/OAEOxFRpzjkKZCeg-ZtJ3CoYRZE/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/OAEOxFRpzjkKZCeg-ZtJ3CoYRZE/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/OAEOxFRpzjkKZCeg-ZtJ3CoYRZE/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/-YfmuH6qGJc" height="1" width="1"/&gt;</content><link rel="related" href="http://seriousstats.wordpress.com/2012/02/05/comparing-correlations/" title="Comparing correlations update" /><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/8574724576909358550/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2012/02/comparing-correlations-update.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/8574724576909358550?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/8574724576909358550?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/-YfmuH6qGJc/comparing-correlations-update.html" title="Comparing correlations update" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2012/02/comparing-correlations-update.html</feedburner:origLink></entry><entry gd:etag="W/&quot;AkUGSHc7cCp7ImA9WhVSEkk.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-525144403265746919</id><published>2012-02-02T16:50:00.001Z</published><updated>2012-03-08T23:10:29.908Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-03-08T23:10:29.908Z</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="blog" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="serious stats" /><category scheme="http://www.blogger.com/atom/ns#" term="science" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>Serious Stats book and blog update</title><content type="html">&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;This is a quick update to announce my new blog &lt;a href="http://seriousstats.wordpress.com/"&gt;Serious Stats&lt;/a&gt;. This is a companion to my forthcoming book of the same name:&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="line-height: 23px;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;Baguley, T. (2012, in press).&amp;nbsp;&lt;a data-mce-href="http://www.amazon.co.uk/gp/product/0230577180/ref=as_li_tf_tl?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0230577180" href="http://www.amazon.co.uk/gp/product/0230577180/ref=as_li_tf_tl?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0230577180" style="line-height: 1.7;"&gt;Serious stats: A guide to advanced statistics for the behavioral sciences&lt;/a&gt;. Basingstoke: Palgrave.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="line-height: 23px;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="line-height: 23px;"&gt;I think is better to separate book specific content out from my regular posts (though in some cases this will be a bit fuzzy). I will also try and post short updates here when something relevant gets published on the blog for the book.&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="line-height: 23px;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="color: #444444; line-height: 23px;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="color: #444444; font-family: Georgia, 'Bitstream Charter', serif; font-size: 14px; line-height: 23px;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-525144403265746919?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/HYfOGQuXbbIQZyTLI-Ynt4jf_FY/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/HYfOGQuXbbIQZyTLI-Ynt4jf_FY/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/HYfOGQuXbbIQZyTLI-Ynt4jf_FY/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/HYfOGQuXbbIQZyTLI-Ynt4jf_FY/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/1sjN9SPMz-4" height="1" width="1"/&gt;</content><link rel="related" href="http://seriousstats.wordpress.com/" title="Serious Stats book and blog update" /><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/525144403265746919/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2012/02/serious-stats-book-and-blog-update.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/525144403265746919?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/525144403265746919?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/1sjN9SPMz-4/serious-stats-book-and-blog-update.html" title="Serious Stats book and blog update" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2012/02/serious-stats-book-and-blog-update.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CE8BSX8_fyp7ImA9WhVTGUk.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-7472403449669426879</id><published>2012-02-01T20:38:00.009Z</published><updated>2012-03-05T10:20:58.147Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-03-05T10:20:58.147Z</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="correlation" /><category scheme="http://www.blogger.com/atom/ns#" term="statistical significance" /><category scheme="http://www.blogger.com/atom/ns#" term="interaction" /><category scheme="http://www.blogger.com/atom/ns#" term="confidence intervals" /><category scheme="http://www.blogger.com/atom/ns#" term="comparing correlations" /><title>More on "A problem of significance"</title><content type="html">&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;A longer version of my earlier post &lt;a href="http://psychologicalstatistics.blogspot.com/2011/09/problem-of-significance.html"&gt;A problem of significance&lt;/a&gt; just appeared in The Psychologist.&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="border-collapse: collapse; font-family: Times, 'Times New Roman', serif; line-height: 18px;"&gt;Baguley, T. (2012). &lt;a href="http://nottinghamtrent.academia.edu/ThomBaguley/Papers/1358154/Can_we_confident_in_our_statistics"&gt;Can we be confident in our statistics?&lt;/a&gt; The Psychologist. 25, 128-9.&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="border-collapse: collapse; font-family: Times, 'Times New Roman', serif; line-height: 18px;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="border-collapse: collapse; font-family: Times, 'Times New Roman', serif; line-height: 18px;"&gt;Shortly after publication I received an email asking about statistical analysis of differences in correlations. This is more tricky than you might think. I'm working on some R code to implement one of the better approaches and plan to blog on this shortly ...&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="border-collapse: collapse; font-family: Georgia, 'Times New Roman', serif; line-height: 18px;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="border-collapse: collapse; font-family: Georgia, 'Times New Roman', serif; line-height: 18px;"&gt;(See &lt;a href="http://psychologicalstatistics.blogspot.com/2012/02/comparing-correlations-update.html"&gt;update&lt;/a&gt; here.) &lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="border-collapse: collapse; color: #626262; font-family: 'Helvetica Neue', Arial, sans-serif; font-size: 11px; line-height: 18px;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-7472403449669426879?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/s92XuI8NSaFQmTrbButYaAXqJWk/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/s92XuI8NSaFQmTrbButYaAXqJWk/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/s92XuI8NSaFQmTrbButYaAXqJWk/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/s92XuI8NSaFQmTrbButYaAXqJWk/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/BA0MjtKV1Mo" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/7472403449669426879/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2012/02/more-on-problem-of-significance.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7472403449669426879?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7472403449669426879?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/BA0MjtKV1Mo/more-on-problem-of-significance.html" title="More on &quot;A problem of significance&quot;" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2012/02/more-on-problem-of-significance.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CEIGRX05fSp7ImA9WhVREUs.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-5477542586888203401</id><published>2011-10-04T15:43:00.006+01:00</published><updated>2012-03-19T13:08:44.325Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-03-19T13:08:44.325Z</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="Loftus and Masson" /><category scheme="http://www.blogger.com/atom/ns#" term="displaying means" /><category scheme="http://www.blogger.com/atom/ns#" term="Goldstein and Healy" /><category scheme="http://www.blogger.com/atom/ns#" term="sphericity" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="ANOVA" /><category scheme="http://www.blogger.com/atom/ns#" term="statistical significance" /><category scheme="http://www.blogger.com/atom/ns#" term="confidence intervals" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>Calculating and graphing within-subject confidence intervals for ANOVA</title><content type="html">&lt;span class="Apple-style-span" style="line-height: 24px;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;                     &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Psychologists are gradually coming round to the view that it is a good idea to present interval estimates alongside point estimates of statistics. The most common statistic reported in psychology research is almost certainly the mean (strictly the arithmetic mean). Presenting an interval estimate for the mean of a single sample is usually quite simple. This is usually done as 95% confidence interval about the mean – and most researchers in psychology are able to calculate this by hand or get their statistical software to calculate and graph it for them.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;  &lt;/span&gt;&lt;br /&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Extending this to more than one mean introduces an additional layer of complexity. This is because the difference between two means is a different quantity, and its CI (although related to those of the individual means) is different in width from the CIs of the individual means. This creates a problem when plotting the CI because a researcher might be interested in the CI for an individual mean, the CI for their difference (or both).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;The complexity increases further if the aim is to plot a set of means (e.g., from an ANOVA design). In this case, plotting all the possible differences (as is commonly done) obscures patterns in the individual means (e.g., linear or quadratic trends).&amp;nbsp; Last, but not least, if the means are not from independent samples, there are further difficulties. This happens in within-subjects or repeated measures designs.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;In these designs the variation around each mean is correlated with the variation around the other means. This correlation arises from individual differences. Statistical procedures such as ANOVA can capitalize on these individual differences to produce more sensitive statistical inferences (i.e., to increase statistical power or obtain narrower CIs). This is done by estimating the variation due to individual differences, and removing it from the error variance (the estimate of statistical noise in the data set).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;This is a problem for graphical presentation of means because the precision of individual means is influenced by individual differences, whereas the precision of differences between means is not (because the estimate of individual differences is common to repeated samples from the same people and thus can be removed). Further complications arise when the sphericity assumption of repeated measures ANOVA is violated.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Several solutions to these problems have been proposed in the literature. The best known of these in psychology is that of Loftus and Masson (1994). Another well-known solution is that of Goldstein and Healy (1995), extended to correlated samples by Afshartous and Preston (2010).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Despite a large literature on the problems of graphing a set of correlated means, many people avoid the problems altogether by not reporting (or graphing) CIs or report CIs that are misleading in some way. Researchers are often unaware of the problems or find the solutions hard to understand and implement.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;I recently reviewed the main approaches in the literature, describe how to obtain suitable intervals for individual means and differences between means and provide R code to calculate and plot the intervals.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;The main highlights are that:&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;i) for inferences about individual means the standard approach works fairly well for between-subject (independent measures) designs, but there is a case to use CIs from a multilevel model for within-subject (repeated measures) designs&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;ii) an approach proposed by Cousineau (2005) with a correction by Morey (2008) offers advantages over the Loftus and Masson (1994) approach for within-subject ANOVA designs. It simplifies the calculations and does not assume sphericity. The Loftus-Masson approach will however usually be superior when &lt;i style="mso-bidi-font-style: normal;"&gt;n&lt;/i&gt; is small.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;iii) if you are interested in differences between means then you should probably plot a version of the Cousineau-Morey (or Loftus-Masson) interval that is adjusted so that overlap of the CIs around two individual means corresponds to overlap of the CI for their difference. This can be done by incorporating a multiplier to the width of the individual CIs. This multiplier is equal to (2^0.5)/2.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;iv) if you are interested in both precision of individual means and their differences you can use a two-tiered error bar to display both quantities (Cleveland, 1985).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;v) the intervals (and graphical presentation of means) are useful for informal inference about a set of means. For formal inference it is better to set up precise hypotheses and test these via an a priori of contrast. This could be a traditional null hypothesis significance test, but other approaches are available. These include confidence intervals, Bayes factors, likelihood ratios and so forth (Baguley, in press; Dienes, 2008).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;br /&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: -38.6pt;"&gt;
&lt;span lang="EN-US" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: -38.6pt;"&gt;
&lt;span lang="EN-US" style="font-family: inherit;"&gt;The paper is available &lt;a href="http://nottinghamtrent.academia.edu/ThomBaguley/Papers/900446/Calculating_and_graphing_within-subject_confidence_intervals_for_ANOVA"&gt;here&lt;/a&gt;, the R code &lt;a href="https://docs.google.com/leaf?id=0B8YTvx5C8MjXNWRjMzhhMzQtZmE5YS00MWQ3LTk1NTAtMjc1MzNkOWU2MjYz&amp;amp;hl=en_GB"&gt;here&lt;/a&gt; and the data sets &lt;a href="https://docs.google.com/leaf?id=0B8YTvx5C8MjXMjc3ODY2YjgtZTAyMi00OTM3LTlkYTYtY2E0YzU4YTBjNGEy&amp;amp;hl=en_GB"&gt;here&lt;/a&gt;.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span lang="EN-US" style="font-family: inherit;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;
Update: &lt;a href="http://seriousstats.wordpress.com/2012/03/18/cis-for-anova/"&gt;R functions now available&lt;/a&gt; for the simpler between-subjects (independent measures) ANOVA&lt;br /&gt;
case (at the &lt;a href="http://www.amazon.co.uk/gp/product/0230577180/ref=as_li_tf_tl?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0230577180"&gt;Serious stats&lt;/a&gt; blog).&lt;br /&gt;
&lt;span lang="EN-US" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: -38.6pt;"&gt;
&lt;b style="mso-bidi-font-weight: normal;"&gt;&lt;i style="mso-bidi-font-style: normal;"&gt;&lt;span lang="EN-US" style="font-family: inherit;"&gt;References&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;
&lt;b style="mso-bidi-font-weight: normal;"&gt;&lt;i style="mso-bidi-font-style: normal;"&gt;&lt;span lang="EN-US" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span lang="EN-US" style="font-family: inherit;"&gt;Afshartous D., &amp;amp; Preston R. A. (2010). Confidence intervals for dependent data: equating nonoverlap with statistical significance. &lt;i&gt;Computational Statistics and Data&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;Analysis. 54&lt;/span&gt;&lt;/i&gt;&lt;span lang="EN-US"&gt;, 2296-2305.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;a href="http://www.blogger.com/goog_536825781"&gt;Baguley, T. (2011, in press).&amp;nbsp;&lt;/a&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;a href="http://nottinghamtrent.academia.edu/ThomBaguley/Papers/900446/Calculating_and_graphing_within-subject_confidence_intervals_for_ANOVA"&gt;Calculating and graphing within-subject confidence intervals for ANOVA. Behavior Research Methods.&amp;nbsp;DOI: 10.3758/s13428-011-0123-7&lt;/a&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt; Baguley, T. (2012, in press).&amp;nbsp;&lt;img alt="" border="0" height="1" src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;amp;l=as2&amp;amp;o=2&amp;amp;a=0230577180" style="border: none !important; margin: 0px !important;" width="1" /&gt;&lt;a href="http://www.amazon.co.uk/gp/product/0230577180/ref=as_li_tf_tl?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0230577180"&gt;Serious Stats: A guide to advanced statistics for the behavioral sciences&lt;/a&gt;. Basingstoke: Palgrave.&lt;/span&gt;&lt;br /&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;/div&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Cleveland. W. S. (1985).&amp;nbsp;&lt;/span&gt;&lt;a href="http://www.amazon.co.uk/gp/product/0412742705/ref=as_li_tf_tl?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0412742705"&gt;Elements Of Graphing Data&lt;/a&gt;&lt;span lang="EN-US"&gt;. New York, NY: Chapman &amp;amp; Hall.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span lang="EN-US" style="font-family: inherit;"&gt;Cousineau, D. (2005). Confidence intervals in within-subject designs: A simpler solution to Loftus and Masson’s method. &lt;i&gt;Tutorials in Quantitative Methods for Psychology,&lt;/i&gt; &lt;i&gt;1&lt;/i&gt;, 42-45.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Dienes, Z. (2008).&amp;nbsp;&lt;/span&gt;&lt;a href="http://www.amazon.co.uk/gp/product/023054231X/ref=as_li_tf_tl?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=023054231X"&gt;Understanding Psychology as a Science: An Introduction to Scientific and Statistical Inference&lt;/a&gt;&lt;img alt="" border="0" height="1" src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;amp;l=as2&amp;amp;o=2&amp;amp;a=023054231X" style="border-bottom-style: none !important; border-color: initial !important; border-left-style: none !important; border-right-style: none !important; border-top-style: none !important; border-width: initial !important; cursor: move; margin-bottom: 0px !important; margin-left: 0px !important; margin-right: 0px !important; margin-top: 0px !important;" width="1" /&gt;&lt;span lang="EN-US"&gt;. Basingstoke: Palgrave Macmillan.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Goldstein, H., &amp;amp; Healy, M. J. R. (1995). &lt;/span&gt;&lt;span lang="EN-US"&gt;The graphical presentation of a collection of means. &lt;/span&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;Journal of the Royal Statistical Society. Series A (Statistics in Society), 158&lt;/span&gt;&lt;/i&gt;&lt;span lang="EN-US"&gt;, 175-177.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span lang="EN-US" style="font-family: inherit;"&gt;Loftus, G. R., &amp;amp; Masson, M. E. J. (1994). Using confidence intervals in within-subject designs. &lt;i&gt;Psychonomic Bulletin &amp;amp; Review&lt;/i&gt;, &lt;i&gt;1&lt;/i&gt;, 476-490.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;
&lt;div class="Default"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Morey, R. D. (2008). &lt;/span&gt;&lt;span lang="EN-US"&gt;Confidence intervals from normalized data: A correction to Cousineau (2005). &lt;/span&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;Tutorials in Quantitative Methods for Psychology, 4&lt;/span&gt;&lt;/i&gt;&lt;span lang="EN-US"&gt;, 61-64.&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;
&lt;span class="Apple-style-span" style="font-family: Calibri;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-5477542586888203401?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/-LDsewU80NCl5VC8p8jacx1Wm5g/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/-LDsewU80NCl5VC8p8jacx1Wm5g/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/-LDsewU80NCl5VC8p8jacx1Wm5g/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/-LDsewU80NCl5VC8p8jacx1Wm5g/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/4y9e8wjv1z4" height="1" width="1"/&gt;</content><link rel="related" href="http://nottinghamtrent.academia.edu/ThomBaguley/Papers/900446/Calculating_and_graphing_within-subject_confidence_intervals_for_ANOVA" title="Calculating and graphing within-subject confidence intervals for ANOVA" /><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/5477542586888203401/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2011/10/calculating-and-graphing-within-subject.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/5477542586888203401?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/5477542586888203401?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/4y9e8wjv1z4/calculating-and-graphing-within-subject.html" title="Calculating and graphing within-subject confidence intervals for ANOVA" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2011/10/calculating-and-graphing-within-subject.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DkYFQ38_cCp7ImA9WhRbFEs.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-2280818886067466881</id><published>2011-09-15T23:27:00.006+01:00</published><updated>2012-02-05T17:48:32.148Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-02-05T17:48:32.148Z</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="ANOVA" /><category scheme="http://www.blogger.com/atom/ns#" term="statistical significance" /><category scheme="http://www.blogger.com/atom/ns#" term="interaction" /><title>A problem of significance</title><content type="html">Several people have drawn my attention to a recent article on a common error in published statistical analyses in neuroscience. Sander Nieuwenhuis, Birte Forstmann and Eric-Jan Wagenmakers published (in Nature Neuroscience) a critique of statistical analyses in the neuroscience literature. This paper has been written about by &lt;a href="http://www.guardian.co.uk/commentisfree/2011/sep/09/bad-science-research-error"&gt;Ben Goldacre&lt;/a&gt; and &lt;a href="http://andrewgelman.com/2011/09/the-difference-between-significant-and-not-significant/"&gt;Andrew Gelman&lt;/a&gt; (who published an article on the general problem some time ago) - so I won't go into too much detail.&lt;br /&gt;
&lt;br /&gt;
The point of interest for me is that the error concerns something that most psychologists should know all about (and hence should be expected not to make the error). It concerns the case of two differences, one statistically significant and one non-significant. For example, group 1 may show a significant difference between experimental condition and placebo (for a drug intervention), while group 2 do not. A naive interpretation is that the drug works for group 1 but not group 2. This is not necessarily true. The proper test of a difference in effects of the drug between groups is an interaction test. Psychologists tend to avoid this error because we have heavily trained in ANOVA as undergraduates (certainly in the UK and probably also in the US and most of Europe). Even if we fail to learn this, &amp;nbsp;reviewers and editors (in psychology) tend to spot the error.*&lt;br /&gt;
&lt;br /&gt;
Are psychologists then entitled to feel a little bit smug? Perhaps, but only a little. First, I think the reason we are relatively good performers on this point is because we tend to view many statistical analyses through an "ANOVA" lens. Factorial ANOVA (in which factors are orthogonal) includes the interaction term by default. The 2 by 2 factorial ANOVA is the workhorse of experimental psychology. Our familiarity with this type of design and analysis makes this easy to spot. Second, our ANOVA lens leads to other errors - notably dichotomizing continuous variables (e.g., via median split) in order to squeeze them into an ANOVA design. This always decreases statistical power, and can - albeit infrequently - produce spuriously significant effects (see MacCallum et al., 2002). These errors are sometimes less serious than the difference of differences/interaction error (but are not harmless).&lt;br /&gt;
&lt;br /&gt;
The real test then, is whether psychologists make the same (conceptual) error in a different context. The obvious context is that of association rather than difference. If males show a significant correlation between testosterone and aggression (e.g., &lt;i&gt;r&lt;/i&gt; = .5, &lt;i&gt;N&lt;/i&gt; = 25) and females don't (e.g.,&amp;nbsp;&lt;i&gt;r&lt;/i&gt; = .3, &lt;i&gt;N&lt;/i&gt; = 25), the correlation between testosterone and aggression is not significantly bigger for males than females. To confirm this you'd need to construct a test or (better still) confidence interval for the difference in correlations. This is hardly ever done - and, in my experience, psychologists frequently make this kind of claim without backing it up.** Methods for testing differences in correlations are a bit fiddly (e.g., depending on overlap or lack of overlap in the measurements), and rarely taught at undergraduate or even postgraduate level. The methods that are taught are also often a bit dodgy (see Zou, 2007, for some better alternatives).&lt;br /&gt;
&lt;br /&gt;
Also note that (in both cases) the error can work the other way. Two correlations could be non-significantly different from zero but different from each other (e.g., &lt;i&gt;r&lt;/i&gt; = .5 and &lt;i&gt;r&lt;/i&gt; = -.5 with &lt;i&gt;N&lt;/i&gt; = 10).&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;Postscript&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
There is, I think, a lesson or two here. A minor lesson is that interactions are bit more complicated than psychologists (particularly those very familiar with ANOVA) often think. I could write more on this (and do a bit in my forthcoming book). A major lesson is that this concept (the difference between significant and non-significant is not necessarily also statistically significant - see Gelman &amp;amp; Stern, 2006) is probably quite tricky. It may be worth exploring why ... I suspect it is because of several factors.&lt;br /&gt;
&lt;br /&gt;
See updates &lt;a href="http://psychologicalstatistics.blogspot.com/2012/02/more-on-problem-of-significance.html"&gt;here&lt;/a&gt; and &lt;a href="http://psychologicalstatistics.blogspot.com/2012/02/comparing-correlations-update.html"&gt;here&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;References&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Gelman, A., &amp;amp; Stern, H. (2006). The difference between “significant” and “not significant” is not itself statistically significant. &lt;i&gt;American Statistician, 60&lt;/i&gt;, 328–331.&lt;br /&gt;
&lt;br /&gt;
MacCallum, R. C., Zhang, S., Preacher, K. J., &amp;amp; Rucker, D. D. (2002). On the practice of dichotomization of quantitative variables. &lt;i&gt;Psychological Methods, 7&lt;/i&gt;, 19-40.&lt;br /&gt;
&lt;br /&gt;
Nieuwenhuis, S., Forstmann, B. U., &amp;amp; Wagenmakers, E.-J. (2011). Erroneous analyses of interactions in neuroscience: A problem of significance. &lt;i&gt;Nature Neuroscience, 14&lt;/i&gt;, 1105-1107.&lt;br /&gt;
&lt;br /&gt;
Zou, G. Y. (2007). Toward using confidence intervals to compare correlations. &lt;i&gt;Psychological Methods, 12,&lt;/i&gt; 399-413.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;*&amp;nbsp;&lt;/span&gt;&amp;nbsp;Do I have any support for this position? Yes: anecodotal support (e.g., from editing or reviewing many dozens of papers) and some support from&amp;nbsp;Nieuwenhuis et al. They found the error more prevalent in cellular and molecular neuroscience. ANOVA is core training in psychology and widely used in cognitive and behavioural neuroscience - and I'd argue that this reflects the influence of psychologists working in this area and of neuroscientists trained in and using similar methods.&lt;br /&gt;
&lt;div class="ReferenceStyle"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="ReferenceStyle"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;br /&gt;
**&amp;nbsp;Do I have any support for this position?&amp;nbsp;A little. It is easy to find basic psychology texts with ANOVA but without tests of differences in correlations being mentioned. It is rare to find tests of CIs of differences in published papers.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-2280818886067466881?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/gRPin-VjdXs_OFXExZcZmzdzdAg/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/gRPin-VjdXs_OFXExZcZmzdzdAg/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/gRPin-VjdXs_OFXExZcZmzdzdAg/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/gRPin-VjdXs_OFXExZcZmzdzdAg/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/m4FtucwOKj4" height="1" width="1"/&gt;</content><link rel="related" href="http://www.ejwagenmakers.com/2011/NieuwenhuisEtAl2011.pdf" title="A problem of significance" /><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/2280818886067466881/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2011/09/problem-of-significance.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/2280818886067466881?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/2280818886067466881?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/m4FtucwOKj4/problem-of-significance.html" title="A problem of significance" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2011/09/problem-of-significance.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0QCQHs5fip7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-3228682685795043970</id><published>2011-01-13T13:50:00.000Z</published><updated>2011-10-04T16:16:01.526+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:16:01.526+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>R: An introduction for psychologists</title><content type="html">&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;Here are the slides from the Introduction to R session Danny Kaye and I ran at the BPS Mathematics, Statistics &amp;amp; Computing section CPS Workshop (13 December 2010, Nottingham Trent University).&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;a href="https://docs.google.com/present/edit?id=0AcYTvx5C8MjXZHA4Nmg2dF8xMDJnY2c2eDdoYw&amp;amp;hl=en_GB&amp;amp;authkey=CMHxy54B"&gt;&lt;/a&gt;&lt;br /&gt;
&lt;iframe frameborder="0" height="451" src="https://docs.google.com/present/embed?id=dp86h6t_102gcg6x7hc&amp;amp;size=m" width="555"&gt;&lt;/iframe&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-3228682685795043970?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/eWJXVyqBzHP71er4E-Da_jXWdsU/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/eWJXVyqBzHP71er4E-Da_jXWdsU/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/eWJXVyqBzHP71er4E-Da_jXWdsU/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/eWJXVyqBzHP71er4E-Da_jXWdsU/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/qr7EtW4YEeE" height="1" width="1"/&gt;</content><link rel="related" href="https://docs.google.com/present/edit?id=0AcYTvx5C8MjXZHA4Nmg2dF8xMDJnY2c2eDdoYw&amp;hl=en_GB&amp;authkey=CMHxy54B" title="R: An introduction for psychologists" /><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/3228682685795043970/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2011/01/r-introduction-for-psychologists.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/3228682685795043970?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/3228682685795043970?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/qr7EtW4YEeE/r-introduction-for-psychologists.html" title="R: An introduction for psychologists" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2011/01/r-introduction-for-psychologists.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0QCSXk8eCp7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-1960747264016613408</id><published>2010-05-13T16:21:00.000+01:00</published><updated>2011-10-04T16:16:08.770+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:16:08.770+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>Introduction to using R in research</title><content type="html">I was recently asked to give a talk to our graduate school annual conference. I offered several titles and the one they picked was &lt;i&gt;Using R in research&lt;/i&gt;. I'm not sure if this was a good idea or not. The graduate school covers PhD students across three areas of the university: social sciences (including psychology), law and business. In addition the students tend to specialize in either qualitative or quantitive research methods, so I was talking to an audience who might know nothing about statistics or a great deal (e.g., several students have completed MSc courses in psychological research methods here or elsewhere).&lt;br /&gt;
&lt;br /&gt;
&lt;iframe src="http://docs.google.com/present/embed?id=dp86h6t_85c23cmxsp" frameborder="0" width="410" height="342"&gt;&lt;/iframe&gt;&lt;br /&gt;
&lt;br /&gt;
My solution was try and explain the advantages of R relative to alternatives such as SPSS (probably the most common statistic package in the University). I also focussed a lot on graphical methods and simulation. It seemed to go quite well, but I worry that quite a few members of the audience were overwhelmed by large chunks of it.&lt;br /&gt;
&lt;br /&gt;
I promised to put my slides on my blog - though I am not sure how useful they are to anybody who wasn't there. Without my commentary some (possibly most) of the slides won't make much sense. I spent a good deal of the time talking through exploratory plots of one data set (from Hayden, 2005). I use this example a lot in teaching and it involves a bit of class participating (guessing the origin of the data) - so I won't go into to detail here (lest I spoil it for future students), but you can google the original article if you are curious. I also spent some time on how R works (e.g., object types, assignment, basic modeling, plotting functions). My reasoning was that many of the audience have no familiarity with non-GUI interfaces in software and without explaining the basics of the interface they will not have the faintest clue how R works. For those with some familiarity (e.g., SPSS syntax) the examples were selected to show how powerful R can be for things like exploratory graphics.&lt;br /&gt;
&lt;br /&gt;
Several students ask about resources for learning R. I mentioned some in earlier blog posts, but for psychologists &lt;a href="http://www.psych.upenn.edu/~baron/rpsych/rpsych.html"&gt;Li and Baron's web resources&lt;/a&gt; are a good place to start. The other major resource is probably &lt;a href="http://www.statmethods.net/"&gt;Quick R&lt;/a&gt;, but there are hundreds of other places to look online (depending on what stuff you need most).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-1960747264016613408?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/JRhnInx5Zv29X_wO3Q6fKedgqCw/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/JRhnInx5Zv29X_wO3Q6fKedgqCw/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/JRhnInx5Zv29X_wO3Q6fKedgqCw/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/JRhnInx5Zv29X_wO3Q6fKedgqCw/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/BaAmi8ZfMqQ" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/1960747264016613408/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2010/05/introduction-to-using-r-in-research.html#comment-form" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/1960747264016613408?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/1960747264016613408?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/BaAmi8ZfMqQ/introduction-to-using-r-in-research.html" title="Introduction to using R in research" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>1</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2010/05/introduction-to-using-r-in-research.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0UBRH07fSp7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-7227981003814024457</id><published>2010-03-03T20:14:00.018Z</published><updated>2011-10-04T16:14:15.305+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:14:15.305+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="memory" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>Example of plotting a serial position curve in R</title><content type="html">&lt;div class="separator" style="clear: both; text-align: left;"&gt;A while ago I wrote a co-wrote chapter for an introductory psychology text book&amp;nbsp;&lt;a href="http://www.amazon.co.uk/gp/product/1847875386?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=1847875386"&gt;Essential Psychology: A Concise Introduction&lt;/a&gt;.&amp;nbsp;This is a book edited and written by members of the department where I work. My contribution was the chapter on human memory (cunningly titled &lt;i&gt;Memory&lt;/i&gt;).&lt;/div&gt;&lt;br /&gt;
I produced several plots for the chapter (some of which got cut due to severe space restrictions). One that stayed in was a serial position curve. For this plot I used data from Postman and Phillips (1965).&lt;br /&gt;
&lt;div style="text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_3fIauBYLLhY/S46_ENrgdVI/AAAAAAAAADw/I7TsDMvy6VQ/s1600-h/pp65+plot.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="400" src="http://1.bp.blogspot.com/_3fIauBYLLhY/S46_ENrgdVI/AAAAAAAAADw/I7TsDMvy6VQ/s400/pp65+plot.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
I feel particular proud of this plot because I was just beginning to use and learn R at the time (as opposed to dabbling) and because I had had a really hard time getting hold of the data. I first tried google, but had no joy (for some reason I thought someone would have put the raw data online, as it is a classic study - though maybe I just missed it). Then I searched for alternative data sets &amp;nbsp;(as around that period there were quite a few similar studies). I was probably being too picky, but whatever the reason I had no luck.&lt;br /&gt;
&lt;br /&gt;
It would have been trivial to make up fake data, but that didn't feel right. What I eventually did (and wished I'd done straight away) was print out the original figure and measure all the points by hand. I then entered these values into a spreadsheet and tweaked and remeasured until all the summary statistics matched those in the original paper to about one decimal place. This was a lot quicker than I had thought. I cheated slightly because I only needed data from the 20 word conditions (so I could leave out the 10 and 30 word conditions).&lt;br /&gt;
&lt;br /&gt;
(I'm pretty sure I could have used computer software to capture the raw data from an image file, but I'd have had to find the software, learn how to use it and do all the checking anyway. For a single figure I'm reasonably sure measuring by hand would be faster.)&lt;br /&gt;
&lt;br /&gt;
In re-plotting it I noticed a few things that I hadn't paid much attention to before. The main one was the authors report frequency of recalls for 18 participants with 6 lists each. This means all scores are out of 108 and I suspect lots of casual readers would (like me) assume they were percentages. For re-plotting I rescaled the data as percentages.&lt;br /&gt;
&lt;br /&gt;
The plot itself just uses basic R functions. I'm writing about it because:&amp;nbsp;&amp;nbsp;i) I think it is a fairly clear illustration of how basic plot functions in R can produce what I think is a rather nice Figure. (The published version has been edited by the publisher, adding colour and making the style match figures in other chapters), ii) people may find it useful for teaching purposes. So please feel free to use and adapt the R code for non-commercial (e.g., teaching use).&lt;br /&gt;
&lt;br /&gt;
First load the data from this&amp;nbsp;&lt;a href="https://docs.google.com/leaf?id=0B8YTvx5C8MjXYzExMmUwNDctZGMwZC00MzA5LTgyOWMtYzg2MDRhMzAwMjRj&amp;amp;hl=en_GB"&gt;.csv file&lt;/a&gt;&amp;nbsp;(you will need to specify the path or change the working directory if the file is saved elsewhere).&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace; font-size: 14px;"&gt;pp65 &amp;lt;- read.csv("&lt;a href="https://docs.google.com/leaf?id=0B8YTvx5C8MjXYzExMmUwNDctZGMwZC00MzA5LTgyOWMtYzg2MDRhMzAwMjRj&amp;amp;hl=en_GB"&gt;pp65.csv&lt;/a&gt;")&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace; font-size: 14px;"&gt;&lt;span class="Apple-style-span" style="font-family: Courier, Courier, monospace;"&gt;Then paste the following:&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace; font-size: 14px;"&gt;&lt;span class="Apple-style-span" style="font-family: Courier, Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;plot(pp65$SP, pch=NA, ylim=c(0,80), xlab= "Serial position", ylab= "Mean percentage recall", main = "Postman &amp;amp; Phillips (1965)", sub = '(20 word conditions only)')&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;points(pp65$C0, pch=19, col='black', cex=.7)&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;lines(pp65$C0, lty=3)&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;points(pp65$C15, pch=24, col='black', cex=.7)&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;lines(pp65$C15, lty=2)&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;points(pp65$C30, pch=22, col='black', cex=.7)&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;lines(pp65$C30, lty=5)&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;legend(3, 80, legend=c("No delay","15 second delay","30 second delay"), lty=c(3,2,5)&lt;/span&gt;)&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;If you are new to R you can find out more about these plotting functions by using R help:&amp;nbsp;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;?par&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;,&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&amp;nbsp;?plot&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;,&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&amp;nbsp;?points&amp;nbsp;&lt;span class="Apple-style-span" style="font-family: Courier;"&gt;and so on ...&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Courier, Courier, monospace;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-family: Courier;"&gt;&lt;span class="Apple-style-span" style="font-family: Courier; font-style: italic;"&gt;References&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;br /&gt;
&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-family: Helvetica; font-size: 12px; font-style: normal;"&gt;Baguley, T., &amp;amp; Edmonds, A. J. (2010). Memory. In P. Banyard, M. N. O. Davies, C. Norman, &amp;amp; B. Winder (Eds.) &lt;a href="http://www.amazon.co.uk/gp/product/1847875386?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=1847875386"&gt;Essential Psychology: A Concise Introduction&lt;/a&gt;&lt;img alt="" border="0" height="1" src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;amp;l=as2&amp;amp;o=2&amp;amp;a=1847875386" style="border: none !important; margin: 0px !important;" width="1" /&gt;&amp;nbsp;(pp. 65-82). London: Sage.&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;i&gt;&lt;/i&gt;&lt;br /&gt;
&lt;i&gt;&lt;/i&gt;&lt;br /&gt;
&lt;i&gt;&lt;/i&gt;&lt;br /&gt;
&lt;i&gt;&lt;/i&gt;&lt;br /&gt;
&lt;i&gt;&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;&lt;em&gt;&lt;span style="font-family: Arial; font-style: normal;"&gt;Postman&lt;/span&gt;&lt;/em&gt;&lt;span style="font-family: Arial;"&gt;, L. &amp;amp; &lt;em&gt;&lt;span style="font-style: normal;"&gt;Philips&lt;/span&gt;&lt;/em&gt;, L. W. (&lt;em&gt;&lt;span style="font-style: normal;"&gt;1965&lt;/span&gt;&lt;/em&gt;). Short-term temporal changes in free recall. &lt;i&gt;Quarterly Journal of Experimental Psychology, 17&lt;/i&gt;&lt;/span&gt;&lt;span style="font-family: Arial;"&gt;, 132-138.&lt;/span&gt;&lt;/div&gt;&lt;/i&gt;&lt;br /&gt;
&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: center;"&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://www.amazon.co.uk/gp/product/1847875386?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=1847875386"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/_3fIauBYLLhY/S47DhQ6agBI/AAAAAAAAAD4/8sltf9y5r5Y/s320/51JNUB6mI3L._SL160_.jpg" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;img alt="" border="0" height="1" src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;amp;l=as2&amp;amp;o=2&amp;amp;a=1847875386" style="border: none !important; margin: 0px !important;" width="1" /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;i&gt;&lt;br /&gt;
&lt;/i&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-7227981003814024457?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/DdOM_Otyf327FyY_d3M0N09HVMw/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/DdOM_Otyf327FyY_d3M0N09HVMw/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/DdOM_Otyf327FyY_d3M0N09HVMw/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/DdOM_Otyf327FyY_d3M0N09HVMw/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/6pTxjoGZiMk" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/7227981003814024457/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2010/03/example-of-plotting-serial-position.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7227981003814024457?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7227981003814024457?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/6pTxjoGZiMk/example-of-plotting-serial-position.html" title="Example of plotting a serial position curve in R" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/_3fIauBYLLhY/S46_ENrgdVI/AAAAAAAAADw/I7TsDMvy6VQ/s72-c/pp65+plot.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2010/03/example-of-plotting-serial-position.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0UFQns5fip7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-6683967616011422003</id><published>2010-02-25T00:23:00.001Z</published><updated>2011-10-04T16:13:33.526+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:13:33.526+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="displaying means" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="ANOVA" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>Interaction plot from cell means</title><content type="html">I needed to produce a few a interaction plots for my book in R and, while the &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;interaction.plot(&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;)&lt;/span&gt; function is useful it has a couple of drawbacks. First, the default output isn't very pretty. Second, it works from the raw data, whereas I often need plots from cell means. For teaching purposes it is quite common to produce plots without raw data (for hypothetical data or from published examples).&lt;br /&gt;
&lt;br /&gt;
My first attempts at the plots involved setting them up element by element. Just going over some examples I decided to turn the basic plot (for a 2 x 2 ANOVA) into a simple function. Nothing fancy, just a regular interaction plot in black and white that I think is prettier than the SPSS, Excel or R defaults. At some point I may have a go turning it into a general I x J ANOVA plot (or maybe even add CIs, but I'll probably do that from raw data if I ever get round to it).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;plot.2by2 &amp;lt;- function(A1B1,A1B2, A2B1, A2B2, group.names, legend = TRUE, leg.loc=NULL, factor.labels=c('Factor A', 'Factor B'), swap = FALSE, ylab= NULL, main = NULL){&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;group.means &amp;lt;- c(A1B1, A2B1, A1B2, A2B2)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;if(missing(ylab)) ylab &amp;lt;- expression(italic(DV))&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;if(swap==TRUE) {&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;group.names &amp;lt;- list(group.names[[2]], group.names[[1]]) ; group.means &amp;lt;- c(A1B1, A1B2, A2B1, A2B2); factor.labels &amp;lt;- c(factor.labels[2], factor.labels[1])&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;}&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;plot(group.means, pch=NA, ylim=c(min(group.means)*.95, max(group.means)*1.025), xlim=c(0.8,2.2), ylab=ylab, xaxt='n', xlab=factor.labels[1], main=main)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;points(group.means[1:2], pch = 21)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;points(group.means[3:4], pch = 19)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;axis(side = 1, at = c(1:2), labels = group.names[[1]])&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;lines(group.means[1:2], lwd = .6, lty = 2)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;lines(group.means[3:4], lwd = .6)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;if(missing(leg.loc)) leg.loc &amp;lt;- c(1,max(group.means))&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;if(legend ==TRUE) legend(leg.loc[1], leg.loc[2],legend = group.names[[2]], &amp;nbsp;title = factor.labels[2], lty = c(3,1))&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;}&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Call the function by entering the four cell means in conventional order: A1B1, A1B2 and so on where A1B1 is the mean of level 1 of factor A at level 1 of factor B. You also need a two item list containing text strings of the two level names of each factor. For instance:&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;lev.names &amp;lt;- list(c('A1', 'A2'), c('B1', 'B2'))&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;plot.2by2(5,15,10,20, lev.names)&lt;/span&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_3fIauBYLLhY/S4W72RriWwI/AAAAAAAAADY/cxsvT33MVYU/s1600-h/tempA.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="400" src="http://3.bp.blogspot.com/_3fIauBYLLhY/S4W72RriWwI/AAAAAAAAADY/cxsvT33MVYU/s400/tempA.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;You can swap the axes by adding the argument &lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;swap = TRUE&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;:&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;plot.2by2(5,15,10,20, lev.names, swap = TRUE)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_3fIauBYLLhY/TNvkiC5AeXI/AAAAAAAAAEg/V9Zmu23zz2Y/s1600/swap.plot.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="400" src="http://3.bp.blogspot.com/_3fIauBYLLhY/TNvkiC5AeXI/AAAAAAAAAEg/V9Zmu23zz2Y/s400/swap.plot.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;/div&gt;The default factor names are 'Factor A' and 'Factor B', but these are over-ridden&amp;nbsp;in the call:&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;plot.2by2(5,15,10,20,lev.names, swap = TRUE, factor.labels= c('Factor 1','Factor 2'))&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;You can also change the &lt;/span&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;y-&lt;/span&gt;&lt;/i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;axis label with &lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;ylab&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt; or add a main title with &lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;main&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;. &amp;nbsp;The legend can be dropped (&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;legend = FALSE&lt;/span&gt;) if you don't want one or need it to be located outside the plot. To move the legend just specify coordinates with an argument such as &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;leg.loc = c(1,10&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;)&lt;/span&gt;. You can also edit the source code directly.&amp;nbsp;Here is an example with title and meaningful labels:&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;group.names &amp;lt;- list(c('placebo','drug'), c('male', 'female'))&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;div class="RcodeStyle"&gt;&lt;span lang="EN-US"&gt;  &lt;!--EndFragment--&gt;   &lt;/span&gt;&lt;/div&gt;&lt;div class="RcodeStyle"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;plot.2by2(10,10,15,20, group.names, factor.labels=c('Drug', 'Sex'), swap = FALSE)&lt;o:p&gt;&lt;/o:p&gt;&lt;br /&gt;
&lt;!--EndFragment--&gt;   &lt;/span&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_3fIauBYLLhY/S4W--6-oydI/AAAAAAAAADo/ZZ8YsWrZX2E/s1600-h/example.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="400" src="http://3.bp.blogspot.com/_3fIauBYLLhY/S4W--6-oydI/AAAAAAAAADo/ZZ8YsWrZX2E/s400/example.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;As this just uses basic plotting functions in R you can also manipulate the plot in other ways: adding lines with &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;segments()&lt;/span&gt;, adding text with &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;text()&lt;/span&gt; changing graphical parameters with &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;par(&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;)&lt;/span&gt; and so on.&amp;nbsp;Depending on your platform it is also easy to extract the plot as a .pdf or .jpg file. On a mac I save it as a .pdf file and open it in preview which allows me to save it as .png, .gif or whatever I need.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-6683967616011422003?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/0i5cSdcSNnUzshoV9eqYpodVBUc/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/0i5cSdcSNnUzshoV9eqYpodVBUc/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/0i5cSdcSNnUzshoV9eqYpodVBUc/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/0i5cSdcSNnUzshoV9eqYpodVBUc/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/ntE9PjYdk60" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/6683967616011422003/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2010/02/interaction-plot-from-cell-means.html#comment-form" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/6683967616011422003?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/6683967616011422003?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/ntE9PjYdk60/interaction-plot-from-cell-means.html" title="Interaction plot from cell means" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://3.bp.blogspot.com/_3fIauBYLLhY/S4W72RriWwI/AAAAAAAAADY/cxsvT33MVYU/s72-c/tempA.png" height="72" width="72" /><thr:total>2</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2010/02/interaction-plot-from-cell-means.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CU4NSHoycSp7ImA9WhVSE04.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-4284625626024500904</id><published>2010-01-29T16:48:00.035Z</published><updated>2012-03-09T22:59:59.499Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-03-09T22:59:59.499Z</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="effect size" /><title>When correlations go bad ... (Or, I always wanted to write about the Society for the Suppression of the Correlation Coefficient)</title><content type="html">&lt;div style="text-align: left;"&gt;
&lt;div style="text-align: left;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;A while ago I promised a longer post on standardized effect size. This isn't it. Instead it is a link to &lt;/span&gt;&lt;a href="http://www.thepsychologist.org.uk/archive/archive_home.cfm?volumeID=23&amp;amp;editionID=185&amp;amp;ArticleID=1633"&gt;&lt;span class="Apple-style-span" style="color: black;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;my piece in the February 2010 issue of &lt;/span&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;The Psychologist&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;/a&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;.&lt;/span&gt;&lt;/div&gt;
&lt;/div&gt;
&lt;div style="text-align: left;"&gt;
&lt;div style="text-align: left;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt; &lt;/div&gt;
&lt;/div&gt;
&lt;div style="text-align: left;"&gt;
&lt;div style="text-align: auto;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;I had intended to write a short summary of my 2009 BJP paper, but that didn't really get off the ground. However, I had for a while wanted to write about the &lt;a href="http://societytosupressthecorrelationcoefficient.wordpress.com/"&gt;Society for the Suppression of the Correlation Coefficient&lt;/a&gt;. I had first read about this in Tukey's (1954) chapter on regression and path analysis. This is one of the earlier papers in the literature criticizing the preference for (standardized) correlation coefficients over (simple, unstandardized) regression coefficients. Read the article for an earlier example! I was reminded of its existence by Brillinger's (2001) paper.&lt;/span&gt;&lt;/div&gt;
&lt;/div&gt;
&lt;div style="text-align: left;"&gt;
&lt;div style="text-align: left;"&gt;
&lt;span class="Apple-style-span" style="-webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;/div&gt;
As soon as I tried writing about the &lt;a href="http://societytosupressthecorrelationcoefficient.wordpress.com/"&gt;Society for the Suppression of the Correlation Coefficient&lt;/a&gt; things (I think) fell into place. The result is a piece that is one part history of statistics trivia, one part mini-tutorial and one part a summary of my 2009 paper.&lt;br /&gt;
&lt;br /&gt;
For non-members of the BPS the link below contains a pre-publication version of 'When correlations go bad ...'.&lt;br /&gt;
&lt;br /&gt;
&lt;div&gt;
I may also get around to the web site one day.&lt;br /&gt;
&lt;br /&gt;
P.S. I adapted data and R code from Gelman and Hill (2007) for the example. I chose it because it a nice simple example of regression and because it is also a pointer to someone who argues in favour of standardization )at least in some situations). Both my 2009 paper and the 'When correlations go bad ...' are my attempts at getting people to rethink the use of correlation and standardization. The tone is deliberately (slightly?) polemic. There are other views so don't just take my word for it ... think about what you are trying to do and decide whether a correlation coefficient (or a standardized mean d&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;ifference) is the right option.&lt;/span&gt;&lt;br /&gt;
&lt;div style="text-align: left;"&gt;
&lt;div style="text-align: left;"&gt;
&lt;span class="Apple-style-span" style="-webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;
&lt;/div&gt;
&lt;div style="text-align: left;"&gt;
&lt;div style="text-align: left;"&gt;
&lt;span class="Apple-style-span" style="-webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; white-space: pre;"&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;References&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;/div&gt;
&lt;/div&gt;
&lt;div style="text-align: left;"&gt;
&lt;div style="text-align: left;"&gt;
&lt;span class="Apple-style-span" style="-webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; white-space: pre;"&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt; &lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;/div&gt;
&lt;/div&gt;
&lt;div style="text-align: left;"&gt;
&lt;div style="text-align: left;"&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;a href="http://dx.doi.org/10.1348/000712608X377117"&gt;Baguley, T. (2009). Standardized or simple effect size: What should be reported? British Journal of Psychology. 100, 603–617.&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;a href="http://dx.doi.org/10.1348/000712608X377117"&gt;&lt;/a&gt;&lt;span lang="EN-US"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Baguley, T. (2010). &lt;a href="http://nottinghamtrent.academia.edu/ThomBaguley/Papers/138898/"&gt;When correlations go bad&lt;/a&gt; … &lt;/span&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;The Psychologist, 23&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;, 122-3.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;a href="http://nottinghamtrent.academia.edu/ThomBaguley/Papers/138898/"&gt;&lt;/a&gt;Brillinger, D.R. (2001). John Tukey and the correlation coefficient, Computing Science and Statistics, 33, 204–218.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Gelman, A. &amp;amp; Hill, J. (2007). &lt;/span&gt;&lt;a href="http://www.amazon.co.uk/gp/product/052168689X?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=052168689X"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Data Analysis Using Regression and Multilevel/Hierarchical Models (Analytical Methods for Social Research)&lt;/span&gt;&lt;/a&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;img alt="" border="0" height="1" src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;amp;l=as2&amp;amp;o=2&amp;amp;a=052168689X" style="border: none !important; margin: 0px !important;" width="1" /&gt;. Cambridge: Cambridge University Press.&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Tukey, J.W. (1954). Causation, regression and path analysis. In O. Kempthorne, T.A. Bancroft, J.W. Gowen &amp;amp; J.L. Lush (Eds.) Statistics and mathematics in biology (pp.35–66). Ames, IA: Iowa State College Press.&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;/div&gt;
&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-4284625626024500904?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/bPBMvI_zArk_pLyLPU-O8VtpFSA/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/bPBMvI_zArk_pLyLPU-O8VtpFSA/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/bPBMvI_zArk_pLyLPU-O8VtpFSA/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/bPBMvI_zArk_pLyLPU-O8VtpFSA/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/sRdNPDxFf6I" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/4284625626024500904/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2010/01/when-correlations-go-bad-or-i-always.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/4284625626024500904?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/4284625626024500904?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/sRdNPDxFf6I/when-correlations-go-bad-or-i-always.html" title="When correlations go bad ... (Or, I always wanted to write about the Society for the Suppression of the Correlation Coefficient)" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2010/01/when-correlations-go-bad-or-i-always.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0QHSXg-fip7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-1846690160708647315</id><published>2010-01-25T09:40:00.003Z</published><updated>2011-10-04T16:15:38.656+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:15:38.656+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="multilevel models" /><category scheme="http://www.blogger.com/atom/ns#" term="sphericity" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="philosophy" /><title>Strathclyde multilevel modeling talk</title><content type="html">I recently gave an introduction to &lt;a href="http://bps-msc.blogspot.com/2010/01/psypag-statistics-workshop-in.html"&gt;multilevel modeling talk at Strathclyde University&lt;/a&gt; for &lt;a href="http://www.psypag.co.uk/"&gt;PsyPAG&lt;/a&gt;. I promised to make my slides available. I'm still using powerpoint (and regretting it) so some of the symbols may be garbled. (I noticed that powerpoint turned all my tilda symbols to colons during the talk). So I hope it is readable:&lt;br /&gt;
&lt;br /&gt;
Here is the link:&lt;br /&gt;
&lt;br /&gt;
&lt;iframe src="http://docs.google.com/present/embed?id=dp86h6t_49dt38wwdx&amp;interval=15" frameborder="0" width="410" height="342"&gt;&lt;/iframe&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-1846690160708647315?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/IEvLurkuQ79b-gVb793k3ibxlR4/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/IEvLurkuQ79b-gVb793k3ibxlR4/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/IEvLurkuQ79b-gVb793k3ibxlR4/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/IEvLurkuQ79b-gVb793k3ibxlR4/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/wxpeyMHsvL4" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/1846690160708647315/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2010/01/strathclyde-multilevel-modeling-talk.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/1846690160708647315?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/1846690160708647315?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/wxpeyMHsvL4/strathclyde-multilevel-modeling-talk.html" title="Strathclyde multilevel modeling talk" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2010/01/strathclyde-multilevel-modeling-talk.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0UNQn49fip7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-6707489846455399931</id><published>2009-12-13T10:26:00.119Z</published><updated>2011-10-04T16:14:53.066+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:14:53.066+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="multilevel models" /><category scheme="http://www.blogger.com/atom/ns#" term="puzzle" /><title>A statistical puzzle about averages II</title><content type="html">&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;div style="font: normal normal normal 16px/normal 'Times New Roman'; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;span style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;1. Who is correct?&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &amp;nbsp;Professor B is correct. If the average family has 1.8 children then the average child would be expected to have more than 0.8 siblings.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;2. Why?&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &amp;nbsp;The average child is not from the average family. The concepts of average child and average family are different. For this reason there should no expectation that the average child should be from a family with an average number of children. Although there are restricted circumstances under which this can happen, they are sufficiently implausible to be discounted in any real world application (e.g., if all families have exactly the same number of children).&lt;br /&gt;
&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;  &lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/div&gt;&lt;div style="font: normal normal normal 16px/normal 'Times New Roman'; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;/div&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;In one case the unit of analysis of the family and in the other it is the child. A concrete example may help. (I'll stick to defining average as arithmetic mean throughout, but the same logic extends to other averages such as the median - see the quotation from Kish below).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Assuming that the number of children varies between families (which it must do if the mean number of children per family is 1.8) then the average child will be from a family with a larger number of children than average. For example, imagine there are only four families:&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;table cellpadding="0" cellspacing="0" style="border-collapse: collapse;"&gt;&lt;tbody&gt;
&lt;tr&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 63.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 147.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td colspan="4" style="padding: 0.0px 7.0px 0.0px 7.0px; width: 332.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Number of siblings per child&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;/tr&gt;
&lt;tr&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 63.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Family&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 147.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Number of Children&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 70.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;1&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font: normal normal normal 13px/normal 'Times New Roman';"&gt;&lt;i&gt;&lt;sup&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;st&lt;/span&gt;&lt;/span&gt;&lt;/sup&gt;&lt;/i&gt;&lt;/span&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; child&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 73.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;2&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font: normal normal normal 13px/normal 'Times New Roman';"&gt;&lt;i&gt;&lt;sup&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;nd&lt;/span&gt;&lt;/span&gt;&lt;/sup&gt;&lt;/i&gt;&lt;/span&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; child&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 75.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;3&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font: normal normal normal 13px/normal 'Times New Roman';"&gt;&lt;i&gt;&lt;sup&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;rd&lt;/span&gt;&lt;/span&gt;&lt;/sup&gt;&lt;/i&gt;&lt;/span&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; child&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 72.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;4&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font: normal normal normal 13px/normal 'Times New Roman';"&gt;&lt;i&gt;&lt;sup&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;th&lt;/span&gt;&lt;/span&gt;&lt;/sup&gt;&lt;/i&gt;&lt;/span&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; child&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;/tr&gt;
&lt;tr&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 63.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;b&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;A&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 147.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;1&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 70.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 73.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 75.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 72.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;/tr&gt;
&lt;tr&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 63.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;b&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;B&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 147.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;1&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 70.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 73.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 75.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 72.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;/tr&gt;
&lt;tr&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 63.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;b&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;C&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 147.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;2&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 70.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;1&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 73.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;1&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 75.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 72.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;/tr&gt;
&lt;tr&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 63.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;b&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;D&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 147.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;4&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 70.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;3&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 73.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;3&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 75.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;3&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 72.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;3&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;/tr&gt;
&lt;/tbody&gt; &lt;/table&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;There are (1+1+2+4) = 8 children in the four families, thus the mean number of children per family is: 8/4 = 2.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;It follows that the mean number of siblings per child is therefore:&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;(0+0+1+1+3+3+3+3)/8 = 14/8 = 1.75&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;So although each family has only two children (on average) each child has 1.75 siblings (not 1 sibling).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Note is that there are &lt;/span&gt;&lt;/span&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;N&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; = 4 families and &lt;/span&gt;&lt;/span&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;n&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; = 8 children. So switching the unit of analysis changes the denominator. Also note that while the families can plausibly be considered independent of each other the children can't (all children in the same family have the same number of siblings in this example, and more generally the number of siblings will be correlated).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;What about zeroes?&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&amp;nbsp;In the calculations above I excluded childless households as families. If you include only households without children as families the discrepancy would be larger.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Does it matte&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;r? &amp;nbsp;Yes. Much real world data is clustered in this way. It is important to realize that the average &lt;/span&gt;&lt;/span&gt;&lt;category member=""&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;category memory=""&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;is not going to be from the average &lt;/span&gt;&lt;/span&gt;&lt;category&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; (e.g., the average worker isn't from the average firm)&lt;/span&gt;&lt;/span&gt;&lt;category&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;.&lt;/span&gt;&lt;/span&gt;&lt;/category&gt;&lt;/category&gt;&lt;/category&gt;&lt;/category&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;category memory=""&gt;&lt;category&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;In practical terms this means that careful attention needs to be paid to sampling from families, workplaces, schools and so forth. A random sample of children will disproportionately sample children from large families. There are also social policy implications (e.g., if you are interested in reducing child poverty). Another example is that a&amp;nbsp;random sample of schools will disproportionately sample small schools.&lt;/span&gt;&lt;/span&gt;&lt;/category&gt;&lt;/category&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;category memory=""&gt;&lt;category&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;There are also other problems with analysis of clustered data. For this reason anyone working with clustered data needs to seriously consider using multilevel modeling or other methods that i) &amp;nbsp;take into the clustering and ii) allow hypotheses about different levels of the model (e.g., children and families) to be explored.&lt;/span&gt;&lt;/span&gt;&lt;/category&gt;&lt;/category&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Postscript&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;This is quite an old puzzle. I first came across this puzzle in the article by Jenkins and Tuten (1992). They include formulae for deriving one average from the other an&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;d cite&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Huntington (1924) and other later observations of the same phenomenon.&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;I made the connection to multilevel models a little later. For a good (if slightly out of date) introduction to multilevel models see Snijders and Bosker (1999). Recently I noticed that Kish (1965) discusses the same phenomenon in passing.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;This quote from Kish sets out the problem quite clearly&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;:&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;blockquote&gt;&lt;span style="font-family: 'Trebuchet MS', sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;Although the mean number of adults per household is only 2.02, the mean number of household members is 2.24 for the average adult. The greater size ranges of large organizations produce more striking effects. In 1960, 50 million people lived in 130 U. S. cities that had 100,000 or more population; in this population, the average city size was 0.39 million, but the size of the city in which the average person lived was 2.0 millions. Using medians does not help: the median city size was 0.19 million, but the median person lived in a city of 0.62 million.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/blockquote&gt;&lt;div style="text-align: right;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;Kish (1965, p. 571).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-style: italic;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;References&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;Jenkins, J. J., &amp;amp; Tuten, J. T. (1992). Why isn’t the average child from the average family? – and similar puzzles. &lt;/span&gt;&lt;/span&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;American Journal of Psychology, 105&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;, 517-526.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;br /&gt;
&lt;em style="font-style: normal;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;Kish&lt;/span&gt;&lt;/span&gt;&lt;/em&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;, L. (&lt;/span&gt;&lt;/span&gt;&lt;em style="font-style: normal;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;1965&lt;/span&gt;&lt;/span&gt;&lt;/em&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;). Sampling Organizations and Groups of Unequal Sizes,&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;American Sociological Review, 20&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;, 564-72&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;br /&gt;
&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;Sniders, T. &amp;amp; Boskers, R. (1999).&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;a href="http://www.amazon.co.uk/gp/product/0761958908?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0761958908"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;Multilevel Analysis: An Introduction to Basic and Advanced Multilevel Modeling.&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;London: Sage.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-6707489846455399931?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/dnd7GYzKBHgtr4TwMKE8bsKN-To/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/dnd7GYzKBHgtr4TwMKE8bsKN-To/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/dnd7GYzKBHgtr4TwMKE8bsKN-To/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/dnd7GYzKBHgtr4TwMKE8bsKN-To/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/HDkW-cL8cbc" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/6707489846455399931/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2009/12/answer-to-statistical-puzzle-about.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/6707489846455399931?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/6707489846455399931?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/HDkW-cL8cbc/answer-to-statistical-puzzle-about.html" title="A statistical puzzle about averages II" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2009/12/answer-to-statistical-puzzle-about.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0UNQn49fyp7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-4127323298397205135</id><published>2009-12-09T22:46:00.002Z</published><updated>2011-10-04T16:14:53.067+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:14:53.067+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="multilevel models" /><category scheme="http://www.blogger.com/atom/ns#" term="puzzle" /><title>A statistical puzzle about averages I</title><content type="html">&lt;span style="font-style: italic;"&gt;I wrote this a few years ago for a departmental newsletter. For some reason the second part (with the answer) never got published. I stumbled across it almost by accident the other day and thought I'd share it. I'll publish the canonical answer in due course.&lt;/span&gt;&lt;br /&gt;
&lt;div class="Section1"&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;Professor Quack knows (from the UK census) that the average family&amp;nbsp;has an arithmetic mean of 1.8 children. He also knows that (due to a bizarre&amp;nbsp;mix-up in enrolment) that his &lt;i&gt;Psychology For Everyone&lt;/i&gt;&lt;span style="font-style: normal;"&gt; class is attended by a random sample of 50 people from the UK population. As&amp;nbsp;part of an in-class demonstration of&amp;nbsp;sampling theory he records the number&amp;nbsp;of siblings of each student and calculates the average (using the standard formula for the arithmetic mean).&amp;nbsp;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span style="font-style: normal;"&gt;To his dismay he&amp;nbsp;discovers that the students in his class have an arithmetic mean of 1.2&amp;nbsp;siblings. He later repeats the demonstration with two more random samples of&amp;nbsp;the UK population (again with &lt;/span&gt;&lt;i&gt;n&lt;/i&gt;&lt;span style="font-style: normal;"&gt; = 50) and obtains values of 1.1 and 1.3 siblings.&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;Profesor Quack consults two of his colleagues: Professor A&amp;nbsp;and Professor B.&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;Professor A replies thus:&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;“You are right to be dismayed. Bias has somehow entered&amp;nbsp;either your sampling procedure or your calculation of the mean. The true mean&amp;nbsp;number of siblings should be 1.8 - 1 = 0.8.”&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;Professor B interjects thus:&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;“Nonsense! All is as it should be. The expected number of&amp;nbsp;siblings in a random sample is most certainly not 0.8. Rather, one would expect&amp;nbsp;the average student to have more than 0.8 siblings, just as you have observed.”&lt;br /&gt;
&lt;br /&gt;
&lt;/div&gt;&lt;div class="MsoNormal"&gt;1) Who is correct? &amp;nbsp;Professor A or Professor B?&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;2) Why?&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-4127323298397205135?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/KWutSEb1FRz3Ys3EbqKYD0F-ECY/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/KWutSEb1FRz3Ys3EbqKYD0F-ECY/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/KWutSEb1FRz3Ys3EbqKYD0F-ECY/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/KWutSEb1FRz3Ys3EbqKYD0F-ECY/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/EYvNLO4Mxxg" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/4127323298397205135/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2009/12/statistical-puzzle-about-averages-i.html#comment-form" title="5 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/4127323298397205135?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/4127323298397205135?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/EYvNLO4Mxxg/statistical-puzzle-about-averages-i.html" title="A statistical puzzle about averages I" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>5</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2009/12/statistical-puzzle-about-averages-i.html</feedburner:origLink></entry><entry gd:etag="W/&quot;A0AHQHYyfip7ImA9WhdUF04.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-2830032057768498548</id><published>2009-11-17T09:43:00.016Z</published><updated>2011-10-04T15:48:51.896+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T15:48:51.896+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="philosophy" /><category scheme="http://www.blogger.com/atom/ns#" term="Bayesian statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="likelihood" /><category scheme="http://www.blogger.com/atom/ns#" term="science" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><category scheme="http://www.blogger.com/atom/ns#" term="Bayes factors" /><title>R functions for Dienes (2008) Understanding Psychology as a Science</title><content type="html">&lt;div&gt;I recently wrote a review of Understanding psychology as a science: an introduction to scientific and statistical inference by &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_0"&gt;Zoltan&lt;/span&gt; &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_1"&gt;Dienes&lt;/span&gt; (2008). &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_2"&gt;Dienes&lt;/span&gt;' book covers &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_3"&gt;Neyman&lt;/span&gt;-Pearson null hypothesis significance testing, Bayesian inference and the likelihood method of inference (inspired by Fisher and associated with A. W. F. Edwards and more recently R. Royall).&lt;/div&gt;&lt;div&gt; &lt;/div&gt;&lt;br /&gt;&lt;div&gt;One of the most useful features of the book is that &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_4"&gt;Dienes&lt;/span&gt; provides &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_5"&gt;Matlab&lt;/span&gt; code for examples of calculations in the book (e.g., for Bayes factors, likelihood intervals and so forth). This is not so useful for me because I don't use &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_6"&gt;Matlab&lt;/span&gt;. &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_7"&gt;Matlab&lt;/span&gt; licenses are also quite expensive and may not be possible for students to access it in many Psychology departments. For those without access to &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_8"&gt;Matlab&lt;/span&gt;, &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_9"&gt;Dienes&lt;/span&gt; also provides calculators for a number of functions on his own &lt;a href="http://www.lifesci.sussex.ac.uk/home/Zoltan_Dienes/inference/index.htm"&gt;web page&lt;/a&gt; for the book. (The calculators are found by following the links to the appropriate chapter, so the Bayes factor calculator is found by following the Chapter 4 link).&lt;/div&gt;&lt;br /&gt;&lt;div&gt;Danny Kaye and I thought it would be useful to write R code to compliment the &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_10"&gt;Matlab&lt;/span&gt; code for &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_11"&gt;Dienes&lt;/span&gt;' functions as a 'bonus feature' for the review. As these functions and the notes for them take up quite a bit of space we decided to include only one, for a Bayes factor, in the review itself (with some notes on how to use it). Danny did most of the work writing functions, which are more-or-less direct translations of the original &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_12"&gt;Matlab&lt;/span&gt; code (and have been checked against the web versions). The &lt;a href="http://danny-kaye.co.uk/Docs/Dienes_functions.txt"&gt;full set of functions&lt;/a&gt; is hosted on his web site along with the &lt;a href="http://danny-kaye.co.uk/Docs/Dienes_notes.pdf"&gt;notes on how to use them&lt;/a&gt;. Also included are page references for the examples in the book.&lt;/div&gt;&lt;br /&gt;&lt;div&gt;Why did we write the R functions? First, they offer convenient access to the functions for teachers and students (because R is free and runs on Windows, Mac OS or Linux operating systems). Second, it reduces the burden on &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_13"&gt;Dienes&lt;/span&gt;' web calculator (at a marginal decrease in ease of use). Third, R is open source so it is simple to see how the code works and to edit, extend or adapt it (though it is polite to acknowledge the authors of the original code). Fourth, we want to encourage more people to start using R!&lt;/div&gt;&lt;br /&gt;&lt;div&gt;As an example, I've already written some alternative functions for likelihood intervals (though as I happened I re-wrote these almost from scratch to get them to plot the likelihood function and interval and to take advantage of some built-in R functions). Those functions are intended for a the book I'm working on and so should appear in due course.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For those who are interested Danny and I are presently working on implementing Bayesian &lt;i&gt;t&lt;/i&gt; tests in R (Bayes factors with objective priors) in a user-friendly way for researchers.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;div&gt; &lt;/div&gt;&lt;i&gt;References:&lt;/i&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_14"&gt;Baguley&lt;/span&gt;, T., &amp;amp; Kaye, W.S. (in press, 2009). Review of Understanding psychology as a science: An introduction to scientific and statistical inference.&lt;span class="Apple-style-span" style="font-style: italic; "&gt; &lt;a href="http://www.bpsjournals.co.uk/journals/bjmsp/"&gt;British Journal of Mathematical &amp;amp; Statistical Psychology&lt;/a&gt;.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;!--StartFragment--&gt;  &lt;p class="ReferenceStyle"&gt;&lt;span lang="EN-US"&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_15"&gt;Dienes&lt;/span&gt;, Z. (2008). &lt;i&gt;&lt;span class="Apple-style-span" style="font-style: normal; "&gt;&lt;a href="http://www.amazon.co.uk/gp/product/023054231X?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=023054231X"&gt;Understanding Psychology as a Science: An Introduction to Scientific and Statistical Inference.&lt;/a&gt;&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_16"&gt;Basingstoke&lt;/span&gt;: &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_17"&gt;Palgrave&lt;/span&gt; Macmillan.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-2830032057768498548?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/RgPHFVJHqrRFjBWuiTBdXni4VLY/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/RgPHFVJHqrRFjBWuiTBdXni4VLY/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/RgPHFVJHqrRFjBWuiTBdXni4VLY/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/RgPHFVJHqrRFjBWuiTBdXni4VLY/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/PP4FFjhm1XE" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/2830032057768498548/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2009/11/r-functions-for-dienes-2008.html#comment-form" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/2830032057768498548?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/2830032057768498548?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/PP4FFjhm1XE/r-functions-for-dienes-2008.html" title="R functions for Dienes (2008) Understanding Psychology as a Science" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>2</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2009/11/r-functions-for-dienes-2008.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0QMR3o_fSp7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-7335983259524795784</id><published>2009-10-20T10:52:00.020+01:00</published><updated>2011-10-04T16:16:26.445+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:16:26.445+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="effect size" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><title>Chi-square test of independence and the odds ratio</title><content type="html">&lt;div&gt;Every time I teach an introductory statistics course I'm struck by how difficult it is to run a simple 2 by 2 chi-square test of independence. SPSS is one of the worst culprits (but even in R it almost seems more trouble than it is worth to use a computer). One solution is calculate it by hand (and that's what I do in my introductory classes). This leaves me with a practical problem - how can I quickly and easily calculate solutions to practical exercises, check student work or calculate chi-square statistics for my own work?&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div&gt;I've mostly assumed that my solution is so obvious that it is what most statistics teachers do. I just set up an Excel spreadsheet that calculates the chi-square statistic, degrees of freedom and &lt;em&gt;p&lt;/em&gt; value for a 2 by 2 table. Thus all I have to do is type in four numbers.&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div&gt;Over the years I've included a few other features - it displays the intermediate calculations (useful for checking student work), calculates standardized residuals and various common 'effect size' estimates. These are phi, phi-squared and the odds ratio. For instance, I also added the Haldane estimator of the odds ratio (which is the odds ratio estimated after adding 0.5 to each cell). This is a useful estimator when observed counts are zero or very small.&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div&gt;My favourite application of the spreadsheet is when editing or reviewing journal submissions reporting chi-square. Often they will not consider the odds ratio. The spreadsheet means I can check calculations for accuracy (if they look dubious) and I can easily include odds ratios in my review or decision (all in under a minute). The odds ratio is useful because it strips out the base rates when comparing effects from different conditions.&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div&gt;Using the absolute difference in rates for chi-squares with different base rates can be misleading. For instance, the difference in solution rates in an easy problem solving task might be 70% - 50% = 20%. For a very hard problem it might be 20% - 10% = 10%. Comparing the absolute differences in rates (often called the ARR or &lt;a href="http://en.wikipedia.org/wiki/Absolute_risk_reduction"&gt;absolute risk reduction&lt;/a&gt; in medical settings) would be misleading. The 10% difference is probably more impressive for the very hard problem (it represents a doubling or halving of solution rates). The odds ratio allows a comparison of the probability of solution relative to the probability of non-solution (i.e., the odds) and thus strips out the base rate impact (by scaling in terms of the probability of non-solution). The odds ratio also has other nice mathematical properties (which I won't go into here).&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div&gt;Not everyone loves odds ratios. Medics often dislike them ... because they strip out base rates! For interpreting and communicating medical risks the base rates are important. It matters whether a disease has a base rate of 1 in 100 or 1 in 10. So stripping out the base rate might be misleading in this context. Even so, medics seem unreasonably biased against odds ratios. If the base rates in your sample are dodgy (i.e., don't reflect the population you are interested in) the odds ratio is probably a safer bet. If you know the odds ratio and the correct base rates you can estimate the true risks by combining the two.&lt;br /&gt;
&lt;/div&gt;&lt;div&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div&gt;(This is one reason why I think that the quest for the perfect effect size statistic is flawed. Different statistics are required for different jobs. Odds ratios, however, get a (largely undeserved) bad press. Most psychologists will probably be better off using them than other measures for 2 by 2 tables, because we rarely have samples that accurately sample the 'true' base rate. This is because our samples are either unrepresentative or, even more importantly, there may be no 'true' base rate. The problem solving example illustrates this. A problem doesn't really have a 'true' difficulty level that we are trying to generalize to or make decisions about.)&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div&gt;If you want to use the spreadsheet you can download it from &lt;a href="http://spreadsheets.google.com/ccc?key=0AsYTvx5C8MjXdGxHWG9UV1Mza1phM2lqTUhyMmhvSFE&amp;amp;hl=en_GB"&gt;here&lt;/a&gt;. Alternatively, just set up your own (it is quite a useful exercise for understanding how chi-square, odds ratios and so on work). On the other hand my spreadsheet does a few of the slightly awkward calculations for you (e.g., CIs for the odds ratios). The first sheet has very basic instructions. The second sheet has the 2 by 2 table calculations (and effect sizes). The other sheets provide basic statistics for 2 by 3 and 3 by 3 tables. No effect size metrics are included for the latter cases because I think they are not very meaningful for effects with &lt;i&gt;df&lt;/i&gt; &amp;gt; 1 (and I've never needed them for real research).&lt;br /&gt;
&lt;/div&gt;&lt;div&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div&gt;Note: for best results (to preserve formatting etc.) download the document as an Excel file.&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-7335983259524795784?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/2QKEIfqTO7-H7QDmy4YS4TD-eqs/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/2QKEIfqTO7-H7QDmy4YS4TD-eqs/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/2QKEIfqTO7-H7QDmy4YS4TD-eqs/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/2QKEIfqTO7-H7QDmy4YS4TD-eqs/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/sYeRW6u6Hvg" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/7335983259524795784/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2009/10/chi-square-test-of-independence-and.html#comment-form" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7335983259524795784?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7335983259524795784?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/sYeRW6u6Hvg/chi-square-test-of-independence-and.html" title="Chi-square test of independence and the odds ratio" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>1</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2009/10/chi-square-test-of-independence-and.html</feedburner:origLink></entry><entry gd:etag="W/&quot;CEIFQHs-eyp7ImA9WhVUEkk.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-3600043174785579580</id><published>2009-03-12T19:13:00.005Z</published><updated>2012-05-17T09:15:11.553+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-05-17T09:15:11.553+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="effect size" /><category scheme="http://www.blogger.com/atom/ns#" term="moderated multiple regression" /><category scheme="http://www.blogger.com/atom/ns#" term="moderator effects" /><title>Don't standardize interaction/moderator effects in multiple regression</title><content type="html">&lt;div&gt;
This is another quick blog entry related to a query I had today.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
In my &lt;a href="http://psychologicalstatistics.blogspot.com/2007/10/introduction-to-moderated-multiple.html"&gt;moderated multiple regression workshop&lt;/a&gt; a while back I wrote (on slide 15) "don't use standardized regression coefficients". In the talk itself I briefly mentioned why and directed people to the relevant section of the (excellent) explanation by Kris Preacher:&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div style="text-align: center;"&gt;
&lt;span class="Apple-style-span" style="color: #0000ee; text-decoration: underline;"&gt;&lt;a href="http://quantpsy.org/interact/interactions.htm"&gt;A primer on interaction effects in multiple linear regression.&lt;/a&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;
&lt;span class="Apple-style-span" style="color: #0000ee; text-decoration: underline;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;
I'd like to clarify &lt;span class="Apple-style-span" style="font-style: italic;"&gt;why&lt;/span&gt; standardization is a &lt;span class="Apple-style-span" style="font-style: italic;"&gt;particularly&lt;/span&gt; bad idea in this case. My dislike of 'standardization' is fairly well known, and it goes without saying (I hope) that one reason not to use standardized regression coefficient relates to this. I hope to write about this in detail soon (but check out my &lt;a href="http://www.ingentaconnect.com/content/bpsoc/bjp/pre-prints/310805"&gt;BJP article on effect size&lt;/a&gt; for further details).&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
However, there are additional reasons why standardizing predictors will cause trouble in moderated &amp;nbsp;multiple regression. Standardization involves centering the predictors and scaling them terms of their sample SDs. Centering is very often a useful thing to do in moderated multiple regression. However, &amp;nbsp;statistics packages such as SPSS will standardize all the predictors - including the product terms - &amp;nbsp;in moderated multiple regression. This is because they have no way of knowing that the product term is not a 'regular' predictor (similarly if anyone were foolish enough to do a stepwise regression ... the software would not know to keep in &lt;span class="Apple-style-span" style="font-style: italic;"&gt;X1&lt;/span&gt; and &lt;span class="Apple-style-span" style="font-style: italic;"&gt;X2&lt;/span&gt; for each &lt;span class="Apple-style-span" style="font-style: italic;"&gt;X1.X2&lt;/span&gt; product term). This means that the &lt;span class="Apple-style-span" style="font-style: italic;"&gt;X1.X2&lt;/span&gt; product term will be standardized along with &lt;span class="Apple-style-span" style="font-style: italic;"&gt;X1&lt;/span&gt; and &lt;span class="Apple-style-span" style="font-style: italic;"&gt;X2&lt;/span&gt; rather than being computed (correctly) as the product of the two standardized predictors (i.e., &lt;span class="Apple-style-span" style="font-style: italic;"&gt;Zx1&lt;/span&gt; and &lt;span class="Apple-style-span" style="font-style: italic;"&gt;Zx2&lt;/span&gt; multiplied together).&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
That's clearly a problem (the &lt;span class="Apple-style-span" style="font-style: italic;"&gt;t&lt;/span&gt; test should be OK but the value of the coefficient and simple slopes will be wrong). However, the clincher is that even if the correct standardization is carried out (e.g., computing the standardized predictors yourself and then taking the product of the relevant standardized predictors and entering it into the regression) the standard errors will be incorrect (which is problematic for constructing confidence and prediction intervals).&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
In summary, you can't rely on the software to get the standardization correct so use the unstandardized regression coefficients and standard errors! (Also, did I mention that standardized coefficients are generally a bad idea anyway?).&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-3600043174785579580?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/raluP6N2FzkXckHnC3V0EkdNeDo/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/raluP6N2FzkXckHnC3V0EkdNeDo/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/raluP6N2FzkXckHnC3V0EkdNeDo/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/raluP6N2FzkXckHnC3V0EkdNeDo/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/SAFMLJWlG7g" height="1" width="1"/&gt;</content><link rel="related" href="http://people.ku.edu/~preacher/interact/interactions.htm#stand" title="Don't standardize interaction/moderator effects in multiple regression" /><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/3600043174785579580/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2009/03/dont-standardize-interactionmoderator.html#comment-form" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/3600043174785579580?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/3600043174785579580?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/SAFMLJWlG7g/dont-standardize-interactionmoderator.html" title="Don't standardize interaction/moderator effects in multiple regression" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>2</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2009/03/dont-standardize-interactionmoderator.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0QHSXg-fyp7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-5565370538913067015</id><published>2009-01-12T23:07:00.004Z</published><updated>2011-10-04T16:15:38.657+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:15:38.657+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>R resources for psychologists</title><content type="html">As a quick follow-up to my previous post, I'll quickly note some R links specifically aimed at psychologists. Jonathan Baron and &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_0"&gt;Yuelin&lt;/span&gt; Li have a excellent set of &lt;a href="http://www.psych.upenn.edu/~baron/rpsych/rpsych.html"&gt;notes&lt;/a&gt; that cover ground from R basics to &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_1"&gt;ANOVA&lt;/span&gt; (including repeated measures &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_2"&gt;ANOVA&lt;/span&gt;) and beyond. Jonathan Baron also maintains a general set of &lt;a href="http://finzi.psych.upenn.edu/"&gt;R links&lt;/a&gt;. William &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_3"&gt;Revelle's&lt;/span&gt; &lt;a href="http://personality-project.org/r/r.guide.html"&gt;R guide for psychological research&lt;/a&gt; is also very good and has more emphasis on &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_4"&gt;psychometrics&lt;/span&gt; than Baron and Li.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Pilgrim's blog is an &lt;a href="http://theregressingpilgrim.blogspot.com/"&gt;R resource for the exercises&lt;/a&gt; in the classic &lt;a href="http://www.amazon.co.uk/gp/product/0805822232?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0805822232"&gt;Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences&lt;/a&gt;&lt;img src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;amp;l=as2&amp;amp;o=2&amp;amp;a=0805822232" width="1" height="1" border="0" alt="" style="border:none !important; margin:0px !important;" /&gt; text.&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-5565370538913067015?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/WvGJtSsHvJQFVKmx6jWho60lXzQ/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/WvGJtSsHvJQFVKmx6jWho60lXzQ/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/WvGJtSsHvJQFVKmx6jWho60lXzQ/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/WvGJtSsHvJQFVKmx6jWho60lXzQ/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/cnkFbKPmUMs" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/5565370538913067015/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2009/01/r-resources-for-psychologists.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/5565370538913067015?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/5565370538913067015?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/cnkFbKPmUMs/r-resources-for-psychologists.html" title="R resources for psychologists" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2009/01/r-resources-for-psychologists.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D04HQX4-cCp7ImA9WxVVGEU.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-8552169957558482645</id><published>2009-01-09T19:32:00.021Z</published><updated>2009-03-12T19:25:30.058Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-03-12T19:25:30.058Z</app:edited><title>Simulating data for inquiry based learning</title><content type="html">I've just presented a talk on at the &lt;font class="Apple-style-span" style="font-style: italic;"&gt;&lt;a href="http://www.psychology.heacademy.ac.uk/html/event_reports_2009.asp"&gt;Statistics for Psychology Students&lt;/a&gt;&lt;/font&gt; workshop for the HEA Psychology Network in York. Richard Rowe (Psychology, Sheffield) gave an interesting talk on teaching statistics via inquiry based learning. Part of the work involved using getting students to generate their own research questions in a tutorial and then analyzing data addressing these questions in a follow-up tutorial. A rather clever idea he reported was to generate suitable data via simulation in STATA. The students were first or second year undergraduates and so the experimental designs they came up with were constrained to simple cases (e.g., &lt;font class="Apple-style-span" style="font-style: italic;"&gt;t&lt;/font&gt; tests, correlations and chi-square for the first years; one-way ANOVA with 3 levels for the second years). This meant that whoever led the tutorial (e.g., a Ph.D. student) could generate tailored data to match the students' research designs.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The novel and clever aspect of this isn't simulating real data sets or getting students to come up with their own research ideas, but combining the two. This means that you can run these sessions without getting students to collect real data. (There is a place for that too, but collecting real data has massive overheads in terms of student time, staff time, ethics approval and so forth). Furthermore, students can legitimately come up with ideas for studies that can't be run by undergraduates for ethical, resource or other reasons - for example using clinical populations). I can also see other uses for this. For example, a supervisor could simulate data for a final year project and a student could use the simulated data as a dry run for the real analysis. (The real data will usually be much messier, but I think that getting familiar with the statistical software and analyses to be used could be useful for some students).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;A few members of the audience asked questions about STATA (which I'm not familar with) and I pointed out that you could do the same things in &lt;a href="http://www.r-project.org/"&gt;R&lt;/a&gt; for free. As I understand it STATA allows you to simulate data with a specified covariance matrix fairly easily. I'm sure this can be done in R too, but I'm still learning how to use R and tracking down the right package and commands would have taken a bit of time. In any case, for this to be useful it needs to be very simple and easy to run by people with relatively basic statistical computing skills (and who have never used R before). So I set myself a challenge of writing code that should run with functions from the R base package, be trivial to edit and generate usable data for two common analyses: the independent &lt;font class="Apple-style-span" style="font-style: italic;"&gt;t&lt;/font&gt; test and a bivariate correlation.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;It took me about 2 or 3 minutes to write the independent &lt;font class="Apple-style-span" style="font-style: italic;"&gt;t&lt;/font&gt; test code. (This may sound impressive but a competent R programmer could probably do it in under a minute). It took a little longer to comment it and work out how to write it to a tab delimited text file. It took me about 5 or 10 minutes to do the bivariate correlation and a little longer to fix a rather stupid error I'd made. All the code is rather clumsily written (and in some cases I've deliberately separated out steps to make it easier for someone unfamiliar with R to follow). The correlation solution (in particular) isn't very good and specifying a covariance matrix to constrain a simulation would be much more satisfactory. (I should also note that it uses a trick I picked up some time ago from &lt;a href="http://www.pbarrett.net/"&gt;Paul Barrett's web site&lt;/a&gt; in his article on &lt;a href="http://www.pbarrett.net/techpapers/likert.pdf"&gt;Correlation attenuation due to Likert categorization&lt;/a&gt;.) The key point is that it took longer to describe here than it took to write. I'm sure better solutions exist in R, but these ones work and could easily be extended to paired &lt;font class="Apple-style-span" style="font-style: italic;"&gt;t&lt;/font&gt; tests or one-way ANOVA. (I'm not sure about chi-square. I could write something, but it might be a lot easier just to specify a 2x2 or 3x2 contingency table by hand).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Running the R code is also easy. R runs on PC, Linux and Mac OS X, but if you don't fancy installing it (for some bizarre reason) it can also be run from a &lt;a href="http://www.math.montana.edu/Rweb/"&gt;web server&lt;/a&gt; (though it won't be able to write to a file). To run the code you just paste it into the R console and hit return. To tailor the code just read the comments and tweak the parameter values before pasting.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;If you want a copy to play with just &lt;a href="mailto:tsbaguley@gmail.com"&gt;email me&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For anyone who wants to do some simulation in R there are lots of resources around (try googling the obvious keywords), but Andrew Gelman and Jennifer Hill's book &lt;a href="http://www.amazon.co.uk/gp/offer-listing/052168689X?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=am2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=052168689X"&gt;Data Analysis Using Regression and Multilevel/Hierarchical Models (Analytical Methods for Social Research)&lt;/a&gt; is an impressive book for a serious take on using simulation (although the stance is Bayesian this doesn't get in the way - at least for the sections I've read; I'm about 40% through the book). &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;font class="Apple-style-span" style="font-style: italic;"&gt;Postscript&lt;/font&gt;. All the talks were video recorded and Anne Trapp threatened to put video podcasts - vodcasts? - on the Psychology Network site at some point in the future - including my talk (&lt;font class="Apple-style-span" style="font-style: italic;"&gt;Effect size: why what we teach psychology students is wrong&lt;/font&gt;). These will be worth looking out for just to see Andy Field's talk. (Andy claims he looks particularly ridiculous when video recorded. My advice: just don't ever look at the recording. &lt;font class="Apple-style-span" style="font-style: italic;"&gt;Ever&lt;/font&gt;.) I'll write something about effect size in the near future - though much of the talk was based on my &lt;a href="http://www.ingentaconnect.com/content/bpsoc/bjp/pre-prints/310805"&gt;forthcoming BJP paper on effect size.&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-8552169957558482645?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/xW5mBPp6w3eFKBZNxi55eMtsmFI/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/xW5mBPp6w3eFKBZNxi55eMtsmFI/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/xW5mBPp6w3eFKBZNxi55eMtsmFI/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/xW5mBPp6w3eFKBZNxi55eMtsmFI/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/6wfJm_G66WI" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/8552169957558482645/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2009/01/simulating-data-for-inquiry-based.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/8552169957558482645?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/8552169957558482645?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/6wfJm_G66WI/simulating-data-for-inquiry-based.html" title="Simulating data for inquiry based learning" /><author><name>Thom Baguley</name><uri>https://profiles.google.com/110859229901482390817</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="//lh5.googleusercontent.com/-kB60GMkSCOc/AAAAAAAAAAI/AAAAAAAAAGw/lSvc7_pS3lI/s512-c/photo.jpg" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2009/01/simulating-data-for-inquiry-based.html</feedburner:origLink></entry></feed>

