<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/atom10full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><feed xmlns="http://www.w3.org/2005/Atom" xmlns:openSearch="http://a9.com/-/spec/opensearch/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr="http://purl.org/syndication/thread/1.0" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" gd:etag="W/&quot;DUEGQXszeCp7ImA9WhRRFE4.&quot;"><id>tag:blogger.com,1999:blog-27862247</id><updated>2011-11-27T23:47:00.580Z</updated><category term="repeated measure ANOVA" /><category term="compound symmetry" /><category term="Bayesian statistics" /><category term="Loftus and Masson" /><category term="MLwin" /><category term="moderator effects" /><category term="memory" /><category term="puzzle" /><category term="displaying means" /><category term="philosophy" /><category term="sphericity" /><category term="multilevel models" /><category term="Greenhouse-Geisser" /><category term="PsyPAG" /><category term="blog" /><category term="effect size" /><category term="likelihood" /><category term="psychology" /><category term="statistical significance" /><category term="Goldstein and Healy" /><category term="interaction" /><category term="simple main effects" /><category term="ANOVA" /><category term="Bayes factors" /><category term="BPS" /><category term="moderated multiple regression" /><category term="statistics" /><category term="confidence intervals" /><category term="science" /><category term="R" /><category term="NTU" /><title>Psychological Statistics</title><subtitle type="html" /><link rel="http://schemas.google.com/g/2005#feed" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/posts/default" /><link rel="alternate" type="text/html" href="http://psychologicalstatistics.blogspot.com/" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><generator version="7.00" uri="http://www.blogger.com">Blogger</generator><openSearch:totalResults>20</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/atom+xml" href="http://feeds.feedburner.com/PsychologicalStatistics" /><feedburner:info uri="psychologicalstatistics" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><entry gd:etag="W/&quot;D0ENQ306cSp7ImA9WhdbF0U.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-5477542586888203401</id><published>2011-10-04T15:43:00.006+01:00</published><updated>2011-10-16T18:21:32.319+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-16T18:21:32.319+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="Loftus and Masson" /><category scheme="http://www.blogger.com/atom/ns#" term="displaying means" /><category scheme="http://www.blogger.com/atom/ns#" term="Goldstein and Healy" /><category scheme="http://www.blogger.com/atom/ns#" term="sphericity" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="ANOVA" /><category scheme="http://www.blogger.com/atom/ns#" term="statistical significance" /><category scheme="http://www.blogger.com/atom/ns#" term="confidence intervals" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>Calculating and graphing within-subject confidence intervals for ANOVA</title><content type="html">&lt;span class="Apple-style-span" style="line-height: 24px;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;                     &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Psychologists are gradually coming round to the view that it is a good idea to present interval estimates alongside point estimates of statistics. The most common statistic reported in psychology research is almost certainly the mean (strictly the arithmetic mean). Presenting an interval estimate for the mean of a single sample is usually quite simple. This is usually done as 95% confidence interval about the mean – and most researchers in psychology are able to calculate this by hand or get their statistical software to calculate and graph it for them.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;  &lt;/span&gt;&lt;br /&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Extending this to more than one mean introduces an additional layer of complexity. This is because the difference between two means is a different quantity, and its CI (although related to those of the individual means) is different in width from the CIs of the individual means. This creates a problem when plotting the CI because a researcher might be interested in the CI for an individual mean, the CI for their difference (or both).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;The complexity increases further if the aim is to plot a set of means (e.g., from an ANOVA design). In this case, plotting all the possible differences (as is commonly done) obscures patterns in the individual means (e.g., linear or quadratic trends).&amp;nbsp; Last, but not least, if the means are not from independent samples, there are further difficulties. This happens in within-subjects or repeated measures designs.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;In these designs the variation around each mean is correlated with the variation around the other means. This correlation arises from individual differences. Statistical procedures such as ANOVA can capitalize on these individual differences to produce more sensitive statistical inferences (i.e., to increase statistical power or obtain narrower CIs). This is done by estimating the variation due to individual differences, and removing it from the error variance (the estimate of statistical noise in the data set).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;This is a problem for graphical presentation of means because the precision of individual means is influenced by individual differences, whereas the precision of differences between means is not (because the estimate of individual differences is common to repeated samples from the same people and thus can be removed). Further complications arise when the sphericity assumption of repeated measures ANOVA is violated.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Several solutions to these problems have been proposed in the literature. The best known of these in psychology is that of Loftus and Masson (1994). Another well-known solution is that of Goldstein and Healy (1995), extended to correlated samples by Afshartous and Preston (2010).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Despite a large literature on the problems of graphing a set of correlated means, many people avoid the problems altogether by not reporting (or graphing) CIs or report CIs that are misleading in some way. Researchers are often unaware of the problems or find the solutions hard to understand and implement.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;I recently reviewed the main approaches in the literature, describe how to obtain suitable intervals for individual means and differences between means and provide R code to calculate and plot the intervals.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;The main highlights are that:&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;i) for inferences about individual means the standard approach works fairly well for between-subject (independent measures) designs, but there is a case to use CIs from a multilevel model for within-subject (repeated measures) designs&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;ii) an approach proposed by Cousineau (2005) with a correction by Morey (2008) offers advantages over the Loftus and Masson (1994) approach for within-subject ANOVA designs. It simplifies the calculations and does not assume sphericity. The Loftus-Masson approach will however usually be superior when &lt;i style="mso-bidi-font-style: normal;"&gt;n&lt;/i&gt; is small.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;iii) if you are interested in differences between means then you should probably plot a version of the Cousineau-Morey (or Loftus-Masson) interval that is adjusted so that overlap of the CIs around two individual means corresponds to overlap of the CI for their difference. This can be done by incorporating a multiplier to the width of the individual CIs. This multiplier is equal to (2^0.5)/2.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;iv) if you are interested in both precision of individual means and their differences you can use a two-tiered error bar to display both quantities (Cleveland, 1985).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: 3.9pt;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;v) the intervals (and graphical presentation of means) are useful for informal inference about a set of means. For formal inference it is better to set up precise hypotheses and test these via an a priori of contrast. This could be a traditional null hypothesis significance test, but other approaches are available. These include confidence intervals, Bayes factors, likelihood ratios and so forth (Baguley, in press; Dienes, 2008).&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;
&lt;div class="MsoNormal" style="line-height: 150%; margin-right: -38.6pt;"&gt;&lt;span lang="EN-US" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: -38.6pt;"&gt;&lt;span lang="EN-US" style="font-family: inherit;"&gt;The paper is available &lt;a href="http://nottinghamtrent.academia.edu/ThomBaguley/Papers/900446/Calculating_and_graphing_within-subject_confidence_intervals_for_ANOVA"&gt;here&lt;/a&gt;, the R code &lt;a href="https://docs.google.com/leaf?id=0B8YTvx5C8MjXNWRjMzhhMzQtZmE5YS00MWQ3LTk1NTAtMjc1MzNkOWU2MjYz&amp;amp;hl=en_GB"&gt;here&lt;/a&gt; and the data sets &lt;a href="https://docs.google.com/leaf?id=0B8YTvx5C8MjXMjc3ODY2YjgtZTAyMi00OTM3LTlkYTYtY2E0YzU4YTBjNGEy&amp;amp;hl=en_GB"&gt;here&lt;/a&gt;.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span lang="EN-US" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="line-height: 150%; margin-right: -38.6pt;"&gt;&lt;b style="mso-bidi-font-weight: normal;"&gt;&lt;i style="mso-bidi-font-style: normal;"&gt;&lt;span lang="EN-US" style="font-family: inherit;"&gt;References&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;br /&gt;
&lt;b style="mso-bidi-font-weight: normal;"&gt;&lt;i style="mso-bidi-font-style: normal;"&gt;&lt;span lang="EN-US" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/i&gt;&lt;/b&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span lang="EN-US" style="font-family: inherit;"&gt;Afshartous D., &amp;amp; Preston R. A. (2010). Confidence intervals for dependent data: equating nonoverlap with statistical significance. &lt;i&gt;Computational Statistics and Data&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;Analysis. 54&lt;/span&gt;&lt;/i&gt;&lt;span lang="EN-US"&gt;, 2296-2305.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;a href="http://www.blogger.com/goog_536825781"&gt;Baguley, T. (2011, in press).&amp;nbsp;&lt;/a&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;a href="http://nottinghamtrent.academia.edu/ThomBaguley/Papers/900446/Calculating_and_graphing_within-subject_confidence_intervals_for_ANOVA"&gt;Calculating and graphing within-subject confidence intervals for ANOVA. Behavior Research Methods.&amp;nbsp;DOI: 10.3758/s13428-011-0123-7&lt;/a&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt; Baguley, T. (2012, in press).&amp;nbsp;&lt;img alt="" border="0" height="1" src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;amp;l=as2&amp;amp;o=2&amp;amp;a=0230577180" style="border: none !important; margin: 0px !important;" width="1" /&gt;&lt;a href="http://www.amazon.co.uk/gp/product/0230577180/ref=as_li_tf_tl?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0230577180"&gt;Serious Stats: A guide to advanced statistics for the behavioral sciences&lt;/a&gt;. Basingstoke: Palgrave.&lt;/span&gt;&lt;br /&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Cleveland. W. S. (1985).&amp;nbsp;&lt;/span&gt;&lt;a href="http://www.amazon.co.uk/gp/product/0412742705/ref=as_li_tf_tl?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0412742705"&gt;Elements Of Graphing Data&lt;/a&gt;&lt;span lang="EN-US"&gt;. New York, NY: Chapman &amp;amp; Hall.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span lang="EN-US" style="font-family: inherit;"&gt;Cousineau, D. (2005). Confidence intervals in within-subject designs: A simpler solution to Loftus and Masson’s method. &lt;i&gt;Tutorials in Quantitative Methods for Psychology,&lt;/i&gt; &lt;i&gt;1&lt;/i&gt;, 42-45.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Dienes, Z. (2008).&amp;nbsp;&lt;/span&gt;&lt;a href="http://www.amazon.co.uk/gp/product/023054231X/ref=as_li_tf_tl?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=023054231X"&gt;Understanding Psychology as a Science: An Introduction to Scientific and Statistical Inference&lt;/a&gt;&lt;img alt="" border="0" height="1" src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;amp;l=as2&amp;amp;o=2&amp;amp;a=023054231X" style="border-bottom-style: none !important; border-color: initial !important; border-left-style: none !important; border-right-style: none !important; border-top-style: none !important; border-width: initial !important; cursor: move; margin-bottom: 0px !important; margin-left: 0px !important; margin-right: 0px !important; margin-top: 0px !important;" width="1" /&gt;&lt;span lang="EN-US"&gt;. Basingstoke: Palgrave Macmillan.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Goldstein, H., &amp;amp; Healy, M. J. R. (1995). &lt;/span&gt;&lt;span lang="EN-US"&gt;The graphical presentation of a collection of means. &lt;/span&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;Journal of the Royal Statistical Society. Series A (Statistics in Society), 158&lt;/span&gt;&lt;/i&gt;&lt;span lang="EN-US"&gt;, 175-177.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span lang="EN-US" style="font-family: inherit;"&gt;Loftus, G. R., &amp;amp; Masson, M. E. J. (1994). Using confidence intervals in within-subject designs. &lt;i&gt;Psychonomic Bulletin &amp;amp; Review&lt;/i&gt;, &lt;i&gt;1&lt;/i&gt;, 476-490.&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;&lt;div class="Default"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;Morey, R. D. (2008). &lt;/span&gt;&lt;span lang="EN-US"&gt;Confidence intervals from normalized data: A correction to Cousineau (2005). &lt;/span&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;Tutorials in Quantitative Methods for Psychology, 4&lt;/span&gt;&lt;/i&gt;&lt;span lang="EN-US"&gt;, 61-64.&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="MsoNormal" style="margin-bottom: .0001pt; margin-bottom: 0cm; mso-layout-grid-align: none; mso-pagination: none; text-autospace: none;"&gt;&lt;span class="Apple-style-span" style="font-family: Calibri;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-5477542586888203401?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/RLPqmnCzi2-uE9qvZZWdCs-_6gg/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/RLPqmnCzi2-uE9qvZZWdCs-_6gg/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/RLPqmnCzi2-uE9qvZZWdCs-_6gg/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/RLPqmnCzi2-uE9qvZZWdCs-_6gg/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/4y9e8wjv1z4" height="1" width="1"/&gt;</content><link rel="related" href="http://nottinghamtrent.academia.edu/ThomBaguley/Papers/900446/Calculating_and_graphing_within-subject_confidence_intervals_for_ANOVA" title="Calculating and graphing within-subject confidence intervals for ANOVA" /><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/5477542586888203401/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2011/10/calculating-and-graphing-within-subject.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/5477542586888203401?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/5477542586888203401?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/4y9e8wjv1z4/calculating-and-graphing-within-subject.html" title="Calculating and graphing within-subject confidence intervals for ANOVA" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2011/10/calculating-and-graphing-within-subject.html</feedburner:origLink></entry><entry gd:etag="W/&quot;A0AHQHYyfSp7ImA9WhdUF04.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-2280818886067466881</id><published>2011-09-15T23:27:00.005+01:00</published><updated>2011-10-04T15:48:51.895+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T15:48:51.895+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="ANOVA" /><category scheme="http://www.blogger.com/atom/ns#" term="statistical significance" /><category scheme="http://www.blogger.com/atom/ns#" term="interaction" /><title>A problem of significance</title><content type="html">Several people have drawn my attention to a recent article on a common error in published statistical analyses in neuroscience. Sander Nieuwenhuis, Birte Forstmann and Eric-Jan Wagenmakers published (in Nature Neuroscience) a critique of statistical analyses in the neuroscience literature. This paper has been written about by &lt;a href="http://www.guardian.co.uk/commentisfree/2011/sep/09/bad-science-research-error"&gt;Ben Goldacre&lt;/a&gt; and &lt;a href="http://andrewgelman.com/2011/09/the-difference-between-significant-and-not-significant/"&gt;Andrew Gelman&lt;/a&gt; (who published an article on the general problem some time ago) - so I won't go into too much detail.&lt;br /&gt;
&lt;br /&gt;
The point of interest for me is that the error concerns something that most psychologists should know all about (and hence should be expected not to make the error). It concerns the case of two differences, one statistically significant and one non-significant. For example, group 1 may show a significant difference between experimental condition and placebo (for a drug intervention), while group 2 do not. A naive interpretation is that the drug works for group 1 but not group 2. This is not necessarily true. The proper test of a difference in effects of the drug between groups is an interaction test. Psychologists tend to avoid this error because we have heavily trained in ANOVA as undergraduates (certainly in the UK and probably also in the US and most of Europe). Even if we fail to learn this, &amp;nbsp;reviewers and editors (in psychology) tend to spot the error.*&lt;br /&gt;
&lt;br /&gt;
Are psychologists then entitled to feel a little bit smug? Perhaps, but only a little. First, I think the reason we are relatively good performers on this point is because we tend to view many statistical analyses through an "ANOVA" lens. Factorial ANOVA (in which factors are orthogonal) includes the interaction term by default. The 2 by 2 factorial ANOVA is the workhorse of experimental psychology. Our familiarity with this type of design and analysis makes this easy to spot. Second, our ANOVA lens leads to other errors - notably dichotomizing continuous variables (e.g., via median split) in order to squeeze them into an ANOVA design. This always decreases statistical power, and can - albeit infrequently - produce spuriously significant effects (see MacCallum et al., 2002). These errors are sometimes less serious than the difference of differences/interaction error (but are not harmless).&lt;br /&gt;
&lt;br /&gt;
The real test then, is whether psychologists make the same (conceptual) error in a different context. The obvious context is that of association rather than difference. If males show a significant correlation between testosterone and aggression (e.g., &lt;i&gt;r&lt;/i&gt; = .5, &lt;i&gt;N&lt;/i&gt; = 25) and females don't (e.g.,&amp;nbsp;&lt;i&gt;r&lt;/i&gt; = .3, &lt;i&gt;N&lt;/i&gt; = 25), the correlation between testosterone and aggression is not significantly bigger for males than females. To confirm this you'd need to construct a test or (better still) confidence interval for the difference in correlations. This is hardly ever done - and, in my experience, psychologists frequently make this kind of claim without backing it up.** Methods for testing differences in correlations are a bit fiddly (e.g., depending on overlap or lack of overlap in the measurements), and rarely taught at undergraduate or even postgraduate level. The methods that are taught are also often a bit dodgy (see Zou, 2007, for some better alternatives).&lt;br /&gt;
&lt;br /&gt;
Also note that (in both cases) the error can work the other way. Two correlations could be non-significantly different from zero but different from each other (e.g., &lt;i&gt;r&lt;/i&gt; = .5 and &lt;i&gt;r&lt;/i&gt; = -.5 with &lt;i&gt;N&lt;/i&gt; = 10).&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;Postscript&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
There is, I think, a lesson or two here. A minor lesson is that interactions are bit more complicated than psychologists (particularly those very familiar with ANOVA) often think. I could write more on this (and do a bit in my forthcoming book). A major lesson is that this concept (the difference between significant and non-significant is not necessarily also statistically significant - see Gelman &amp;amp; Stern, 2006) is probably quite tricky. It may be worth exploring why ... I suspect it is because of several factors.&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;References&lt;/i&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Gelman, A., &amp;amp; Stern, H. (2006). The difference between “significant” and “not significant” is not itself statistically significant. &lt;i&gt;American Statistician, 60&lt;/i&gt;, 328–331.&lt;br /&gt;
&lt;br /&gt;
MacCallum, R. C., Zhang, S., Preacher, K. J., &amp;amp; Rucker, D. D. (2002). On the practice of dichotomization of quantitative variables. &lt;i&gt;Psychological Methods, 7&lt;/i&gt;, 19-40.&lt;br /&gt;
&lt;br /&gt;
Nieuwenhuis, S., Forstmann, B. U., &amp;amp; Wagenmakers, E.-J. (2011). Erroneous analyses of interactions in neuroscience: A problem of significance. &lt;i&gt;Nature Neuroscience, 14&lt;/i&gt;, 1105-1107.&lt;br /&gt;
&lt;br /&gt;
Zou, G. Y. (2007). Toward using confidence intervals to compare correlations. &lt;i&gt;Psychological Methods, 12,&lt;/i&gt; 399-413.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;*&amp;nbsp;&lt;/span&gt;&amp;nbsp;Do I have any support for this position? Yes: anecodotal support (e.g., from editing or reviewing many dozens of papers) and some support from&amp;nbsp;Nieuwenhuis et al. They found the error more prevalent in cellular and molecular neuroscience. ANOVA is core training in psychology and widely used in cognitive and behavioural neuroscience - and I'd argue that this reflects the influence of psychologists working in this area and of neuroscientists trained in and using similar methods.&lt;br /&gt;
&lt;div class="ReferenceStyle"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;div class="ReferenceStyle"&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/div&gt;&lt;br /&gt;
**&amp;nbsp;Do I have any support for this position?&amp;nbsp;A little. It is easy to find basic psychology texts with ANOVA but without tests of differences in correlations being mentioned. It is rare to find tests of CIs of differences in published papers.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-2280818886067466881?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/A9tHbwx4Okail8ZfJE66G_DSJxM/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/A9tHbwx4Okail8ZfJE66G_DSJxM/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/A9tHbwx4Okail8ZfJE66G_DSJxM/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/A9tHbwx4Okail8ZfJE66G_DSJxM/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/m4FtucwOKj4" height="1" width="1"/&gt;</content><link rel="related" href="http://www.ejwagenmakers.com/2011/NieuwenhuisEtAl2011.pdf" title="A problem of significance" /><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/2280818886067466881/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2011/09/problem-of-significance.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/2280818886067466881?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/2280818886067466881?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/m4FtucwOKj4/problem-of-significance.html" title="A problem of significance" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2011/09/problem-of-significance.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0QCQHs5fip7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-3228682685795043970</id><published>2011-01-13T13:50:00.000Z</published><updated>2011-10-04T16:16:01.526+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:16:01.526+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>R: An introduction for psychologists</title><content type="html">&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Georgia, 'Times New Roman', serif;"&gt;Here are the slides from the Introduction to R session Danny Kaye and I ran at the BPS Mathematics, Statistics &amp;amp; Computing section CPS Workshop (13 December 2010, Nottingham Trent University).&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;a href="https://docs.google.com/present/edit?id=0AcYTvx5C8MjXZHA4Nmg2dF8xMDJnY2c2eDdoYw&amp;amp;hl=en_GB&amp;amp;authkey=CMHxy54B"&gt;&lt;/a&gt;&lt;br /&gt;
&lt;iframe frameborder="0" height="451" src="https://docs.google.com/present/embed?id=dp86h6t_102gcg6x7hc&amp;amp;size=m" width="555"&gt;&lt;/iframe&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-3228682685795043970?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/eWJXVyqBzHP71er4E-Da_jXWdsU/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/eWJXVyqBzHP71er4E-Da_jXWdsU/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/eWJXVyqBzHP71er4E-Da_jXWdsU/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/eWJXVyqBzHP71er4E-Da_jXWdsU/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/qr7EtW4YEeE" height="1" width="1"/&gt;</content><link rel="related" href="https://docs.google.com/present/edit?id=0AcYTvx5C8MjXZHA4Nmg2dF8xMDJnY2c2eDdoYw&amp;hl=en_GB&amp;authkey=CMHxy54B" title="R: An introduction for psychologists" /><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/3228682685795043970/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2011/01/r-introduction-for-psychologists.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/3228682685795043970?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/3228682685795043970?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/qr7EtW4YEeE/r-introduction-for-psychologists.html" title="R: An introduction for psychologists" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2011/01/r-introduction-for-psychologists.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0QCSXk8eCp7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-1960747264016613408</id><published>2010-05-13T16:21:00.000+01:00</published><updated>2011-10-04T16:16:08.770+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:16:08.770+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>Introduction to using R in research</title><content type="html">I was recently asked to give a talk to our graduate school annual conference. I offered several titles and the one they picked was &lt;i&gt;Using R in research&lt;/i&gt;. I'm not sure if this was a good idea or not. The graduate school covers PhD students across three areas of the university: social sciences (including psychology), law and business. In addition the students tend to specialize in either qualitative or quantitive research methods, so I was talking to an audience who might know nothing about statistics or a great deal (e.g., several students have completed MSc courses in psychological research methods here or elsewhere).&lt;br /&gt;
&lt;br /&gt;
&lt;iframe src="http://docs.google.com/present/embed?id=dp86h6t_85c23cmxsp" frameborder="0" width="410" height="342"&gt;&lt;/iframe&gt;&lt;br /&gt;
&lt;br /&gt;
My solution was try and explain the advantages of R relative to alternatives such as SPSS (probably the most common statistic package in the University). I also focussed a lot on graphical methods and simulation. It seemed to go quite well, but I worry that quite a few members of the audience were overwhelmed by large chunks of it.&lt;br /&gt;
&lt;br /&gt;
I promised to put my slides on my blog - though I am not sure how useful they are to anybody who wasn't there. Without my commentary some (possibly most) of the slides won't make much sense. I spent a good deal of the time talking through exploratory plots of one data set (from Hayden, 2005). I use this example a lot in teaching and it involves a bit of class participating (guessing the origin of the data) - so I won't go into to detail here (lest I spoil it for future students), but you can google the original article if you are curious. I also spent some time on how R works (e.g., object types, assignment, basic modeling, plotting functions). My reasoning was that many of the audience have no familiarity with non-GUI interfaces in software and without explaining the basics of the interface they will not have the faintest clue how R works. For those with some familiarity (e.g., SPSS syntax) the examples were selected to show how powerful R can be for things like exploratory graphics.&lt;br /&gt;
&lt;br /&gt;
Several students ask about resources for learning R. I mentioned some in earlier blog posts, but for psychologists &lt;a href="http://www.psych.upenn.edu/~baron/rpsych/rpsych.html"&gt;Li and Baron's web resources&lt;/a&gt; are a good place to start. The other major resource is probably &lt;a href="http://www.statmethods.net/"&gt;Quick R&lt;/a&gt;, but there are hundreds of other places to look online (depending on what stuff you need most).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-1960747264016613408?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/JRhnInx5Zv29X_wO3Q6fKedgqCw/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/JRhnInx5Zv29X_wO3Q6fKedgqCw/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/JRhnInx5Zv29X_wO3Q6fKedgqCw/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/JRhnInx5Zv29X_wO3Q6fKedgqCw/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/BaAmi8ZfMqQ" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/1960747264016613408/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2010/05/introduction-to-using-r-in-research.html#comment-form" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/1960747264016613408?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/1960747264016613408?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/BaAmi8ZfMqQ/introduction-to-using-r-in-research.html" title="Introduction to using R in research" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>1</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2010/05/introduction-to-using-r-in-research.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0UBRH07fSp7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-7227981003814024457</id><published>2010-03-03T20:14:00.018Z</published><updated>2011-10-04T16:14:15.305+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:14:15.305+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="memory" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>Example of plotting a serial position curve in R</title><content type="html">&lt;div class="separator" style="clear: both; text-align: left;"&gt;A while ago I wrote a co-wrote chapter for an introductory psychology text book&amp;nbsp;&lt;a href="http://www.amazon.co.uk/gp/product/1847875386?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=1847875386"&gt;Essential Psychology: A Concise Introduction&lt;/a&gt;.&amp;nbsp;This is a book edited and written by members of the department where I work. My contribution was the chapter on human memory (cunningly titled &lt;i&gt;Memory&lt;/i&gt;).&lt;/div&gt;&lt;br /&gt;
I produced several plots for the chapter (some of which got cut due to severe space restrictions). One that stayed in was a serial position curve. For this plot I used data from Postman and Phillips (1965).&lt;br /&gt;
&lt;div style="text-align: center;"&gt;&lt;a href="http://1.bp.blogspot.com/_3fIauBYLLhY/S46_ENrgdVI/AAAAAAAAADw/I7TsDMvy6VQ/s1600-h/pp65+plot.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="400" src="http://1.bp.blogspot.com/_3fIauBYLLhY/S46_ENrgdVI/AAAAAAAAADw/I7TsDMvy6VQ/s400/pp65+plot.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;
I feel particular proud of this plot because I was just beginning to use and learn R at the time (as opposed to dabbling) and because I had had a really hard time getting hold of the data. I first tried google, but had no joy (for some reason I thought someone would have put the raw data online, as it is a classic study - though maybe I just missed it). Then I searched for alternative data sets &amp;nbsp;(as around that period there were quite a few similar studies). I was probably being too picky, but whatever the reason I had no luck.&lt;br /&gt;
&lt;br /&gt;
It would have been trivial to make up fake data, but that didn't feel right. What I eventually did (and wished I'd done straight away) was print out the original figure and measure all the points by hand. I then entered these values into a spreadsheet and tweaked and remeasured until all the summary statistics matched those in the original paper to about one decimal place. This was a lot quicker than I had thought. I cheated slightly because I only needed data from the 20 word conditions (so I could leave out the 10 and 30 word conditions).&lt;br /&gt;
&lt;br /&gt;
(I'm pretty sure I could have used computer software to capture the raw data from an image file, but I'd have had to find the software, learn how to use it and do all the checking anyway. For a single figure I'm reasonably sure measuring by hand would be faster.)&lt;br /&gt;
&lt;br /&gt;
In re-plotting it I noticed a few things that I hadn't paid much attention to before. The main one was the authors report frequency of recalls for 18 participants with 6 lists each. This means all scores are out of 108 and I suspect lots of casual readers would (like me) assume they were percentages. For re-plotting I rescaled the data as percentages.&lt;br /&gt;
&lt;br /&gt;
The plot itself just uses basic R functions. I'm writing about it because:&amp;nbsp;&amp;nbsp;i) I think it is a fairly clear illustration of how basic plot functions in R can produce what I think is a rather nice Figure. (The published version has been edited by the publisher, adding colour and making the style match figures in other chapters), ii) people may find it useful for teaching purposes. So please feel free to use and adapt the R code for non-commercial (e.g., teaching use).&lt;br /&gt;
&lt;br /&gt;
First load the data from this&amp;nbsp;&lt;a href="https://docs.google.com/leaf?id=0B8YTvx5C8MjXYzExMmUwNDctZGMwZC00MzA5LTgyOWMtYzg2MDRhMzAwMjRj&amp;amp;hl=en_GB"&gt;.csv file&lt;/a&gt;&amp;nbsp;(you will need to specify the path or change the working directory if the file is saved elsewhere).&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace; font-size: 14px;"&gt;pp65 &amp;lt;- read.csv("&lt;a href="https://docs.google.com/leaf?id=0B8YTvx5C8MjXYzExMmUwNDctZGMwZC00MzA5LTgyOWMtYzg2MDRhMzAwMjRj&amp;amp;hl=en_GB"&gt;pp65.csv&lt;/a&gt;")&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace; font-size: 14px;"&gt;&lt;span class="Apple-style-span" style="font-family: Courier, Courier, monospace;"&gt;Then paste the following:&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace; font-size: 14px;"&gt;&lt;span class="Apple-style-span" style="font-family: Courier, Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;plot(pp65$SP, pch=NA, ylim=c(0,80), xlab= "Serial position", ylab= "Mean percentage recall", main = "Postman &amp;amp; Phillips (1965)", sub = '(20 word conditions only)')&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;points(pp65$C0, pch=19, col='black', cex=.7)&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;lines(pp65$C0, lty=3)&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;points(pp65$C15, pch=24, col='black', cex=.7)&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;lines(pp65$C15, lty=2)&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;points(pp65$C30, pch=22, col='black', cex=.7)&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;lines(pp65$C30, lty=5)&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;legend(3, 80, legend=c("No delay","15 second delay","30 second delay"), lty=c(3,2,5)&lt;/span&gt;)&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;If you are new to R you can find out more about these plotting functions by using R help:&amp;nbsp;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;?par&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;,&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&amp;nbsp;?plot&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;,&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&amp;nbsp;?points&amp;nbsp;&lt;span class="Apple-style-span" style="font-family: Courier;"&gt;and so on ...&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: Courier, Courier, monospace;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-family: Courier;"&gt;&lt;span class="Apple-style-span" style="font-family: Courier; font-style: italic;"&gt;References&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;br /&gt;
&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-family: Helvetica; font-size: 12px; font-style: normal;"&gt;Baguley, T., &amp;amp; Edmonds, A. J. (2010). Memory. In P. Banyard, M. N. O. Davies, C. Norman, &amp;amp; B. Winder (Eds.) &lt;a href="http://www.amazon.co.uk/gp/product/1847875386?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=1847875386"&gt;Essential Psychology: A Concise Introduction&lt;/a&gt;&lt;img alt="" border="0" height="1" src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;amp;l=as2&amp;amp;o=2&amp;amp;a=1847875386" style="border: none !important; margin: 0px !important;" width="1" /&gt;&amp;nbsp;(pp. 65-82). London: Sage.&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;/div&gt;&lt;i&gt;&lt;/i&gt;&lt;br /&gt;
&lt;i&gt;&lt;/i&gt;&lt;br /&gt;
&lt;i&gt;&lt;/i&gt;&lt;br /&gt;
&lt;i&gt;&lt;/i&gt;&lt;br /&gt;
&lt;i&gt;&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px; min-height: 14.0px;"&gt;&lt;em&gt;&lt;span style="font-family: Arial; font-style: normal;"&gt;Postman&lt;/span&gt;&lt;/em&gt;&lt;span style="font-family: Arial;"&gt;, L. &amp;amp; &lt;em&gt;&lt;span style="font-style: normal;"&gt;Philips&lt;/span&gt;&lt;/em&gt;, L. W. (&lt;em&gt;&lt;span style="font-style: normal;"&gt;1965&lt;/span&gt;&lt;/em&gt;). Short-term temporal changes in free recall. &lt;i&gt;Quarterly Journal of Experimental Psychology, 17&lt;/i&gt;&lt;/span&gt;&lt;span style="font-family: Arial;"&gt;, 132-138.&lt;/span&gt;&lt;/div&gt;&lt;/i&gt;&lt;br /&gt;
&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: center;"&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://www.amazon.co.uk/gp/product/1847875386?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=1847875386"&gt;&lt;img border="0" src="http://4.bp.blogspot.com/_3fIauBYLLhY/S47DhQ6agBI/AAAAAAAAAD4/8sltf9y5r5Y/s320/51JNUB6mI3L._SL160_.jpg" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;img alt="" border="0" height="1" src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;amp;l=as2&amp;amp;o=2&amp;amp;a=1847875386" style="border: none !important; margin: 0px !important;" width="1" /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 14.0px Courier; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;i&gt;&lt;br /&gt;
&lt;/i&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-7227981003814024457?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/DdOM_Otyf327FyY_d3M0N09HVMw/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/DdOM_Otyf327FyY_d3M0N09HVMw/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/DdOM_Otyf327FyY_d3M0N09HVMw/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/DdOM_Otyf327FyY_d3M0N09HVMw/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/6pTxjoGZiMk" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/7227981003814024457/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2010/03/example-of-plotting-serial-position.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7227981003814024457?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7227981003814024457?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/6pTxjoGZiMk/example-of-plotting-serial-position.html" title="Example of plotting a serial position curve in R" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/_3fIauBYLLhY/S46_ENrgdVI/AAAAAAAAADw/I7TsDMvy6VQ/s72-c/pp65+plot.png" height="72" width="72" /><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2010/03/example-of-plotting-serial-position.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0UFQns5fip7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-6683967616011422003</id><published>2010-02-25T00:23:00.001Z</published><updated>2011-10-04T16:13:33.526+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:13:33.526+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="displaying means" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="ANOVA" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>Interaction plot from cell means</title><content type="html">I needed to produce a few a interaction plots for my book in R and, while the &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;interaction.plot(&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;)&lt;/span&gt; function is useful it has a couple of drawbacks. First, the default output isn't very pretty. Second, it works from the raw data, whereas I often need plots from cell means. For teaching purposes it is quite common to produce plots without raw data (for hypothetical data or from published examples).&lt;br /&gt;
&lt;br /&gt;
My first attempts at the plots involved setting them up element by element. Just going over some examples I decided to turn the basic plot (for a 2 x 2 ANOVA) into a simple function. Nothing fancy, just a regular interaction plot in black and white that I think is prettier than the SPSS, Excel or R defaults. At some point I may have a go turning it into a general I x J ANOVA plot (or maybe even add CIs, but I'll probably do that from raw data if I ever get round to it).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;plot.2by2 &amp;lt;- function(A1B1,A1B2, A2B1, A2B2, group.names, legend = TRUE, leg.loc=NULL, factor.labels=c('Factor A', 'Factor B'), swap = FALSE, ylab= NULL, main = NULL){&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;group.means &amp;lt;- c(A1B1, A2B1, A1B2, A2B2)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;if(missing(ylab)) ylab &amp;lt;- expression(italic(DV))&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;if(swap==TRUE) {&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;group.names &amp;lt;- list(group.names[[2]], group.names[[1]]) ; group.means &amp;lt;- c(A1B1, A1B2, A2B1, A2B2); factor.labels &amp;lt;- c(factor.labels[2], factor.labels[1])&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;}&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;plot(group.means, pch=NA, ylim=c(min(group.means)*.95, max(group.means)*1.025), xlim=c(0.8,2.2), ylab=ylab, xaxt='n', xlab=factor.labels[1], main=main)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;points(group.means[1:2], pch = 21)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;points(group.means[3:4], pch = 19)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;axis(side = 1, at = c(1:2), labels = group.names[[1]])&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;lines(group.means[1:2], lwd = .6, lty = 2)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;lines(group.means[3:4], lwd = .6)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;if(missing(leg.loc)) leg.loc &amp;lt;- c(1,max(group.means))&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;if(legend ==TRUE) legend(leg.loc[1], leg.loc[2],legend = group.names[[2]], &amp;nbsp;title = factor.labels[2], lty = c(3,1))&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;}&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 12.0px Helvetica; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Call the function by entering the four cell means in conventional order: A1B1, A1B2 and so on where A1B1 is the mean of level 1 of factor A at level 1 of factor B. You also need a two item list containing text strings of the two level names of each factor. For instance:&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;lev.names &amp;lt;- list(c('A1', 'A2'), c('B1', 'B2'))&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;plot.2by2(5,15,10,20, lev.names)&lt;/span&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_3fIauBYLLhY/S4W72RriWwI/AAAAAAAAADY/cxsvT33MVYU/s1600-h/tempA.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="400" src="http://3.bp.blogspot.com/_3fIauBYLLhY/S4W72RriWwI/AAAAAAAAADY/cxsvT33MVYU/s400/tempA.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;You can swap the axes by adding the argument &lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;swap = TRUE&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;:&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;plot.2by2(5,15,10,20, lev.names, swap = TRUE)&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_3fIauBYLLhY/TNvkiC5AeXI/AAAAAAAAAEg/V9Zmu23zz2Y/s1600/swap.plot.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="400" src="http://3.bp.blogspot.com/_3fIauBYLLhY/TNvkiC5AeXI/AAAAAAAAAEg/V9Zmu23zz2Y/s400/swap.plot.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;/div&gt;The default factor names are 'Factor A' and 'Factor B', but these are over-ridden&amp;nbsp;in the call:&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;plot.2by2(5,15,10,20,lev.names, swap = TRUE, factor.labels= c('Factor 1','Factor 2'))&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;You can also change the &lt;/span&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;y-&lt;/span&gt;&lt;/i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;axis label with &lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;ylab&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt; or add a main title with &lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;main&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;. &amp;nbsp;The legend can be dropped (&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;legend = FALSE&lt;/span&gt;) if you don't want one or need it to be located outside the plot. To move the legend just specify coordinates with an argument such as &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;leg.loc = c(1,10&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;)&lt;/span&gt;. You can also edit the source code directly.&amp;nbsp;Here is an example with title and meaningful labels:&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;group.names &amp;lt;- list(c('placebo','drug'), c('male', 'female'))&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;&lt;div class="RcodeStyle"&gt;&lt;span lang="EN-US"&gt;  &lt;!--EndFragment--&gt;   &lt;/span&gt;&lt;/div&gt;&lt;div class="RcodeStyle"&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/div&gt;plot.2by2(10,10,15,20, group.names, factor.labels=c('Drug', 'Sex'), swap = FALSE)&lt;o:p&gt;&lt;/o:p&gt;&lt;br /&gt;
&lt;!--EndFragment--&gt;   &lt;/span&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://3.bp.blogspot.com/_3fIauBYLLhY/S4W--6-oydI/AAAAAAAAADo/ZZ8YsWrZX2E/s1600-h/example.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="400" src="http://3.bp.blogspot.com/_3fIauBYLLhY/S4W--6-oydI/AAAAAAAAADo/ZZ8YsWrZX2E/s400/example.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;As this just uses basic plotting functions in R you can also manipulate the plot in other ways: adding lines with &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;segments()&lt;/span&gt;, adding text with &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;text()&lt;/span&gt; changing graphical parameters with &lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;par(&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;"&gt;)&lt;/span&gt; and so on.&amp;nbsp;Depending on your platform it is also easy to extract the plot as a .pdf or .jpg file. On a mac I save it as a .pdf file and open it in preview which allows me to save it as .png, .gif or whatever I need.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-6683967616011422003?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/0i5cSdcSNnUzshoV9eqYpodVBUc/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/0i5cSdcSNnUzshoV9eqYpodVBUc/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/0i5cSdcSNnUzshoV9eqYpodVBUc/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/0i5cSdcSNnUzshoV9eqYpodVBUc/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/ntE9PjYdk60" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/6683967616011422003/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2010/02/interaction-plot-from-cell-means.html#comment-form" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/6683967616011422003?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/6683967616011422003?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/ntE9PjYdk60/interaction-plot-from-cell-means.html" title="Interaction plot from cell means" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://3.bp.blogspot.com/_3fIauBYLLhY/S4W72RriWwI/AAAAAAAAADY/cxsvT33MVYU/s72-c/tempA.png" height="72" width="72" /><thr:total>2</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2010/02/interaction-plot-from-cell-means.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0YNRnwyeCp7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-4284625626024500904</id><published>2010-01-29T16:48:00.035Z</published><updated>2011-10-04T16:13:17.290+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:13:17.290+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="effect size" /><title>When correlations go bad ... (Or, I always wanted to write about the Society for the Suppression of the Correlation Coefficient)</title><content type="html">&lt;div style="text-align: left;"&gt;&lt;div style="text-align: left;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;A while ago I promised a longer post on standardized effect size. This isn't it. Instead it is a link to &lt;/span&gt;&lt;a href="http://www.thepsychologist.org.uk/archive/archive_home.cfm?volumeID=23&amp;amp;editionID=185&amp;amp;ArticleID=1633"&gt;&lt;span class="Apple-style-span" style="color: black;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;my piece in the February 2010 issue of &lt;/span&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;The Psychologist&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;/a&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;.&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;div style="text-align: left;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;br /&gt;
&lt;/span&gt; &lt;/div&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;div style="text-align: auto;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;I had intended to write a short summary of my 2009 BJP paper, but that didn't really get off the ground. However, I had for a while wanted to write about the Society for the Suppression of the Correlation Coefficient. I had first read about this in Tukey's (1954) chapter on regression and path analysis. This is one of the earlier papers in the literature criticizing the preference for (standardized) correlation coefficients over (simple, unstandardized) regression coefficients. Read the article for an earlier example! I was reminded of its existence by Brillinger's (2001) paper.&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;div style="text-align: left;"&gt;&lt;span class="Apple-style-span" style="-webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;As soon as I tried writing about Society for the Suppression of the Correlation Coefficient things (I think) fell into place. The result is a piece that is one part history of statistics trivia, one part mini-tutorial and one part a summary of my 2009 paper.&lt;br /&gt;
&lt;br /&gt;
For non-members of the BPS the link below contains a pre-publication version of 'When correlations go bad ...'.&lt;br /&gt;
&lt;br /&gt;
&lt;div&gt;I may also get around to the web site one day.&lt;br /&gt;
&lt;br /&gt;
P.S. I adapted data and R code from Gelman and Hill (2007) for the example. I chose it because it a nice simple example of regression and because it is also a pointer to someone who argues in favour of standardization )at least in some situations). Both my 2009 paper and the 'When correlations go bad ...' are my attempts at getting people to rethink the use of correlation and standardization. The tone is deliberately (slightly?) polemic. There are other views so don't just take my word for it ... think about what you are trying to do and decide whether a correlation coefficient (or a standardized mean d&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;ifference) is the right option.&lt;/span&gt;&lt;br /&gt;
&lt;div style="text-align: left;"&gt;&lt;div style="text-align: left;"&gt;&lt;span class="Apple-style-span" style="-webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; white-space: pre;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;div style="text-align: left;"&gt;&lt;span class="Apple-style-span" style="-webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; white-space: pre;"&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;References&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;div style="text-align: left;"&gt;&lt;span class="Apple-style-span" style="-webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; white-space: pre;"&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt; &lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;div style="text-align: left;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;a href="http://dx.doi.org/10.1348/000712608X377117"&gt;Baguley, T. (2009). Standardized or simple effect size: What should be reported? British Journal of Psychology. 100, 603–617.&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;a href="http://dx.doi.org/10.1348/000712608X377117"&gt;&lt;/a&gt;&lt;span lang="EN-US"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Baguley, T. (2010). &lt;a href="http://nottinghamtrent.academia.edu/ThomBaguley/Papers/138898/"&gt;When correlations go bad&lt;/a&gt; … &lt;/span&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;The Psychologist, 23&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;, 122-3.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span lang="EN-US"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;a href="http://nottinghamtrent.academia.edu/ThomBaguley/Papers/138898/"&gt;&lt;/a&gt;Brillinger, D.R. (2001). John Tukey and the correlation coefficient, Computing Science and Statistics, 33, 204–218.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Gelman, A. &amp;amp; Hill, J. (2007). &lt;/span&gt;&lt;a href="http://www.amazon.co.uk/gp/product/052168689X?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=052168689X"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Data Analysis Using Regression and Multilevel/Hierarchical Models (Analytical Methods for Social Research)&lt;/span&gt;&lt;/a&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;img alt="" border="0" height="1" src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;amp;l=as2&amp;amp;o=2&amp;amp;a=052168689X" style="border: none !important; margin: 0px !important;" width="1" /&gt;. Cambridge: Cambridge University Press.&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;Tukey, J.W. (1954). Causation, regression and path analysis. In O. Kempthorne, T.A. Bancroft, J.W. Gowen &amp;amp; J.L. Lush (Eds.) Statistics and mathematics in biology (pp.35–66). Ames, IA: Iowa State College Press.&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-4284625626024500904?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/g96jbZt5CL4sZoz7VLjZVYihTUo/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/g96jbZt5CL4sZoz7VLjZVYihTUo/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/g96jbZt5CL4sZoz7VLjZVYihTUo/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/g96jbZt5CL4sZoz7VLjZVYihTUo/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/sRdNPDxFf6I" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/4284625626024500904/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2010/01/when-correlations-go-bad-or-i-always.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/4284625626024500904?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/4284625626024500904?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/sRdNPDxFf6I/when-correlations-go-bad-or-i-always.html" title="When correlations go bad ... (Or, I always wanted to write about the Society for the Suppression of the Correlation Coefficient)" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2010/01/when-correlations-go-bad-or-i-always.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0QHSXg-fip7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-1846690160708647315</id><published>2010-01-25T09:40:00.003Z</published><updated>2011-10-04T16:15:38.656+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:15:38.656+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="multilevel models" /><category scheme="http://www.blogger.com/atom/ns#" term="sphericity" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="philosophy" /><title>Strathclyde multilevel modeling talk</title><content type="html">I recently gave an introduction to &lt;a href="http://bps-msc.blogspot.com/2010/01/psypag-statistics-workshop-in.html"&gt;multilevel modeling talk at Strathclyde University&lt;/a&gt; for &lt;a href="http://www.psypag.co.uk/"&gt;PsyPAG&lt;/a&gt;. I promised to make my slides available. I'm still using powerpoint (and regretting it) so some of the symbols may be garbled. (I noticed that powerpoint turned all my tilda symbols to colons during the talk). So I hope it is readable:&lt;br /&gt;
&lt;br /&gt;
Here is the link:&lt;br /&gt;
&lt;br /&gt;
&lt;iframe src="http://docs.google.com/present/embed?id=dp86h6t_49dt38wwdx&amp;interval=15" frameborder="0" width="410" height="342"&gt;&lt;/iframe&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-1846690160708647315?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/IEvLurkuQ79b-gVb793k3ibxlR4/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/IEvLurkuQ79b-gVb793k3ibxlR4/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/IEvLurkuQ79b-gVb793k3ibxlR4/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/IEvLurkuQ79b-gVb793k3ibxlR4/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/wxpeyMHsvL4" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/1846690160708647315/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2010/01/strathclyde-multilevel-modeling-talk.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/1846690160708647315?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/1846690160708647315?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/wxpeyMHsvL4/strathclyde-multilevel-modeling-talk.html" title="Strathclyde multilevel modeling talk" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2010/01/strathclyde-multilevel-modeling-talk.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0UNQn49fip7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-6707489846455399931</id><published>2009-12-13T10:26:00.119Z</published><updated>2011-10-04T16:14:53.066+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:14:53.066+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="multilevel models" /><category scheme="http://www.blogger.com/atom/ns#" term="puzzle" /><title>A statistical puzzle about averages II</title><content type="html">&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;div style="font: normal normal normal 16px/normal 'Times New Roman'; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;span style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;1. Who is correct?&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &amp;nbsp;Professor B is correct. If the average family has 1.8 children then the average child would be expected to have more than 0.8 siblings.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;2. Why?&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &amp;nbsp;The average child is not from the average family. The concepts of average child and average family are different. For this reason there should no expectation that the average child should be from a family with an average number of children. Although there are restricted circumstances under which this can happen, they are sufficiently implausible to be discounted in any real world application (e.g., if all families have exactly the same number of children).&lt;br /&gt;
&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt;  &lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/div&gt;&lt;div style="font: normal normal normal 16px/normal 'Times New Roman'; margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;"&gt;&lt;/div&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;In one case the unit of analysis of the family and in the other it is the child. A concrete example may help. (I'll stick to defining average as arithmetic mean throughout, but the same logic extends to other averages such as the median - see the quotation from Kish below).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Assuming that the number of children varies between families (which it must do if the mean number of children per family is 1.8) then the average child will be from a family with a larger number of children than average. For example, imagine there are only four families:&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;table cellpadding="0" cellspacing="0" style="border-collapse: collapse;"&gt;&lt;tbody&gt;
&lt;tr&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 63.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 147.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td colspan="4" style="padding: 0.0px 7.0px 0.0px 7.0px; width: 332.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Number of siblings per child&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;/tr&gt;
&lt;tr&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 63.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Family&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 147.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Number of Children&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 70.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;1&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font: normal normal normal 13px/normal 'Times New Roman';"&gt;&lt;i&gt;&lt;sup&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;st&lt;/span&gt;&lt;/span&gt;&lt;/sup&gt;&lt;/i&gt;&lt;/span&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; child&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 73.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;2&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font: normal normal normal 13px/normal 'Times New Roman';"&gt;&lt;i&gt;&lt;sup&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;nd&lt;/span&gt;&lt;/span&gt;&lt;/sup&gt;&lt;/i&gt;&lt;/span&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; child&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 75.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;3&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font: normal normal normal 13px/normal 'Times New Roman';"&gt;&lt;i&gt;&lt;sup&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;rd&lt;/span&gt;&lt;/span&gt;&lt;/sup&gt;&lt;/i&gt;&lt;/span&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; child&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 72.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;4&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font: normal normal normal 13px/normal 'Times New Roman';"&gt;&lt;i&gt;&lt;sup&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;th&lt;/span&gt;&lt;/span&gt;&lt;/sup&gt;&lt;/i&gt;&lt;/span&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; child&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;/tr&gt;
&lt;tr&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 63.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;b&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;A&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 147.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;1&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 70.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 73.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 75.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 72.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;/tr&gt;
&lt;tr&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 63.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;b&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;B&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 147.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;1&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 70.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 73.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 75.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 72.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;/tr&gt;
&lt;tr&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 63.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;b&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;C&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 147.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;2&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 70.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;1&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 73.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;1&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 75.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 72.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;-&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;/tr&gt;
&lt;tr&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 63.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;b&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;D&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 147.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;4&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 70.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;3&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 73.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;3&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 75.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;3&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;td style="padding: 0.0px 7.0px 0.0px 7.0px; width: 72.0px;" valign="top"&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px; text-align: center;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;3&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;/td&gt; &lt;/tr&gt;
&lt;/tbody&gt; &lt;/table&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;There are (1+1+2+4) = 8 children in the four families, thus the mean number of children per family is: 8/4 = 2.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;It follows that the mean number of siblings per child is therefore:&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;(0+0+1+1+3+3+3+3)/8 = 14/8 = 1.75&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;So although each family has only two children (on average) each child has 1.75 siblings (not 1 sibling).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Note is that there are &lt;/span&gt;&lt;/span&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;N&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; = 4 families and &lt;/span&gt;&lt;/span&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;n&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; = 8 children. So switching the unit of analysis changes the denominator. Also note that while the families can plausibly be considered independent of each other the children can't (all children in the same family have the same number of siblings in this example, and more generally the number of siblings will be correlated).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;What about zeroes?&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&amp;nbsp;In the calculations above I excluded childless households as families. If you include only households without children as families the discrepancy would be larger.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Does it matte&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;r? &amp;nbsp;Yes. Much real world data is clustered in this way. It is important to realize that the average &lt;/span&gt;&lt;/span&gt;&lt;category member=""&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;category memory=""&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;is not going to be from the average &lt;/span&gt;&lt;/span&gt;&lt;category&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; (e.g., the average worker isn't from the average firm)&lt;/span&gt;&lt;/span&gt;&lt;category&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;.&lt;/span&gt;&lt;/span&gt;&lt;/category&gt;&lt;/category&gt;&lt;/category&gt;&lt;/category&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;category memory=""&gt;&lt;category&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;In practical terms this means that careful attention needs to be paid to sampling from families, workplaces, schools and so forth. A random sample of children will disproportionately sample children from large families. There are also social policy implications (e.g., if you are interested in reducing child poverty). Another example is that a&amp;nbsp;random sample of schools will disproportionately sample small schools.&lt;/span&gt;&lt;/span&gt;&lt;/category&gt;&lt;/category&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;category memory=""&gt;&lt;category&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;There are also other problems with analysis of clustered data. For this reason anyone working with clustered data needs to seriously consider using multilevel modeling or other methods that i) &amp;nbsp;take into the clustering and ii) allow hypotheses about different levels of the model (e.g., children and families) to be explored.&lt;/span&gt;&lt;/span&gt;&lt;/category&gt;&lt;/category&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;i&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Postscript&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;This is quite an old puzzle. I first came across this puzzle in the article by Jenkins and Tuten (1992). They include formulae for deriving one average from the other an&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;d cite&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;Huntington (1924) and other later observations of the same phenomenon.&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;I made the connection to multilevel models a little later. For a good (if slightly out of date) introduction to multilevel models see Snijders and Bosker (1999). Recently I noticed that Kish (1965) discusses the same phenomenon in passing.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;This quote from Kish sets out the problem quite clearly&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;:&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;blockquote&gt;&lt;span style="font-family: 'Trebuchet MS', sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;Although the mean number of adults per household is only 2.02, the mean number of household members is 2.24 for the average adult. The greater size ranges of large organizations produce more striking effects. In 1960, 50 million people lived in 130 U. S. cities that had 100,000 or more population; in this population, the average city size was 0.39 million, but the size of the city in which the average person lived was 2.0 millions. Using medians does not help: the median city size was 0.19 million, but the median person lived in a city of 0.62 million.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/blockquote&gt;&lt;div style="text-align: right;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;Kish (1965, p. 571).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-style: italic;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;References&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div style="font: 16.0px Times New Roman; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;Jenkins, J. J., &amp;amp; Tuten, J. T. (1992). Why isn’t the average child from the average family? – and similar puzzles. &lt;/span&gt;&lt;/span&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;American Journal of Psychology, 105&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;, 517-526.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;br /&gt;
&lt;em style="font-style: normal;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;Kish&lt;/span&gt;&lt;/span&gt;&lt;/em&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;, L. (&lt;/span&gt;&lt;/span&gt;&lt;em style="font-style: normal;"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;1965&lt;/span&gt;&lt;/span&gt;&lt;/em&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;). Sampling Organizations and Groups of Unequal Sizes,&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;American Sociological Review, 20&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;, 564-72&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&lt;br /&gt;
&lt;/span&gt;&lt;/span&gt; &lt;br /&gt;
&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;Sniders, T. &amp;amp; Boskers, R. (1999).&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;a href="http://www.amazon.co.uk/gp/product/0761958908?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0761958908"&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;Multilevel Analysis: An Introduction to Basic and Advanced Multilevel Modeling.&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-size: medium;"&gt;&lt;span style="font-family: Times, 'Times New Roman', serif;"&gt;London: Sage.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-6707489846455399931?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/dnd7GYzKBHgtr4TwMKE8bsKN-To/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/dnd7GYzKBHgtr4TwMKE8bsKN-To/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/dnd7GYzKBHgtr4TwMKE8bsKN-To/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/dnd7GYzKBHgtr4TwMKE8bsKN-To/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/HDkW-cL8cbc" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/6707489846455399931/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2009/12/answer-to-statistical-puzzle-about.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/6707489846455399931?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/6707489846455399931?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/HDkW-cL8cbc/answer-to-statistical-puzzle-about.html" title="A statistical puzzle about averages II" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2009/12/answer-to-statistical-puzzle-about.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0UNQn49fyp7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-4127323298397205135</id><published>2009-12-09T22:46:00.002Z</published><updated>2011-10-04T16:14:53.067+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:14:53.067+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="multilevel models" /><category scheme="http://www.blogger.com/atom/ns#" term="puzzle" /><title>A statistical puzzle about averages I</title><content type="html">&lt;span style="font-style: italic;"&gt;I wrote this a few years ago for a departmental newsletter. For some reason the second part (with the answer) never got published. I stumbled across it almost by accident the other day and thought I'd share it. I'll publish the canonical answer in due course.&lt;/span&gt;&lt;br /&gt;
&lt;div class="Section1"&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;Professor Quack knows (from the UK census) that the average family&amp;nbsp;has an arithmetic mean of 1.8 children. He also knows that (due to a bizarre&amp;nbsp;mix-up in enrolment) that his &lt;i&gt;Psychology For Everyone&lt;/i&gt;&lt;span style="font-style: normal;"&gt; class is attended by a random sample of 50 people from the UK population. As&amp;nbsp;part of an in-class demonstration of&amp;nbsp;sampling theory he records the number&amp;nbsp;of siblings of each student and calculates the average (using the standard formula for the arithmetic mean).&amp;nbsp;&lt;/span&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;span style="font-style: normal;"&gt;To his dismay he&amp;nbsp;discovers that the students in his class have an arithmetic mean of 1.2&amp;nbsp;siblings. He later repeats the demonstration with two more random samples of&amp;nbsp;the UK population (again with &lt;/span&gt;&lt;i&gt;n&lt;/i&gt;&lt;span style="font-style: normal;"&gt; = 50) and obtains values of 1.1 and 1.3 siblings.&lt;/span&gt;&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;Profesor Quack consults two of his colleagues: Professor A&amp;nbsp;and Professor B.&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;Professor A replies thus:&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;“You are right to be dismayed. Bias has somehow entered&amp;nbsp;either your sampling procedure or your calculation of the mean. The true mean&amp;nbsp;number of siblings should be 1.8 - 1 = 0.8.”&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;Professor B interjects thus:&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;“Nonsense! All is as it should be. The expected number of&amp;nbsp;siblings in a random sample is most certainly not 0.8. Rather, one would expect&amp;nbsp;the average student to have more than 0.8 siblings, just as you have observed.”&lt;br /&gt;
&lt;br /&gt;
&lt;/div&gt;&lt;div class="MsoNormal"&gt;1) Who is correct? &amp;nbsp;Professor A or Professor B?&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div class="MsoNormal"&gt;2) Why?&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-4127323298397205135?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/KWutSEb1FRz3Ys3EbqKYD0F-ECY/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/KWutSEb1FRz3Ys3EbqKYD0F-ECY/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/KWutSEb1FRz3Ys3EbqKYD0F-ECY/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/KWutSEb1FRz3Ys3EbqKYD0F-ECY/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/EYvNLO4Mxxg" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/4127323298397205135/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2009/12/statistical-puzzle-about-averages-i.html#comment-form" title="5 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/4127323298397205135?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/4127323298397205135?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/EYvNLO4Mxxg/statistical-puzzle-about-averages-i.html" title="A statistical puzzle about averages I" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>5</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2009/12/statistical-puzzle-about-averages-i.html</feedburner:origLink></entry><entry gd:etag="W/&quot;A0AHQHYyfip7ImA9WhdUF04.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-2830032057768498548</id><published>2009-11-17T09:43:00.016Z</published><updated>2011-10-04T15:48:51.896+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T15:48:51.896+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="philosophy" /><category scheme="http://www.blogger.com/atom/ns#" term="Bayesian statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="likelihood" /><category scheme="http://www.blogger.com/atom/ns#" term="science" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><category scheme="http://www.blogger.com/atom/ns#" term="Bayes factors" /><title>R functions for Dienes (2008) Understanding Psychology as a Science</title><content type="html">&lt;div&gt;I recently wrote a review of Understanding psychology as a science: an introduction to scientific and statistical inference by &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_0"&gt;Zoltan&lt;/span&gt; &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_1"&gt;Dienes&lt;/span&gt; (2008). &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_2"&gt;Dienes&lt;/span&gt;' book covers &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_3"&gt;Neyman&lt;/span&gt;-Pearson null hypothesis significance testing, Bayesian inference and the likelihood method of inference (inspired by Fisher and associated with A. W. F. Edwards and more recently R. Royall).&lt;/div&gt;&lt;div&gt; &lt;/div&gt;&lt;br /&gt;&lt;div&gt;One of the most useful features of the book is that &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_4"&gt;Dienes&lt;/span&gt; provides &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_5"&gt;Matlab&lt;/span&gt; code for examples of calculations in the book (e.g., for Bayes factors, likelihood intervals and so forth). This is not so useful for me because I don't use &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_6"&gt;Matlab&lt;/span&gt;. &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_7"&gt;Matlab&lt;/span&gt; licenses are also quite expensive and may not be possible for students to access it in many Psychology departments. For those without access to &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_8"&gt;Matlab&lt;/span&gt;, &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_9"&gt;Dienes&lt;/span&gt; also provides calculators for a number of functions on his own &lt;a href="http://www.lifesci.sussex.ac.uk/home/Zoltan_Dienes/inference/index.htm"&gt;web page&lt;/a&gt; for the book. (The calculators are found by following the links to the appropriate chapter, so the Bayes factor calculator is found by following the Chapter 4 link).&lt;/div&gt;&lt;br /&gt;&lt;div&gt;Danny Kaye and I thought it would be useful to write R code to compliment the &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_10"&gt;Matlab&lt;/span&gt; code for &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_11"&gt;Dienes&lt;/span&gt;' functions as a 'bonus feature' for the review. As these functions and the notes for them take up quite a bit of space we decided to include only one, for a Bayes factor, in the review itself (with some notes on how to use it). Danny did most of the work writing functions, which are more-or-less direct translations of the original &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_12"&gt;Matlab&lt;/span&gt; code (and have been checked against the web versions). The &lt;a href="http://danny-kaye.co.uk/Docs/Dienes_functions.txt"&gt;full set of functions&lt;/a&gt; is hosted on his web site along with the &lt;a href="http://danny-kaye.co.uk/Docs/Dienes_notes.pdf"&gt;notes on how to use them&lt;/a&gt;. Also included are page references for the examples in the book.&lt;/div&gt;&lt;br /&gt;&lt;div&gt;Why did we write the R functions? First, they offer convenient access to the functions for teachers and students (because R is free and runs on Windows, Mac OS or Linux operating systems). Second, it reduces the burden on &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_13"&gt;Dienes&lt;/span&gt;' web calculator (at a marginal decrease in ease of use). Third, R is open source so it is simple to see how the code works and to edit, extend or adapt it (though it is polite to acknowledge the authors of the original code). Fourth, we want to encourage more people to start using R!&lt;/div&gt;&lt;br /&gt;&lt;div&gt;As an example, I've already written some alternative functions for likelihood intervals (though as I happened I re-wrote these almost from scratch to get them to plot the likelihood function and interval and to take advantage of some built-in R functions). Those functions are intended for a the book I'm working on and so should appear in due course.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For those who are interested Danny and I are presently working on implementing Bayesian &lt;i&gt;t&lt;/i&gt; tests in R (Bayes factors with objective priors) in a user-friendly way for researchers.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;div&gt; &lt;/div&gt;&lt;i&gt;References:&lt;/i&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_14"&gt;Baguley&lt;/span&gt;, T., &amp;amp; Kaye, W.S. (in press, 2009). Review of Understanding psychology as a science: An introduction to scientific and statistical inference.&lt;span class="Apple-style-span" style="font-style: italic; "&gt; &lt;a href="http://www.bpsjournals.co.uk/journals/bjmsp/"&gt;British Journal of Mathematical &amp;amp; Statistical Psychology&lt;/a&gt;.&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;i&gt;&lt;br /&gt;&lt;/i&gt;&lt;/div&gt;&lt;div&gt;&lt;!--StartFragment--&gt;  &lt;p class="ReferenceStyle"&gt;&lt;span lang="EN-US"&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_15"&gt;Dienes&lt;/span&gt;, Z. (2008). &lt;i&gt;&lt;span class="Apple-style-span" style="font-style: normal; "&gt;&lt;a href="http://www.amazon.co.uk/gp/product/023054231X?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=023054231X"&gt;Understanding Psychology as a Science: An Introduction to Scientific and Statistical Inference.&lt;/a&gt;&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_16"&gt;Basingstoke&lt;/span&gt;: &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_17"&gt;Palgrave&lt;/span&gt; Macmillan.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-2830032057768498548?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/RgPHFVJHqrRFjBWuiTBdXni4VLY/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/RgPHFVJHqrRFjBWuiTBdXni4VLY/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/RgPHFVJHqrRFjBWuiTBdXni4VLY/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/RgPHFVJHqrRFjBWuiTBdXni4VLY/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/PP4FFjhm1XE" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/2830032057768498548/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2009/11/r-functions-for-dienes-2008.html#comment-form" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/2830032057768498548?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/2830032057768498548?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/PP4FFjhm1XE/r-functions-for-dienes-2008.html" title="R functions for Dienes (2008) Understanding Psychology as a Science" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>2</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2009/11/r-functions-for-dienes-2008.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0QMR3o_fSp7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-7335983259524795784</id><published>2009-10-20T10:52:00.020+01:00</published><updated>2011-10-04T16:16:26.445+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:16:26.445+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="effect size" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><title>Chi-square test of independence and the odds ratio</title><content type="html">&lt;div&gt;Every time I teach an introductory statistics course I'm struck by how difficult it is to run a simple 2 by 2 chi-square test of independence. SPSS is one of the worst culprits (but even in R it almost seems more trouble than it is worth to use a computer). One solution is calculate it by hand (and that's what I do in my introductory classes). This leaves me with a practical problem - how can I quickly and easily calculate solutions to practical exercises, check student work or calculate chi-square statistics for my own work?&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div&gt;I've mostly assumed that my solution is so obvious that it is what most statistics teachers do. I just set up an Excel spreadsheet that calculates the chi-square statistic, degrees of freedom and &lt;em&gt;p&lt;/em&gt; value for a 2 by 2 table. Thus all I have to do is type in four numbers.&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div&gt;Over the years I've included a few other features - it displays the intermediate calculations (useful for checking student work), calculates standardized residuals and various common 'effect size' estimates. These are phi, phi-squared and the odds ratio. For instance, I also added the Haldane estimator of the odds ratio (which is the odds ratio estimated after adding 0.5 to each cell). This is a useful estimator when observed counts are zero or very small.&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div&gt;My favourite application of the spreadsheet is when editing or reviewing journal submissions reporting chi-square. Often they will not consider the odds ratio. The spreadsheet means I can check calculations for accuracy (if they look dubious) and I can easily include odds ratios in my review or decision (all in under a minute). The odds ratio is useful because it strips out the base rates when comparing effects from different conditions.&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div&gt;Using the absolute difference in rates for chi-squares with different base rates can be misleading. For instance, the difference in solution rates in an easy problem solving task might be 70% - 50% = 20%. For a very hard problem it might be 20% - 10% = 10%. Comparing the absolute differences in rates (often called the ARR or &lt;a href="http://en.wikipedia.org/wiki/Absolute_risk_reduction"&gt;absolute risk reduction&lt;/a&gt; in medical settings) would be misleading. The 10% difference is probably more impressive for the very hard problem (it represents a doubling or halving of solution rates). The odds ratio allows a comparison of the probability of solution relative to the probability of non-solution (i.e., the odds) and thus strips out the base rate impact (by scaling in terms of the probability of non-solution). The odds ratio also has other nice mathematical properties (which I won't go into here).&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div&gt;Not everyone loves odds ratios. Medics often dislike them ... because they strip out base rates! For interpreting and communicating medical risks the base rates are important. It matters whether a disease has a base rate of 1 in 100 or 1 in 10. So stripping out the base rate might be misleading in this context. Even so, medics seem unreasonably biased against odds ratios. If the base rates in your sample are dodgy (i.e., don't reflect the population you are interested in) the odds ratio is probably a safer bet. If you know the odds ratio and the correct base rates you can estimate the true risks by combining the two.&lt;br /&gt;
&lt;/div&gt;&lt;div&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div&gt;(This is one reason why I think that the quest for the perfect effect size statistic is flawed. Different statistics are required for different jobs. Odds ratios, however, get a (largely undeserved) bad press. Most psychologists will probably be better off using them than other measures for 2 by 2 tables, because we rarely have samples that accurately sample the 'true' base rate. This is because our samples are either unrepresentative or, even more importantly, there may be no 'true' base rate. The problem solving example illustrates this. A problem doesn't really have a 'true' difficulty level that we are trying to generalize to or make decisions about.)&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div&gt;If you want to use the spreadsheet you can download it from &lt;a href="http://spreadsheets.google.com/ccc?key=0AsYTvx5C8MjXdGxHWG9UV1Mza1phM2lqTUhyMmhvSFE&amp;amp;hl=en_GB"&gt;here&lt;/a&gt;. Alternatively, just set up your own (it is quite a useful exercise for understanding how chi-square, odds ratios and so on work). On the other hand my spreadsheet does a few of the slightly awkward calculations for you (e.g., CIs for the odds ratios). The first sheet has very basic instructions. The second sheet has the 2 by 2 table calculations (and effect sizes). The other sheets provide basic statistics for 2 by 3 and 3 by 3 tables. No effect size metrics are included for the latter cases because I think they are not very meaningful for effects with &lt;i&gt;df&lt;/i&gt; &amp;gt; 1 (and I've never needed them for real research).&lt;br /&gt;
&lt;/div&gt;&lt;div&gt;&lt;br /&gt;
&lt;/div&gt;&lt;div&gt;Note: for best results (to preserve formatting etc.) download the document as an Excel file.&lt;br /&gt;
&lt;/div&gt;&lt;br /&gt;
&lt;div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-7335983259524795784?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/2QKEIfqTO7-H7QDmy4YS4TD-eqs/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/2QKEIfqTO7-H7QDmy4YS4TD-eqs/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/2QKEIfqTO7-H7QDmy4YS4TD-eqs/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/2QKEIfqTO7-H7QDmy4YS4TD-eqs/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/sYeRW6u6Hvg" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/7335983259524795784/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2009/10/chi-square-test-of-independence-and.html#comment-form" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7335983259524795784?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7335983259524795784?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/sYeRW6u6Hvg/chi-square-test-of-independence-and.html" title="Chi-square test of independence and the odds ratio" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>1</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2009/10/chi-square-test-of-independence-and.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0UGRno9cCp7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-3600043174785579580</id><published>2009-03-12T19:13:00.005Z</published><updated>2011-10-04T16:13:47.468+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:13:47.468+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="effect size" /><category scheme="http://www.blogger.com/atom/ns#" term="moderated multiple regression" /><category scheme="http://www.blogger.com/atom/ns#" term="moderator effects" /><title>Don't standardize interaction/moderator effects in multiple regression</title><content type="html">&lt;div&gt;This is another quick blog entry related to a query I had today.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;In my &lt;a href="http://psychologicalstatistics.blogspot.com/2007/10/introduction-to-moderated-multiple.html"&gt;moderated multiple regression workshop&lt;/a&gt; a while back I wrote (on slide 15) "don't use standardized regression coefficients". In the talk itself I briefly mentioned why and directed people to the relevant section of the (excellent) explanation by Kris Preacher:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;&lt;span class="Apple-style-span" style="color: rgb(0, 0, 238); text-decoration: underline;"&gt;&lt;a href="http://people.ku.edu/~preacher/interact/interactions.htm#stand"&gt;A primer on interaction effects in multiple linear regression.&lt;/a&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="color: rgb(0, 0, 238); text-decoration: underline;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;I'd like to clarify &lt;span class="Apple-style-span" style="font-style: italic;"&gt;why&lt;/span&gt; standardization is a &lt;span class="Apple-style-span" style="font-style: italic;"&gt;particularly&lt;/span&gt; bad idea in this case. My dislike of 'standardization' is fairly well known, and it goes without saying (I hope) that one reason not to use standardized regression coefficient relates to this. I hope to write about this in detail soon (but check out my &lt;a href="http://www.ingentaconnect.com/content/bpsoc/bjp/pre-prints/310805"&gt;BJP article on effect size&lt;/a&gt; for further details).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;However, there are additional reasons why standardizing predictors will cause trouble in moderated  multiple regression. Standardization involves centering the predictors and scaling them terms of their sample SDs. Centering is very often a useful thing to do in moderated multiple regression. However,  statistics packages such as SPSS will standardize all the predictors - including the product terms -  in moderated multiple regression. This is because they have no way of knowing that the product term is not a 'regular' predictor (similarly if anyone were foolish enough to do a stepwise regression ... the software would not know to keep in &lt;span class="Apple-style-span" style="font-style: italic;"&gt;X1&lt;/span&gt; and &lt;span class="Apple-style-span" style="font-style: italic;"&gt;X2&lt;/span&gt; for each &lt;span class="Apple-style-span" style="font-style: italic;"&gt;X1.X2&lt;/span&gt; product term). This means that the &lt;span class="Apple-style-span" style="font-style: italic;"&gt;X1.X2&lt;/span&gt; product term will be standardized along with &lt;span class="Apple-style-span" style="font-style: italic;"&gt;X1&lt;/span&gt; and &lt;span class="Apple-style-span" style="font-style: italic;"&gt;X2&lt;/span&gt; rather than being computed (correctly) as the product of the two standardized predictors (i.e., &lt;span class="Apple-style-span" style="font-style: italic;"&gt;Zx1&lt;/span&gt; and &lt;span class="Apple-style-span" style="font-style: italic;"&gt;Zx2&lt;/span&gt; multiplied together).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;That's clearly a problem (the &lt;span class="Apple-style-span" style="font-style: italic;"&gt;t&lt;/span&gt; test should be OK but the value of the coefficient and simple slopes will be wrong). However, the clincher is that even if the correct standardization is carried out (e.g., computing the standardized predictors yourself and then taking the product of the relevant standardized predictors and entering it into the regression) the standard errors will be incorrect (which is problematic for constructing confidence and prediction intervals).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;In summary, you can't rely on the software to get the standardization correct so use the unstandardized regression coefficients and standard errors! (Also, did I mention that standardized coefficients are generally a bad idea anyway?).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-3600043174785579580?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/Kt8DX6eL73ZZ4prxbqzL3QbXQkQ/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/Kt8DX6eL73ZZ4prxbqzL3QbXQkQ/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/Kt8DX6eL73ZZ4prxbqzL3QbXQkQ/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/Kt8DX6eL73ZZ4prxbqzL3QbXQkQ/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/SAFMLJWlG7g" height="1" width="1"/&gt;</content><link rel="related" href="http://people.ku.edu/~preacher/interact/interactions.htm#stand" title="Don't standardize interaction/moderator effects in multiple regression" /><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/3600043174785579580/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2009/03/dont-standardize-interactionmoderator.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/3600043174785579580?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/3600043174785579580?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/SAFMLJWlG7g/dont-standardize-interactionmoderator.html" title="Don't standardize interaction/moderator effects in multiple regression" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2009/03/dont-standardize-interactionmoderator.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C0QHSXg-fyp7ImA9WhdUF0k.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-5565370538913067015</id><published>2009-01-12T23:07:00.004Z</published><updated>2011-10-04T16:15:38.657+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-04T16:15:38.657+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><category scheme="http://www.blogger.com/atom/ns#" term="R" /><title>R resources for psychologists</title><content type="html">As a quick follow-up to my previous post, I'll quickly note some R links specifically aimed at psychologists. Jonathan Baron and &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_0"&gt;Yuelin&lt;/span&gt; Li have a excellent set of &lt;a href="http://www.psych.upenn.edu/~baron/rpsych/rpsych.html"&gt;notes&lt;/a&gt; that cover ground from R basics to &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_1"&gt;ANOVA&lt;/span&gt; (including repeated measures &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_2"&gt;ANOVA&lt;/span&gt;) and beyond. Jonathan Baron also maintains a general set of &lt;a href="http://finzi.psych.upenn.edu/"&gt;R links&lt;/a&gt;. William &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_3"&gt;Revelle's&lt;/span&gt; &lt;a href="http://personality-project.org/r/r.guide.html"&gt;R guide for psychological research&lt;/a&gt; is also very good and has more emphasis on &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_4"&gt;psychometrics&lt;/span&gt; than Baron and Li.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Pilgrim's blog is an &lt;a href="http://theregressingpilgrim.blogspot.com/"&gt;R resource for the exercises&lt;/a&gt; in the classic &lt;a href="http://www.amazon.co.uk/gp/product/0805822232?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0805822232"&gt;Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences&lt;/a&gt;&lt;img src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;amp;l=as2&amp;amp;o=2&amp;amp;a=0805822232" width="1" height="1" border="0" alt="" style="border:none !important; margin:0px !important;" /&gt; text.&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-5565370538913067015?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/WvGJtSsHvJQFVKmx6jWho60lXzQ/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/WvGJtSsHvJQFVKmx6jWho60lXzQ/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/WvGJtSsHvJQFVKmx6jWho60lXzQ/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/WvGJtSsHvJQFVKmx6jWho60lXzQ/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/cnkFbKPmUMs" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/5565370538913067015/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2009/01/r-resources-for-psychologists.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/5565370538913067015?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/5565370538913067015?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/cnkFbKPmUMs/r-resources-for-psychologists.html" title="R resources for psychologists" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2009/01/r-resources-for-psychologists.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D04HQX4-cCp7ImA9WxVVGEU.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-8552169957558482645</id><published>2009-01-09T19:32:00.021Z</published><updated>2009-03-12T19:25:30.058Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-03-12T19:25:30.058Z</app:edited><title>Simulating data for inquiry based learning</title><content type="html">I've just presented a talk on at the &lt;font class="Apple-style-span" style="font-style: italic;"&gt;&lt;a href="http://www.psychology.heacademy.ac.uk/html/event_reports_2009.asp"&gt;Statistics for Psychology Students&lt;/a&gt;&lt;/font&gt; workshop for the HEA Psychology Network in York. Richard Rowe (Psychology, Sheffield) gave an interesting talk on teaching statistics via inquiry based learning. Part of the work involved using getting students to generate their own research questions in a tutorial and then analyzing data addressing these questions in a follow-up tutorial. A rather clever idea he reported was to generate suitable data via simulation in STATA. The students were first or second year undergraduates and so the experimental designs they came up with were constrained to simple cases (e.g., &lt;font class="Apple-style-span" style="font-style: italic;"&gt;t&lt;/font&gt; tests, correlations and chi-square for the first years; one-way ANOVA with 3 levels for the second years). This meant that whoever led the tutorial (e.g., a Ph.D. student) could generate tailored data to match the students' research designs.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The novel and clever aspect of this isn't simulating real data sets or getting students to come up with their own research ideas, but combining the two. This means that you can run these sessions without getting students to collect real data. (There is a place for that too, but collecting real data has massive overheads in terms of student time, staff time, ethics approval and so forth). Furthermore, students can legitimately come up with ideas for studies that can't be run by undergraduates for ethical, resource or other reasons - for example using clinical populations). I can also see other uses for this. For example, a supervisor could simulate data for a final year project and a student could use the simulated data as a dry run for the real analysis. (The real data will usually be much messier, but I think that getting familiar with the statistical software and analyses to be used could be useful for some students).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;A few members of the audience asked questions about STATA (which I'm not familar with) and I pointed out that you could do the same things in &lt;a href="http://www.r-project.org/"&gt;R&lt;/a&gt; for free. As I understand it STATA allows you to simulate data with a specified covariance matrix fairly easily. I'm sure this can be done in R too, but I'm still learning how to use R and tracking down the right package and commands would have taken a bit of time. In any case, for this to be useful it needs to be very simple and easy to run by people with relatively basic statistical computing skills (and who have never used R before). So I set myself a challenge of writing code that should run with functions from the R base package, be trivial to edit and generate usable data for two common analyses: the independent &lt;font class="Apple-style-span" style="font-style: italic;"&gt;t&lt;/font&gt; test and a bivariate correlation.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;It took me about 2 or 3 minutes to write the independent &lt;font class="Apple-style-span" style="font-style: italic;"&gt;t&lt;/font&gt; test code. (This may sound impressive but a competent R programmer could probably do it in under a minute). It took a little longer to comment it and work out how to write it to a tab delimited text file. It took me about 5 or 10 minutes to do the bivariate correlation and a little longer to fix a rather stupid error I'd made. All the code is rather clumsily written (and in some cases I've deliberately separated out steps to make it easier for someone unfamiliar with R to follow). The correlation solution (in particular) isn't very good and specifying a covariance matrix to constrain a simulation would be much more satisfactory. (I should also note that it uses a trick I picked up some time ago from &lt;a href="http://www.pbarrett.net/"&gt;Paul Barrett's web site&lt;/a&gt; in his article on &lt;a href="http://www.pbarrett.net/techpapers/likert.pdf"&gt;Correlation attenuation due to Likert categorization&lt;/a&gt;.) The key point is that it took longer to describe here than it took to write. I'm sure better solutions exist in R, but these ones work and could easily be extended to paired &lt;font class="Apple-style-span" style="font-style: italic;"&gt;t&lt;/font&gt; tests or one-way ANOVA. (I'm not sure about chi-square. I could write something, but it might be a lot easier just to specify a 2x2 or 3x2 contingency table by hand).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Running the R code is also easy. R runs on PC, Linux and Mac OS X, but if you don't fancy installing it (for some bizarre reason) it can also be run from a &lt;a href="http://www.math.montana.edu/Rweb/"&gt;web server&lt;/a&gt; (though it won't be able to write to a file). To run the code you just paste it into the R console and hit return. To tailor the code just read the comments and tweak the parameter values before pasting.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;If you want a copy to play with just &lt;a href="mailto:tsbaguley@gmail.com"&gt;email me&lt;/a&gt;.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;For anyone who wants to do some simulation in R there are lots of resources around (try googling the obvious keywords), but Andrew Gelman and Jennifer Hill's book &lt;a href="http://www.amazon.co.uk/gp/offer-listing/052168689X?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=am2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=052168689X"&gt;Data Analysis Using Regression and Multilevel/Hierarchical Models (Analytical Methods for Social Research)&lt;/a&gt; is an impressive book for a serious take on using simulation (although the stance is Bayesian this doesn't get in the way - at least for the sections I've read; I'm about 40% through the book). &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;font class="Apple-style-span" style="font-style: italic;"&gt;Postscript&lt;/font&gt;. All the talks were video recorded and Anne Trapp threatened to put video podcasts - vodcasts? - on the Psychology Network site at some point in the future - including my talk (&lt;font class="Apple-style-span" style="font-style: italic;"&gt;Effect size: why what we teach psychology students is wrong&lt;/font&gt;). These will be worth looking out for just to see Andy Field's talk. (Andy claims he looks particularly ridiculous when video recorded. My advice: just don't ever look at the recording. &lt;font class="Apple-style-span" style="font-style: italic;"&gt;Ever&lt;/font&gt;.) I'll write something about effect size in the near future - though much of the talk was based on my &lt;a href="http://www.ingentaconnect.com/content/bpsoc/bjp/pre-prints/310805"&gt;forthcoming BJP paper on effect size.&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-8552169957558482645?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/xW5mBPp6w3eFKBZNxi55eMtsmFI/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/xW5mBPp6w3eFKBZNxi55eMtsmFI/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/xW5mBPp6w3eFKBZNxi55eMtsmFI/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/xW5mBPp6w3eFKBZNxi55eMtsmFI/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/6wfJm_G66WI" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/8552169957558482645/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2009/01/simulating-data-for-inquiry-based.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/8552169957558482645?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/8552169957558482645?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/6wfJm_G66WI/simulating-data-for-inquiry-based.html" title="Simulating data for inquiry based learning" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2009/01/simulating-data-for-inquiry-based.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D04GR3w8eSp7ImA9WxVVGEU.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-5682827777697501484</id><published>2008-04-09T20:09:00.003+01:00</published><updated>2009-03-12T19:25:26.271Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-03-12T19:25:26.271Z</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="multilevel models" /><category scheme="http://www.blogger.com/atom/ns#" term="NTU" /><category scheme="http://www.blogger.com/atom/ns#" term="PsyPAG" /><category scheme="http://www.blogger.com/atom/ns#" term="BPS" /><category scheme="http://www.blogger.com/atom/ns#" term="MLwin" /><category scheme="http://www.blogger.com/atom/ns#" term="ANOVA" /><title>Beyond ANOVA: from repeated measures to multilevel models</title><content type="html">I have just finished giving a talk on repeated measures ANOVA and multilevel models at the PsyPAG Postgraduate Mathematics, Statistics and Computing workshop. I promised I would upload a link to my presentation. Here it is:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;iframe src="http://docs.google.com/EmbedSlideshow?docid=dp86h6t_13fw3cgmhk" frameborder="0" height="342" width="410"&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;br /&gt;I managed to correct a couple of minor errors - though the transfer online seems to have added several more (though these disappear - or rather the errors mutate - if I swap browsers). I'll add further links and corrections if I get time.&lt;br /&gt;&lt;br /&gt;This is run every two years or so in conjunction with the Mathematics, Statistics and Computing section of the British Psychological Society (BPS). The 2008 workshop was at Nottingham Trent University. The full schedule is here:&lt;br /&gt;&lt;br /&gt;http://www.psypag.co.uk/msprogramme.html&lt;br /&gt;&lt;br /&gt;The statistics clinic was fun - as was my confrontation with Mark Shevlin (of which more later) - but I missed the signed statistics text book raffle (proceeds to charity). I was very interested in John Reidy's talk on moderated multiple regression. Unlike my talk he covered mediation as well, but possibly covered moderation in less detail as a consequence. I think anyone who found John's talk useful will probably want to look at my &lt;a href="http://psychologicalstatistics.blogspot.com/2007/10/introduction-to-moderated-multiple.html"&gt;moderated multiple regression&lt;/a&gt; blog entry if only for the link to &lt;a href="http://www.psych.ku.edu/preacher/interact/interactions.htm"&gt;Kris Preacher's web resources.&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Mark Shevlin and I had a (friendly) dispute over the relative merits of SEM and multilevel models and MPLUS versus MLwiN for the latter. For experimental psychologists I think multilevel models are definitely the way to go - it being easy to extend classical experimental designs to multilevel models. For non-experimentalists who will have messier studies to analyze SEM is probably more useful. Although all multilevel models are probably just SEM in disguise I think SEM is harder to pick up and (at least until recently) was not great for repeated measures and clustered data. As for software ... because &lt;a href="http://www.cmm.bristol.ac.uk/MLwiN/ordering/ac-uk.shtml"&gt;MLwiN is free for UK academic users&lt;/a&gt; I think it would be my top tip for anyone in the UK learning multilevel modeling (with R a very close runner up).&lt;br /&gt;&lt;br /&gt;&lt;span style="font-style: italic;"&gt;Postscript. &lt;/span&gt; The Centre for Multilevel Modeling have just launched their &lt;a href="http://www.cmm.bristol.ac.uk/learning-training/course.shtml"&gt;online multilevel modeling course&lt;/a&gt;. At first glance it looks very useful, as it starts with simpler statistical techniques (e.g., multiple regression) and builds up to multilevel models. I'm hoping to audit all the modules at some point ... They are released under a &lt;a href="http://creativecommons.org/licenses/by-nc-sa/2.0/uk/"&gt;creative commons license&lt;/a&gt; so it should be possible to use them in Education.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-5682827777697501484?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/9Q5GWDWELE663R7DqkzrsVsPOJw/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/9Q5GWDWELE663R7DqkzrsVsPOJw/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/9Q5GWDWELE663R7DqkzrsVsPOJw/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/9Q5GWDWELE663R7DqkzrsVsPOJw/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/8t9Te8ZvSFk" height="1" width="1"/&gt;</content><link rel="related" href="http://docs.google.com/Present?docid=dp86h6t_13fw3cgmhk" title="Beyond ANOVA: from repeated measures to multilevel models" /><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/5682827777697501484/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2008/04/beyond-anova-from-repeated-measures-to.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/5682827777697501484?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/5682827777697501484?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/8t9Te8ZvSFk/beyond-anova-from-repeated-measures-to.html" title="Beyond ANOVA: from repeated measures to multilevel models" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2008/04/beyond-anova-from-repeated-measures-to.html</feedburner:origLink></entry><entry gd:etag="W/&quot;D04GQHo4eip7ImA9WxVVGEU.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-7916929848444882923</id><published>2007-10-01T11:24:00.001+01:00</published><updated>2009-03-12T19:25:21.432Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-03-12T19:25:21.432Z</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="moderated multiple regression" /><category scheme="http://www.blogger.com/atom/ns#" term="interaction" /><category scheme="http://www.blogger.com/atom/ns#" term="moderator effects" /><title>An introduction to moderated multiple regression</title><content type="html">I've just given a workshop on moderated multiple regression as part of a new Psychology Research Methods series at work. Here is the abstract:&lt;br /&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size:85%;"&gt;Most psychologists are highly familiar with the analysis and interpretation of interaction effects in factorial analysis of variance (ANOVA). In ANOVA these interactions are between categorical independent variables (e.g., experimental conditions). What many researchers do not realize is that interaction effects in ANOVA are a special case of interaction effects in multiple regression which can be computed in much the same way. A common, but unnecessary and inappropriate, strategy for dealing with interaction effects between continuous independent variables is to turn them into categorical ones (e.g., by a median split). A better approach - one with greater statistical power and which is more informative - is to add interaction effects to a multiple regression model. This approach is often known as moderated multiple regression. &lt;/span&gt;&lt;span style="font-size:85%;"&gt;This workshop will: &lt;/span&gt;&lt;span style="font-size:85%;"&gt;introduce moderated multiple regression, explain how to compute an interaction term for moderated multiple regression, discuss the role of centring in making the results easier to interpret, and demonstrate how to run and interpret such a moderator analysis in SPSS.&lt;/span&gt;&lt;/p&gt;You can view the presentation on g&lt;em&gt;oogle docs&lt;/em&gt;:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://docs.google.com/TeamPresent?fs=true&amp;amp;docid=dp86h6t_3wmr5ft"&gt;http://docs.google.com/TeamPresent?fs=true&amp;amp;docid=dp86h6t_3wmr5ft&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The workshop is supposed to be a basic introduction. For a more advanced introduction I'd recommend Kris Preacher's web page:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://people.ku.edu/~preacher/interact/interactions.htm"&gt;A primer on interaction effects in moderated multiple regression.&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Kris Preacher also has links to excellent tools for investigating and plotting moderator effects. (These generate &lt;em&gt;R&lt;/em&gt; code that you can plot online with one mouse click or paste into your &lt;em&gt;R&lt;/em&gt; workspace to edit for publication quality graphics). Although, for initial exploration it may be quicker to use the regression equation to plug in different &lt;em&gt;X&lt;/em&gt; values and see how the expected &lt;em&gt;Y&lt;/em&gt; value changes. I'd probably do this by hand (or maybe Excel) and sketch out basic plots by hand.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-7916929848444882923?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/bFOxgFc7IHrAg_ai-M0gIM4p93I/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/bFOxgFc7IHrAg_ai-M0gIM4p93I/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/bFOxgFc7IHrAg_ai-M0gIM4p93I/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/bFOxgFc7IHrAg_ai-M0gIM4p93I/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/syR2AqiokfU" height="1" width="1"/&gt;</content><link rel="related" href="http://docs.google.com/TeamPresent?fs=true&amp;docid=dp86h6t_3wmr5ft" title="An introduction to moderated multiple regression" /><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/7916929848444882923/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2007/10/introduction-to-moderated-multiple.html#comment-form" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7916929848444882923?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/7916929848444882923?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/syR2AqiokfU/introduction-to-moderated-multiple.html" title="An introduction to moderated multiple regression" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>2</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2007/10/introduction-to-moderated-multiple.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DUUGQXY4fip7ImA9WxNVEk4.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-115253297116113742</id><published>2006-07-10T12:42:00.008+01:00</published><updated>2009-10-22T19:47:00.836+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2009-10-22T19:47:00.836+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="simple main effects" /><category scheme="http://www.blogger.com/atom/ns#" term="ANOVA" /><title>How to calculate simple main effects using generic ANOVA software</title><content type="html">&lt;p&gt;&lt;span style="font-family:arial;font-size:85%;"&gt;Some time ago a PhD student of mine needed to conduct simple main effects for several different ANOVA designs. As the explanation of how to do this is a bit fiddly (and it is the kind of thing I get asked about quite a lot) I decided to write the explanation down. This later turned into a web page and now I'm updating it for this blog. The main focus is on explaining how to do the calculations based on output from generic ANOVA software. The calculation methods I describe seem obvious to me, but intended to be the best or most efficient method. Rather, their aim is to make clear what the calculation is doing and to be sufficiently robust that one can adapt them to most (if not all) standard ANOVA software. For any given software there will probably be more efficient solutions (e.g., via syntax in SPSS).&lt;/span&gt;&lt;/p&gt;&lt;p&gt;This post aims to set out how to calculate simple main effects for common ANOVA designs. Some statistics packages will calculate &lt;i&gt;simple main effects&lt;/i&gt; for you, but many will not do so or will only do so for certain designs. There is also some confusion as to how to calculate simple main effects for some designs. For this reason I aim to set out generic methods to calculate simple main effects that should work for any software package that can cope with common ANOVA designs (even if they don't support direct calculation of simple main effects).&lt;/p&gt;&lt;p&gt;&lt;span style="font-size:180%;"&gt;&lt;span style="font-size:100%;"&gt;&lt;b&gt;What is a &lt;/b&gt;&lt;b&gt;&lt;i&gt;simple main effect&lt;/i&gt;&lt;/b&gt;&lt;b&gt;?&lt;/b&gt;&lt;/span&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;In analysis of variance (ANOVA) we are often interested only in the effects of a single factor. These effects are termed &lt;i&gt;main effects&lt;/i&gt;. A one-way (one factor) ANOVA therefore has only a single &lt;i&gt;main effect&lt;/i&gt;. The &lt;i&gt;F&lt;/i&gt; test for a main effect tests the hypothesis that the means differ. &lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#footnote1" name="1"&gt;&lt;sup&gt;1&lt;/sup&gt;&lt;/a&gt; In a design with more than one factor (e.g., a two-way design with two factors) we can also look at &lt;i&gt;interaction&lt;/i&gt; effects. An interaction between two factors (a two-way interaction) is present if the effects of the factors are not independent. In general researchers will ignore a non-significant&lt;br /&gt;two-way interaction and interpret the two main effects (which are considered to be independent of each other).&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#footnote2" name="2"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;If a two-way interaction is significant it may not make sense to interpret the main effects on their own (because they represent “average” effects of a factor which are known to vary between levels of the other factor).&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#footnote3" name="3"&gt;&lt;sup&gt;3&lt;/sup&gt;&lt;/a&gt; Instead, it sometimes a good idea to look at the effect of one factor separately for each level of the other factor. These separate analyses are called &lt;i&gt;simple main effects&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;For example, consider a 2 x 2 ANOVA design with &lt;em&gt;gender&lt;/em&gt; and &lt;em&gt;age&lt;/em&gt; as the factors and anxiety prior to a maths test as the dependent variable. Imagine the means look like this:&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;table bordercolordark="#008000" cellpadding="2" border="2"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td&gt;&lt;/td&gt;&lt;td&gt;&lt;p align="center"&gt;&lt;span style="font-family:arial;"&gt;Young &lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td&gt;&lt;p align="center"&gt;&lt;span style="font-family:arial;"&gt;Old &lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;&lt;p align="center"&gt;&lt;span style="font-family:arial;"&gt;Male &lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td&gt;&lt;p align="center"&gt;&lt;span style="font-family:arial;"&gt;2.2&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td&gt;&lt;p align="center"&gt;&lt;span style="font-family:arial;"&gt;2.1 &lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td&gt;&lt;p align="center"&gt;&lt;span style="font-family:arial;"&gt;Female &lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td&gt;&lt;p align="center"&gt;&lt;span style="font-family:arial;"&gt;3.3&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td&gt;&lt;p align="center"&gt;&lt;span style="font-family:arial;"&gt;4.5&lt;/span&gt; &lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;/p&gt;&lt;center&gt;&lt;/center&gt;&lt;br /&gt;&lt;p&gt;This pattern shows a fairly clear interaction and also suggests there might be main effects of both age and gender (assuming equal cell sizes).&lt;/p&gt;&lt;p&gt;Let us assume that all three effects are significant. In this example it makes little sense to pay much attention to the &lt;i&gt;main effect&lt;/i&gt; of age (because while age does increase anxiety &lt;em&gt;on average,&lt;/em&gt; it would be slightly misleading to interpret this as a &lt;em&gt;general&lt;/em&gt; effect). On the other hand it doesn't seem unreasonable to interpret the main effect of gender as a general effect (as anxiety seems quite a bit higher for females regardless of age). Nevertheless, to be sure, it seems sensible to look at the effects of age separately for males and females (and possibly also the the effects of gender for young and old participants).&lt;/p&gt;&lt;p&gt;The test of the effect of age on anxiety for male participants is sometimes called the simple main effect of age at "male". The test of the effect of age on anxiety for female participants is called the simple main effect of age at "female".&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#footnote4" name="4"&gt;&lt;sup&gt;4&lt;/sup&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;Note that there are are 4 simple main effects for a 2x2 design, 5 for a 3x2 design and so forth.&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#footnote5" name="5"&gt;&lt;sup&gt;5&lt;/sup&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;The &lt;/b&gt;&lt;b&gt;&lt;i&gt;F&lt;/i&gt;&lt;/b&gt;&lt;b&gt; ratio&lt;/b&gt; &lt;/p&gt;&lt;p&gt;As with any ANOVA test we'll want to come up with an &lt;i&gt;F&lt;/i&gt; ratio to test the effect.&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#footnote6" name="6"&gt;&lt;sup&gt;6&lt;/sup&gt;&lt;/a&gt; The &lt;i&gt;F&lt;/i&gt; ratio is the ratio of the variance estimate for the treatment effect to the variance estimate of the error. In ANOVA these variance estimates are called mean squares and are calculated by dividing the sums of squares for a source of variation by its degrees of freedom. &lt;/p&gt;&lt;p&gt;The mean square error will depend on whether the factor which is being looked at is independent measures (between subjects) or repeated measures (within subjects). For a mixed design the mean square error depends on which factor is being looked at. For an independent measures factor a pooled error term is used, while for a repeated measures design a non-pooled error term is used. It turns out that obtaining a pooled error term from a mixed ANOVA printout is not trivial so I will return to this later on (&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#MixedIndependentPooledError"&gt;see the discussion of pooled errors for mixed ANOVA designs below&lt;/a&gt;).&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;i&gt;&lt;/i&gt;&lt;/p&gt;&lt;p&gt;&lt;i&gt;Obtaining the treatment mean square:&lt;/i&gt; &lt;/p&gt;&lt;p&gt;One easy way to calculate the treatment mean square is to run separate one-way ANOVAs for each level of the second factor. We can simply read off the mean square value from the print out. &lt;/p&gt;&lt;p&gt;For example, for the simple main effect of gender at young we would merely run a one-way ANOVA with gender as the factor (taking care to exclude all the old participants!). &lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size:85%;"&gt;&lt;i&gt;&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size:85%;"&gt;&lt;i&gt;&lt;span style="font-size:100%;"&gt;Obtaining the error mean square (independent measures):&lt;/span&gt;&lt;/i&gt; &lt;/span&gt;&lt;/p&gt;&lt;p&gt;I think the easiest way to obtain the pooled error term used for a pure independent measures design is to read it off the output from the original ANOVA (i.e., the ANOVA with all factors in the analysis). In an independent measures design the pooled error term averages over all the participants and therefore minimizes the effect of individual differences and thus produces the most accurate test.&lt;br /&gt;&lt;br /&gt;The appropriate &lt;i&gt;F&lt;/i&gt; ratio can be calculated by dividing the treatment mean square by the error mean square in the normal way. This can be checked for significance using tables or by exact methods using software such as Excel or SPSS (both of which have look-up functions for the significance of the central &lt;i&gt;F&lt;/i&gt; distribution).&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;i&gt;&lt;/i&gt;&lt;/p&gt;&lt;p&gt;&lt;i&gt;Obtaining the error mean square (repeated measures):&lt;/i&gt; &lt;/p&gt;&lt;p&gt;The easiest way to obtain the non-pooled error term required for repeated measures factors is to run separate one-way ANOVAs for each level of the second factor. . A big advantage of this method is that that as the mean square treatment is also obtained this way, the output will give you the correct &lt;i&gt;F&lt;/i&gt; ratio and observed significance (&lt;i&gt;p &lt;/i&gt;value). &lt;/p&gt;(In some packages, notably SPSS, the simplest ways to do this is are &lt;em&gt;i)&lt;/em&gt; copy and paste these data into new columns, or &lt;em&gt;ii)&lt;/em&gt; to save the data file under &lt;em&gt;k&lt;/em&gt; different names - where &lt;em&gt;k&lt;/em&gt; is the number of levels, and delete unwanted levels before running the one-way repeated measures ANOVA. The latter method avoids you having to re-type variable names and other information).&lt;br /&gt;&lt;br /&gt;&lt;p&gt;The choice of the non-pooled error term in within-subjects simple effects mirrors the choice in standard repeated measures designs and is now generally accepted as the appropriate procedure.&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#footnote7" name="7"&gt;&lt;sup&gt;7&lt;/sup&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;a name="MixedIndependentPooledError"&gt;&lt;em&gt;Obtaining a pooled error term for an independent measures factor from mixed measures ANOVA output:&lt;/em&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;In a mixed ANOVA design there is at least one repeated measures factor and at least one independent measures factor. Simple main effects for the repeated measures (within subjects)factors should use a non-pooled error term (just as with standard repeated measures design outlined above).&lt;/p&gt;&lt;p&gt;For the independent measures factors we want to use the pooled error estimate from our main analysis. This is problematic because there isn't a single pooled error term in the mixed ANOVA output. Instead, we must take the error terms from the mixed ANOVA output and calculate the pooled error term 'by hand'. Howell (2002, pp. 490-493) describes a (relatively) simple procedure for this.&lt;/p&gt;&lt;p&gt;Calculating the pooled (within cells) SS from the mixed ANOVA output is fairly easy. The first step is to identify the &lt;em&gt;two&lt;/em&gt; error terms from the original mixed ANOVA output. These will usually be clearly labelled as error terms.&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#footnote8" name="8"&gt;&lt;sup&gt;8&lt;/sup&gt;&lt;/a&gt; &lt;/p&gt;&lt;p&gt;Designate one as error &lt;em&gt;v&lt;/em&gt; and one as error &lt;em&gt;u&lt;/em&gt; and combine the sums of squares and mean square as follows:&lt;/p&gt;&lt;p align="center"&gt;Pooled &lt;em&gt;SS&lt;/em&gt; = &lt;em&gt;SS&lt;/em&gt;&lt;sub&gt;u&lt;/sub&gt; + &lt;em&gt;SS&lt;/em&gt;&lt;sub&gt;v&lt;/sub&gt;&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;p align="center"&gt;Pooled &lt;em&gt;MS&lt;/em&gt; = Pooled &lt;em&gt;SS&lt;/em&gt; / (df&lt;sub&gt;u&lt;/sub&gt; + df&lt;sub&gt;v&lt;/sub&gt;)&lt;/p&gt;&lt;p&gt;(Where &lt;em&gt;SS&lt;/em&gt; stands for sums of squares, &lt;em&gt;MS&lt;/em&gt; for mean square and df for degrees of freedom for &lt;em&gt;u&lt;/em&gt; or &lt;em&gt;v &lt;/em&gt;as appropriate).&lt;br /&gt;&lt;br /&gt;So far so good ... &lt;/p&gt;&lt;p&gt;Pooling the sums of squares in this way is fine except that the pooled Mean Square is derived from two different variance estimates. This makes it tricky to work out the correct &lt;em&gt;F&lt;/em&gt; distribution for the pooled mean square error. This in turn means that calculating a significance (&lt;em&gt;p)&lt;/em&gt; value is tricky. It turns out that we can use the following formula to derive an error d.f. (called &lt;em&gt;f&lt;/em&gt;') that gives us the correct &lt;em&gt;p&lt;/em&gt; value:&lt;/p&gt;&lt;p align="center"&gt;&lt;em&gt;f&lt;/em&gt;' = (&lt;em&gt;u&lt;/em&gt; + &lt;em&gt;v&lt;/em&gt;)&lt;sup&gt;2&lt;/sup&gt; / (&lt;em&gt;u&lt;/em&gt;&lt;sup&gt;2&lt;/sup&gt;/df&lt;sub&gt;u&lt;/sub&gt; + &lt;em&gt;v&lt;/em&gt;&lt;sup&gt;2&lt;/sup&gt;/df&lt;sub&gt;v&lt;/sub&gt;)&lt;/p&gt;&lt;p align="center"&gt;[Note: &lt;span class="Apple-style-span" style="font-style: italic;"&gt;u&lt;/span&gt; and &lt;span class="Apple-style-span" style="font-style: italic;"&gt;v&lt;/span&gt; are the SS for the two error terms]&lt;/p&gt;&lt;p&gt;Once you have obtained the pooled MS error simply calculate your simple main effect ratio as normal (&lt;em&gt;F&lt;/em&gt; = MS&lt;sub&gt;treatment&lt;/sub&gt;/MS&lt;sub&gt;error&lt;/sub&gt;) and evaluate against the usual treatment d.f. and using &lt;em&gt;f'&lt;/em&gt; as the error d.f.&lt;/p&gt;&lt;p&gt;While this isn't too difficult it can be a bit fiddly. My preferred solution is to set up a spreadsheet in Excel or a similar program to calculate all these for you (see Resources below).&lt;/p&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;em&gt;Looking up significance of F:&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;Provided you know the &lt;em&gt;F &lt;/em&gt;ratio, the treatment &lt;i&gt;df&lt;/i&gt; and error&lt;i&gt; df&lt;/i&gt; looking up significance is easy. The traditional option is to use tables of significance. These allow you to look up the critical value of &lt;em&gt;F&lt;/em&gt; or &lt;em&gt;F&lt;/em&gt;&lt;sub&gt;crit&lt;/sub&gt; that is required to reach significance for a given alpha level. If the observed level of &lt;em&gt;F&lt;/em&gt; equals or exceeds&lt;em&gt; F&lt;/em&gt;&lt;sub&gt;crit&lt;/sub&gt; then the test is signifiant at that alpha level. The main drawback of this procedure is that exact significance (&lt;em&gt;p&lt;/em&gt; values) can not be obtained (though they can be estimated in some cases).&lt;/p&gt;&lt;p&gt;An alternative strategy is to use a computer program to look up the exact significance (&lt;em&gt;p&lt;/em&gt;) value. For example, in Excel one can use the FDIST function.&lt;/p&gt;&lt;p&gt;e.g., FDIST(3.29,1,42) returns the value 0.076851736&lt;/p&gt;&lt;blockquote&gt;&lt;p&gt;Here 3.29 is the observed &lt;em&gt;F&lt;/em&gt; ratio, 1 is the treatment d.f. and 42 the error d.f. The &lt;em&gt;p&lt;/em&gt; value for this test is therefore marginally non-significant. We could thus report the test as &lt;em&gt;F&lt;/em&gt;&lt;sub&gt;1,42&lt;/sub&gt; = 3.29, &lt;em&gt;p&lt;/em&gt; = .0769.&lt;br /&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;strong&gt;Resources:&lt;/strong&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;Here is a very basic (and certainly not very pretty) Excel spreadsheet that calculates the pooled MS error used for independent factors in a mixed ANOVA design. The spreadsheet takes as input six numbers (labelled in bold in the spreadsheet). These are &lt;em&gt;SS&lt;/em&gt;&lt;sub&gt;u&lt;/sub&gt;, &lt;em&gt;SS&lt;/em&gt;&lt;sub&gt;v&lt;/sub&gt;, df&lt;sub&gt;u&lt;/sub&gt;, df&lt;sub&gt;v&lt;/sub&gt; (the SS and d.f. for the two error terms in the mixed design), the SS&lt;sub&gt;treatment&lt;/sub&gt; and the df&lt;sub&gt;treatment&lt;/sub&gt;. The latter two are obtained as normal for simple main effects (e.g., they can be readily taken from the output of separate one-way ANOVAs for each level of a second factor.&lt;/p&gt;&lt;p style="text-align: center;"&gt;&lt;a href="http://spreadsheets.google.com/ccc?key=pHGqhOm07qOHkVgx-N6ahLw&amp;amp;hl=en_GB"&gt;Mixed_Pooled_Error.xls&lt;/a&gt;&lt;/p&gt;&lt;p&gt;For convenience the spreadsheet also uses the FDIST function to calculate the &lt;em&gt;p&lt;/em&gt; value for the simple main effect (using &lt;em&gt;f'&lt;/em&gt; as the error d.f. as described above). This function doesn't work in &lt;span class="Apple-style-span" style="font-style: italic;"&gt;Google Docs&lt;/span&gt; but just work just fine in &lt;span class="Apple-style-span" style="font-style: italic;"&gt;Excel&lt;/span&gt; (just select &lt;span class="Apple-style-span" style="font-style: italic;"&gt;Export&lt;/span&gt; from the &lt;span class="Apple-style-span" style="font-style: italic;"&gt;File&lt;/span&gt; menu and save it as an .xls file).&lt;/p&gt;&lt;p&gt;The default values in the spreadsheet are taken from Howell (2002, pp. 492-3).&lt;/p&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;strong&gt;Bibliography&lt;/strong&gt;&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;Howell, D. C. (2002). &lt;a href="http://www.amazon.co.uk/gp/offer-listing/0495093610?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=am2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0495093610"&gt;Statistical Methods for Psychology&lt;/a&gt;&lt;img src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;amp;l=as2&amp;amp;o=2&amp;amp;a=0495093610" width="1" height="1" border="0" alt="" style="border:none !important; margin:0px !important;" /&gt;&lt;br /&gt;(5th. ed.). Pacific Grove, CA: Duxberry. &lt;/p&gt;&lt;p align="right"&gt;&lt;span style="font-family:arial;font-size:85%;"&gt;[David Howell has just released a seventh edition which looks excellent. At first glance the main changes are in layout and use of software examples.]&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;strong&gt;Footnotes:&lt;/strong&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#1" name="footnote1"&gt;1&lt;/a&gt; Strictly, the usual interpretation is that if the &lt;em&gt;F&lt;/em&gt; ratio is significant then there is evidence against the &lt;em&gt;null hypothesis&lt;/em&gt; that the means are all equal. &lt;/p&gt;&lt;p&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#2" name="footnote2"&gt;2&lt;/a&gt; I have glossed over the finer points of interaction analysis for these pages. There are occasions when non-significant interactions should probably be looked at more closely and occasions when significant interactions should probably be ignored. This relates to the power of the study and the size of the interaction effect. A powerful study may find detect interactions that are negligible in size and a study that lacks power may fail to show significance for an important or large interaction effect.&lt;/p&gt;&lt;p&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#3" name="footnote3"&gt;3&lt;/a&gt; Interpreting main effects in the light of an interaction effect is more of an art than a science. In a mathematical sense interaction effects are independent of main effects and can be interpreted separately. However, verbal interpretations of analyses aren't always sensitive to the mathematics of the situation. The context of the study is also important in the interpretation of the results. If in doubt, always interpret the interaction effect first. Also, if the simple main effects show a consistent pattern (e.g., significant differences or close to significant at all levels of a factor) then the main effects are probably straight-forward to report. In the example given here it probably makes sense to report and interpret the main effect of gender (females are generally more anxious than males in this context), but not age (as it seems misleading to interpret a significant main effect of gender as indicating that older people are generally more anxious in this context).&lt;br /&gt;&lt;/p&gt;&lt;p&gt;This decision might differ in another situation (with identical numbers). For example, if the study looked at reading preferences and the factors were gender and font type (serif or sans serif) we might conclude that the main effect of font type was worth interpreting as, in general, people do prefer the serif font. The mathematics hasn't changed, but the context has – in this case the serif font seems like a sensible compromise that pleases most people most of the time (though even in this case the simple main effects might be useful for deciding what font to use for a motoring magazine aimed at men or a lifestyle magazine aimed at women). If there is a moral to be learned from this digression (other than that the author tends to ramble on a bit) it is that applied statistics is much more than mathematics and needs to take account of the context and the goals of the research.&lt;/p&gt;&lt;p&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#4" name="footnote4"&gt;4&lt;/a&gt; In this case the terminology is a bit awkward and most authors would say "the simple main effect of age for the male participants" or similar. In other cases the terminology is quite handy.&lt;/p&gt;&lt;p&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#5" name="footnote5"&gt;5&lt;/a&gt; This should be fairly obvious to work out if you think about it. With a 3x2 design there are 2 simple main effects for the first factor (one for each level of the other factor) plus 3 for the second factor (one for each level of the first factor).&lt;/p&gt;&lt;p&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#6" name="footnote6"&gt;6&lt;/a&gt; Some readers will note that we could also calculate a &lt;i&gt;t&lt;/i&gt; ratio if simple main effect has 1 d.f. for the treatment (as is the case for my example). This this &lt;i&gt;t&lt;/i&gt; ratio would merely be the square root of the &lt;i&gt;F&lt;/i&gt; ratio. In some cases the software one has available makes &lt;i&gt;t &lt;/i&gt;easier or quicker to calculate than &lt;i&gt;F&lt;/i&gt; so this relationship will prove useful.&lt;/p&gt;&lt;p&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#7" name="footnote7"&gt;7&lt;/a&gt; Pooling the error term would also increase the potential for problems with the repeated measures sphericity assumption. If the simple main effect has 1 treatment d.f. then using a non-pooled error term will mean any problems with spheicity are avoided. If the simple main effect has more than 1 treatment d.f. and sphericity is considered to be untenable the researcher should probably consider whether the substantive hypothesis being investigated can be tested with a specific contrast of means. &lt;a href="http://psychologicalstatistics.blogspot.com/2006/05/what-is-all-this-stuff-about.html"&gt;See also my guidance on checking and correcting sphericity violations&lt;/a&gt;.&lt;/p&gt;&lt;p&gt;&lt;a href="http://www.blogger.com/post-edit.g?blogID=27862247&amp;amp;postID=115253297116113742#8" name="footnote8"&gt;8&lt;/a&gt; In any case the terms to look for are the "Subjects within ..." term and the "x [&lt;em&gt;or&lt;/em&gt; by] Subjects within ..." interaction term if they are not labelled as error terms. For example, a recent version of SPSS labels these simply "Error" in the "Tests of Between-Subjects Effects" table and "Error(&lt;em&gt;Repeated factor A x Repeated factor B x ...&lt;/em&gt;)" in "Tests of Within-Subjects Effects".&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-115253297116113742?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/zLIGlkKbl3mcyp-5if1goCf6imk/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/zLIGlkKbl3mcyp-5if1goCf6imk/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/zLIGlkKbl3mcyp-5if1goCf6imk/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/zLIGlkKbl3mcyp-5if1goCf6imk/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/ZkRV7CQCW0s" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/115253297116113742/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2006/07/how-to-calculate-simple-main-effects.html#comment-form" title="7 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/115253297116113742?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/115253297116113742?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/ZkRV7CQCW0s/how-to-calculate-simple-main-effects.html" title="How to calculate simple main effects using generic ANOVA software" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>7</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2006/07/how-to-calculate-simple-main-effects.html</feedburner:origLink></entry><entry gd:etag="W/&quot;DUUDRXw6cCp7ImA9WxRWE0w.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-114831327926138278</id><published>2006-05-22T16:54:00.004+01:00</published><updated>2008-10-29T21:54:34.218Z</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-10-29T21:54:34.218Z</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="sphericity" /><category scheme="http://www.blogger.com/atom/ns#" term="Greenhouse-Geisser" /><category scheme="http://www.blogger.com/atom/ns#" term="repeated measure ANOVA" /><category scheme="http://www.blogger.com/atom/ns#" term="compound symmetry" /><title>What is all this stuff about sphericity in my repeated measures ANOVA output?</title><content type="html">&lt;p&gt;&lt;span lang="EN-US"   style="font-family:georgia;font-size:85%;"&gt;Until relatively recently most psychologists had never heard of sphericity. Most text books didn't mention it (and even now some of them confuse it with &lt;em&gt;compound symmetry&lt;/em&gt;). The big change happened a few years ago when SPSS started including information about sphericity in its repeated measures ANOVA output. Of course, because most undergarduate text books didn't mention it people started looking for information on it. Around that time I taught an advanced course on psychological statistics and produced this resource for my tutees. It turned out that they passed the web link on to other students on the course. Once I realized it was widely used I tried to keep it fairly up-to-date until I moved jobs and my web pages became homeless. I keep meaning to improve the layout and a little more detail in places, but in the mean time I hope a few people find it useful ...&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:georgia;font-size:85%;"&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;span style="font-family:arial;font-size:130%;"&gt;&lt;strong&gt;An Introduction to Sphericity in Repeated Measures ANOVA&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:arial;"&gt;The article is written for a general audience of post-graduate and graduate researchers. The technical material goes slightly beyond what is covered in most text books, although there is still some simplification (which is usually indicated in the text). The aim to give advice about best practice for checking and dealing with sphericity in repeated measures ANOVA. Some of the content is personal opinion (which I have tried to indicate in the text). I include a short bibliography of my sources at the end for readers who want to explore the topic in more detail.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:arial;"&gt;&lt;strong&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:arial;"&gt;&lt;strong&gt;Some background&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:arial;"&gt;Sphericity is a mathematical assumption in repeated measures ANOVA designs. Let's start by considering a simpler ANOVA design (e.g., one-way independent measures ANOVA).&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:arial;"&gt;In independent measures ANOVA one of the mathematical assumptions is that the variances of the populations that groups are sampled from are equal. This homogeneity of variance assumption more-or-less follows from the null hypothesis being tested in ANOVA - if the treatment has no effect on the thing being measured (the DV) then we can consider all the groups to be sampled from the same population.&lt;sup&gt;&lt;span style="font-size:85%;"&gt;1&lt;/span&gt; &lt;/sup&gt;&lt;/span&gt;&lt;span style="font-family:arial;"&gt;However, because we're taking samples we'd be very lucky to observe exactly equal variances (even if the assumption were perfectly met). Real data is rarely that neat! What we'd expect to get (most of the time) is groups with similar variances.&lt;sup&gt;&lt;span style="font-size:85%;"&gt;2&lt;/span&gt; &lt;/sup&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:arial;"&gt;The sphericity assumption can be thought of as an extension of the homogeneity of variance assumption in independent measures ANOVA. Why does the assumption need to be extended? To understand this we need to introduce the ANOVA covariance matrix.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:arial;"&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;The covariance matrix&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:arial;"&gt;What is a covariance matrix? In a nutshell, it is a matrix that contains the covariances between levels of a factor in an ANOVA design.&lt;/span&gt;&lt;sup&gt;&lt;span style="font-family:arial;font-size:85%;"&gt;3&lt;/span&gt;&lt;/sup&gt;&lt;span style="font-family:arial;"&gt; A covariance is the shared or overlapping variance between two things (sometimes called &lt;i&gt;variance in common&lt;/i&gt;). Let's look at an example of the layout for a one-factor ANOVA design with four levels and therefore four samples (called &lt;span style="font-size:85%;"&gt;A&lt;sub&gt;1&lt;/sub&gt;&lt;/span&gt;, &lt;span style="font-size:85%;"&gt;A&lt;sub&gt;2&lt;/sub&gt;&lt;/span&gt;, &lt;span style="font-size:85%;"&gt;A&lt;sub&gt;3&lt;/sub&gt;&lt;/span&gt; and &lt;span style="font-size:85%;"&gt;A&lt;sub&gt;4&lt;/sub&gt;&lt;/span&gt;):&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;br /&gt;&lt;table cellpadding="10" border="0"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;font-size:130%;"&gt;Samples:&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;1 &lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;2&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;3 &lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;4 &lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;&lt;strong&gt;A&lt;sub&gt;1&lt;/sub&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;span style="font-size:85%;"&gt;&lt;sub&gt;1&lt;/sub&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:100%;"&gt;12&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:100%;"&gt;13&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:100%;"&gt;14&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;font-size:130%;"&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;2&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:100%;"&gt;21&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;span style="font-size:85%;"&gt;&lt;sub&gt;2&lt;/sub&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class=""&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:100%;"&gt;23&lt;/span&gt;&lt;/sub&gt; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/td&gt;&lt;td class=""&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:100%;"&gt;24&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;font-size:130%;"&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;3&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:100%;"&gt;31&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:100%;"&gt;32&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;span style="font-size:85%;"&gt;&lt;sub&gt;3&lt;/sub&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:100%;"&gt;34&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;4&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:100%;"&gt;41&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:100%;"&gt;42&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:100%;"&gt;43&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;span style="font-size:85%;"&gt;&lt;sub&gt;4&lt;/sub&gt;&lt;/span&gt;&lt;span style="font-size:78%;"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;The first thing to notice is the main diagonal cells in the matrix (running top left to bottom right)contain the variances of the four levels (e.g., &lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;strong&gt;&lt;em&gt;S&lt;/em&gt;&lt;/strong&gt;&lt;span style="font-size:85%;"&gt;&lt;sub&gt;1&lt;/sub&gt;&lt;/span&gt;&lt;span style="font-size:78%;"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;em&gt; &lt;/em&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;is the variance of A&lt;sub&gt;&lt;span style="font-size:85%;"&gt;1&lt;/span&gt;&lt;/sub&gt;).&lt;sup&gt;&lt;span style="font-size:85%;"&gt;4&lt;/span&gt;&lt;strong&gt; &lt;/strong&gt;&lt;/sup&gt;The second thing to notice is that the covariances are therefore in the cells off the main diagonals (called the off-diagonal cells and greyed out in the table above). The third thing to notice is that the covariances are mirrored above and below the main diagonal. (The term &lt;/span&gt;&lt;span lang="EN-US"  style="font-size:18;"&gt;&lt;span style="font-size:100%;"&gt;&lt;strong&gt;&lt;em&gt;S&lt;/em&gt;&lt;/strong&gt;&lt;span style="font-size:85%;"&gt;&lt;sub&gt;14&lt;/sub&gt;&lt;/span&gt;&lt;em&gt; &lt;/em&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;is the covariance between samples A&lt;sub&gt;&lt;span style="font-size:85%;"&gt;1&lt;/span&gt;&lt;/sub&gt; and A&lt;sub&gt;&lt;span style="font-size:85%;"&gt;4&lt;/span&gt;&lt;/sub&gt;, while &lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;strong&gt;&lt;em&gt;S&lt;/em&gt;&lt;/strong&gt;&lt;sub&gt;41&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"  style="font-size:12;"&gt;&lt;sup&gt;&lt;span style="font-size:100%;"&gt; &lt;/span&gt;&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;is the covariance between samples &lt;span style="font-size:85%;"&gt;&lt;span style="font-size:100%;"&gt;A&lt;/span&gt;&lt;sub&gt;4&lt;/sub&gt;&lt;/span&gt; and A&lt;sub&gt;&lt;span style="font-size:85%;"&gt;1&lt;/span&gt;&lt;/sub&gt;. As this is the variance they have in common, &lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;&lt;strong&gt;S&lt;/strong&gt;&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sub&gt;14&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"  style="font-size:12;"&gt;&lt;sup&gt; &lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;= &lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;&lt;strong&gt;S&lt;/strong&gt;&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;&lt;span style="font-size:85%;"&gt;41&lt;/span&gt;&lt;/sub&gt;.)&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;What does the covariance matrix look like for independent measures ANOVA? Here an example for a one-way independent measures ANOVA design with 4 levels (and hence four groups):&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;br /&gt;&lt;table cellpadding="10" border="0"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;font-size:130%;"&gt;Samples: &lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;1 &lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;2&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;3 &lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;4 &lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;&lt;strong&gt;A&lt;sub&gt;1&lt;/sub&gt;&lt;/strong&gt; &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;span style="font-size:85%;"&gt;&lt;sub&gt;1&lt;/sub&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;font-size:130%;"&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;2&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;span style="font-size:85%;"&gt;&lt;sub&gt;2&lt;/sub&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class=""&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;0 &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/td&gt;&lt;td class=""&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;font-size:130%;"&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;3&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;span style="font-size:85%;"&gt;&lt;sub&gt;3&lt;/sub&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;4&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;0&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;span style="font-size:85%;"&gt;&lt;sub&gt;4&lt;/sub&gt;&lt;/span&gt;&lt;span style="font-size:78%;"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;The most striking observation is that all the covariances are zero. Why? The answer is fairly straight-forward. In an independent measures design the observations should be independent and therefore uncorrelated with each other.&lt;sup&gt;&lt;span style="font-size:85%;"&gt;5&lt;/span&gt;&lt;/sup&gt; Two samples that are uncorrelated will share no variance (and the covariance will be zero). So in this relatively simple case we only have to worry about &lt;em&gt;homogeneity of variance&lt;/em&gt; - which would lead us to expect that the observed variances on the main diagonal should be similar.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;Reminder: &lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;Assumptions such as homogeneity of variance, sphericity and so forth are assumptions about the populations we are sampling from. I'll try and indicate this as I go through, but it sometimes gets clumsy to keep repeating "in the population being sampled" all the time! We expect samples to have similar characteristics to the populations being sampled, but only in rare cases will the samples show &lt;em&gt;exactly&lt;/em&gt; the same pattern of variance (or whatever) as the population. It is also worth adding that large samples are more similar to the populations they are sampled from than small samples.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Finally, please note that the statistical term "population" is an abstract one. We are referring to a population of data points that we might potentially be sampling not a fixed entity such as the population of a country. (In other contexts, such as market research, people sometimes deal with such fixed populations, but this requires slightly different methods from those in most scientific research).&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:180%;"&gt;What is the sphericity assumption?&lt;/span&gt;&lt;/strong&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;Compound symmetry&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;The &lt;em&gt;sphericity&lt;/em&gt; assumption is an assumption about the structure of the covariance matrix in a repeated measures design. Before we describe it in detail lets consider a simpler (but stricter) condition. This one is called &lt;em&gt;compound symmetry&lt;/em&gt;. Compound symmetry is met if all the covariances (the off-diagonal elements of the covariance matrix) are equal and all the variances are equal in the populations being sampled. (Note that the variances don't have to equal the covariances.) Just as with the &lt;em&gt;homogeneity of variance&lt;/em&gt; assumption we'd only rarely expect a real data set to meet &lt;em&gt;compound symmetry&lt;/em&gt; exactly, but provided the observed covariances are roughly equal in our samples (and the variances are OK too) we can be pretty confident that compound symmetry is not violated.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;The good news about compound symmetry&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;If compound symmetry is met then sphericity is also met. So if you take a look at the covariance matrix and the covariances are similar and the variances are similar then we know that sphericity is not going to be a problem.&lt;sup&gt;&lt;span style="font-size:85%;"&gt;6&lt;/span&gt;&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;The bad news about compound symmetry&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;As &lt;em&gt;compound symmetry&lt;/em&gt; is a stricter requirement than &lt;em&gt;sphericity&lt;/em&gt; we still need to check sphericity if compound symmetry isn't met. This is where it gets technical (or should that be ... &lt;em&gt;even&lt;/em&gt; more technical).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;The sphericity assumption&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Lets take a look at some raw data. Imagine that the first few observations of A&lt;sub&gt;&lt;span style="font-size:85%;"&gt;1&lt;/span&gt;&lt;/sub&gt;, A&lt;sub&gt;&lt;span style="font-size:85%;"&gt;2&lt;/span&gt;&lt;/sub&gt;, A&lt;sub&gt;&lt;span style="font-size:85%;"&gt;3&lt;/span&gt;&lt;/sub&gt; and A&lt;sub&gt;&lt;span style="font-size:85%;"&gt;4&lt;/span&gt; &lt;/sub&gt;are as follows:&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial;"&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial;"&gt;&lt;/span&gt;&lt;/p&gt;&lt;table cellpadding="0" border="0" dwcopytype="CopyTableRow"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"&gt;&lt;span style="font-family:arial;"&gt;&lt;strong&gt;&lt;span style="font-size:130%;"&gt;A&lt;/span&gt;&lt;/strong&gt;&lt;sub&gt;1&lt;strong&gt; &lt;/strong&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"&gt;&lt;span style="font-family:arial;"&gt;&lt;strong&gt;&lt;span style="font-size:130%;"&gt;A&lt;/span&gt;&lt;/strong&gt;&lt;sub&gt;2&lt;strong&gt; &lt;/strong&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"&gt;&lt;span style="font-family:arial;"&gt;&lt;strong&gt;&lt;span style="font-size:130%;"&gt;A&lt;/span&gt;&lt;/strong&gt;&lt;sub&gt;3&lt;strong&gt; &lt;/strong&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;strong&gt;A&lt;/strong&gt;&lt;sub&gt;&lt;span style="font-size:100%;"&gt;4&lt;/span&gt;&lt;strong&gt; &lt;/strong&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;&lt;sub&gt;&lt;span style="font-family:arial;"&gt;Participant &lt;span style="font-size:100%;"&gt;1&lt;/span&gt;&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;8&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;9&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;12&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;4&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;&lt;sub&gt;&lt;span style="font-family:arial;"&gt;Participant &lt;span style="font-size:100%;"&gt;2&lt;/span&gt;&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;6&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;11&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;16&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;3&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;&lt;sub&gt;&lt;span style="font-family:arial;"&gt;Participant &lt;span style="font-size:100%;"&gt;3&lt;/span&gt;&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;9&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;8&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;12&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;5&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span lang="EN-US"   style="font-family:arial;font-size:130%;"&gt;&lt;sub&gt;etc.&lt;/sub&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"   style="font-family:arial;font-size:130%;"&gt;...&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"   style="font-family:arial;font-size:130%;"&gt;...&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"   style="font-family:arial;font-size:130%;"&gt;...&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"   style="font-family:arial;font-size:130%;"&gt;...&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;For each possible pair of levels of factor A (e.g., A&lt;sub&gt;&lt;span style="font-size:85%;"&gt;1&lt;/span&gt;&lt;/sub&gt; and A&lt;sub&gt;&lt;span style="font-size:85%;"&gt;2 &lt;/span&gt;&lt;/sub&gt;&lt;em&gt;or&lt;/em&gt; A&lt;sub&gt;&lt;span style="font-size:85%;"&gt;2&lt;/span&gt;&lt;/sub&gt; and A&lt;sub&gt;&lt;span style="font-size:85%;"&gt;3&lt;/span&gt;&lt;/sub&gt;) we can calculate the difference between the observations. For example:&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;table cellpadding="0" border="0" dwcopytype="CopyTableRow"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:85%;"&gt;&lt;strong&gt;A&lt;sub&gt;1&lt;/sub&gt;&lt;/strong&gt;- &lt;/span&gt;&lt;strong&gt;&lt;span style="font-size:85%;"&gt;A&lt;/span&gt;&lt;sub&gt;&lt;span style="font-size:85%;"&gt;2 &lt;/span&gt;&lt;/sub&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:85%;"&gt;&lt;strong&gt;A&lt;sub&gt;1&lt;/sub&gt;&lt;/strong&gt;- &lt;/span&gt;&lt;strong&gt;&lt;span style="font-size:85%;"&gt;A&lt;/span&gt;&lt;sub&gt;&lt;span style="font-size:85%;"&gt;3 &lt;/span&gt;&lt;/sub&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:85%;"&gt;&lt;strong&gt;A&lt;sub&gt;1&lt;/sub&gt;&lt;/strong&gt;- &lt;strong&gt;A&lt;/strong&gt;&lt;/span&gt;&lt;sub&gt;&lt;span style="font-size:85%;"&gt;4 &lt;/span&gt;&lt;strong&gt;&lt;/strong&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;strong&gt;&lt;span lang="EN-US"&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:100%;"&gt;etc.&lt;/span&gt;&lt;sub&gt; &lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;&lt;sub&gt;&lt;span style="font-family:arial;"&gt;Participant &lt;span style="font-size:100%;"&gt;1&lt;/span&gt;&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;-1&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;-4&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;+4&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;&lt;strong&gt;...&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;&lt;sub&gt;&lt;span style="font-family:arial;"&gt;Participant &lt;span style="font-size:100%;"&gt;2&lt;/span&gt;&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;-5&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;-10&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;+3&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;&lt;strong&gt;...&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;&lt;sub&gt;&lt;span style="font-family:arial;"&gt;Participant &lt;span style="font-size:100%;"&gt;3&lt;/span&gt;&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;+1&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;-3&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;+4&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"  style="font-family:arial;"&gt;&lt;strong&gt;...&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span lang="EN-US"   style="font-family:arial;font-size:130%;"&gt;&lt;sub&gt;etc.&lt;/sub&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"   style="font-family:arial;font-size:130%;"&gt;...&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"   style="font-family:arial;font-size:130%;"&gt;...&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"   style="font-family:arial;font-size:130%;"&gt;...&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p style="TEXT-ALIGN: center" align="center"&gt;&lt;span lang="EN-US"   style="font-family:arial;font-size:130%;"&gt;&lt;strong&gt;...&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;We could then calculate variances for each of these differences (e.g., &lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;S&lt;/span&gt;&lt;/em&gt;&lt;span style="font-size:85%;"&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;1-4&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;). &lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;The &lt;em&gt;sphericity assumption&lt;/em&gt; is that the all the variances of the differences are equal (in the population sampled). In practice, we'd expect the observed sample variances of the differences to be similar if the sphericity assumption was met.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;Using the covariance matrix to check the sphericity assumption&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;We can check sphericity assumption using the covariance matrix, but it turns out to be fairly laborious. (Later on I'll discuss some simpler ways to check sphericity using output from SPSS and similar statistics packages). Variance of differences can be computed using a version of the &lt;em&gt;variance sum law:&lt;/em&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p align="center"&gt;&lt;span style="font-family:georgia;"&gt;&lt;span style="font-size:130%;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;x-y&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; &lt;/span&gt;&lt;span lang="EN-US"&gt;= &lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:180%;"&gt;s&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;x&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; + &lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:180%;"&gt;s&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;y&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; - 2(&lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:180%;"&gt;s&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;xy&lt;/sub&gt;)&lt;/span&gt;&lt;/span&gt; &lt;/p&gt;&lt;p align="center"&gt;&lt;span style="font-size:78%;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;&lt;/span&gt;&lt;/span&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;In other words the variance of a difference is the sum of the two variances minus twice their covariance. A simple arithmetic check will show that this works out as zero if the two variances share all their variance.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;&lt;span style="font-size:85%;"&gt;(Note that we could also calculate the variances of the differences directly from the raw data. We'd simply calculate the differences between all the possible pairs of levels of a factor. For example, using Excel or SPSS we could define a new column value as one level minus another level and then calculate the variances of each column using the built in descriptive statistics of the program. This would get very tedious if we had lots of levels, so I'd recommend the above method if you really want to calculate the variances of the differences and you already have a covariance matrix. Fortunately this isn't necessary in most cases - as I'll discuss later.)&lt;/span&gt; &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;An example&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;This example is adapted from Kirk (1995). Imagine the observed covariance matrix for a one-way repeated measures ANOVA design is this:&lt;/span&gt; &lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;table cellpadding="10" border="0"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;font-size:130%;"&gt;Samples:&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;1 &lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;2&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;3 &lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;4 &lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;&lt;strong&gt;A&lt;sub&gt;1&lt;/sub&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;10&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;5&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;10&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;15&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;font-size:130%;"&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;2&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;5&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;20&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class=""&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;15 &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/td&gt;&lt;td class=""&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;20&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;font-size:130%;"&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;3&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;10&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;15&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;30&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;25&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;4&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;15&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;20&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;25&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;40&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:arial;"&gt;The variances of the differences are: &lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:georgia;"&gt;&lt;em&gt;&lt;strong&gt;S&lt;/strong&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;x-y&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; &lt;/span&gt;&lt;span lang="EN-US"&gt;= &lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;S&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;x&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; + &lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;&lt;strong&gt;S&lt;/strong&gt;&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;y&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; - 2(&lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;&lt;strong&gt;S&lt;/strong&gt;&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;xy&lt;/sub&gt;)&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:arial;"&gt;&lt;span lang="EN-US"&gt;So ...&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial;"&gt;&lt;/span&gt;&lt;/p&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-family:arial;"&gt;&lt;em&gt;&lt;strong&gt;&lt;/strong&gt;&lt;/em&gt;&lt;/span&gt;&lt;/span&gt;&lt;p&gt;&lt;span style="font-family:georgia;"&gt;&lt;em&gt;&lt;strong&gt;S&lt;/strong&gt;&lt;/em&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sub&gt;1-2&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; &lt;/span&gt;&lt;span lang="EN-US"&gt;= &lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;&lt;strong&gt;S&lt;/strong&gt;&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;1&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; + &lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;&lt;strong&gt;S&lt;/strong&gt;&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;2&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; - 2(&lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;&lt;strong&gt;S&lt;/strong&gt;&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;12&lt;/sub&gt;)&lt;/span&gt;&lt;span lang="EN-US"&gt; = &lt;/span&gt;&lt;span lang="EN-US"&gt;10&lt;/span&gt;&lt;span lang="EN-US"&gt; + &lt;/span&gt;&lt;span lang="EN-US"&gt;20&lt;/span&gt;&lt;span lang="EN-US"&gt; - 2(&lt;/span&gt;&lt;span lang="EN-US"&gt;5&lt;/span&gt;&lt;span lang="EN-US"&gt;)&lt;/span&gt;&lt;span lang="EN-US"&gt; = 30 - 10 = 20&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:georgia;"&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:georgia;"&gt;&lt;em&gt;&lt;strong&gt;S&lt;/strong&gt;&lt;/em&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sub&gt;1-3&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; &lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;em&gt;=&lt;/em&gt; &lt;/span&gt;&lt;span lang="EN-US"&gt;10 + 30 - 2(10) = 20&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:georgia;"&gt;&lt;em&gt;&lt;strong&gt;S&lt;/strong&gt;&lt;/em&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sub&gt;1-4&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; &lt;/span&gt;&lt;span lang="EN-US"&gt;= &lt;/span&gt;&lt;span lang="EN-US"&gt;10&lt;/span&gt;&lt;span lang="EN-US"&gt; + &lt;/span&gt;&lt;span lang="EN-US"&gt;40&lt;/span&gt;&lt;span lang="EN-US"&gt; - 2(&lt;/span&gt;&lt;span lang="EN-US"&gt;15&lt;/span&gt;&lt;span lang="EN-US"&gt;)&lt;/span&gt;&lt;span lang="EN-US"&gt; = 20&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:georgia;"&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:georgia;"&gt;&lt;em&gt;&lt;strong&gt;S&lt;/strong&gt;&lt;/em&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sub&gt;2-3&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; &lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;em&gt;=&lt;/em&gt; &lt;/span&gt;&lt;span lang="EN-US"&gt;20 + 30 - 2(15) = 20&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:georgia;"&gt;&lt;em&gt;&lt;strong&gt;S&lt;/strong&gt;&lt;/em&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sub&gt;2-4&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; &lt;/span&gt;&lt;span lang="EN-US"&gt;= &lt;/span&gt;&lt;span lang="EN-US"&gt;20&lt;/span&gt;&lt;span lang="EN-US"&gt; + &lt;/span&gt;&lt;span lang="EN-US"&gt;40&lt;/span&gt;&lt;span lang="EN-US"&gt; - 2(&lt;/span&gt;&lt;span lang="EN-US"&gt;20&lt;/span&gt;&lt;span lang="EN-US"&gt;)&lt;/span&gt;&lt;span lang="EN-US"&gt; = 20&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;span style="font-family:georgia;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:georgia;"&gt;&lt;em&gt;&lt;strong&gt;S&lt;/strong&gt;&lt;/em&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sub&gt;3-4&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt; &lt;/span&gt;&lt;span lang="EN-US"&gt;= &lt;/span&gt;&lt;span lang="EN-US"&gt;30&lt;/span&gt;&lt;span lang="EN-US"&gt; + &lt;/span&gt;&lt;span lang="EN-US"&gt;40&lt;/span&gt;&lt;span lang="EN-US"&gt; - 2(2&lt;/span&gt;&lt;span lang="EN-US"&gt;5&lt;/span&gt;&lt;span lang="EN-US"&gt;)&lt;/span&gt;&lt;span lang="EN-US"&gt; = 20&lt;/span&gt;&lt;/span&gt; &lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;This example has been contrived so that variances of the differences are exactly equal (which would be extremely unlikely in real data), but it does demonstrate that lack of &lt;em&gt;compound symmetry&lt;/em&gt; does not necessarily mean that sphericity is violated. (Compound symmetry is a sufficient, but not necessary requirement for sphericity to be met.)&lt;sup&gt;&lt;span style="font-size:85%;"&gt;7&lt;/span&gt;&lt;/sup&gt; In this example, &lt;em&gt;compound symmetry&lt;/em&gt; is clearly not met (the largest variances and covariances are 4 or 5 times bigger than the smallest), but &lt;em&gt;sphericity&lt;/em&gt; holds.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;What to do if sphericity is violated in repeated measures ANOVA&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;There are two broad approaches to dealing with violations of sphericity. The first is to use a &lt;em&gt;correction&lt;/em&gt; to the standard ANOVA tests. The second is to use a different test (i.e., one that doesn't assume sphericity).&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;In the following sub-sections I give general advice on what to do if sphericity is violated, this advice tends to hold well in most cases for factorial repeated measures designs but may be problematic for mixed ANOVA designs (discussed later under &lt;strong&gt;&lt;i&gt;Complications&lt;/i&gt;&lt;/strong&gt;).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;Correcting for violations of sphericity&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;The best known corrections are those developed by Greenhouse and Geisser (the Greenhouse-Geisser correction) and Huynh and Feldt (the Huynh-Feldt correction).&lt;sup&gt;&lt;span style="font-size:85%;"&gt;8&lt;/span&gt;&lt;/sup&gt; Each of these corrections works roughly in the same way. They all attempt to adjust the degrees of freedom in the ANOVA test in order to produce a more accurate significance (&lt;em&gt;p&lt;/em&gt;) value. If sphericity is violated the &lt;em&gt;p&lt;/em&gt; values&lt;br /&gt;need to be adjusted upwards (and this can be accomplished by adjusting the degrees of freedom downwards).&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;The first step in each test is to estimate something called &lt;em&gt;epsilon&lt;/em&gt;.&lt;sup&gt;&lt;span style="font-size:85%;"&gt;9&lt;/span&gt;&lt;strong&gt; &lt;/strong&gt;&lt;/sup&gt;For our purposes we can consider &lt;em&gt;epsilon&lt;/em&gt; to be a descriptive statistic indicating the degree to which sphericity has been violated. If sphericity is met perfectly then epsilon will be exactly 1. If epsilon is below 1 then&lt;br /&gt;sphericity is violated. The further epsilon gets away from 1 the worse the violation.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;How bad can epsilon get? Well, it depends on the number of levels (&lt;em&gt;k&lt;/em&gt;) on the repeated measures factor:&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p align="center"&gt;&lt;span style="font-family:arial;"&gt;Lower bound of &lt;em&gt;epsilon&lt;/em&gt; =&lt;/span&gt;&lt;span style="font-family:georgia;"&gt; 1/(&lt;em&gt;k&lt;/em&gt;-1)&lt;/span&gt;&lt;/p&gt;&lt;span style="font-family:Arial;"&gt;&lt;/span&gt;&lt;p&gt;&lt;br /&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Therefore, epsilon can go as low as &lt;span style="font-family:arial;"&gt;1/(3-2) = 0.5&lt;/span&gt; for three levels, as low as 0.2 for six levels and so forth. The more levels on the repeated measures factor the worse the potential for violations of sphericity.&lt;sup&gt;&lt;span style="font-size:85%;"&gt;10&lt;/span&gt;&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;The three common corrections fall into a range of most to least strict. First consider the most strict. We could use the &lt;em&gt;lower bound&lt;/em&gt; value of epsilon and correct for the worst possible case. Fortunately, there is a &lt;em&gt;much&lt;/em&gt; better option. The Greenhouse-Geisser correction is a conservative correction (it tends to underestimate epsilon when epsilon is close to 1 and therefore tends to over-correct). Huynh-Feldt produced a modified version for use when the true value of epsilon is thought to be near or above 0.75.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;The Huynh-Feldt correction tends to overestimate sphericity, so some statisticians have suggested using the average of the Greenhouse-Geisser and Huynh-Feldt corrections. My advice would be to consider the aims of the research and the relative cost of Type I and II errors. If Type I errors are considered more costly (especially if the estimates of epsilon fall below 0.75) then stick to the more conservative Greenhouse-Geisser correction.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Using the correction is fairly simple. Replace the treatment and error d.f. by (&lt;em&gt;epsilon&lt;/em&gt;*d.f.). So an epsilon value of 0.6 would turn an &lt;em&gt;F&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:85%;"&gt;1,50&lt;/span&gt;&lt;/sub&gt; test into an &lt;em&gt;F&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:85%;"&gt;0.6,30&lt;/span&gt;&lt;/sub&gt; test.&lt;sup&gt;&lt;span style="font-size:85%;"&gt;11&lt;/span&gt;&lt;strong&gt; &lt;/strong&gt;&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Using the these corrections seems to work well for relatively modest departures from 1 by epsilon or when sample sizes are small.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;Using MANOVA&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;An alternative approach is to use a test that doesn't assume sphericity. In the case of repeated measures ANOVA this usually means switching to &lt;em&gt;multivariate ANOVA&lt;/em&gt; (MANOVA for short). Some computer programs print out MANOVA automatically alongside repeated measures ANOVA (SPSS is one of these). While this can be confusing, it does make it easy to compare results for different tests and corrections. If sphericity is met (i.e., epsilon = 1) all the &lt;em&gt;p&lt;/em&gt; values for a given test should be identical. The degree to which they differ can be informative. If there is a wide discrepancy between different tests or correction then this suggests that the sphericity assumption may be severely violated and that one of the more conservative tests should be reported (e.g., Greenhouse-Geisser corrected &lt;em&gt;F&lt;/em&gt; or MANOVA).&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;In general MANOVA is probably less powerful than repeated measures ANOVA and therefore should probably be avoided. However, when some experts suggest sample sizes are reasonably large (&lt;em&gt;n&lt;/em&gt; exceeds 10 + &lt;em&gt;k&lt;/em&gt;) &lt;em&gt;and&lt;/em&gt; epsilon is low (less than 0.7) MANOVA will typically be more powerful and should probably be preferred. The most up-to-date evidence suggests that the precise circumstances under which MANOVA is more powerful depend on the properties of the covariance matrix and &lt;em&gt;k&lt;/em&gt; in ways that are hard to predict.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;My view is that most psychological research should avoid MANOVA and prefer sphericity assumed or corrections based on Huynh-Feldt or Greenhouse-Geisser epsilon estimates (especially when multisample sphericity - see below - is an issue). This assumes two things. First, that the important tests can not be reduced to a 1 effect d.f. contrasts with a non-pooled error term (see &lt;strong&gt;Multiple comparisons and contrasts&lt;/strong&gt; below). Second, that psychologists tend to deal with relatively small &lt;em&gt;k&lt;/em&gt; and small &lt;em&gt;N&lt;/em&gt; situations. I don't think that there is much evidence that where MANOVA is more powerful that it is much more powerful than ANOVA for the modest values of &lt;em&gt;n&lt;/em&gt; or &lt;em&gt;k&lt;/em&gt; that appear in most of the psychological literature. I suspect that greater gains in power would be achieved by looking at other aspects of the study in most cases (e.g., see Baguley, 2004). If high quality pilot data were available it would be possible to estimate the the power of MANOVA or ANOVA in this situation (e.g., see Miles, 2003). However, I suspect that a small pilot sample would probably not estimate the population covariance with sufficient precision to be confident in this approach (this is not a flaw in the approach, but merely the observation that there is error in the estimates of all parameters in a sample - not just for the mean).&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;How to check sphericity&lt;/span&gt;&lt;/strong&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;In this section I will focus on information readily available in SPSS (and most good statistics packages).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;Factorial repeated measures ANOVA&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;If there is more than one repeated measures factor consider each factor separately. (See also &lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;Special cases: factors with 2 levels&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;.)&lt;/span&gt;&lt;/span&gt; &lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;A warning about Mauchly's sphericity test&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Many text books recommend using significance tests such as &lt;em&gt;Mauchly's&lt;/em&gt; to test sphericity. In general this is a &lt;em&gt;very bad idea&lt;/em&gt;. &lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Why? First, tests of statistical assumptions - and Mauchly's is no exception - tend to lack statistical power (they tend to be bad at spotting violations of assumptions when &lt;em&gt;N&lt;/em&gt; is small). Second, tests of statistical assumptions - and, again, Mauchly's is no exception - tend not to be very robust (unlike ANOVA and MANOVA they are poor at coping with violations of assumptions such as normality). Third, significance tests don't reveal the degree of violation (e.g., with very large &lt;em&gt;N&lt;/em&gt; even a poor test like Mauchly's will show significance if there are very minor violations of sphericity; with low &lt;em&gt;N&lt;/em&gt; the poor power means that even severe violations may not be detected). Fourth, significance tests of assumptions tend to be used as substitutes for looking at the data - if you followed the advice of many popular introductory texts you'd never look at the descriptive statistics at all (e.g., the variances, the covariance matrix, estimates of epsilon and so forth). Fifth, I just don't like them.&lt;sup&gt;&lt;span style="font-size:85%;"&gt;&lt;strong&gt;12&lt;/strong&gt;&lt;/span&gt;&lt;/sup&gt;&lt;sup&gt;&lt;br /&gt;&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;Using Mauchly's sphericity test&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;The test principle is fairly simple. The null hypothesis is that sphericity holds (I like to think of it as a test that the true value of epsilon = 1). A significant result indicates evidence that sphericity is violated (i.e., evidence that the true value of episilon is below 1).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;Epsilon&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;I would recommend using estimates of epsilon to decide whether sphericity is violated. If epsilon is close to 1 then it is likely that sphericity is intact (or that any violation is very minor). If epsilon is close to the lower bound (see &lt;em&gt;Correcting for violations of sphericity&lt;/em&gt; above) then a correction or alternative procedure is likely to be necessary. Exactly where to draw the line is a matter of personal judgement, but it is often instructive to compare &lt;em&gt;p&lt;/em&gt; values for the corrected and uncorrected tests. If they are fairly similar then there is little indication that sphericity is violated. If the the discrepancy is large then one of the corrections (or possibly MANOVA) should probably be used.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;The covariance matrix&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;If estimates of epsilon are not readily available then lower-bound procedures can be used (see above) or the covariance matrix can be consulted. If &lt;em&gt;compound symmetry&lt;/em&gt; holds then it is safe to proceed with repeated measures ANOVA. If compound symmetry does not hold it is relatively simple (if time-consuming) to calculate the variances of the differences for each factor from the covariance matrix.&lt;/span&gt;&lt;/span&gt; &lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;Complications&lt;/span&gt;&lt;/strong&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"&gt;Special cases: factors with 2 levels (and the paired t test)&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;If &lt;em&gt;k&lt;/em&gt; = 2 (a repeated measures factor with only two levels) then the sphericity assumption is always met. Using the lower-bound formula one can see that when &lt;em&gt;k&lt;/em&gt; = 2 epsilon can't be lower than 1/(&lt;em&gt;k&lt;/em&gt;-1) = 1/(&lt;em&gt;2&lt;/em&gt;-1) = 1. This is also true for the paired &lt;em&gt;t&lt;/em&gt; test (in effect a one-way repeated measures ANOVA where &lt;em&gt;k&lt;/em&gt; = 2).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Why isn't sphericity a problem when there are only two levels? Well, think about the covariance matrix:&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;table cellpadding="10" border="0"&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;font-size:130%;"&gt;Samples:&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;sub&gt;&lt;span style="font-size:100%;"&gt;1&lt;/span&gt; &lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;/span&gt;&lt;span style="font-size:100%;"&gt;&lt;sub&gt;2&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;&lt;strong&gt;A&lt;sub&gt;&lt;span style="font-size:100%;"&gt;1&lt;/span&gt;&lt;/sub&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;span style="font-size:85%;"&gt;&lt;sub&gt;1&lt;/sub&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:85%;"&gt;12&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;&lt;/em&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;&lt;/em&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;font-size:130%;"&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="font-size:130%;"&gt;A&lt;/span&gt;&lt;span style="font-size:100%;"&gt;&lt;sub&gt;2&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;span style="color:#999999;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;sub&gt;&lt;span style="font-size:85%;"&gt;21&lt;/span&gt;&lt;/sub&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;td class="Normal"&gt;&lt;p&gt;&lt;span style="font-size:130%;"&gt;&lt;span style="font-family:arial;"&gt;&lt;em&gt;S&lt;/em&gt;&lt;span style="font-size:85%;"&gt;&lt;sub&gt;2&lt;/sub&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;There are two covariances &lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;s&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;&lt;span style="font-size:85%;"&gt;21&lt;/span&gt;&lt;/sub&gt; and &lt;/span&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;s&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;12&lt;/sub&gt;. The covariances above and below the main diagonal are constrained to be equal (because the shared variance between level 1 and level 2 is the same thing as the shared variance between level 2 and level 1). In effect there is only one covariance. Similarly, if we calculated &lt;/span&gt;&lt;span style="font-size:85%;"&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;s&lt;/span&gt;&lt;/em&gt;&lt;span lang="EN-US"&gt;&lt;sub&gt;1-2&lt;/sub&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;2 &lt;/sup&gt;&lt;/span&gt;&lt;/span&gt;&lt;span lang="EN-US"&gt;(the variance of the difference) we should realize there is, in effect, only one such variance. Sphericity is met if all the variances of the differences are equal. As there is only one, it can't not be equal to itself. For this reason Mauchly's sphericity test can't be computed if d.f. = 1 (i.e, if &lt;em&gt;k&lt;/em&gt; = 2) and some computer programs give confusing messages or printouts if you try. This sometimes leads people to conclude that sphericity is violated when sucah a violation would be impossible.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Note that sphericity subsumes the standard &lt;em&gt;homogeneity of variance&lt;/em&gt; assumption. In effect, we are only interested in the variances of the differences. When &lt;em&gt;k&lt;/em&gt; = 2 there is only one variance of the difference between levels and we can ignore differences in the 'raw' level variances themselves.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;Multiple comparisons and contrasts&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Bonferroni &lt;em&gt;t &lt;/em&gt;tests are frequently recommended for repeated measures ANOVA (whether or not sphericity is violated). The Bonferroni correction relies on a general probability inequality and therefore isn't dependent on specific ANOVA assumptions. As Bonferroni corrections tend to be conservative, a number of modified Bonferroni procedures have been proposed. Some are specific to certain patterns of hypothesis-testing, but others such as Holm's test (or the equivalent Larzelere and Mulaik test) are more powerful than standard Bonferroni corrections and should be used more widely (and not just for ANOVA). The superficially similar Hochberg procedure is usually much more powerful than Bonferroni and modified Bonferroni procedures and might also be considered. The Hochberg procedure is radically different in philosophy from standard multiple comparison procedures (though similar to the Holm procedure in its computation) and should be considered carefully before adopted).&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Experts recommend specific (rather than pooled) error terms for repeated measures factors (i.e., calculate the &lt;em&gt;SE &lt;/em&gt;for &lt;em&gt;t&lt;/em&gt; using only the conditions being compared, rather than using the square root of the &lt;em&gt;MSE&lt;/em&gt; term from the ANOVA table to derive the pooled SD and hence compute a pooled SE).&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;This advice also extends to contrasts which can be easily calculated by performing paired &lt;em&gt;t&lt;/em&gt; tests on weighted averages of the appropriate means. Using a specific error term should avoid problems with sphericity (e.g., see Judd et al., 1995) for the same reason that sphericity is not a problem for factors with only 2 levels.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial;"&gt;In complex ANOVA or MANOVA designs an approach involving &lt;em&gt;a priori&lt;/em&gt; contrasts is also preferable for other reasons. Such designs leads to multiple tests of theoretically uninteresting hypotheses whereas selecting a small number of contrasts on the basis of prior theory is to be preferred on the basis of clarity, statistical robustness, control of Type I error and statistical power.&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;em&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;Mixed designs&lt;/span&gt;&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Mixed designs (combining independent and repeated measures factors) muddy the waters somewhat. Mixed measures ANOVA requires that &lt;em&gt;multisample sphericity&lt;/em&gt; holds. This more-or-less means that the covariance matrices should be similar between groups (i.e., across the levels of the independent measures factors). Provided group sizes are equal (or at least roughly equal) the Greenhouse-Geisser and Huynh-Feldt corrections perform well when &lt;em&gt;multisample sphericity&lt;/em&gt; doesn't hold and can therefore still be used. If these corrections are inappropriate, or if group sizes are markedly unequal then more sophisticated methods are required (Keselman, Algina &amp;amp; Kowalchuk, 2001). A description of these methods is beyond the scope of this summary (possible solutions include multilevel methods found in SAS PROC MIXED, MlWin and HLM, though Keselman et al. also discuss a number of other options). For most psychological research best advice is, if at all possible, to keep group sizes in mixed ANOVA equal (or as close to equal as feasible).&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;Bibliography&lt;/span&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;&lt;a href="http://dx.doi.org/doi:10.1016/j.apergo.2004.01.002"&gt;Baguley, T. (2004). Understanding statistical power in the context of applied research. Applied Ergonomics, 35, 73-80.&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;&lt;span lang="EN-US"&gt;Field, A. (1998). A bluffer's guide to ... sphericity. &lt;i&gt;The British Psychological Society: Mathematical, Statistical &amp;amp; Computing Section Newsletter, 6&lt;/i&gt;, 13-22. &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p align="left"&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Howell, D. C. (2002). &lt;a href="http://www.amazon.co.uk/gp/offer-listing/0495093610?ie=UTF8&amp;tag=psychologic05-21&amp;linkCode=am2&amp;camp=1634&amp;creative=6738&amp;creativeASIN=0495093610"&gt;Statistical Methods for Psychology&lt;/a&gt;&lt;img src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;l=as2&amp;o=2&amp;a=0495093610" width="1" height="1" border="0" alt="" style="border:none !important; margin:0px !important;" /&gt;. (5th. ed.). Belmont, CA: Duxberry Press.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p align="right"&gt;&lt;span style="font-family:Arial;font-size:85%;"&gt;[David Howell has just released a sixth edition which looks excellent. At first glance the main changes are in layout and use of software examples. Once I'm sure of the the content I'll update this reference.]&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Judd, C. M., McClelland, G. H., &amp;amp; Culhane, S. E. (1995). Data analysis: continuing issues in everday analysis of psychological data. &lt;i&gt;Annual Review of Psychology, 46&lt;/i&gt;, 433-465. &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Keselman, H. J., Algina, J., &amp;amp; Kowalchuk, R. K. (2001). The analysis of repeated measures designs: a review. &lt;i&gt;British Journal of Mathematical and Statistical Psychology, 54&lt;/i&gt;, 1-20. &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;span lang="EN-US"&gt;Kirk, R. E. (1995). &lt;a href="http://www.amazon.co.uk/gp/product/0534250920?ie=UTF8&amp;amp;tag=psychologic05-21&amp;amp;linkCode=as2&amp;amp;camp=1634&amp;amp;creative=6738&amp;amp;creativeASIN=0534250920"&gt;Experimental Design: Procedures for the Behavioral Sciences&lt;/a&gt;&lt;img src="http://www.assoc-amazon.co.uk/e/ir?t=psychologic05-21&amp;amp;l=as2&amp;amp;o=2&amp;amp;a=0534250920" width="1" height="1" border="0" alt="" style="border:none !important; margin:0px !important;" /&gt;. (3rd ed.). Pacific Grove: Brooks/Cole.&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family:arial;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.biomedcentral.com/1471-2288/3/27"&gt;Miles, J. (2003). A framework for power analysis using a structural equation modelling procedure. &lt;em&gt;BMC Medical Research Methodology, 3: 27&lt;/em&gt;. Published online 2003 December 11. doi: 10.1186/1471-2288-3-27.&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:130%;"&gt;&lt;i&gt;Footnotes:&lt;/i&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;a name="1"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sup&gt;1&lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/a&gt;&lt;span lang="EN-US"&gt;&lt;span style="font-size:85%;"&gt; &lt;/span&gt;As long as the treatment only has the effect of adding or subtracting to the group means (and doesn't influence their variances) the &lt;em&gt;homogeneity of variance&lt;/em&gt; assumption isn't a problem. This special case is known as &lt;em&gt;unit-treatment-additivity&lt;/em&gt;. Unfortunately life isn't always that simple: there are good reasons why treatments might be expected to influence both means an variances. For this reason it is always sensible to check the group variances in independent measures designs.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;a name="2"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sup&gt;2&lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/a&gt;&lt;span lang="EN-US"&gt; As a rule of thumb the largest group variance should be no more than three or four times as large as the smallest group variance.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;a name="3"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sup&gt;3&lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/a&gt;&lt;span lang="EN-US"&gt; Covariance matrices also crop up in all sorts of other statistics, but we can forget about that for now.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;a name="4"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sup&gt;4&lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/a&gt;&lt;strong&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt; &lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;span lang="EN-US"&gt;The diagonals contain the variances because samples share all of their variance with themselves. I've used &lt;strong&gt;&lt;i&gt;&lt;span style="font-size:130%;"&gt;s&lt;/span&gt;&lt;/i&gt;&lt;/strong&gt; rather than the Greek sigma symbol because it turns out better when browsers with different fonts are used. &lt;strong&gt;&lt;i&gt;&lt;span style="font-size:130%;"&gt;s&lt;/span&gt;&lt;/i&gt;&lt;/strong&gt; is normally used for samples and sigma for populations, but I'm using &lt;strong&gt;&lt;i&gt;&lt;span style="font-size:130%;"&gt;s &lt;/span&gt;&lt;/i&gt;&lt;/strong&gt;throughout for convenience (sorry!). You can generate Greek letters by using the "symbol" font in many word processors (e.g., 's' for sigma, m for mu and so forth).&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sup&gt;5 &lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;span lang="EN-US"&gt;The covariance between groups will rarely be exactly zero in the samples. However, provided people are randomly assigned to groups, and each person contributes only one data point then we can be pretty certain that the covariances in the populations being sampled are zero (and therefore the independence assumption is met). Even if random assignment to groups doesn't occur the independence assumption is often reasonable. Any time we know or believe that the measures will be correlated (e.g., in matched designs) a repeated measures analysis should be used. (I use repeated measures in a non-technical rather than just to mean a type of ANOVA or general linear model).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;a name="6"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sup&gt;6&lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/a&gt;&lt;strong&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt; &lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;span lang="EN-US"&gt;I probably should have mentioned this earlier, but covariances (like correlations) can be both negative and positive (unlike variances which are always positive). Positive covariances occur between samples when two samples are positively correlated. Negative covariances occur between samples when two samples are negatively correlated. The idea of a negative covariance is often tricky to grasp - but it just means that as one group tends to vary upwards in value the other tends to vary downwards. So when checking covariances to see if they are similar bear in mind the sign of the covariance as well as its magnitude (e.g., a covariance of -124.3 is very different from +124.3).&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;a name="7"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sup&gt;7&lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/a&gt;&lt;strong&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt; &lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;span lang="EN-US"&gt;By now you can probably appreciate why many text books focus on compound symmetry and don't cover sphericity in detail.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;a name="8"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sup&gt;8&lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/a&gt;&lt;strong&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt; &lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;span lang="EN-US"&gt;One of the nice things about this topic is that the tests have nice, impressive-sounding names. &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;a name="9"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sup&gt;9&lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/a&gt;&lt;strong&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt; &lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;span lang="EN-US"&gt;The Greek letter epsilon is usually used. Greenhouse-Geisser estimates of epsilon have a little hat on top (^). Huynh-Feldt estimates have a little squiggle on top (~). You can generate Greek letters by using the "symbol" font in many word processors (e.g., 'e' for epsilon).&lt;/span&gt;&lt;/span&gt; &lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;a name="10"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sup&gt;10&lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/a&gt;&lt;strong&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt; &lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;span lang="EN-US"&gt;Later on we discuss the special case of &lt;em&gt;k&lt;/em&gt; = 2 and the analagous case of paired &lt;em&gt;t&lt;/em&gt; tests. Feel free to jump ahead if you wish.&lt;/span&gt;&lt;/span&gt; &lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;strong&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt;&lt;span style="font-size:85%;"&gt;11&lt;/span&gt; &lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;span lang="EN-US"&gt;You won't find tables for fractional d.f., but exact &lt;em&gt;p&lt;/em&gt; values can be calculated if d.f. are fractional (most good computer packages do this automatically these days).&lt;/span&gt;&lt;/span&gt; &lt;/p&gt;&lt;p&gt;&lt;span style="font-family:Arial, Helvetica, sans-serif;"&gt;&lt;a name="12"&gt;&lt;strong&gt;&lt;span lang="EN-US"  style="font-size:85%;"&gt;&lt;sup&gt;12&lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/a&gt;&lt;strong&gt;&lt;span lang="EN-US"&gt;&lt;sup&gt; &lt;/sup&gt;&lt;/span&gt;&lt;/strong&gt;&lt;span lang="EN-US"&gt;Why don't I like them? Apart from all the above reasons, I don't like the idea of using a significance test to test the assumptions of a significance test. If that was a good idea, why don't we use significance tests to test the assumptions of &lt;em&gt;Mauchly's&lt;/em&gt; sphericity test or &lt;em&gt;Levine's &lt;/em&gt;test&lt;em&gt; &lt;/em&gt;of homogeneity of variances? At some point you've got look at the data (using graphical methods, descriptive statistics and so forth) and make a considered judgement about what procedures to use. My view is that the earlier this is done in the process the better ...&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;br /&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-114831327926138278?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/KODkHR48GKJYT4YngSn0yIl1mvo/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/KODkHR48GKJYT4YngSn0yIl1mvo/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/KODkHR48GKJYT4YngSn0yIl1mvo/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/KODkHR48GKJYT4YngSn0yIl1mvo/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/MR9pTf_kR6Y" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/114831327926138278/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2006/05/what-is-all-this-stuff-about.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/114831327926138278?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/114831327926138278?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/MR9pTf_kR6Y/what-is-all-this-stuff-about.html" title="What is all this stuff about sphericity in my repeated measures ANOVA output?" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2006/05/what-is-all-this-stuff-about.html</feedburner:origLink></entry><entry gd:etag="W/&quot;C04BRXo_fSp7ImA9WxZbEUo.&quot;"><id>tag:blogger.com,1999:blog-27862247.post-114726392006825136</id><published>2006-05-10T13:03:00.001+01:00</published><updated>2008-04-14T12:19:14.445+01:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2008-04-14T12:19:14.445+01:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="statistics" /><category scheme="http://www.blogger.com/atom/ns#" term="blog" /><category scheme="http://www.blogger.com/atom/ns#" term="psychology" /><title>What is a psychological statistics blog?</title><content type="html">&lt;span style="font-family:times new roman;"&gt;I've no idea.&lt;br /&gt;&lt;br /&gt;Some time ago I had the idea of a blog about the kinds of statistics I use in my work. I didn't really think much about it until I noticed a couple of blogs by a friend of mine (&lt;em&gt;&lt;a href="http://www.jeremymiles.co.uk/learningstats/index.html"&gt;Learning Statistics&lt;/a&gt;&lt;/em&gt; and &lt;/span&gt;&lt;a href="http://www.jeremymiles.co.uk/regressionbook/regressionblog.html"&gt;&lt;span style="font-family:times new roman;"&gt;&lt;em&gt;Applying Regression&lt;/em&gt; &lt;/span&gt;&lt;/a&gt;&lt;span style="font-family:times new roman;"&gt;by Jeremy Miles). Jeremy's blogs pointed me to the excellent &lt;/span&gt;&lt;a href="http://www.stat.columbia.edu/~cook/movabletype/mlm/"&gt;&lt;span style="font-family:times new roman;"&gt;&lt;em&gt;Statistical Modeling, Causal Inference and Social Science&lt;/em&gt; &lt;/span&gt;&lt;/a&gt;&lt;span style="font-family:times new roman;"&gt;by Andrew Gelman and Samantha Cook. As a consequence I'm not sure if this means that a psychological statistics blog is where all the cool academics hang out (though ... on reflection it seems unlikely) or whether I'm just slow to catch on (as usual).&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family:times new roman;"&gt;&lt;br /&gt;The reason the blog is appearing now is really just to make a home for all sorts of odds and ends I've written or (half-written) that have never seen the light or day, or were published but have since become homeless. What this blog is about might never become clear, but then (as a newcomer to blogging) I suspect if I did have a clear idea what I was doing I'd write a book instead.&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/27862247-114726392006825136?l=psychologicalstatistics.blogspot.com' alt='' /&gt;&lt;/div&gt;
&lt;p&gt;&lt;a href="http://feedads.g.doubleclick.net/~a/9nXx7Ew4CcmrM-Ly15XSLj9P8vA/0/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/9nXx7Ew4CcmrM-Ly15XSLj9P8vA/0/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;br/&gt;
&lt;a href="http://feedads.g.doubleclick.net/~a/9nXx7Ew4CcmrM-Ly15XSLj9P8vA/1/da"&gt;&lt;img src="http://feedads.g.doubleclick.net/~a/9nXx7Ew4CcmrM-Ly15XSLj9P8vA/1/di" border="0" ismap="true"&gt;&lt;/img&gt;&lt;/a&gt;&lt;/p&gt;&lt;img src="http://feeds.feedburner.com/~r/PsychologicalStatistics/~4/18tZrdLUCTU" height="1" width="1"/&gt;</content><link rel="replies" type="application/atom+xml" href="http://psychologicalstatistics.blogspot.com/feeds/114726392006825136/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://psychologicalstatistics.blogspot.com/2006/05/what-is-psychological-statistics-blog.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/114726392006825136?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/27862247/posts/default/114726392006825136?v=2" /><link rel="alternate" type="text/html" href="http://feedproxy.google.com/~r/PsychologicalStatistics/~3/18tZrdLUCTU/what-is-psychological-statistics-blog.html" title="What is a psychological statistics blog?" /><author><name>Palinurus</name><uri>http://www.blogger.com/profile/00392478801981388165</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>0</thr:total><feedburner:origLink>http://psychologicalstatistics.blogspot.com/2006/05/what-is-psychological-statistics-blog.html</feedburner:origLink></entry></feed>

