<?xml version='1.0' encoding='UTF-8'?><rss xmlns:atom='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' version='2.0'><channel><atom:id>tag:blogger.com,1999:blog-15703134</atom:id><lastBuildDate>Mon, 26 Oct 2009 17:00:58 +0000</lastBuildDate><title>Learning Statistics</title><description>A blog about learning statistics in psychology, health and social sciences.</description><link>http://www.jeremymiles.co.uk/learningstats/</link><managingEditor>noreply@blogger.com (J)</managingEditor><generator>Blogger</generator><openSearch:totalResults>52</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-3487012686344408604</guid><pubDate>Mon, 04 Aug 2008 05:03:00 +0000</pubDate><atom:updated>2008-08-04T06:05:23.336+01:00</atom:updated><title>Retiring</title><description>I've decided to retire this blog.  Well, I've decided to accept that I've almost run out of things to say on it.  I've decided to merge it with my &lt;a href="http://www.jeremymiles.co.uk/regressionbook/regressionblog.html"&gt;other blog&lt;/a&gt;, into a more general set of ramblings, and start a new blog which you can find &lt;a href="http://www.jeremymiles.co.uk/randomness/index.html"&gt;here&lt;/a&gt; (if you so care).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-3487012686344408604?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2008/08/retiring.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>2</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-6077434348570573297</guid><pubDate>Mon, 07 Apr 2008 17:05:00 +0000</pubDate><atom:updated>2008-04-07T18:08:11.700+01:00</atom:updated><title>Measurement vs Statistics</title><description>Bad Science has an interesting &lt;a href="http://www.badscience.net/?p=653"&gt;article about issues of measurement, before you even get to issues of analysis&lt;/a&gt;.  Here's the first paragraph:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;"There's this vague idea - which has been going around for the past few centuries - that statistics is quite difficult. But in reality the maths is often the least of your problems: the tricky bit comes way before the number crunching, when you are deciding what to measure, how to measure it, and what those measurements mean."&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-6077434348570573297?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2008/04/measurement-vs-statistics_07.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>3</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-21239454671496262</guid><pubDate>Mon, 07 Apr 2008 17:05:00 +0000</pubDate><atom:updated>2008-04-07T18:06:32.258+01:00</atom:updated><title>Measurement vs Statistics</title><description>&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-21239454671496262?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2008/04/measurement-vs-statistics.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-940263946107781445</guid><pubDate>Sat, 29 Mar 2008 10:05:00 +0000</pubDate><atom:updated>2008-03-29T10:05:35.808Z</atom:updated><title>Zehn Mark I</title><description>&lt;div style="float: right; margin-left: 10px; margin-bottom: 10px;"&gt; &lt;a href="http://www.flickr.com/photos/arenamontanus/2359186022/" title="photo sharing"&gt;&lt;img src="http://farm3.static.flickr.com/2200/2359186022_e90989dc70_m.jpg" alt="" style="border: solid 2px #000000;" /&gt;&lt;/a&gt; &lt;br /&gt; &lt;span style="font-size: 0.9em; margin-top: 0px;"&gt;  &lt;a href="http://www.flickr.com/photos/arenamontanus/2359186022/"&gt;Zehn Mark I&lt;/a&gt;  &lt;br /&gt;  Originally uploaded by &lt;a href="http://www.flickr.com/people/arenamontanus/"&gt;Arenamontanus&lt;/a&gt; &lt;/span&gt;&lt;/div&gt;Here's an old German 10 mark note, with a picture of Carl Gauss on it (as in Gaussian, as in normal distribution).  To the left of the picture of Gauss is a normal distribution, with the formula.  &lt;br /&gt;&lt;br /&gt;You can read more about the distribution (and see a larger version of the formula) at Wikipedia http://en.wikipedia.org/wiki/Gaussian_distribution.&lt;br clear="all" /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-940263946107781445?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2008/03/zehn-mark-i.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>2</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-6918969336509974926</guid><pubDate>Mon, 21 Jan 2008 13:02:00 +0000</pubDate><atom:updated>2008-01-21T13:04:40.166Z</atom:updated><title>Another LOLScience Picture</title><description>&lt;div style="float: right; margin-left: 10px; margin-bottom: 10px;"&gt; &lt;a href="http://www.flickr.com/photos/22909689@N04/2204443031/" title="photo sharing"&gt;&lt;img src="http://farm3.static.flickr.com/2318/2204443031_c6c97de5eb_m.jpg" alt="" style="border: 2px solid rgb(0, 0, 0);" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;span style="margin-top: 0px;font-size:0;" &gt;  &lt;a href="http://www.flickr.com/photos/22909689@N04/2204443031/"&gt;icanhaspaic128452639380076250&lt;/a&gt; &lt;br /&gt; Originally uploaded by &lt;a href="http://www.flickr.com/people/22909689@N04/"&gt;increpare&lt;/a&gt; &lt;/span&gt;&lt;/div&gt;It's Florence Nightingale who invented the pie chart (or something very similar she actually called them polar axis charts, but they led to pie charts) and was the first female member of the &lt;a href="http://www.rss.org.uk"&gt;Royal Statistical Society&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-6918969336509974926?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2008/01/another-lolscience-picture.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-3036847655860211079</guid><pubDate>Fri, 21 Dec 2007 16:36:00 +0000</pubDate><atom:updated>2007-12-21T16:57:12.596Z</atom:updated><title>Swanz</title><description>&lt;div style="float: right; margin-left: 10px; margin-bottom: 10px;"&gt; &lt;a href="http://www.flickr.com/photos/prawnwarp/530746651/" title="photo sharing"&gt;&lt;img src="http://farm2.static.flickr.com/1208/530746651_461072b105_m.jpg" alt="" style="border: 2px solid rgb(0, 0, 0);" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;span style="margin-top: 0px;font-size:0;" &gt;  &lt;a href="http://www.flickr.com/photos/prawnwarp/530746651/"&gt;swanz&lt;/a&gt;&lt;br /&gt;Originally uploaded by &lt;a href="http://www.flickr.com/people/prawnwarp/"&gt;mr lynch&lt;/a&gt; &lt;/span&gt;&lt;/div&gt;Fickr has a group called &lt;a href="http://www.flickr.com/groups/lolscience/pool/"&gt;LOL Science&lt;/a&gt;, here's one of the pictures.  It's a combination of a hideous number of in jokes.  If you get it, you can feel smug now.&lt;br /&gt;&lt;br /&gt;If you don't get it, here's some explanation.  We'll take them in order of relevance to this blog.&lt;br /&gt;&lt;br /&gt;&lt;ol&gt;&lt;li&gt;It's about Popper, and his philosophy of science.  Popper is important, because his philosophy defined the way that we think about science, and statistics.  Popper's important idea was that you couldn't prove a theory, you can only disprove it.  If you believe that all swans are white, then you can never prove this by finding white swans.  You have to go out and try to disprove your theory - you have to try to find a black swan.  If you try really hard to find more white swans, that doesn't tell us anything.  If you try really hard to find a black swan, and fail, that supports your theory (but doesn't prove it).  If you succeed, you've disproved your theory. That's why we focus on the null hypothesis in statistic. The old approach of collecting data to try to prove your theory, is the inductive method, the new approach, of trying hard to disprove your theory, is called the hypothetico-deductive method.  Now we know that black swans live in Australia (or the antipodes).&lt;br /&gt;Note for the enthusiastic - more recently it's been suggested that an &lt;a href="http://books.google.com/books?id=lZ2ybZzrnroC&amp;amp;pg=PA335&amp;amp;lpg=PA335&amp;amp;dq=abductive+mulaik&amp;amp;source=web&amp;amp;ots=06bUgKO1ab&amp;amp;sig=pE36Dmi0vdGAIRu7gmnJoMRLp9w"&gt;abductive &lt;/a&gt;approach should be employed, rather than a hypothetico-deductive approach.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;I'm in ur (X), (Y)ing ur Zs, is a &lt;a href="http://snowclones.org/2007/10/19/im-in-ur-x-ying-ur-z/"&gt;snowclone&lt;/a&gt;.  The term snowclone originated on the &lt;a href="http://itre.cis.upenn.edu/%7Emyl/languagelog/"&gt;Language Log&lt;/a&gt; blog, a snowclone (originally) takes the form "&lt;a href="http://itre.cis.upenn.edu/%7Emyl/languagelog/archives/000350.html"&gt;If eskimos have 94 words for snow, then X must have Y words for Z&lt;/a&gt;" (eskimos, of course, don't have 94 words for snow, or 16 or 138).  The original phrase was "I'm in ur base, killing ur d00dz".&lt;br /&gt;&lt;/li&gt;&lt;li&gt;The third link is about &lt;a href="http://icanhascheezburger.com/"&gt;LOLcats&lt;/a&gt; - this one's not about psychology or statistics, but it is about pictures of cats (or other animals) with amusing captions, usually with grammatical and spelling errors (think how your cat would write if it was aged 14 and texting its friends).  The I'm in ur X... meme is a very popular one to use for LOLcats pictures.  &lt;a href="http://online.wsj.com/article/SB118798557326508182.html"&gt;Here's an article that explains more&lt;/a&gt;.  (That last one didn't have anything to do with statistics or psychology).&lt;/li&gt;&lt;/ol&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-3036847655860211079?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/12/swanz.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-4321428129143430252</guid><pubDate>Sat, 04 Aug 2007 05:40:00 +0000</pubDate><atom:updated>2007-08-04T06:41:25.826+01:00</atom:updated><title>New email subscribe thing</title><description>If you want to get an email when a new post is added to this page, enter your email address in the box on the right.  (It's organized by &lt;a href="http://www.feedburner.com"&gt;Feedburner&lt;/a&gt;.)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-4321428129143430252?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/08/new-email-subscribe-thing.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-5201367297798793056</guid><pubDate>Wed, 01 Aug 2007 04:07:00 +0000</pubDate><atom:updated>2007-08-01T05:21:12.996+01:00</atom:updated><title>e</title><description>&lt;a href="http://www.newscientist.com/"&gt;New Scientist&lt;/a&gt; recently had an article about &lt;a href="http://www.newscientist.com/channel/fundamentals/mg19526131.600-iei-the-mystery-number.html"&gt;&lt;span style="font-style: italic;"&gt;e&lt;/span&gt;&lt;/a&gt; [subscription of some sort required for full article], everybody's second favourite mathematical constant.  (&lt;span style="font-style: italic;font-family:symbol;" &gt;&lt;/span&gt;&lt;span style="font-style: italic;"&gt;pi&lt;/span&gt; is everyone's favourite mathematical constant, &lt;span style="font-style: italic;"&gt;obviously&lt;/span&gt;).  It talked, in a relatively non-technical manner, about the reasons why &lt;span style="font-style: italic;"&gt;e&lt;/span&gt; was exciting.  It didn't talk about its role in statistics, in, for example, the &lt;a href="http://en.wikipedia.org/wiki/Normal_distribution"&gt;normal distribution&lt;/a&gt;, but it's probably too difficult to do that without assuming some prior knowledge.  [Did someone say too boring?  Out of the room!  NOW!]&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-5201367297798793056?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/08/e.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-3071099651981457218</guid><pubDate>Wed, 01 Aug 2007 03:33:00 +0000</pubDate><atom:updated>2007-08-01T04:57:05.717+01:00</atom:updated><title>Language Log on Odds Ratios</title><description>There was an &lt;a href="http://itre.cis.upenn.edu/%7Emyl/languagelog/archives/004767.html"&gt;interesting post on the presentation of odds ratios on the Language Log Blog&lt;/a&gt;, the other day.  They give some examples of odds ratios being deceptive, confusing and misunderstood.  It's been said plenty of times, by plenty of people (&lt;a href="http://www.amazon.co.uk/Understanding-Using-Statistics-Psychology-Introduction/dp/0761943978/r"&gt;including me&lt;/a&gt;) but it's interesting that linguists are saying it too.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-3071099651981457218?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/08/language-log-on-odds-ratios.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>2</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-5538341633871982881</guid><pubDate>Fri, 29 Jun 2007 23:21:00 +0000</pubDate><atom:updated>2007-06-30T00:52:02.937+01:00</atom:updated><title>You're a Bayesian!</title><description>I've written a bit before about Bayesian statistics, &lt;a href="http://www.jeremymiles.co.uk/learningstats/2006/07/what-is-probability.html"&gt;here&lt;/a&gt;, &lt;a href="http://www.jeremymiles.co.uk/learningstats/2006/12/example-of-bayesian-analysis.html"&gt;here&lt;/a&gt; and &lt;a href="http://www.jeremymiles.co.uk/learningstats/2007/02/types-of-errors.html"&gt;here&lt;/a&gt; (that last one where I stole a line from Brad Efron, who said "We can all be Bayesians when we need to be," and also in a &lt;a href="http://www.jeremymiles.co.uk/usingstatistics/index.html"&gt;recently published book&lt;/a&gt;.  I'm kind of sympathetic towards Bayesian analysis, but I very rarely do it.  The basis for Bayesian analysis is that we incorporate the prior probability of a result into our analysis.  Some people are positively antagonistic towards Bayesian thinking - denying that there is ever a use for it - the selection of the prior probability being something of a sticking point.  (Actually, Bayesian analysis is lots more complex than that, and doesn't always require what are called 'informative priors', but we won't worry about that for a minute.&lt;br /&gt;&lt;br /&gt;However, the most recent issue of &lt;a href="http://www.blackwellpublishing.com/journal.asp?ref=1740-9705"&gt;Significance&lt;/a&gt; had a very interesting article by &lt;a href="http://www.senns.demon.co.uk/home.html"&gt;Stephen Senn&lt;/a&gt;, in which he wrote about the &lt;a href="http://en.wikipedia.org/wiki/TGN1412"&gt;TeGenero tgn1412 drug trial catastrophe&lt;/a&gt; which occurred in March 2006, when 6 volunteers received the drug, and two received a placebo.  The 6 volunteers almost immediately had massive immune system reactions - specifically a cytokine storm, and were hospitalised for at least a month.&lt;br /&gt;&lt;br /&gt;What we have here, is the potential of a statistical analysis  We've got a 2x2 table, so let's do the stats.&lt;br /&gt;&lt;pre&gt;                       Placebo    Drug&lt;br /&gt;                Yes   0          6&lt;br /&gt;Cytokine Storm&lt;br /&gt;                No    2          0&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/pre&gt;A 2x2 table.  We obviously can't do a chi-square test, as the sample is too small.  But we can do a Fisher's exact test. If we do that we get a one-tailed p of 0.036.  It's a one-tailed test, so our p-value cut off is 0.025, so we don't have evidence that the drug caused the cytokine storm, and all the subsequent ills.&lt;br /&gt;&lt;br /&gt;But that's got to be a silly thing to say.  It's obvious that the drug &lt;span style="font-style: italic;"&gt;did&lt;/span&gt; cause the cytokine storm.  It's not just barely significant; it's really, really obvious.  Why is it so obvious? It's obvious because people don't have cytokine storms every day.  In fact, if you haven't got the &lt;a href="http://en.wikipedia.org/wiki/Spanish_flu"&gt;Spanish Flu&lt;/a&gt; we're pretty safe saying that you will never have a cytokine storm.   In other words, it's not just the data that we have obtained here that we need to take into account.  We need to take into account the probability of having a cytokine storm &lt;span style="font-style: italic;"&gt;ever&lt;/span&gt; is very low.  In other words, we need to take into account the prior probability.  And so we have just done a Bayesian analysis.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-5538341633871982881?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/06/youre-bayesian.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-115559461175063566</guid><pubDate>Tue, 26 Jun 2007 03:34:00 +0000</pubDate><atom:updated>2007-06-30T00:53:54.991+01:00</atom:updated><title></title><description>&lt;a href="http://www.york.ac.uk/inst/spru/"&gt;Tricia&lt;/a&gt; sent me an email:&lt;br /&gt;&lt;blockquote&gt;Greetings and a quick query re power calculations. I've read the very clear text on your website about this [that would be &lt;a href="http://www.jeremymiles.co.uk/misc/power/index.html"&gt;here&lt;/a&gt; - JM], but have one question that it doesn't answer. We are planning a study of stress, coping etc in staff in paediatric oncology. The subjects will be all staff in all the UK paediatric oncology units - so the total population rather than a sample. I think I vaguely remember that if you do a total population study then you don't do a power calculation, as it's not a sample, and anyway you can't increase&lt;br /&gt;numbers. Is that correct?&lt;/blockquote&gt;Interesting question.&lt;br /&gt;&lt;br /&gt;Two points to answer it.&lt;br /&gt;&lt;br /&gt;First, if you have the whole population, you don't need to do inferential statistics at all, so power is irrelevant.&lt;br /&gt;&lt;br /&gt;If you have a cohort of sociology students at the University of &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_0"&gt;Uttoxeter&lt;/span&gt;, their average age might be 20 years and 3 months.  There's no standard error or confidence interval on that.  Because you *know* their average age. A cohort of psychology students might be 20 years, 3 months, and 1 day.  The psychology students are older.  There's no significance test to be done, because it's a population difference.&lt;br /&gt;&lt;br /&gt;However, that's not what you are normally interested in.  Because we aren't interested in &lt;span style="font-style: italic;"&gt;just&lt;/span&gt; those cohorts of students.  We don't want to know if psychology students in 2007 in York are older than sociology students, we want to know about all psychology and sociology students in general.&lt;br /&gt;&lt;br /&gt;Similarly, we aren't really interested in all staff in paediatric oncology units.  We want to know about all possible staff in those units, and so the current population is just a sample from the population of possible staff.  We can make a statement about those staff today, but tomorrow, they might have changed.  You can assess it today but if you don't do a significance test, you can't say anything about it tomorrow.&lt;br /&gt;&lt;br /&gt;So you need to do a significance test, and if you need to do a significance test, you need to do a power calculation.  The purpose of the power calculation is to determine how large an effect you are likely to find.  If you don't get a significant result, you can say "Well, the effect would have had to be as large as X, so whatever the&lt;br /&gt;effect is, it's probably smaller than X."&lt;br /&gt;&lt;br /&gt;Two further issues, that you might not have considered.&lt;br /&gt;&lt;br /&gt;1) How large is the population? You might look into &lt;a href="http://www.psycho.uni-duesseldorf.de/aap/projects/gpower/reference/reference_manual_03.html"&gt;compromise power analysis &lt;/a&gt;if it's small, and there is nothing you can do about it.&lt;br /&gt;&lt;br /&gt;2) The sample is going to be clustered, and that's going to make life tricky, because you need to take it into account in both the analysis, and the power calculation.  That is, you need to distinguish between within unit effects, and between unit effects.  (Is it that within a unit, the people who have coping style X are more stressed, than the&lt;br /&gt;people who have coping style Y, or is it that in there are differences between units, in that some units have more coping style X, and less stress, with within a unit, there is no relation between style and stress.  It's possible for those effects to be in opposite directions.)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-115559461175063566?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/06/tricia-sent-me-email-greetings-and.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-9096776619043009692</guid><pubDate>Thu, 03 May 2007 04:48:00 +0000</pubDate><atom:updated>2007-05-04T15:35:39.929+01:00</atom:updated><title>You might hope ...</title><description>[Update: Read the link associated with the first comment, it does reveal that Furedi was somewhat misreported - or at least reported out of context.]&lt;br /&gt;&lt;br /&gt;Ann Furedi is chief executive of the British Pregnancy Advisory Service.  You'd hope that someone who was providing advice would base it on evidence of some sort, and that that person would understand evidence.  There was an a&lt;a href="http://www.guardian.co.uk/uk_news/story/0,,2071032,00.html"&gt;rticle in today's Guardian&lt;/a&gt;, about whether GPs would sign abortion refusal forms - reporting on a study that survey 309 GPs which found that nearly one in five refused to sign abortion refusal forms.&lt;br /&gt;&lt;br /&gt;Furedi was quoted in the Guardian as saying "she did not believe the survey accurately reflected GPs' opinions as it polled less than 1% of the UK's 40,000 GPs".  But it's not the proportion of people that you sample that counts, it's the number, and it's their representativeness.&lt;br /&gt;&lt;br /&gt;Opinion polls do a pretty good job of predicting elections, but they tend to survey around 1000 people.  No one says "There are [approx, I'm guessing] 40,000,000 voters in the UK, and they sampled only 1 in 40,000 (or 0.0025%) of them, so I don't believe the results."  If we move to a bigger country, say the US, then 1000 people is taken from a (hypothetical voting) population of 200,000,000 - they sample only 1 in 200,000 people, or 0.0005%, but shockingly this is no worse.&lt;br /&gt;&lt;br /&gt;Experiments in psychology are doing pretty well if they have a sample of a couple of hundred people, and we're not trying to generalise to a country there, we're trying to generalise to everyone there is - and everyone there might be - that's an infinite sample, so we have, according to this logic, sampled no one at all.&lt;br /&gt;&lt;br /&gt;The crucial thing is how representative the sample is.  I have tried to have a look for the article, which seems to have been published in &lt;a href="http://www.pulse-i.co.uk/"&gt;Pulse-i&lt;/a&gt; so I could check on it, but I couldn't find it.  (You seem to need to be a doctor to register as well).  If it was like most magazine surveys, it was a volunteer sample, in which case it probably is nonsense, but not for the reasons Furedi said.&lt;br /&gt;&lt;br /&gt;Two further points:&lt;br /&gt;&lt;br /&gt;First, it's possible, indeed likely, that Furedi was horribly misquoted.  There's a good chance that the journalist who was writing the story didn't understand the issue, so took one part of what she said, and quoted it.  (Imagine she might have given a 10 minute explanation of sampling, and at the end said ..."and the sample was on the small side", and that's the bit they decide to quote.&lt;br /&gt;&lt;br /&gt;Second, the article also mentions &lt;a href="http://www.cmwf.org/fellows/fellows_show.htm?doc_id=400054"&gt;Robbie Foy&lt;/a&gt;, who's a pal of mine, so I feel sort of famous by association.  (Although not as famous as the fact that I used to have a girlfriend whose mother's hairdresser's sister wrote the words to the theme song to &lt;a href="http://www.neighbours.com/"&gt;&lt;span style="font-style: italic;"&gt;Neighbours&lt;/span&gt;&lt;/a&gt;).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-9096776619043009692?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/05/you-might-hope.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-8554592218338887020</guid><pubDate>Sun, 22 Apr 2007 06:08:00 +0000</pubDate><atom:updated>2007-04-22T07:14:30.709+01:00</atom:updated><title>Wiki on research methods</title><description>A while ago, I wrote a book called 'Research methods and statistics in psychology', which was aimed at first year students studying psychology at university in the UK.&lt;br /&gt;&lt;br /&gt;I've now made it into a Wiki (well, the text, anyway).  Anyone is free to read it, and edit it.  You can find it at &lt;a href="http://www.researchmethodsinpsychology.com"&gt;www.researchmethodsinpsychology.com&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;By the nature of Wikis, it's &lt;a href="http://en.wikipedia.org/wiki/Free_as_in_beer"&gt;free as in beer, and free as in speech&lt;/a&gt;.  I wonder if it's an interesting idea to let students write or change a textbook.  Maybe they won't.  I'm wondering how much of a role I should / will play.  Those 50,000 (or so) words were my baby.  Will I get upset if people mash them around?  As it says on every editing page "If you don't want your writing to be edited mercilessly, then don't submit it here." Will I have urges to correct things I believe to be wrong?  Will it get too big for me to keep track of whether things are wrong?  (I doubt it, but you never know).&lt;br /&gt;&lt;br /&gt;It's got Google ads on it, which means that it might make me back the $6/month that I spend on hosting.  But I suspect it won't quite.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-8554592218338887020?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/04/wiki-on-research-methods.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-6984149385629858368</guid><pubDate>Mon, 02 Apr 2007 05:20:00 +0000</pubDate><atom:updated>2007-04-02T06:22:46.918+01:00</atom:updated><title>Book page updated</title><description>I've updated the page of supplemental information for the Understanding and Using Statistics in Psychology page.  You can see it &lt;a href="http://www.jeremymiles.co.uk/usingstatistics"&gt;here&lt;/a&gt;, but it's not much use if you haven't got the book.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-6984149385629858368?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/04/book-page-updated.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-1779052691690812095</guid><pubDate>Sun, 01 Apr 2007 22:54:00 +0000</pubDate><atom:updated>2007-04-02T00:01:43.483+01:00</atom:updated><title>What's the Denominator</title><description>When you want to try to fiddle people with statistics, to make yourself look better, one thing you can do is to try to change the denominator in your equation.  (The denominator is the underneath number of a fraction, so if we say 3/4, the numerator is 3, the denominator is 4). &lt;br /&gt;&lt;br /&gt;There was &lt;a href="http://observer.guardian.co.uk/cash/story/0,,2047267,00.html"&gt;a story in today's Guardian&lt;/a&gt; that gives a nice example of this.  The story was about ticket touting, and whether &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_0"&gt;Ebay&lt;/span&gt; is used by touts.  Here's the quote:&lt;br /&gt;&lt;blockquote&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_1"&gt;Ebay&lt;/span&gt; disputes that large-scale ticket touting takes place on its site, citing research by &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_2"&gt;ICM&lt;/span&gt; last year that showed nine out of 10 eBay sellers who had listed tickets in the previous 12 months had listed less than five.&lt;/blockquote&gt;(We'll ignore the fact that they should have said 'fewer than', not less than.)&lt;br /&gt;&lt;br /&gt;They are using sellers as the denominator, but that's not the right one.  The appropriate denominator is tickets.  Let's imagine that we had 10 sellers.  9 of those 10 sellers sold 4 tickets (making 36 tickets).  The tenth seller sold 964 tickets.  This matches what &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_3"&gt;Ebay&lt;/span&gt; said, but it's not relevant.  What is relevant is that 96.4% of tickets are sold by people who sell more than 500 per year.  That looks like a problem.&lt;br /&gt;&lt;br /&gt;It isn't necessarily the case that this is true - but it's not ruled out by what they say.  Lies, damn lies and statistics may be true - but the lies aren't very hard to see through.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-1779052691690812095?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/04/whats-denominator.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-5266518421795405606</guid><pubDate>Sun, 01 Apr 2007 04:54:00 +0000</pubDate><atom:updated>2007-05-08T22:59:41.996+01:00</atom:updated><title>Genetics</title><description>&lt;div style="float: right; margin-left: 10px; margin-bottom: 10px;"&gt; &lt;a href="http://www.flickr.com/photos/jeremymiles/441560049/" title="photo sharing"&gt;&lt;img src="http://farm1.static.flickr.com/189/441560049_a5fba56bac_m.jpg" alt="" style="border: 2px solid rgb(0, 0, 0);" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;span style="margin-top: 0px;font-size:0;" &gt;  &lt;a href="http://www.flickr.com/photos/jeremymiles/441560049/"&gt;Genetics&lt;/a&gt;&lt;br /&gt;Originally uploaded by &lt;a href="http://www.flickr.com/people/jeremymiles/"&gt;Beautiful Freaks&lt;/a&gt;. &lt;/span&gt;&lt;/div&gt;These are my twin sons, Alex (on the right) and Daniel (left), aged 4,  reading a book called &lt;span style="font-style: italic;"&gt;Polytomous Item Response Theory Models &lt;/span&gt;&lt;span&gt;(click &lt;/span&gt;on the picture to make it larger).&lt;br /&gt;&lt;br /&gt;If I saw this picture, I'd think it was posed, but I really did find them like this.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-5266518421795405606?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/04/genetics.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>2</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-3957231422375132104</guid><pubDate>Wed, 28 Mar 2007 03:56:00 +0000</pubDate><atom:updated>2007-03-28T05:14:18.305+01:00</atom:updated><title>Standard Deviations (Sample and Population)</title><description>There are two different kinds of standard deviations, and they are a bit prone to getting confused.  Here's the text of an email that Darren Van &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_0"&gt;Laar&lt;/span&gt; sent to me:&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;&lt;br /&gt;There's a bit on p.&lt;span id="st" name="st" class="st"&gt;42&lt;/span&gt; that &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_1"&gt;i'm&lt;/span&gt; unsure of though - as the population and sample standard deviation terms seem to be a bit conflated unless &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_2"&gt;i'm &lt;/span&gt;going crazy (possible!).&lt;br /&gt;Sample &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_3"&gt;Sd&lt;/span&gt; is the unbiased estimator (is divided by n-1) and is denoted by s.  Pop &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_4"&gt;Sd&lt;/span&gt; is the other one.&lt;br /&gt;Also, this makes Excel correct, but probably just me...&lt;/blockquote&gt;&lt;br /&gt;As I haven't got the book yet, I'm not exactly sure what's on page 42, but I can guess what it's about.  And this is where there are slightly different terminologies for the standard deviation.&lt;br /&gt;&lt;br /&gt;If you have a sample, taken from the population (which is what you almost always have), then you use the SD which divides by N - 1.  This is the unbiased estimate of the standard deviation.  It is the best estimate of the population standard deviation.&lt;br /&gt;Because it's the estimate of the population standard deviation, it's sometimes called the population standard deviation (this is what we call it).  But it's the estimate that&lt;br /&gt;you should use when you have a sample, so you could call it the sample standard deviation as well.&lt;br /&gt;&lt;br /&gt;Similarly, if you have measured the entire population, then your standard deviation is not divided by N-1, it's divided by N.  Sometimes (and Excel is included), this is called the population standard deviation, because it's the standard deviation that's used when you have measured the population.&lt;br /&gt;&lt;br /&gt;In fact, it's mostly a lot of worrying about not very much, because we never have an entire population, so we never divide by N, we always divide by N - 1 .&lt;br /&gt;&lt;br /&gt;However, it's possible that it's wrong in the book - I haven't seen it to check, and we had such a horrible time with the typesetter that all kinds of things changed - between what we wrote and the proofs that we saw.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-3957231422375132104?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/03/standard-deviations-sample-and.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-5790135800764753247</guid><pubDate>Wed, 28 Mar 2007 03:37:00 +0000</pubDate><atom:updated>2007-03-28T04:56:33.386+01:00</atom:updated><title>Book is Out</title><description>I heard that the book I have written with Phil Banyard "Understanding and Using Statistics in Psychology" is available for purchase in your favourite bookshop.&lt;br /&gt;&lt;br /&gt;You'd imagine that someone like the publisher would tell me stuff like that - after all.  But you'd be wrong.  I discovered it was out because Darren Van Laar, from Portsmouth University, emailed to ask me a question about something on Page 42.&lt;br /&gt;&lt;br /&gt;Unfortunately, as I haven't actually got a copy of the book yet, I was only able to help partially.  Also unfortunately, this means that all the things I said I was going to put on the &lt;a href="http://www.jeremymiles.co.uk/usingstatistics"&gt;book's website&lt;/a&gt;, because they wouldn't fit in the book, will actually have to be put on the book's website.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-5790135800764753247?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/03/book-is-out.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-6181079605118399589</guid><pubDate>Wed, 28 Mar 2007 03:17:00 +0000</pubDate><atom:updated>2007-03-28T04:33:01.857+01:00</atom:updated><title>Scoring questionnaires in SPSS and Stata.</title><description>I sent an email to the psych-&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_0"&gt;postgrads&lt;/span&gt; list, about scoring questionnaires in &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_1"&gt;SPSS&lt;/span&gt;.  I thought I'd reproduce it here, and elaborate a bit to include &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_2"&gt;Stata&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;First, use syntax.  You can check syntax for errors, you can rerun it when you find another late questionnaire, you can save it and re-use it years later.  You can give it to your friends and make yourself a more popular person.&lt;br /&gt;&lt;br /&gt;Here are some tips for syntax:&lt;br /&gt;&lt;br /&gt;First, &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_3"&gt;SPSS&lt;/span&gt; doesn't mind "&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_4"&gt;whitespace&lt;/span&gt;", so you can make your syntax&lt;br /&gt;more legible, and less prone to mistakes:&lt;br /&gt;&lt;br /&gt;Instead of:&lt;br /&gt;&lt;div style="direction: ltr;"&gt;&lt;span class="q"&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt; COMPUTE &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_5"&gt;AQPHYSIC&lt;/span&gt; = q1&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_6"&gt;aq&lt;/span&gt;  + q5&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_7"&gt;aq&lt;/span&gt; + q7&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_8"&gt;aq&lt;/span&gt; + q12&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_9"&gt;aq&lt;/span&gt; + q13&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_10"&gt;aq&lt;/span&gt; + q20&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_11"&gt;aq&lt;/span&gt; + q24&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_12"&gt;aq&lt;/span&gt; +&lt;/span&gt;&lt;span style="font-family: courier new;"&gt; q27&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_13"&gt;aq&lt;/span&gt; + q29&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_14"&gt;aq&lt;/span&gt;.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="direction: ltr;"&gt;Write:&lt;br /&gt;&lt;/div&gt;&lt;script&gt;&lt;!-- D(["mb","\u003cdiv style\u003d\"direction:ltr\"\&gt;\u003cspan class\u003dq\&gt;\u003cbr /\&gt;COMPUTE AQPHYSIC \u003d\u003cbr /\&gt;q1aq  +\u003cbr /\&gt;q5aq +\u003cbr /\&gt;q7aq +\u003cbr /\&gt;q12aq +\u003cbr /\&gt;q13aq +\u003cbr /\&gt;q20aq +\u003cbr /\&gt;q24aq +\u003cbr /\&gt;q27aq +\u003cbr /\&gt;q29aq\u003cbr /\&gt;.\u003cbr /\&gt;\u003cbr /\&gt;\u003c/span\&gt;\u003c/div\&gt;",1] );  //--&gt;&lt;/script&gt;&lt;div style="direction: ltr;"&gt;&lt;span class="q"&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;COMPUTE &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_15"&gt;AQPHYSIC&lt;/span&gt; =&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;q1&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_16"&gt;aq&lt;/span&gt;  +&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;q5&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_17"&gt;aq&lt;/span&gt; +&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;q7&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_18"&gt;aq&lt;/span&gt; +&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;q12&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_19"&gt;aq&lt;/span&gt; +&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;q13&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_20"&gt;aq&lt;/span&gt; +&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;q20&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_21"&gt;aq&lt;/span&gt; +&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;q24&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_22"&gt;aq&lt;/span&gt; +&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;q27&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_23"&gt;aq&lt;/span&gt; +&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;q29&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_24"&gt;aq&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;script&gt;&lt;!-- D(["mb","\u003cdiv style\u003d\"direction:ltr\"\&gt;Secondly, some items often need to be reversed, so you need to create\u003cbr /\&gt;a new variable for that, which you might call q29aq.rev.  But if you\u003cbr /\&gt;do that, you\'ve got to remember to put it in your syntax.\u003cbr /\&gt;\u003cbr /\&gt;So instead, create new variables with the new scales.\u003cbr /\&gt;\u003cbr /\&gt;For example, for my epq scoring syntax, this creates the lie items:\u003cbr /\&gt;\u003cbr /\&gt;COMPUTE \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l1.jm\" target\u003d_blank\&gt;l1.jm\u003c/a\&gt; \u003d epq3 .\u003cbr /\&gt;COMPUTE \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l2.jm\" target\u003d_blank\&gt;l2.jm\u003c/a\&gt; \u003d epq8 .\u003cbr /\&gt;COMPUTE \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l3.jm\" target\u003d_blank\&gt;l3.jm\u003c/a\&gt; \u003d epq12 .\u003cbr /\&gt;COMPUTE \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l4.jm\" target\u003d_blank\&gt;l4.jm\u003c/a\&gt; \u003d epq16.\u003cbr /\&gt;COMPUTE \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l5.jm\" target\u003d_blank\&gt;l5.jm\u003c/a\&gt; \u003d epq20 .\u003cbr /\&gt;COMPUTE \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l6.jm\" target\u003d_blank\&gt;l6.jm\u003c/a\&gt; \u003d epq24 .\u003cbr /\&gt;COMPUTE \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l7.jm\" target\u003d_blank\&gt;l7.jm\u003c/a\&gt; \u003d epq29 .\u003cbr /\&gt;COMPUTE \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l8.jm\" target\u003d_blank\&gt;l8.jm\u003c/a\&gt; \u003d epq33 .\u003cbr /\&gt;COMPUTE \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l9.jm\" target\u003d_blank\&gt;l9.jm\u003c/a\&gt; \u003d epq37 .\u003cbr /\&gt;COMPUTE \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l10.jm\" target\u003d_blank\&gt;l10.jm\u003c/a\&gt; \u003d epq40 .\u003cbr /\&gt;COMPUTE \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l11.jm\" target\u003d_blank\&gt;l11.jm\u003c/a\&gt; \u003d epq45 .\u003cbr /\&gt;COMPUTE \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l12.jm\" target\u003d_blank\&gt;l12.jm\u003c/a\&gt; \u003d epq47 .\u003cbr /\&gt;\u003cbr /\&gt;(I call them .jm to make it obvious what they were, and because I\u003cbr /\&gt;might be using this on anyone\'s dataset, and I don\'t know what their\u003cbr /\&gt;variables will be called.\u003cbr /\&gt;",1] );  //--&gt;&lt;/script&gt;Secondly, items often need to be reversed, so you need to create a new variable for that, which you might call q29&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_25"&gt;aq&lt;/span&gt;.rev.  But if you do that, you've got to remember to put it in your syntax to score the questionnaire.  Instead of doing that, create a new variable for every item in your scale (or &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_26"&gt;subscale&lt;/span&gt;).&lt;br /&gt;&lt;br /&gt;So instead, create new variables with the new scales.&lt;br /&gt;&lt;br /&gt;For example, I wrote some syntax (in 1994) for scoring the &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_27"&gt;EPQ&lt;/span&gt;-R (short form).  Here's the bit that creates the lie scale:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;COMPUTE l1.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_28"&gt;jm&lt;/span&gt; = &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_29"&gt;epq&lt;/span&gt;3 .&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;COMPUTE l2.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_30"&gt;jm&lt;/span&gt; = &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_31"&gt;epq&lt;/span&gt;8 .&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;COMPUTE l3.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_32"&gt;jm&lt;/span&gt; = &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_33"&gt;epq&lt;/span&gt;12 .&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;COMPUTE l4.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_34"&gt;jm&lt;/span&gt; = &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_35"&gt;epq&lt;/span&gt;16.&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;COMPUTE l5.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_36"&gt;jm&lt;/span&gt; = &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_37"&gt;epq&lt;/span&gt;20 .&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;COMPUTE l6.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_38"&gt;jm&lt;/span&gt; = &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_39"&gt;epq&lt;/span&gt;24 .&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;COMPUTE l7.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_40"&gt;jm&lt;/span&gt; = &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_41"&gt;epq&lt;/span&gt;29 .&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;COMPUTE l8.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_42"&gt;jm&lt;/span&gt; = &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_43"&gt;epq&lt;/span&gt;33 .&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;COMPUTE l9.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_44"&gt;jm&lt;/span&gt; = &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_45"&gt;epq&lt;/span&gt;37 .&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;COMPUTE l10.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_46"&gt;jm&lt;/span&gt; = &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_47"&gt;epq&lt;/span&gt;40 .&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;COMPUTE l11.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_48"&gt;jm&lt;/span&gt; = &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_49"&gt;epq&lt;/span&gt;45 .&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;COMPUTE l12.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_50"&gt;jm&lt;/span&gt; = &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_51"&gt;epq&lt;/span&gt;47 .&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;I create items l1.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_52"&gt;jm&lt;/span&gt; to l12.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_53"&gt;jm&lt;/span&gt;.  (I call them .&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_54"&gt;jm&lt;/span&gt; to make it obvious what they were, and because I might be using this on &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_55"&gt;anyone's&lt;/span&gt; &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_56"&gt;dataset&lt;/span&gt;, and I don't know what their&lt;br /&gt;variables will be called.)&lt;br /&gt;&lt;script&gt;&lt;!-- D(["mb","\u003cbr /\&gt;Next you need to reverse the items that need to be reversed:\u003cbr /\&gt;\u003cbr /\&gt;RECODE\u003cbr /\&gt;  \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l1.jm\" target\u003d_blank\&gt;l1.jm\u003c/a\&gt;  \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l4.jm\" target\u003d_blank\&gt;l4.jm\u003c/a\&gt; \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l11.jm\" target\u003d_blank\&gt;l11.jm\u003c/a\&gt; (1\u003d0)  (0\u003d1)  .\u003cbr /\&gt;\u003cbr /\&gt;When you\'ve done that, you can create the sums very easily.\u003cbr /\&gt;\u003cbr /\&gt;Instead of writing all the variables out, you can use &amp;quot;to&amp;quot;.\u003cbr /\&gt;\u003cbr /\&gt;So:\u003cbr /\&gt;\u003cbr /\&gt;compute l.total \u003d sum(\u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l1.jm\" target\u003d_blank\&gt;l1.jm\u003c/a\&gt; to \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l12.jm\" target\u003d_blank\&gt;l12.jm\u003c/a\&gt;).\u003cbr /\&gt;\u003cbr /\&gt;BUT WAIT, there\'s another problem, which is missing data.  Software\u003cbr /\&gt;handles this in two ways: Excel gives it a zero.  SPSS either gives it\u003cbr /\&gt;a zero, or makes the sum missing.  Neither of these are right.\u003cbr /\&gt;\u003cbr /\&gt;The solution is to use the mean score, and then multiple the mean by\u003cbr /\&gt;the number of items.  If all variables are completed, then the mean\u003cbr /\&gt;score multiplied by the number of items will equal the total.  If a\u003cbr /\&gt;score is missing, then that item will be given the average of all the\u003cbr /\&gt;items that the person did complete.\u003cbr /\&gt;\u003cbr /\&gt;However, if a person only completed one item out of 50, we probably\u003cbr /\&gt;don\'t want to give them a score for the total.  The solution is\u003cbr /\&gt;mean.x, where x is the number of items that must have been completed\u003cbr /\&gt;for a score to be given.\u003cbr /\&gt;\u003cbr /\&gt;So, if you want to only give people a score on the lie scale if they\u003cbr /\&gt;have completed 8 items, you use:\u003cbr /\&gt;\u003cbr /\&gt;compute l.total \u003d mean.8(\u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l1.jm\" target\u003d_blank\&gt;l1.jm\u003c/a\&gt; to \u003ca onclick\u003d\"return top.js.OpenExtLink(window,event,this)\" href\u003d\"http://l12.jm\" target\u003d_blank\&gt;l12.jm\u003c/a\&gt;) * 12.\u003cbr /\&gt;\u003cbr /\&gt;However, it\'s often better not to use the total score anyway, the mean\u003cbr /\&gt;score for an item is more useful, as it\'s more interpretable, so just\u003cbr /\&gt;miss off the *12.\u003cbr /\&gt;\u003cbr /\&gt;\u003cbr /\&gt;Jeremy\u003cbr /\&gt;\u003c/div\&gt;",1] );  //--&gt;&lt;/script&gt;&lt;br /&gt;Next you need to reverse the items that need to be reversed:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;RECODE&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt; l1.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_57"&gt;jm&lt;/span&gt;  l4.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_58"&gt;jm&lt;/span&gt; l11.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_59"&gt;jm&lt;/span&gt; (1=0)  (0=1)  .&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;When you've done that, you can create the sums very easily, using compute statements.&lt;br /&gt;&lt;br /&gt;There's an additional advantage to using new variables, and that is that of writing all the variables out, you can use "to".&lt;br /&gt;&lt;br /&gt;So:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;compute l.total = sum(l1.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_60"&gt;jm&lt;/span&gt; to l12.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_61"&gt;jm&lt;/span&gt;).&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;And &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_62"&gt;SPSS&lt;/span&gt; knows that means all the between l1.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_63"&gt;jm&lt;/span&gt; and l12.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_64"&gt;jm&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;BUT WAIT, there's another problem, which is missing data.  Software handles this in two ways: Excel gives it a zero.  &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_65"&gt;SPSS&lt;/span&gt; either gives it a zero, or makes the sum variable missing.  Neither of these are right.&lt;br /&gt;&lt;br /&gt;The solution is to use the mean score, and then multiple the mean by the number of items.  If all variables are completed, then the mean score multiplied by the number of items will equal the total.  If a score is missing, then that item will be given the average of all the items that the person did complete.&lt;br /&gt;&lt;br /&gt;However, if a person only completed one item out of 50, we probably don't want to give them a score for the total.  The solution is mean.x, where x is the number of items that must have been completed for a score to be given.&lt;br /&gt;&lt;br /&gt;So, if you want to only give people a score on the lie scale if they have completed 8 items, you use:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;compute l.total = mean.8(&lt;/span&gt;&lt;span style="font-family: courier new;"&gt;l1.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_66"&gt;jm&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: courier new;"&gt; to &lt;/span&gt;&lt;span style="font-family: courier new;"&gt;l12.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_67"&gt;jm&lt;/span&gt;&lt;/span&gt;&lt;span style="font-family: courier new;"&gt;) * 12.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;However, it's often better not to use the total score anyway, the mean&lt;br /&gt;score for an item is more useful, as it's more interpretable, so just&lt;br /&gt;miss off the *12, and use:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;compute l.total = mean.8(l1.jm to l12.&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_68"&gt;jm&lt;/span&gt;) .&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;You can also do this in &lt;a href="http://www.stata.com/"&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_69"&gt;Stata&lt;/span&gt;&lt;/a&gt;, and it's far, far easier.  In &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_70"&gt;Stata&lt;/span&gt;, you use the -alpha- command.  The command works out which items need to be reversed (very, very occasionally it gets it wrong though), and calculates a mean score.&lt;br /&gt;&lt;br /&gt;&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_71"&gt;Stata&lt;/span&gt; does care about carriage returns (&lt;a href="http://www.stata.com/help.cgi?delimit"&gt;unless you tell it not to&lt;/a&gt;) so you can't use the same trick to clarify your code.&lt;br /&gt;&lt;br /&gt;In &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_72"&gt;Stata&lt;/span&gt;, you would write (all on one line):&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;alpha &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_73"&gt;epq&lt;/span&gt;8 &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_74"&gt;epq&lt;/span&gt;12 &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_75"&gt;epq&lt;/span&gt;16&lt;span class="blsp-spelling-error" id="SPELLING_ERROR_76"&gt;epq&lt;/span&gt;20 &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_77"&gt;epq&lt;/span&gt;24 &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_78"&gt;epq&lt;/span&gt;29 &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_79"&gt;epq&lt;/span&gt;33 &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_80"&gt;epq&lt;/span&gt;37 &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_81"&gt;epq&lt;/span&gt;40 &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_82"&gt;epq&lt;/span&gt;45 &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_83"&gt;epq&lt;/span&gt;47 , generate (l.total) min(8) item&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;The &lt;span style="font-family: courier new;"&gt;generate()&lt;/span&gt; option tells &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_84"&gt;Stata&lt;/span&gt; to generate a new variable.  &lt;span style="font-family: courier new;"&gt;min(8) &lt;/span&gt;tells &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_85"&gt;Stata&lt;/span&gt; to only include cases that have answered at least 8 items, and &lt;span style="font-family: courier new;"&gt;item &lt;/span&gt;tells &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_86"&gt;Stata&lt;/span&gt; to do an item analysis, so that you can see which items &lt;span class="blsp-spelling-error" id="SPELLING_ERROR_87"&gt;Stata&lt;/span&gt; thought should be reversed.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-6181079605118399589?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/03/scoring-questionnaires-in-spss-and.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>4</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-3645256518282304268</guid><pubDate>Wed, 07 Mar 2007 22:12:00 +0000</pubDate><atom:updated>2007-03-07T22:29:11.178Z</atom:updated><title>Conservapedia</title><description>If you're the kind of person who reads about stuff on the internet (which obviously you are, 'cos you're reading this), you'll have encountered &lt;a href="http://www.conservapedia.com/Main_Page"&gt;Conservapedia&lt;/a&gt;, and various websites that have &lt;a href="http://cosmicvariance.com/2007/02/22/conservapedia/"&gt;mocked some of the entries&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;I had a look for some articles on statistical sorts of things, so that I could join in the mocking, that being my idea of fun.  But I failed.  The only article I found was &lt;a href="http://www.conservapedia.com/Statistics"&gt;one on statistics&lt;/a&gt;, which is brief but I couldn't really knock for accuracy.  I couldn't find anything else, mean, median, mode, regression, correlation, Fisher, Pearson, all have no entry. &lt;br /&gt;&lt;br /&gt;I tried more  words more distantly related to statistics - causality takes you to a page on &lt;a href="http://www.conservapedia.com/Non_sequitur"&gt;non-sequiturs&lt;/a&gt; , which does a half-hearted job of explaining the relation between correlation and causality.  'Evidence' takes you to a page about Jesus. &lt;br /&gt;&lt;br /&gt;There are lots of pages to mock (read the Jesus page, but also note that the discussion shows that there are some more sensible people there), but really, it's too easy.  However, I'll point out that on the Macroevolution page they make the error of describing macroevolution as "the unproven theory".  Of course it's unproven.  Theories can be lots of things - supported, useful, good, elaborate, but what they can't be is proven.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-3645256518282304268?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/03/conservapedia.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-3343428866064713596</guid><pubDate>Wed, 14 Feb 2007 18:32:00 +0000</pubDate><atom:updated>2007-02-14T18:34:42.540Z</atom:updated><title>GPower 3</title><description>G*Power version 3, a free program for power analysis, has just been released - it's a big step up from version 2, which looked a bit ancient, but still worked well. You can download it from here: &lt;a href="http://www.psycho.uni-duesseldorf.de/abteilungen/aap/gpower3/"&gt;http://www.psycho.uni-duesseldorf.de/abteilungen/aap/gpower3/&lt;/a&gt; .  You can read more about power here: &lt;a href="http://www.jeremymiles.co.uk/misc/power"&gt;http://www.jeremymiles.co.uk/misc/power&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-3343428866064713596?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/02/gpower-3.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-5273413716549123720</guid><pubDate>Sat, 10 Feb 2007 18:44:00 +0000</pubDate><atom:updated>2007-02-10T18:44:56.824Z</atom:updated><title>What is a facilitated system</title><description>&lt;a href="http://www.jeremymiles.co.uk/learningstats/2006/12/example-of-bayesian-analysis.html"&gt;A little while ago on this blog&lt;/a&gt;, I wrote about a post on the facilitated systems blog. I said I didn't know what a facilitated system was. Well, &lt;a href="http://facilitatedsystems.com/weblog/2007/01/what-is-are-facilitated-systems.html"&gt;Bill Harris has answered my (implied) question&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-5273413716549123720?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/02/what-is-facilitated-system.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-1693238567150486120</guid><pubDate>Thu, 08 Feb 2007 05:54:00 +0000</pubDate><atom:updated>2007-01-25T23:14:45.465Z</atom:updated><title>Types of Errors</title><description>&lt;a href="http://www.stat.columbia.edu/%7Ecook/movabletype/mlm/"&gt;Andrew Gelman's blog&lt;/a&gt; had a post today which linked back to an &lt;a href="http://www.stat.columbia.edu/%7Ecook/movabletype/archives/2004/12/type_1_type_2_t.html"&gt;old post&lt;/a&gt; where he discussed the different kinds of errors.   In our &lt;a href="http://www.jeremymiles.co.uk/usingstatistics"&gt;forthcoming book&lt;/a&gt;, we discuss the fact that all null hypotheses are false.  If that's the case (and it is) you can never make a type I error.  Because you make a type I error when you reject a null hypothesis, which is true.  And if you know that all null hypotheses are false, you can't make a type II error, when you say that the null hypothesis isn't false, when it is.  Because you know it isn't false.&lt;br /&gt;&lt;br /&gt;So what do we do?  We can't say that we never make an error, so instead, we need knew kinds of errors, which Gelman calls Type M and Type S.&lt;br /&gt;&lt;br /&gt;A type M error occurs when you get the magnitude of an effect wrong.  If we test for a correlation between two measures, and we find that the correlation is not significant, we (should) say, 'Well, whatever the effect is, it's small (and we don't know what direction it's in).'  If the correlation is actually large, we've made a type M error.&lt;br /&gt;&lt;br /&gt;A type S correlation occurs when we get the sign of an effect wrong.  Let's say  we find a significant positive correlation, and conclude that the population correlation is positive.  If the population correlation is actually negative, we've made a type S error.&lt;br /&gt;&lt;br /&gt;Type M and Type S errors make a lot more sense than Type I and II errors (which, as we've seen here, don't make sense).  And they're a lot easier to remember.  Gelman then goes into a lot of Bayesian elaboration, which I don't want to go into.  I can be a Bayesian when I need to, but I've really got to need to.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-1693238567150486120?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/02/types-of-errors.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>3</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-6156549428162814956</guid><pubDate>Tue, 02 Jan 2007 18:01:00 +0000</pubDate><atom:updated>2007-01-21T05:03:19.925Z</atom:updated><title>Statistics versus Qualitative Research</title><description>I wrote an article that was published in &lt;a href="http://www.bps.org.uk/publications/thepsychologist/the-psychologist_home.cfm?&amp;redirectCount=0"&gt;The Psychologist &lt;/a&gt;  called "Why I study statistics".  (It was, if you're interested, volume 19, part 11, 672-673, and part of a series on Why I study ...).  Anyway, in the next issue (or maybe the one after) the following letter was published, written by David Mingay:&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;I HAVE just read ‘Qualitative research… Emerging from the cocoon of science’ (November 2006) and didn’t understand a word of it. I turned the page and read ‘Why I study statistics’ and understood it completely. Maybe that simple observation answers the question posed by the author of the first article:‘Why does it [qualitative research] have such a low profile within academic psychology?’&lt;/blockquote&gt;I'm not sure that anyone has ever previously said they understood &lt;span style="font-style: italic;"&gt;anything &lt;/span&gt;I wrote.&lt;br /&gt;&lt;br /&gt;Update: Article '&lt;a href="http://www.jeremymiles.co.uk/mestuff/publications/46.pdf"&gt;Why I study  ... statistics&lt;/a&gt;' available as PDF.  (Nice picture of me as well).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-6156549428162814956?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2007/01/statistics-versus-qualitative-research.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>1</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-15703134.post-8917152520935277985</guid><pubDate>Wed, 20 Dec 2006 06:09:00 +0000</pubDate><atom:updated>2006-12-20T06:27:58.638Z</atom:updated><title>Horse racing deaths</title><description>One reason that we do statistical analysis and don't rely on our brains is that our brains are not to be trusted, a great deal of the time.  Brains were designed by evolution to keep us alive in the risky environment of something like the African Savannah.  One thing that might get you into a lot of trouble is missing out on some important pattern (there's always a tiger here at dusk, 3 days after rain, there is water over there, 3 weeks after rain, it's over here, etc).  So our brains make a lot of type I errors, when it comes to spotting patterns - a type I error (seeing a pattern when there wasn't one) is a lot more serious than a type II error (not seeing a pattern, when there was one).  A type I error means you avoid a certain area at dusk, which is a slight inconvenience. A type II error means you get eaten. &lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.guardian.co.uk"&gt;The Guardian&lt;/a&gt; has a story today about horse racing, and specifically about a run of &lt;a href="http://sport.guardian.co.uk/horseracing/story/0,,1975680,00.html?gusrc=rss&amp;feed=1"&gt;deaths of horses at Wolverhampton racetrack&lt;/a&gt;.  This obviously isn't a good thing, but one needs to ask the question of whether this means that there is something wrong with the track which is causing this problem, or whether it's just a statistical fluke - an artefact, which our brains spot as a pattern. &lt;br /&gt;&lt;br /&gt;Doing the analysis to find out is tricky, for two reasons - our friends the type I and type II errors.  There are quite a lot of racetracks (in England? Britain? The UK? British Isles? Europe? The world?) - what type I error rate do we want to accept, and over what area?  And, if we control the type I error rate, we lose power, and therefore don't detect an effect. &lt;br /&gt;&lt;br /&gt;This isn't just a problem for horses - the same problem arose in the &lt;a href="http://en.wikipedia.org/wiki/Harold_Shipman"&gt;Harold Shipman&lt;/a&gt; case - a type I error means you wrongly accuse a doctor of killing their patients.  A type II error means that you miss a murderous GP.&lt;br /&gt;&lt;br /&gt;(If you're interested, you can read about appropriate tests &lt;a href="http://intqhc.oxfordjournals.org/cgi/content/full/15/1/7"&gt;here&lt;/a&gt;, or watch a video &lt;a href="http://www.sciencelive.org/component/option,com_mediadb/task,view/idstr,CUSP-BAFOS05-06_DavidSpiegelhalter"&gt;here&lt;/a&gt;.)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/15703134-8917152520935277985?l=www.jeremymiles.co.uk%2Flearningstats%2Findex.html'/&gt;&lt;/div&gt;</description><link>http://www.jeremymiles.co.uk/learningstats/2006/12/horse-racing-deaths.html</link><author>noreply@blogger.com (J)</author><thr:total xmlns:thr='http://purl.org/syndication/thread/1.0'>0</thr:total></item></channel></rss>
