{"id":1647,"date":"2019-08-15T16:30:33","date_gmt":"2019-08-15T20:30:33","guid":{"rendered":"https:\/\/pressbooks.bccampus.ca\/simplestats\/?post_type=chapter&#038;p=1647"},"modified":"2019-08-20T18:35:09","modified_gmt":"2019-08-20T22:35:09","slug":"4-4-standard-deviation","status":"publish","type":"chapter","link":"https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/4-4-standard-deviation\/","title":{"raw":"4.4 Variance Continued, Standard Deviation","rendered":"4.4 Variance Continued, Standard Deviation"},"content":{"raw":"[latexpage]\r\n\r\nI'm sure you'll agree the preceding section was a lot to take in. And here's the kicker: after all that, we arrived at something which we cannot easily or intuitively interpret, given the squared units. However, the variance is used a lot in statistics, for great many things. Generally, the larger the variance, the greater the <em>variability<\/em> of the variable, or the larger the \"dispersed-ness\" of the cases.\r\n\r\n&nbsp;\r\n\r\nDespite the seemingly convoluted way we arrived at the variance and all the calculations and mathematical notation, what we did was actually quite simple. (No, really!)\r\n\r\n&nbsp;\r\n\r\nTo recap: just like we average all values by summing them up and dividing the sum on their total to get the mean, we average the distances of the values from the mean by summing them up and dividing the sum on their total. The only difference is that in order to be able to sum the distances, we need to square each of them first, or we cannot proceed.\r\n\r\n&nbsp;\r\n\r\nHere are the formulas for the mean and the variance together so that you can compare:\r\n\r\n&nbsp;\r\n\r\n$\\frac{\\sum\\limits_{i=1}^{N}{x_i}}{N} = \\overline{x}$\u00a0 \u00a0\u2190\u00a0<em>mean<\/em>\r\n\r\n&nbsp;\r\n\r\n$\\frac{\\sum\\limits_{i=1}^{N}{(x_i-\\overline{x})^2}}{N} = \\sigma^2$\u00a0 \u00a0\u00a0\u2190\u00a0<em>variance<\/em>\r\n\r\n&nbsp;\r\n\r\nNow that I have you feeling somewhat comfortable, I have a confession to make. <strong>This above isn't the only version of the formula for variance that exists or that we will be using.<\/strong>\r\n\r\n&nbsp;\r\n\r\nBear with me (and welcome back, to those who threw the reading away in disgust) -- I promise to explain everything when we get to inferential statistics further in the textbook, as the explanation requires concepts and terminology we have not yet covered and which cannot be easily introduced at this point. (Hint: it deals with estimation and uncertainty.)[footnote]If you'd like a preview,\u00a0<strong>the alternative, to-be-explained-later, formula for variance is:<\/strong>\r\n\r\n&nbsp;\r\n\r\n$\\frac{\\sum\\limits_{i=1}^{N}{(x_i-\\overline{x})^2}}{N-1} = s^2\u00a0=$\u00a0 \u00a0\u00a0\u2190\u00a0<strong><em>variance<\/em><\/strong>\r\n\r\n&nbsp;\r\n\r\nAs you can see, the modification is quite small -- <strong>instead of dividing the sum of squares by the total number <em>N<\/em>, we actually divide it by the total <em>minus one<\/em>, <em>N<\/em>-1<\/strong>. If it makes you feel better, dividing just by <em>N<\/em> or by <em>N-1 <\/em>produces generally similar results, in terms of magnitude of the variance.\u00a0<span style=\"text-indent: 18.6667px;font-size: 14pt\">We also denote this version with a regular small-case $s^2$.<\/span><span style=\"text-indent: 1em;font-size: 14pt\">[\/footnote]\u00a0<\/span>\r\n\r\n&nbsp;\r\n\r\nOne thing worth noting, however, is that despite the lack of proper explanation as of yet, when working with typical datasets<strong>\u00a0SPSS will produce variances by dividing the sum of squares by <em>N-1<\/em> instead of by <em>N<\/em>.<\/strong>\r\n\r\n&nbsp;\r\n<div class=\"textbox textbox--learning-objectives\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\"><em><span style=\"color: #ff0000\"><strong>Watch Out!!<\/strong><\/span>\u00a0<span style=\"background-color: #000000;color: #ff0000\">#9<\/span> ... for The Order of Operations<\/em><\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n&nbsp;\r\n\r\nWhen considering the formula for variance, and the steps we took to calculate it, pay special attention to the <em>sum of squares<\/em>. That is, we need a sum of <em>squares\u00a0<\/em>(a.k.a., to add the squared distances from the mean together): <strong>we <em>first<\/em> calculate the distances, <em>then<\/em> square them, and finally sum the <em>squared<\/em> distances up<\/strong>.\r\n\r\n&nbsp;\r\n\r\nA common mistake, however, is to try to calculate the distances, sum them up, <em>then<\/em> square the sum. As explained above, the (un-squared) distances add up to zero, and squaring the zero will not improve things. A version of this mistake is also to calculate the distances, then try to sum them and divide them by <em>N<\/em>-1, and <em>then<\/em> square the result. Obviously this would also be unsuccessful. To avoid these type of frustrations, try to remember the purpose of the squaring: to \"turn\" all distances into positive numbers. Everything else we do (summing, dividing), we do to the already squared distances.\r\n\r\n<\/div>\r\n<\/div>\r\n&nbsp;\r\n\r\nIn an effort to show you that the calculation of the variance is simple when done without the protracted explanations, take another example we have used before, <em>number of siblings<\/em>.\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\"><em>Example 4.5 Variance for\u00a0Number of Siblings<\/em><\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n&nbsp;\r\n\r\nIn discussing the median in Section 3.2 (<a href=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/3-2-median\/\">https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/3-2-median\/<\/a>), we imagined you asked seven of your friends about the number of their siblings. These were the values we used:\u00a0 2, 1, 4, 2, 1, 0, 3.\r\n\r\n&nbsp;\r\n\r\nLet's produce the variance, in four simple steps, after calculating the mean; Step 1A, obtain the distances from the mean; Step 1B, square the distances from the mean; Step 2, obtain the sum of squares (i.e., sum the distances up); Step 3, divide by <em>N<\/em>.\r\n\r\n&nbsp;\r\n\r\n<strong>Preliminary step: obtain the mean.<\/strong>\r\n\r\n$\\frac{\\sum\\limits_{i=1}^{N}{x_i}}{N}=\\frac{2+1+4+2+1+0+3}{7}=\\frac{13}{7}=1.857= \\overline{x}$\r\n\r\n&nbsp;\r\n\r\n<strong>Steps 1A and 1B are presented in the table below:<\/strong>\r\n\r\n&nbsp;\r\n\r\n<em>Table 4.4 Calculating Distances To the Mean and Squaring Each Distance<\/em>\r\n<table class=\"lines\" style=\"border-collapse: collapse;width: 131.729%;height: 236px\" border=\"0\">\r\n<tbody>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 34.1359%;height: 15px;text-align: center\"><strong>$x_i$<\/strong><\/td>\r\n<td style=\"width: 36.6855%;height: 15px;text-align: center\"><strong>$(x_i - \\overline{x})$<\/strong><\/td>\r\n<td style=\"width: 61.0955%;height: 15px;text-align: center\"><strong>$(x_i - \\overline{x})^2$<\/strong><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 34.1359%;height: 15px;text-align: center\">2<\/td>\r\n<td style=\"width: 36.6855%;height: 15px;text-align: center\">(2 - 1.857) = 0.143<\/td>\r\n<td style=\"width: 61.0955%;height: 15px;text-align: center\">(0.143)<sup>2<\/sup> = 0.02<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 34.1359%;height: 15px;text-align: center\">1<\/td>\r\n<td style=\"width: 36.6855%;height: 15px;text-align: center\">(1 - 1.857) = -0.857<\/td>\r\n<td style=\"width: 61.0955%;height: 15px;text-align: center\">(-0.857)<sup>2<\/sup> = 0.734<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 34.1359%;height: 15px;text-align: center\">4<\/td>\r\n<td style=\"width: 36.6855%;height: 15px;text-align: center\">(4 - 1.857) = 2.143<\/td>\r\n<td style=\"width: 61.0955%;height: 15px;text-align: center\">(2.143)<sup>2<\/sup> = 4.592<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 34.1359%;height: 15px;text-align: center\">2<\/td>\r\n<td style=\"width: 36.6855%;height: 15px;text-align: center\">(2 - 1.857) = 0.143<\/td>\r\n<td style=\"width: 61.0955%;height: 15px;text-align: center\">(0.143)<sup>2<\/sup> = 0.02<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 34.1359%;height: 15px;text-align: center\">1<\/td>\r\n<td style=\"width: 36.6855%;height: 15px;text-align: center\">(1 - 1.857) = -0.857<\/td>\r\n<td style=\"width: 61.0955%;height: 15px;text-align: center\">(-0.857)<sup>2<\/sup> = 0.734<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 34.1359%;height: 15px;text-align: center\">0<\/td>\r\n<td style=\"width: 36.6855%;height: 15px;text-align: center\">(0 - 1.857) = -1.857<\/td>\r\n<td style=\"width: 61.0955%;height: 15px;text-align: center\">(-1.857)<sup>2<\/sup> = 3.448<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 34.1359%;height: 15px;text-align: center\">3<\/td>\r\n<td style=\"width: 36.6855%;height: 15px;text-align: center\">(3 - 1.86) = 1.143<\/td>\r\n<td style=\"width: 61.0955%;height: 15px;text-align: center\">(1.143)<sup>2<\/sup> = 1.306<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<strong>Step 2, obtain the sum of squares<\/strong>:\r\n\r\n&nbsp;\r\n\r\n$\\sum\\limits_{i=1}^{N}{(x_i-\\overline{x})^2} = (0.02)2+(0.734)2+4.592+3.448+1.306=10.854$\u00a0 \u00a0\u00a0\u2190<em>Sum of Squares<\/em>\r\n\r\n&nbsp;\r\n\r\n<strong>Step 3, divide the sum of squares<\/strong> (rounded down to two digits) <strong>by <em>N<\/em><\/strong>, i.e., by<em>\u00a07<\/em>:\r\n\r\n&nbsp;\r\n\r\n$\\frac{\\sum\\limits_{i=1}^{N}{(x_i-\\overline{x})^2}}{N}=\\frac{10.85}{7}=1.55= \\sigma^2$\u00a0 \u00a0\u00a0\u2190\u00a0<strong><em>variance<\/em><\/strong>\r\n\r\n&nbsp;\r\n\r\nThus, we find that your seven friends have an average of about 1.6 squared distances from the mean number of siblings 1.9 (rounded up from 1.857).\r\n\r\n<\/div>\r\n<\/div>\r\n&nbsp;\r\n\r\n<em>Oh, great<\/em>, you are probably thinking now, and I can imagine the sarcasm -- <em>we calculated something we can't even interpret properly<\/em>. I mean, it's more than a tad awkward to try to explain \"an average of about 1.6 squared distances from the mean number of siblings\" to anyone not versed in statistics. Maybe it would be better if we could get rid of the \"squared-ness\"?\r\n\r\n&nbsp;\r\n\r\nYou know what?\u00a0<em>We can<\/em>. The standard deviation is here to help.\r\n\r\n&nbsp;\r\n\r\n<strong>Standard deviation<\/strong>. Believe it or not, after all the steps we went through to get to the variance, calculating the standard deviation is a breeze: specifically, a breeze that turns back the squared units into <em>standard<\/em> units, hence the name.\r\n\r\n&nbsp;\r\n\r\nSee for yourself:\r\n\r\n&nbsp;\r\n\r\n$\\sqrt{\\frac{\\sum\\limits_{i=1}^{N}{(x_i-\\overline{x})^2}}{N}} = \\sqrt{\\sigma^2}=\\sigma$\u00a0 \u00a0\u00a0\u2190\u00a0<strong><em>standard deviation<\/em><\/strong>\r\n\r\n&nbsp;\r\n\r\nDespite its scary looks, this is actually just the formula for variance <em>under a square root<\/em>. That is, <strong>we take the square root of the variance to get the standard deviation<\/strong>. That's it. Nothing more. Just a regular square root, and we're there. Cue in a sigh of relief![footnote]Note, however, that just like there is an \"alternative\", to-be-explained-later, formula for variance, there is an \"alternative\" formula for standard deviation, following the same principle regarding dividing the sum of squares by <em>N-1<\/em> instead of by <em>N<\/em>:\r\n\r\n&nbsp;\r\n\r\n$\\sqrt{\\frac{\\sum\\limits_{i=1}^{N}{(x_i-\\overline{x})^2}}{N-1}} = \\sqrt{s^2}=s$\u00a0 \u00a0\u2190\u00a0<strong><em>standard deviation\u00a0<\/em><\/strong>\r\n\r\n&nbsp;\r\n\r\nAs well, SPSS will use this (<em>N<\/em>-1) version of the formula when working with variables in a dataset.[\/footnote]\r\n\r\n&nbsp;\r\n\r\nNow that we know how to get back to standard units, let's do that for the two examples we used. We had a variance of\u00a0<em>\u03c3<sup>2<\/sup><\/em> = 15.21 for <em>hours worked per week<\/em>\u00a0in the previous section and a variance of\u00a0<em>\u03c3<sup>2<\/sup><\/em> = 1.6 for <em>numbers of siblings <\/em>in the example above. Square-rooting gives us the following:\r\n\r\n&nbsp;\r\n\r\n$$\\sqrt{\\sigma^2}=\\sqrt{15.21}=3.9$$\r\n\r\n&nbsp;\r\n\r\nand\r\n\r\n&nbsp;\r\n\r\n$$\\sqrt{\\sigma^2}=\\sqrt{1.6}=1.25$$\r\n\r\n&nbsp;\r\n\r\nNow <em>these<\/em> we <em>can<\/em> interpret: on average, your hours worked per week deviated from the mean of 8.7 hours per week by 3.9\u00a0<em>hours<\/em>, and your friends\u00a0<span style=\"text-align: initial;text-indent: 2em;font-size: 14pt\">deviated from the average number of siblings,1.9, by 1.25\u00a0<\/span><em style=\"text-align: initial;text-indent: 2em;font-size: 14pt\">siblings<\/em><span style=\"text-align: initial;text-indent: 2em;font-size: 14pt\">.<\/span>\r\n\r\n&nbsp;\r\n\r\nTo repeat, <strong>the standard deviation is the square root of the variance. The standard deviation is a measure of dispersion which gives us the average deviation of the cases from the mean.<\/strong> (Technically, an average of the squared distances from the mean in standard units.)\r\n\r\n&nbsp;\r\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\"><em>Do It! 4.2 Longevity of The First Fifteen Canadian Prime Ministers<\/em><\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n&nbsp;\r\n\r\nCalculate the variance and standard deviation of the longevity of the first fifteen Prime Ministers of Canada. In chronological order (starting with Macdonald and ending with Pierre Trudeau), their ages at the time of death were: 76, 70, 72, 49, 93, 94, 77, 82, 86, 75, 76, 91, 83, 75, and 80. Interpret your results (i.e., explain what you have found beyond \"the standard deviation is ...\").\r\n\r\n&nbsp;\r\n\r\nYou can use a table like Table 4.4 to organize your calculations. (Hint: Start with calculating the mean age at death, $\\overline{x}$, and round it up to a whole number to make your job easier.) Here\u00a0$x_i$ is age at death for each PM and <em>N<\/em>=15.\r\n\r\n&nbsp;\r\n\r\nYou can check your answers in this footnote.<span style=\"text-indent: 33.6px;font-size: 0.9em\">[footnote]\u00a0<\/span><span style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">The mean is 79 years; the sum of squares 1,717; the variance 114.5; the standard deviation 10.7 years. However, if you calculated the variance and standard deviation with <\/span><em style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">N<\/em><span style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">-1 in the denominators, you will get a variance of 123 and a standard deviation of 11.1 years.\u00a0 The difference is as large as it is due to the small <\/span><em style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">N. H<\/em><span style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">ad we been working with a real dataset of hundreds or thousands of cases, the difference between the just-<\/span><em style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">N<\/em><span style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\"> and <\/span><em style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">N<\/em><span style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">-1 versions of the formulas would have been less pronounced.[\/footnote]<\/span>\r\n\r\n<\/div>\r\n<\/div>\r\n&nbsp;\r\n\r\nOf course, one wouldn't normally calculate variances and standard deviations by hand: we only do it so that you can understand what the measures are and what they really provide us with, by obtaining them ourselves. Usually, however, we simply use SPSS.\r\n\r\n&nbsp;\r\n<div class=\"textbox textbox--key-takeaways\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\"><em>SPSS Tip 4.2 Obtaining Variance and Standard Deviation<\/em><\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<ul>\r\n \t<li>From the <em>Main Menu<\/em>, select <em>Analyze,<\/em>\u00a0then<em> Descriptive Statistics, <\/em>and then<em> Frequencies<\/em>;<\/li>\r\n \t<li>Select your variable of choice from the list on the left and use the arrow to move it to the right side of the window;<\/li>\r\n \t<li>Click on the <em>Statistics<\/em> button on the right;<\/li>\r\n \t<li>In this new window, check <em>Variance<\/em> and <em>Standard deviation<\/em> in the <i>Dispersion\u00a0<\/i>section on the left at the bottom;<\/li>\r\n \t<li>Click <em>Continue<\/em>, then <em>OK<\/em>.<\/li>\r\n \t<li>The <em>Output<\/em> window will provide a table with the requested measures.<\/li>\r\n \t<li>Make sure you know how to interpret your results! (Try to use as little statistics jargon as possible.)<\/li>\r\n<\/ul>\r\n<\/div>\r\n<\/div>","rendered":"<p>I&#8217;m sure you&#8217;ll agree the preceding section was a lot to take in. And here&#8217;s the kicker: after all that, we arrived at something which we cannot easily or intuitively interpret, given the squared units. However, the variance is used a lot in statistics, for great many things. Generally, the larger the variance, the greater the <em>variability<\/em> of the variable, or the larger the &#8220;dispersed-ness&#8221; of the cases.<\/p>\n<p>&nbsp;<\/p>\n<p>Despite the seemingly convoluted way we arrived at the variance and all the calculations and mathematical notation, what we did was actually quite simple. (No, really!)<\/p>\n<p>&nbsp;<\/p>\n<p>To recap: just like we average all values by summing them up and dividing the sum on their total to get the mean, we average the distances of the values from the mean by summing them up and dividing the sum on their total. The only difference is that in order to be able to sum the distances, we need to square each of them first, or we cannot proceed.<\/p>\n<p>&nbsp;<\/p>\n<p>Here are the formulas for the mean and the variance together so that you can compare:<\/p>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-54a4f51450081873fb41ce5946b4aeec_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#115;&#117;&#109;&#92;&#108;&#105;&#109;&#105;&#116;&#115;&#95;&#123;&#105;&#61;&#49;&#125;&#94;&#123;&#78;&#125;&#123;&#120;&#95;&#105;&#125;&#125;&#123;&#78;&#125;&#32;&#61;&#32;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;\" title=\"Rendered by QuickLaTeX.com\" height=\"43\" width=\"71\" style=\"vertical-align: -6px;\" \/>\u00a0 \u00a0\u2190\u00a0<em>mean<\/em><\/p>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-606c5b2c2f045651c6d3f755c33e7d52_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#115;&#117;&#109;&#92;&#108;&#105;&#109;&#105;&#116;&#115;&#95;&#123;&#105;&#61;&#49;&#125;&#94;&#123;&#78;&#125;&#123;&#40;&#120;&#95;&#105;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;&#94;&#50;&#125;&#125;&#123;&#78;&#125;&#32;&#61;&#32;&#92;&#115;&#105;&#103;&#109;&#97;&#94;&#50;\" title=\"Rendered by QuickLaTeX.com\" height=\"43\" width=\"114\" style=\"vertical-align: -6px;\" \/>\u00a0 \u00a0\u00a0\u2190\u00a0<em>variance<\/em><\/p>\n<p>&nbsp;<\/p>\n<p>Now that I have you feeling somewhat comfortable, I have a confession to make. <strong>This above isn&#8217;t the only version of the formula for variance that exists or that we will be using.<\/strong><\/p>\n<p>&nbsp;<\/p>\n<p>Bear with me (and welcome back, to those who threw the reading away in disgust) &#8212; I promise to explain everything when we get to inferential statistics further in the textbook, as the explanation requires concepts and terminology we have not yet covered and which cannot be easily introduced at this point. (Hint: it deals with estimation and uncertainty.)<a class=\"footnote\" title=\"If you'd like a preview,\u00a0the alternative, to-be-explained-later, formula for variance is:\n\n\u00a0\n\n\u00a0 \u00a0\u00a0\u2190\u00a0variance\n\n\u00a0\n\nAs you can see, the modification is quite small -- instead of dividing the sum of squares by the total number N, we actually divide it by the total minus one, N-1. If it makes you feel better, dividing just by N or by N-1 produces generally similar results, in terms of magnitude of the variance.\u00a0We also denote this version with a regular small-case .\" id=\"return-footnote-1647-1\" href=\"#footnote-1647-1\" aria-label=\"Footnote 1\"><sup class=\"footnote\">[1]<\/sup><\/a>\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>One thing worth noting, however, is that despite the lack of proper explanation as of yet, when working with typical datasets<strong>\u00a0SPSS will produce variances by dividing the sum of squares by <em>N-1<\/em> instead of by <em>N<\/em>.<\/strong><\/p>\n<p>&nbsp;<\/p>\n<div class=\"textbox textbox--learning-objectives\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\"><em><span style=\"color: #ff0000\"><strong>Watch Out!!<\/strong><\/span>\u00a0<span style=\"background-color: #000000;color: #ff0000\">#9<\/span> &#8230; for The Order of Operations<\/em><\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>&nbsp;<\/p>\n<p>When considering the formula for variance, and the steps we took to calculate it, pay special attention to the <em>sum of squares<\/em>. That is, we need a sum of <em>squares\u00a0<\/em>(a.k.a., to add the squared distances from the mean together): <strong>we <em>first<\/em> calculate the distances, <em>then<\/em> square them, and finally sum the <em>squared<\/em> distances up<\/strong>.<\/p>\n<p>&nbsp;<\/p>\n<p>A common mistake, however, is to try to calculate the distances, sum them up, <em>then<\/em> square the sum. As explained above, the (un-squared) distances add up to zero, and squaring the zero will not improve things. A version of this mistake is also to calculate the distances, then try to sum them and divide them by <em>N<\/em>-1, and <em>then<\/em> square the result. Obviously this would also be unsuccessful. To avoid these type of frustrations, try to remember the purpose of the squaring: to &#8220;turn&#8221; all distances into positive numbers. Everything else we do (summing, dividing), we do to the already squared distances.<\/p>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>In an effort to show you that the calculation of the variance is simple when done without the protracted explanations, take another example we have used before, <em>number of siblings<\/em>.<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\"><em>Example 4.5 Variance for\u00a0Number of Siblings<\/em><\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>&nbsp;<\/p>\n<p>In discussing the median in Section 3.2 (<a href=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/3-2-median\/\">https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/3-2-median\/<\/a>), we imagined you asked seven of your friends about the number of their siblings. These were the values we used:\u00a0 2, 1, 4, 2, 1, 0, 3.<\/p>\n<p>&nbsp;<\/p>\n<p>Let&#8217;s produce the variance, in four simple steps, after calculating the mean; Step 1A, obtain the distances from the mean; Step 1B, square the distances from the mean; Step 2, obtain the sum of squares (i.e., sum the distances up); Step 3, divide by <em>N<\/em>.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>Preliminary step: obtain the mean.<\/strong><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-4387311fde7759ba75049748259c14ed_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#115;&#117;&#109;&#92;&#108;&#105;&#109;&#105;&#116;&#115;&#95;&#123;&#105;&#61;&#49;&#125;&#94;&#123;&#78;&#125;&#123;&#120;&#95;&#105;&#125;&#125;&#123;&#78;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#50;&#43;&#49;&#43;&#52;&#43;&#50;&#43;&#49;&#43;&#48;&#43;&#51;&#125;&#123;&#55;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#49;&#51;&#125;&#123;&#55;&#125;&#61;&#49;&#46;&#56;&#53;&#55;&#61;&#32;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;\" title=\"Rendered by QuickLaTeX.com\" height=\"43\" width=\"316\" style=\"vertical-align: -6px;\" \/><\/p>\n<p>&nbsp;<\/p>\n<p><strong>Steps 1A and 1B are presented in the table below:<\/strong><\/p>\n<p>&nbsp;<\/p>\n<p><em>Table 4.4 Calculating Distances To the Mean and Squaring Each Distance<\/em><\/p>\n<table class=\"lines\" style=\"border-collapse: collapse;width: 131.729%;height: 236px\">\n<tbody>\n<tr style=\"height: 15px\">\n<td style=\"width: 34.1359%;height: 15px;text-align: center\"><strong><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-c8700e0258243116de0d4f288e2e3b44_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#120;&#95;&#105;\" title=\"Rendered by QuickLaTeX.com\" height=\"11\" width=\"15\" style=\"vertical-align: -3px;\" \/><\/strong><\/td>\n<td style=\"width: 36.6855%;height: 15px;text-align: center\"><strong><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-ed80b30d4524affe2f383f1b9b108ce1_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#40;&#120;&#95;&#105;&#32;&#45;&#32;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;\" title=\"Rendered by QuickLaTeX.com\" height=\"18\" width=\"59\" style=\"vertical-align: -4px;\" \/><\/strong><\/td>\n<td style=\"width: 61.0955%;height: 15px;text-align: center\"><strong><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-3b97388a55278917bb7ca9e0bcd183d6_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#40;&#120;&#95;&#105;&#32;&#45;&#32;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;&#94;&#50;\" title=\"Rendered by QuickLaTeX.com\" height=\"19\" width=\"67\" style=\"vertical-align: -4px;\" \/><\/strong><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 34.1359%;height: 15px;text-align: center\">2<\/td>\n<td style=\"width: 36.6855%;height: 15px;text-align: center\">(2 &#8211; 1.857) = 0.143<\/td>\n<td style=\"width: 61.0955%;height: 15px;text-align: center\">(0.143)<sup>2<\/sup> = 0.02<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 34.1359%;height: 15px;text-align: center\">1<\/td>\n<td style=\"width: 36.6855%;height: 15px;text-align: center\">(1 &#8211; 1.857) = -0.857<\/td>\n<td style=\"width: 61.0955%;height: 15px;text-align: center\">(-0.857)<sup>2<\/sup> = 0.734<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 34.1359%;height: 15px;text-align: center\">4<\/td>\n<td style=\"width: 36.6855%;height: 15px;text-align: center\">(4 &#8211; 1.857) = 2.143<\/td>\n<td style=\"width: 61.0955%;height: 15px;text-align: center\">(2.143)<sup>2<\/sup> = 4.592<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 34.1359%;height: 15px;text-align: center\">2<\/td>\n<td style=\"width: 36.6855%;height: 15px;text-align: center\">(2 &#8211; 1.857) = 0.143<\/td>\n<td style=\"width: 61.0955%;height: 15px;text-align: center\">(0.143)<sup>2<\/sup> = 0.02<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 34.1359%;height: 15px;text-align: center\">1<\/td>\n<td style=\"width: 36.6855%;height: 15px;text-align: center\">(1 &#8211; 1.857) = -0.857<\/td>\n<td style=\"width: 61.0955%;height: 15px;text-align: center\">(-0.857)<sup>2<\/sup> = 0.734<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 34.1359%;height: 15px;text-align: center\">0<\/td>\n<td style=\"width: 36.6855%;height: 15px;text-align: center\">(0 &#8211; 1.857) = -1.857<\/td>\n<td style=\"width: 61.0955%;height: 15px;text-align: center\">(-1.857)<sup>2<\/sup> = 3.448<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 34.1359%;height: 15px;text-align: center\">3<\/td>\n<td style=\"width: 36.6855%;height: 15px;text-align: center\">(3 &#8211; 1.86) = 1.143<\/td>\n<td style=\"width: 61.0955%;height: 15px;text-align: center\">(1.143)<sup>2<\/sup> = 1.306<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><strong>Step 2, obtain the sum of squares<\/strong>:<\/p>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-9ab06aaad8dc4359509688e0c528a108_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#115;&#117;&#109;&#92;&#108;&#105;&#109;&#105;&#116;&#115;&#95;&#123;&#105;&#61;&#49;&#125;&#94;&#123;&#78;&#125;&#123;&#40;&#120;&#95;&#105;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;&#94;&#50;&#125;&#32;&#61;&#32;&#40;&#48;&#46;&#48;&#50;&#41;&#50;&#43;&#40;&#48;&#46;&#55;&#51;&#52;&#41;&#50;&#43;&#52;&#46;&#53;&#57;&#50;&#43;&#51;&#46;&#52;&#52;&#56;&#43;&#49;&#46;&#51;&#48;&#54;&#61;&#49;&#48;&#46;&#56;&#53;&#52;\" title=\"Rendered by QuickLaTeX.com\" height=\"43\" width=\"516\" style=\"vertical-align: -17px;\" \/>\u00a0 \u00a0\u00a0\u2190<em>Sum of Squares<\/em><\/p>\n<p>&nbsp;<\/p>\n<p><strong>Step 3, divide the sum of squares<\/strong> (rounded down to two digits) <strong>by <em>N<\/em><\/strong>, i.e., by<em>\u00a07<\/em>:<\/p>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-7a38b5b72da448eea7b088bbcc81a32b_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#115;&#117;&#109;&#92;&#108;&#105;&#109;&#105;&#116;&#115;&#95;&#123;&#105;&#61;&#49;&#125;&#94;&#123;&#78;&#125;&#123;&#40;&#120;&#95;&#105;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;&#94;&#50;&#125;&#125;&#123;&#78;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#49;&#48;&#46;&#56;&#53;&#125;&#123;&#55;&#125;&#61;&#49;&#46;&#53;&#53;&#61;&#32;&#92;&#115;&#105;&#103;&#109;&#97;&#94;&#50;\" title=\"Rendered by QuickLaTeX.com\" height=\"43\" width=\"228\" style=\"vertical-align: -6px;\" \/>\u00a0 \u00a0\u00a0\u2190\u00a0<strong><em>variance<\/em><\/strong><\/p>\n<p>&nbsp;<\/p>\n<p>Thus, we find that your seven friends have an average of about 1.6 squared distances from the mean number of siblings 1.9 (rounded up from 1.857).<\/p>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p><em>Oh, great<\/em>, you are probably thinking now, and I can imagine the sarcasm &#8212; <em>we calculated something we can&#8217;t even interpret properly<\/em>. I mean, it&#8217;s more than a tad awkward to try to explain &#8220;an average of about 1.6 squared distances from the mean number of siblings&#8221; to anyone not versed in statistics. Maybe it would be better if we could get rid of the &#8220;squared-ness&#8221;?<\/p>\n<p>&nbsp;<\/p>\n<p>You know what?\u00a0<em>We can<\/em>. The standard deviation is here to help.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>Standard deviation<\/strong>. Believe it or not, after all the steps we went through to get to the variance, calculating the standard deviation is a breeze: specifically, a breeze that turns back the squared units into <em>standard<\/em> units, hence the name.<\/p>\n<p>&nbsp;<\/p>\n<p>See for yourself:<\/p>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-d3dbdc2c590705a3b7b6af3a2c45df14_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#115;&#113;&#114;&#116;&#123;&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#115;&#117;&#109;&#92;&#108;&#105;&#109;&#105;&#116;&#115;&#95;&#123;&#105;&#61;&#49;&#125;&#94;&#123;&#78;&#125;&#123;&#40;&#120;&#95;&#105;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;&#94;&#50;&#125;&#125;&#123;&#78;&#125;&#125;&#32;&#61;&#32;&#92;&#115;&#113;&#114;&#116;&#123;&#92;&#115;&#105;&#103;&#109;&#97;&#94;&#50;&#125;&#61;&#92;&#115;&#105;&#103;&#109;&#97;\" title=\"Rendered by QuickLaTeX.com\" height=\"54\" width=\"183\" style=\"vertical-align: -10px;\" \/>\u00a0 \u00a0\u00a0\u2190\u00a0<strong><em>standard deviation<\/em><\/strong><\/p>\n<p>&nbsp;<\/p>\n<p>Despite its scary looks, this is actually just the formula for variance <em>under a square root<\/em>. That is, <strong>we take the square root of the variance to get the standard deviation<\/strong>. That&#8217;s it. Nothing more. Just a regular square root, and we&#8217;re there. Cue in a sigh of relief!<a class=\"footnote\" title=\"Note, however, that just like there is an &quot;alternative&quot;, to-be-explained-later, formula for variance, there is an &quot;alternative&quot; formula for standard deviation, following the same principle regarding dividing the sum of squares by N-1 instead of by N:\n\n\u00a0\n\n\u00a0 \u00a0\u2190\u00a0standard deviation\u00a0\n\n\u00a0\n\nAs well, SPSS will use this (N-1) version of the formula when working with variables in a dataset.\" id=\"return-footnote-1647-2\" href=\"#footnote-1647-2\" aria-label=\"Footnote 2\"><sup class=\"footnote\">[2]<\/sup><\/a><\/p>\n<p>&nbsp;<\/p>\n<p>Now that we know how to get back to standard units, let&#8217;s do that for the two examples we used. We had a variance of\u00a0<em>\u03c3<sup>2<\/sup><\/em> = 15.21 for <em>hours worked per week<\/em>\u00a0in the previous section and a variance of\u00a0<em>\u03c3<sup>2<\/sup><\/em> = 1.6 for <em>numbers of siblings <\/em>in the example above. Square-rooting gives us the following:<\/p>\n<p>&nbsp;<\/p>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 19px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-69666412543e62242728ad9ecd766ccc_l3.png\" height=\"19\" width=\"157\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#92;&#115;&#113;&#114;&#116;&#123;&#92;&#115;&#105;&#103;&#109;&#97;&#94;&#50;&#125;&#61;&#92;&#115;&#113;&#114;&#116;&#123;&#49;&#53;&#46;&#50;&#49;&#125;&#61;&#51;&#46;&#57;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>and<\/p>\n<p>&nbsp;<\/p>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 19px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-5e3e7c4c467a860b929e92004bf238ad_l3.png\" height=\"19\" width=\"148\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#92;&#115;&#113;&#114;&#116;&#123;&#92;&#115;&#105;&#103;&#109;&#97;&#94;&#50;&#125;&#61;&#92;&#115;&#113;&#114;&#116;&#123;&#49;&#46;&#54;&#125;&#61;&#49;&#46;&#50;&#53;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>Now <em>these<\/em> we <em>can<\/em> interpret: on average, your hours worked per week deviated from the mean of 8.7 hours per week by 3.9\u00a0<em>hours<\/em>, and your friends\u00a0<span style=\"text-align: initial;text-indent: 2em;font-size: 14pt\">deviated from the average number of siblings,1.9, by 1.25\u00a0<\/span><em style=\"text-align: initial;text-indent: 2em;font-size: 14pt\">siblings<\/em><span style=\"text-align: initial;text-indent: 2em;font-size: 14pt\">.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>To repeat, <strong>the standard deviation is the square root of the variance. The standard deviation is a measure of dispersion which gives us the average deviation of the cases from the mean.<\/strong> (Technically, an average of the squared distances from the mean in standard units.)<\/p>\n<p>&nbsp;<\/p>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\"><em>Do It! 4.2 Longevity of The First Fifteen Canadian Prime Ministers<\/em><\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>&nbsp;<\/p>\n<p>Calculate the variance and standard deviation of the longevity of the first fifteen Prime Ministers of Canada. In chronological order (starting with Macdonald and ending with Pierre Trudeau), their ages at the time of death were: 76, 70, 72, 49, 93, 94, 77, 82, 86, 75, 76, 91, 83, 75, and 80. Interpret your results (i.e., explain what you have found beyond &#8220;the standard deviation is &#8230;&#8221;).<\/p>\n<p>&nbsp;<\/p>\n<p>You can use a table like Table 4.4 to organize your calculations. (Hint: Start with calculating the mean age at death, <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-0d00c2da2b2541a97ae0ac3c10e1504e_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;\" title=\"Rendered by QuickLaTeX.com\" height=\"11\" width=\"11\" style=\"vertical-align: 0px;\" \/>, and round it up to a whole number to make your job easier.) Here\u00a0<img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-c8700e0258243116de0d4f288e2e3b44_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#120;&#95;&#105;\" title=\"Rendered by QuickLaTeX.com\" height=\"11\" width=\"15\" style=\"vertical-align: -3px;\" \/> is age at death for each PM and <em>N<\/em>=15.<\/p>\n<p>&nbsp;<\/p>\n<p>You can check your answers in this footnote.<span style=\"text-indent: 33.6px;font-size: 0.9em\"><a class=\"footnote\" title=\"\u00a0The mean is 79 years; the sum of squares 1,717; the variance 114.5; the standard deviation 10.7 years. However, if you calculated the variance and standard deviation with N-1 in the denominators, you will get a variance of 123 and a standard deviation of 11.1 years.\u00a0 The difference is as large as it is due to the small N. Had we been working with a real dataset of hundreds or thousands of cases, the difference between the just-N and N-1 versions of the formulas would have been less pronounced.\" id=\"return-footnote-1647-3\" href=\"#footnote-1647-3\" aria-label=\"Footnote 3\"><sup class=\"footnote\">[3]<\/sup><\/a><\/span><\/p>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>Of course, one wouldn&#8217;t normally calculate variances and standard deviations by hand: we only do it so that you can understand what the measures are and what they really provide us with, by obtaining them ourselves. Usually, however, we simply use SPSS.<\/p>\n<p>&nbsp;<\/p>\n<div class=\"textbox textbox--key-takeaways\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\"><em>SPSS Tip 4.2 Obtaining Variance and Standard Deviation<\/em><\/p>\n<\/header>\n<div class=\"textbox__content\">\n<ul>\n<li>From the <em>Main Menu<\/em>, select <em>Analyze,<\/em>\u00a0then<em> Descriptive Statistics, <\/em>and then<em> Frequencies<\/em>;<\/li>\n<li>Select your variable of choice from the list on the left and use the arrow to move it to the right side of the window;<\/li>\n<li>Click on the <em>Statistics<\/em> button on the right;<\/li>\n<li>In this new window, check <em>Variance<\/em> and <em>Standard deviation<\/em> in the <i>Dispersion\u00a0<\/i>section on the left at the bottom;<\/li>\n<li>Click <em>Continue<\/em>, then <em>OK<\/em>.<\/li>\n<li>The <em>Output<\/em> window will provide a table with the requested measures.<\/li>\n<li>Make sure you know how to interpret your results! (Try to use as little statistics jargon as possible.)<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<hr class=\"before-footnotes clear\" \/><div class=\"footnotes\"><ol><li id=\"footnote-1647-1\">If you'd like a preview,\u00a0<strong>the alternative, to-be-explained-later, formula for variance is:<\/strong>\r\n\r\n&nbsp;\r\n\r\n<img src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-99067ca8282c39f6931f2645c47a0ca7_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#115;&#117;&#109;&#92;&#108;&#105;&#109;&#105;&#116;&#115;&#95;&#123;&#105;&#61;&#49;&#125;&#94;&#123;&#78;&#125;&#123;&#40;&#120;&#95;&#105;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;&#94;&#50;&#125;&#125;&#123;&#78;&#45;&#49;&#125;&#32;&#61;&#32;&#115;&#94;&#50;&#32;&#61;\" title=\"Rendered by QuickLaTeX.com\" height=\"44\" width=\"130\" style=\"vertical-align: -7px;\" \/>\u00a0 \u00a0\u00a0\u2190\u00a0<strong><em>variance<\/em><\/strong>\r\n\r\n&nbsp;\r\n\r\nAs you can see, the modification is quite small -- <strong>instead of dividing the sum of squares by the total number <em>N<\/em>, we actually divide it by the total <em>minus one<\/em>, <em>N<\/em>-1<\/strong>. If it makes you feel better, dividing just by <em>N<\/em> or by <em>N-1 <\/em>produces generally similar results, in terms of magnitude of the variance.\u00a0<span style=\"text-indent: 18.6667px;font-size: 14pt\">We also denote this version with a regular small-case <img src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-01fed44bee4d1799d82fc07853d7aeea_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#115;&#94;&#50;\" title=\"Rendered by QuickLaTeX.com\" height=\"15\" width=\"15\" style=\"vertical-align: 0px;\" \/>.<\/span><span style=\"text-indent: 1em;font-size: 14pt\"> <a href=\"#return-footnote-1647-1\" class=\"return-footnote\" aria-label=\"Return to footnote 1\">&crarr;<\/a><\/li><li id=\"footnote-1647-2\">Note, however, that just like there is an \"alternative\", to-be-explained-later, formula for variance, there is an \"alternative\" formula for standard deviation, following the same principle regarding dividing the sum of squares by <em>N-1<\/em> instead of by <em>N<\/em>:\r\n\r\n&nbsp;\r\n\r\n<img src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-4f78298ead1cedf5a6a527f4d679043d_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#115;&#113;&#114;&#116;&#123;&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#115;&#117;&#109;&#92;&#108;&#105;&#109;&#105;&#116;&#115;&#95;&#123;&#105;&#61;&#49;&#125;&#94;&#123;&#78;&#125;&#123;&#40;&#120;&#95;&#105;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;&#94;&#50;&#125;&#125;&#123;&#78;&#45;&#49;&#125;&#125;&#32;&#61;&#32;&#92;&#115;&#113;&#114;&#116;&#123;&#115;&#94;&#50;&#125;&#61;&#115;\" title=\"Rendered by QuickLaTeX.com\" height=\"54\" width=\"177\" style=\"vertical-align: -10px;\" \/>\u00a0 \u00a0\u2190\u00a0<strong><em>standard deviation\u00a0<\/em><\/strong>\r\n\r\n&nbsp;\r\n\r\nAs well, SPSS will use this (<em>N<\/em>-1) version of the formula when working with variables in a dataset. <a href=\"#return-footnote-1647-2\" class=\"return-footnote\" aria-label=\"Return to footnote 2\">&crarr;<\/a><\/li><li id=\"footnote-1647-3\">\u00a0<\/span><span style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">The mean is 79 years; the sum of squares 1,717; the variance 114.5; the standard deviation 10.7 years. However, if you calculated the variance and standard deviation with <\/span><em style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">N<\/em><span style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">-1 in the denominators, you will get a variance of 123 and a standard deviation of 11.1 years.\u00a0 The difference is as large as it is due to the small <\/span><em style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">N. H<\/em><span style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">ad we been working with a real dataset of hundreds or thousands of cases, the difference between the just-<\/span><em style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">N<\/em><span style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\"> and <\/span><em style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">N<\/em><span style=\"text-align: initial;text-indent: 2em;font-size: 0.9em\">-1 versions of the formulas would have been less pronounced. <a href=\"#return-footnote-1647-3\" class=\"return-footnote\" aria-label=\"Return to footnote 3\">&crarr;<\/a><\/li><\/ol><\/div>","protected":false},"author":533,"menu_order":4,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-1647","chapter","type-chapter","status-publish","hentry"],"part":26,"_links":{"self":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/1647","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/users\/533"}],"version-history":[{"count":16,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/1647\/revisions"}],"predecessor-version":[{"id":1726,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/1647\/revisions\/1726"}],"part":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/parts\/26"}],"metadata":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/1647\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/media?parent=1647"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapter-type?post=1647"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/contributor?post=1647"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/license?post=1647"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}