{"id":65,"date":"2021-07-30T01:40:28","date_gmt":"2021-07-30T05:40:28","guid":{"rendered":"https:\/\/pressbooks.bccampus.ca\/statspsych\/?post_type=chapter&#038;p=65"},"modified":"2022-08-02T15:06:57","modified_gmt":"2022-08-02T19:06:57","slug":"chapter-2","status":"publish","type":"chapter","link":"https:\/\/pressbooks.bccampus.ca\/statspsych\/chapter\/chapter-2\/","title":{"raw":"2. Central Tendency and Variability","rendered":"2. Central Tendency and Variability"},"content":{"raw":"<h1><a id=\"2a\"><\/a>2a. Central Tendency<\/h1>\r\n<h6 style=\"text-align: right\"><a href=\"#video2a\">video lesson<\/a><\/h6>\r\n<span style=\"text-align: initial;font-size: 1em\">In this chapter we will discuss the three options for measures of central tendency. These measures are all about describing, in one number, an entire dataset.<\/span>\r\n\r\n<span style=\"text-align: initial;font-size: 1em\">A measure of central tendency is a statistical measure that defines the centre of a distribution with a single score. The purpose of a central tendency statistic is to find a single number that is most typical of the entire group. It is a number that should represent the entire group as accurately as possible.<\/span>\r\n<div class=\"textbook__content\">\r\n\r\nDepending on the shape of a distribution, one of these measures may be more accurate than the others. We will see that for symmetrical, unimodal datasets, the <strong>[pb_glossary id=\"68\"]mean[\/pb_glossary]<\/strong> will be the best choice. For asymmetrical (skewed), unimodal datasets, the <strong>[pb_glossary id=\"70\"]median[\/pb_glossary]<\/strong> is likely to be more accurate. For bimodal distributions, the only measure that can capture central tendency accurately is the <strong>[pb_glossary id=\"71\"]mode[\/pb_glossary]<\/strong>.\r\n\r\nIt is very important to note that two out of three measures of central tendency only apply to numeric data. In order to arrive at a mean or a median, the data need to be measured in number form. It makes sense, for example, to measure the average student height in a class. It does not make sense to determine the average major from a class of students.\r\n\r\nBefore we can learn to calculate a mean, we need to familiarize ourselves with some statistical notation. In statistics, when we want to denote \u201ctaking the sum\u201d of a series of numbers, we use the term <strong>[pb_glossary id=\"75\"]\u03a3[\/pb_glossary]<\/strong>. This is the Greek capital letter S, known as \u201csigma\u201d.\r\n\r\nThe tricky thing about <strong>\u03a3<\/strong> is learning how to use it within mathematical order of operations. You may remember the mnenonic BEDMAS from school. This indicates that you should first do any operations that are set off in brackets or parentheses. Next you should do exponents, then division or multiplication, and finally addition\/subtraction. But summation fits in just after division\/multiplication. So BEDMAS becomes BEDMSAS.\r\n\r\n<\/div>\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\">Order of Operations<\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<div>1. Brackets<\/div>\r\n<div>2. Exponents<\/div>\r\n<div>3. Division\/Multiplication<\/div>\r\n<div>4. <strong>Summation<\/strong><\/div>\r\n<div>5. Addition\/Subtraction<\/div>\r\n<div><\/div>\r\nBEDM<strong>S<\/strong>AS\r\n\r\n<\/div>\r\n<\/div>\r\n<div class=\"textbook__content\">Let us try some examples. We will assume my variable X represents a set of scores in a dataset:<\/div>\r\n<table class=\"aligncenter\" style=\"border-collapse: collapse;width: 50px;height: 72px\" border=\"0\" cellspacing=\"0\" cellpadding=\"0\">\r\n<tbody>\r\n<tr style=\"height: 12px\">\r\n<td style=\"width: 394.535px;height: 15px\"><span style=\"text-decoration: underline\">X<\/span><\/td>\r\n<\/tr>\r\n<tr style=\"height: 12px\">\r\n<td style=\"width: 394.535px;height: 15px\">5<\/td>\r\n<\/tr>\r\n<tr style=\"height: 12px\">\r\n<td style=\"width: 394.535px;height: 15px\">8<\/td>\r\n<\/tr>\r\n<tr style=\"height: 12px\">\r\n<td style=\"width: 394.535px;height: 15px\">9<\/td>\r\n<\/tr>\r\n<tr style=\"height: 12px\">\r\n<td style=\"width: 394.535px;height: 15px\">6<\/td>\r\n<\/tr>\r\n<tr style=\"height: 12px\">\r\n<td style=\"width: 394.535px;height: 15px\">7<\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<div class=\"textbook__content\">If you see the formula:<\/div>\r\n[latex] \\[(\\sum X)^{2}\\] [\/latex]\r\n<div class=\"textbook__content\">\r\n\r\nWhat is that telling you to do? Look at order of operations. First do anything in brackets. So first we have to do the <strong>\u03a3<\/strong>X part. This tells us to add up all the scores: 5+8+9+6+7 = 35. Next we need to take that result and square it (exponents): 35<sup>2<\/sup> = <span style=\"background-color: #ccffcc\">1225<\/span>.\r\n\r\nLet us try:\r\n\r\n<\/div>\r\n[latex] \\[\\sum X^{2}\\] [\/latex]\r\n<div class=\"textbook__content\">\r\n\r\nNow there are no brackets, so exponents come first. This formula says to square each score in the dataset, then add together all the results: 5<sup>2<\/sup>+8<sup>2<\/sup>+9<sup>2<\/sup>+6<sup>2<\/sup>+7<sup>2<\/sup> = 25+64+81+36+49 = <span style=\"background-color: #ccffcc\">255<\/span>. So what seems like a minor difference in the formula really changes the result!\r\n\r\nLet us try:\r\n\r\n[latex] \\[\\sum (X-1)^{2}\\] [\/latex]\r\n\r\nFirst brackets, so subtract off 1 from each score. Next exponents, so square each result. And finally summation, so add that all together. 42+72+82+52+62 = 16+49+64+25+36 = <span style=\"background-color: #ccffcc\">190<\/span>.\r\n\r\nAnd finally, we will try:\r\n\r\n<\/div>\r\n[latex] \\[\\sum X-1^{2}\\] [\/latex]\r\n<div class=\"textbook__content\">\r\n\r\nNow exponents is first, so square the number 1. Next is summation, so add all scores together. Finally, complete the subtraction. 1<sup>2<\/sup> = 1. 5+8+9+6+7 = 35. 35 \u2013 1 = <span style=\"background-color: #ccffcc\">34<\/span>.\r\n\r\nSo, now you see that order of operations is vital to decode summation notation. Each of these variations have very different solutions. Whenever you see a new formula, try translating it into words after reviewing order of operations.\r\n\r\nNow we are ready to take a look at calculating a <strong>mean<\/strong>. The <strong>mean<\/strong> is the most common measure of central tendency, because it has some powerful applications in statistics. The <strong>mean<\/strong> is the same thing as an average, something you are very familiar with. You also probably know that to find an average, you add up all the numbers, then divide by how many numbers there were.\r\n\r\n<\/div>\r\n[latex] \\[M=\\frac{\\sum X}{N}\\] [\/latex]\r\n<div class=\"textbook__content\">\r\n\r\nIn statistical notation, the formula for the mean is shown above. A <strong>mean<\/strong> is symbolized as <strong>[pb_glossary id=\"96\"]M[\/pb_glossary]<\/strong>. The number of scores in a dataset is symbolized as <strong>[pb_glossary id=\"97\"]N[\/pb_glossary]<\/strong>.\r\n\r\nConceptually we can think of the <strong>mean<\/strong> as the balancing point for the distribution. In a <strong>histogram<\/strong>, if we were to mentally convert the X-axis into a scale, the <strong>mean<\/strong> would be the fulcrum, or the point of the scale at which the two sides of the scale balance each other out. Each score is like a weight. Their position along the scale determines where the mean will be. Here are some examples.\r\n\r\n<\/div>\r\n<img class=\"aligncenter wp-image-112 \" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.3-1024x676.jpg\" alt=\"\" width=\"486\" height=\"321\" \/>\r\n<div class=\"textbook__content\">In the top <strong>histogram<\/strong>, the <strong>mean<\/strong> is (1+2+6+6+10)\/5 = 25\/5 = 5. In the bottom <strong>histogram<\/strong>, the <strong>mean<\/strong> is (3+6+6+9+11)\/5 = 35\/5 = 7.In the previous examples, the <strong>mean<\/strong> was pretty close to the middle of the scale \u2013 not that surprising, because the data were spread out fairly evenly. However, things can change if the data pile up toward one end of the distribution (i.e. skewed), or if any of the data points are quite extreme (outliers). In the example here, think about where the balance point would be.<\/div>\r\n<img class=\"aligncenter wp-image-114 \" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.2-1024x511.jpg\" alt=\"\" width=\"514\" height=\"257\" \/>\r\n<div class=\"textbook__content\">\r\n\r\nIf we put it close to the middle of the scale, for example at 5, the scale would tip to the right. So we have to move the balance point rightward. Let\u2019s see if that intuition is correct: <strong>M<\/strong> = (1+2+3+4+5+6+6+7+7+7+8+8+8+8+9+9)\/16 = 6.1\r\n\r\nThe <strong>median<\/strong>, unlike the <strong>mean<\/strong>, is a counting-based measure. The values of the the <strong>scores<\/strong> are not important, just how many of them there are. To get the <strong>median<\/strong>, you find the midpoint of the scores after placing them in order. The <strong>median<\/strong> is the point at which half of the scores fall above and half of the scores fall below.\u00a0 Finding the <strong>median<\/strong> for an odd number of scores is easy. Just find the single middle score, and that is the <strong>median<\/strong>.\r\n\r\n<\/div>\r\n<img class=\"aligncenter wp-image-120 \" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.5-1024x175.png\" alt=\"\" width=\"519\" height=\"89\" \/>\r\n<div class=\"textbook__content\">\r\n\r\nFor an even number of scores, it is a little more complicated. You have to find the middle two scores and average those together.\r\n\r\nHere are some graphical examples of <strong>medians<\/strong>.\r\n\r\n<\/div>\r\n<img class=\"aligncenter wp-image-113 \" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.4-1024x516.png\" alt=\"\" width=\"550\" height=\"277\" \/>\r\n<div class=\"textbook__content\">\r\n\r\nIn the top histogram, 2 scores are below the <strong>median<\/strong> of 8, and two scores are above. Notice the <strong>median<\/strong> is not a balancing point (which would be a little to the left to account for the spread of the lower scores). In the bottom histogram, there are 2 scores below and two scores above the middle set of scores (2 and 3, averaged to 2.5). The <strong>median<\/strong> is not a balancing point (which would need to be further to the right to account for the score weighting down the right end of the scale).\r\n\r\nThe <strong>mode<\/strong> is our final option for statistical measures of central tendency. The <strong>mode<\/strong> is simply the <strong>score<\/strong> that occurs most often in the dataset, the one with the highest frequency. This is the measure that can be used with <strong>nominal<\/strong> data too. It is possible to have more than one <strong>mode<\/strong>, if the dataset is <strong>bimodal<\/strong>, for example. In fact the term <strong>unimodal<\/strong> means \u201cone <strong>mode<\/strong>\u201d and <strong>bimodal<\/strong> means \u201ctwo <strong>modes<\/strong>\u201d. Note that the <strong>mode<\/strong> must correspond to an actual <strong>score<\/strong> in the dataset, so a <strong>grouped frequency table<\/strong> or <strong>histogram<\/strong> will not help you identify it.\r\n\r\nAnother thing to note is that if your dataset happens to have two <strong>modes<\/strong>, that does not necessarily mean it is appropriate to describe the distribution as <strong>bimodal<\/strong>. Remember, true <strong>bimodal<\/strong> distributions are ones that show two distinct peaks in the smoothed line with some space between, indicating there are two collections of scores that are clustered together. Basically, if the distribution does not look like a camel's back, then it is not truly <strong>bimodal<\/strong>.\r\n\r\n<\/div>\r\n\r\n[caption id=\"attachment_124\" align=\"aligncenter\" width=\"300\"]<img class=\"wp-image-124 size-medium\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.6-300x217.jpg\" alt=\"\" width=\"300\" height=\"217\" \/> Camel Farm in Mongolia 02 by Alexandr frolov is licensed under <a href=\"https:\/\/creativecommons.org\/licenses\/by-sa\/4.0\/deed.en\">CC BY-SA 4.0<\/a>[\/caption]\r\n\r\n<div><\/div>\r\n<div>\r\n<h4><a id=\"mean_return\"><\/a><a href=\"#mean\">Concept Practice: mean, median, mode<\/a><\/h4>\r\n<\/div>\r\n<div class=\"textbook__content\">Now we should take a look at the difference between a <strong>mean<\/strong> and a <strong>median<\/strong>.<\/div>\r\n<img class=\"aligncenter wp-image-129 \" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.7-1024x371.jpg\" alt=\"\" width=\"522\" height=\"189\" \/>\r\n<div class=\"textbook__content\">On the <strong>histogram<\/strong> above, you can see that there is a cluster of <strong>scores<\/strong> at the low end of the scale. However, there is one outlier with an extreme <strong>score<\/strong> at the upper end. The <strong>median<\/strong> is concerned with how many scores fall above or below, rather than their values, so it is not affected much by the 12 way out there. It still reflects the center of the distribution accurately. However, the <strong>mean<\/strong> takes into account the value of each score. Without the outlier, the <strong>mean<\/strong> would have been (2+2+2+3+3)\/5 = 2.4. With the outlier, the <strong>mean<\/strong> is (2+2+2+3+3+12)\/6 = 4. With an outlier, or a heavily-skewed distribution, the <strong>mean<\/strong> can be pulled in the direction of the outlier or skew, and is thus not the most accurate measure of central tendency. Under these circumstances, the <strong>median<\/strong> will better describe the dataset.<\/div>\r\n<div><\/div>\r\n<div>\r\n<h4><a id=\"distCT_return\"><\/a><a href=\"#distCT\">Concept Practice: distribution shape and central tendency<\/a><\/h4>\r\n<h4><a id=\"meanVS_return\"><\/a><a href=\"#meanVS\">Concept Practice: mean vs median vs mode<\/a><\/h4>\r\n<\/div>\r\n<div><\/div>\r\n<div><\/div>\r\n<h1><a id=\"2b\"><\/a>2b. Variability<\/h1>\r\n<h6 style=\"text-align: right\"><a href=\"#video2b\">video lesson<\/a><\/h6>\r\n<div class=\"textbox__content\">\r\n\r\n&nbsp;\r\n\r\n<img class=\"aligncenter wp-image-1156\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.1-butterflies-beetles-1024x444.png\" alt=\"butterflies-beetles\" width=\"836\" height=\"363\" \/>\r\n\r\nIn the image at left, it is easy to detect a difference in pattern among the rows -- even though there are variations, there is also a clear theme. The butterflies on the top are dark with large light patches, and those on the bottom are light with dark trim. In the image at right, there is no clear theme in colouring pattern top to bottom, and there is a lot of variability in beetle appearance within each grouping. This is important, because if we want to be able to assert differences in what is typical or representative between one group and another, we need enough uniformity to discern those differences. If there is too much random variability, we will be unable to say much about the data or use them for decision making.\r\n\r\nOur objectives in this part of the chapter will be that to explain the concept of variability and why it is important, and to calculate the <strong>descriptive<\/strong> statistics <strong>[pb_glossary id=\"156\"]variance[\/pb_glossary]<\/strong> and <strong>[pb_glossary id=\"157\"]standard deviation[\/pb_glossary]<\/strong>.\r\n\r\nSoon we will be progressing to <strong>inferential<\/strong> statistics, in which we will often wish to figure out if the central tendency of one group of scores is different from another group of scores. If we want to be able to assert differences in what is typical between one group and another, we need enough uniformity within each of those groups to discern those differences. If there is too much random variability, we will be unable to say much about the data or use them for decision making. There will be too little order in the chaos. For that reason, we need to learn how to measure variability in a data set and take that into account in the process of making inferences about the data.\r\n\r\nThere are many ways to measure variability. However, we will focus on the two main measures of variability that are commonly use in both descriptive and inferential statistics of the sort we will cover in this course: <strong>variance<\/strong> and <strong>standard deviation<\/strong>. It is worth noting that <strong>variance<\/strong> and <strong>standard deviation<\/strong> are directly related, but <strong>standard deviation<\/strong> is easier to interpret and is thus more often reported as a <strong>descriptive<\/strong> statistic.\r\n\r\nIn general, measures of variability describe the degree to which <strong>scores<\/strong> in a data set are spread out or clustered together. They also give us a sense of the width of a distribution. Finally, they help us understand how well any individual statistic (for example the <strong>mean<\/strong>) can possibly represent the distribution as a whole.\r\n\r\nWhen it comes to <strong>inferential<\/strong> statistics, smaller variability is better. When comparing two distributions, as we will be doing in inferential statistics, there are two ways to be confident that there is a difference. One is to have dramatically different central tendencies (such as the <strong>means<\/strong>). The other way is to have small variability, such that an individual statistic represents that distribution well.\r\n\r\nThe first measure of variability we will learn to calculate is <strong>variance<\/strong>. <strong>Variance<\/strong> summarizes the extent to which scores are spread out from the mean. To do that, we calculate the deviation of each score from the mean.\r\n\r\n<img class=\"aligncenter wp-image-1158\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.8-1024x483.jpg\" alt=\"Fig 2.8\" width=\"407\" height=\"192\" \/>\r\n\r\nHere is a graphical illustration of deviations. As the first\u00a0step in calculating <strong>variance<\/strong>, for each score we find its distance from the mean. So here if the distribution's mean is 6, a score of 1 is a deviation of 5 away from the mean, and a score of 7 is a deviation of 1 away from the mean.\r\n\r\nHere is the formula to calculate variance.\r\n\r\n<\/div>\r\n[latex] \\[SD^{2}=\\frac{\\sum (X-M)^{2}}{N}\\] [\/latex]\r\n<div class=\"textbook__content\">I have translated the steps\u00a0of the formula into words for you here. Keep in mind that we have to do each part if the formula in the order of operations.\u00a0 First brackets, then exponents, then summation, and finally division.<\/div>\r\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\">Steps to Calculate Variance<\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n<ol>\r\n \t<li>Take the distance, or \u201cdeviation\u201d, of each score from the mean<\/li>\r\n \t<li>Square each distance to get rid of the sign<\/li>\r\n \t<li>Add up all the squared deviations<\/li>\r\n \t<li>Divide by the number of scores<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div class=\"textbox__content\">\r\n\r\nThe first part of the formula we need to calculate, then, is to take the distance, or \u201cdeviation\u201d, of each score from the mean. This is written as X-M.\r\n\r\nNext, we square each distance to get rid of the sign (because some deviations will be negative numbers, which we do not want. This is the exponent outside the brackets.\r\n\r\n<\/div>\r\n[latex] \\[SD^{2}=\\frac{\\sum (X-M)^{2}}{N}\\] [\/latex]\r\n<div class=\"textbox__content\">\r\n\r\nNext, we do summation, so we add up all the squared deviations. In fact, the result of this step has its own name: <strong>[pb_glossary id=\"162\"]Sum of Squares[\/pb_glossary],<\/strong> which we will sometimes abbreviate as SS.\r\n\r\nAnd finally, we divide by the number of scores, to make sure this is an average measure of distances in the dataset. Now, we're doing the division last, because of the notation, because there is a top and a bottom. So that makes it clear that the division is the last thing that we do in the order of operations.\r\n\r\nOne thing to note is that the purpose of squaring each deviation before taking the sum is the following. All the deviations for scores smaller than the mean will come out negative. All the deviations for scores larger than the mean will come out positive. So if we added up the deviations without squaring them, the negatives would cancel out the positives and the variance would always end up zero. Squaring each deviation is mainly a way to convert them all to positive numbers.\r\n\r\n<strong>Standard deviation<\/strong> is the other measure of variability we will use in this course. It expresses the variability in terms of a typical deviation in the data set. This will be a single number that gives us the distance of typical scores in the dataset from the mean.\r\n\r\n<strong>Variance<\/strong> is essentially the average squared deviation; now we want to find the average deviation to get it back into the original units of the data. To find the <strong>standard deviation<\/strong>, we just need to take the square root of the <strong>variance<\/strong>.\r\n\r\n<\/div>\r\n[latex] \\[SD=\\sqrt{SD^{2}}\\] [\/latex]\r\n<div class=\"textbox__content\">\r\n\r\nMost of the <strong>inferential<\/strong> statistics we will use in this course will be based on our calculations of the <strong>mean<\/strong> and either the <strong>variance<\/strong> or <strong>standard deviation<\/strong>.\r\n\r\n<\/div>\r\n<h4><a id=\"variability_return\"><\/a><a href=\"#variability\">Concept Practice: variability<\/a><\/h4>\r\n<h1>Chapter Summary<\/h1>\r\nIn this chapter we examined the purpose and common methods for determining statistics representing the central tendency and the variability of a dataset. We saw the particular characteristics of each statistic that makes it most appropriate or useful for specific situations. The combination of central tendency and the variability statistics not only provides a very succinct summary of the dataset, but it also will become the basis for making inferences from data.\r\n\r\nKey Terms:\r\n<table class=\"no-lines\" style=\"border-collapse: collapse;width: 100%;height: 60px\" border=\"0\">\r\n<tbody>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 33.3333%;height: 15px\"><strong>mean<\/strong><\/td>\r\n<td style=\"width: 33.3333%;height: 15px\"><strong>\u03a3<\/strong><\/td>\r\n<td style=\"width: 33.3333%;height: 15px\"><strong>variance<\/strong><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 33.3333%;height: 15px\"><strong>median<\/strong><\/td>\r\n<td style=\"width: 33.3333%;height: 15px\"><strong>M<\/strong><\/td>\r\n<td style=\"width: 33.3333%;height: 15px\"><strong>standard deviation<\/strong><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 33.3333%;height: 15px\"><strong>mode<\/strong><\/td>\r\n<td style=\"width: 33.3333%;height: 15px\"><strong>N<\/strong><\/td>\r\n<td style=\"width: 33.3333%;height: 15px\"><strong>Sum of Squares<\/strong><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<h1>Concept Practice<\/h1>\r\n<a id=\"mean\"><\/a>[h5p id=\"96\"]\r\n<h6 style=\"text-align: right\">Return to <a href=\"#mean_return\">text<\/a><\/h6>\r\n<a id=\"distCT\"><\/a>[h5p id=\"85\"]\r\n<h6 style=\"text-align: right\">Return to <a href=\"#distCT_return\">text<\/a><\/h6>\r\n<a id=\"meanVS\"><\/a>[h5p id=\"86\"]\r\n<h6 style=\"text-align: right\">Return to <a href=\"#meanVS_return\">text<\/a><\/h6>\r\n<a id=\"variability\"><\/a>[h5p id=\"87\"]\r\n<h6 style=\"text-align: right\">Return to <a href=\"#variability_return\">text<\/a><\/h6>\r\nReturn to <a href=\"#2a\">2a. Central Tendency<\/a>\r\n<h6>Try interactive <a href=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/chapter\/worksheet-2a\/\" target=\"_blank\" rel=\"noopener\">Worksheet 2a<\/a> or download <a href=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2022\/01\/2a.-Class-worksheet.pdf\" target=\"_blank\" rel=\"noopener\">Worksheet 2a<\/a><\/h6>\r\nReturn to <a href=\"#2b\">2b. Variability<\/a>\r\n<h6>Try interactive <a href=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/chapter\/worksheet-2b\/\" target=\"_blank\" rel=\"noopener\">Worksheet 2b<\/a>\u00a0or download <a href=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2022\/01\/2b.-Class-worksheet.pdf\" target=\"_blank\" rel=\"noopener\">Worksheet 2b<\/a><\/h6>\r\n<a id=\"video2a\"><\/a>\r\nvideo 2a1\r\n\r\n[video width=\"1440\" height=\"1080\" mp4=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Chapter-2a1.mp4\"][\/video]\r\n\r\nvideo 2a2\r\n\r\n[video width=\"1440\" height=\"1080\" mp4=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Chapter-2a2.mp4\"][\/video]\r\n\r\n<a id=\"video2b\"><\/a>video 2b\r\n\r\n[video width=\"1440\" height=\"1080\" mp4=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Chapter-2b.mp4\"][\/video]","rendered":"<h1><a id=\"2a\"><\/a>2a. Central Tendency<\/h1>\n<h6 style=\"text-align: right\"><a href=\"#video2a\">video lesson<\/a><\/h6>\n<p><span style=\"text-align: initial;font-size: 1em\">In this chapter we will discuss the three options for measures of central tendency. These measures are all about describing, in one number, an entire dataset.<\/span><\/p>\n<p><span style=\"text-align: initial;font-size: 1em\">A measure of central tendency is a statistical measure that defines the centre of a distribution with a single score. The purpose of a central tendency statistic is to find a single number that is most typical of the entire group. It is a number that should represent the entire group as accurately as possible.<\/span><\/p>\n<div class=\"textbook__content\">\n<p>Depending on the shape of a distribution, one of these measures may be more accurate than the others. We will see that for symmetrical, unimodal datasets, the <strong><a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_65_68\">mean<\/a><\/strong> will be the best choice. For asymmetrical (skewed), unimodal datasets, the <strong><a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_65_70\">median<\/a><\/strong> is likely to be more accurate. For bimodal distributions, the only measure that can capture central tendency accurately is the <strong><a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_65_71\">mode<\/a><\/strong>.<\/p>\n<p>It is very important to note that two out of three measures of central tendency only apply to numeric data. In order to arrive at a mean or a median, the data need to be measured in number form. It makes sense, for example, to measure the average student height in a class. It does not make sense to determine the average major from a class of students.<\/p>\n<p>Before we can learn to calculate a mean, we need to familiarize ourselves with some statistical notation. In statistics, when we want to denote \u201ctaking the sum\u201d of a series of numbers, we use the term <strong><a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_65_75\">\u03a3<\/a><\/strong>. This is the Greek capital letter S, known as \u201csigma\u201d.<\/p>\n<p>The tricky thing about <strong>\u03a3<\/strong> is learning how to use it within mathematical order of operations. You may remember the mnenonic BEDMAS from school. This indicates that you should first do any operations that are set off in brackets or parentheses. Next you should do exponents, then division or multiplication, and finally addition\/subtraction. But summation fits in just after division\/multiplication. So BEDMAS becomes BEDMSAS.<\/p>\n<\/div>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Order of Operations<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<div>1. Brackets<\/div>\n<div>2. Exponents<\/div>\n<div>3. Division\/Multiplication<\/div>\n<div>4. <strong>Summation<\/strong><\/div>\n<div>5. Addition\/Subtraction<\/div>\n<div><\/div>\n<p>BEDM<strong>S<\/strong>AS<\/p>\n<\/div>\n<\/div>\n<div class=\"textbook__content\">Let us try some examples. We will assume my variable X represents a set of scores in a dataset:<\/div>\n<table class=\"aligncenter\" style=\"border-collapse: collapse;width: 50px;height: 72px; border-spacing: 0px;\" cellpadding=\"0\">\n<tbody>\n<tr style=\"height: 12px\">\n<td style=\"width: 394.535px;height: 15px\"><span style=\"text-decoration: underline\">X<\/span><\/td>\n<\/tr>\n<tr style=\"height: 12px\">\n<td style=\"width: 394.535px;height: 15px\">5<\/td>\n<\/tr>\n<tr style=\"height: 12px\">\n<td style=\"width: 394.535px;height: 15px\">8<\/td>\n<\/tr>\n<tr style=\"height: 12px\">\n<td style=\"width: 394.535px;height: 15px\">9<\/td>\n<\/tr>\n<tr style=\"height: 12px\">\n<td style=\"width: 394.535px;height: 15px\">6<\/td>\n<\/tr>\n<tr style=\"height: 12px\">\n<td style=\"width: 394.535px;height: 15px\">7<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<div class=\"textbook__content\">If you see the formula:<\/div>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 25px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/ql-cache\/quicklatex.com-a55c06814cae3ad9de01319b43b6f1b5_l3.png\" height=\"25\" width=\"64\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#32;&#92;&#091;&#40;&#92;&#115;&#117;&#109;&#32;&#88;&#41;&#94;&#123;&#50;&#125;&#92;&#093;&#32;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<div class=\"textbook__content\">\n<p>What is that telling you to do? Look at order of operations. First do anything in brackets. So first we have to do the <strong>\u03a3<\/strong>X part. This tells us to add up all the scores: 5+8+9+6+7 = 35. Next we need to take that result and square it (exponents): 35<sup>2<\/sup> = <span style=\"background-color: #ccffcc\">1225<\/span>.<\/p>\n<p>Let us try:<\/p>\n<\/div>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 25px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/ql-cache\/quicklatex.com-e1d821e5c6bc3fd3fb7ca51c6b34c0d2_l3.png\" height=\"25\" width=\"51\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#32;&#92;&#091;&#92;&#115;&#117;&#109;&#32;&#88;&#94;&#123;&#50;&#125;&#92;&#093;&#32;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<div class=\"textbook__content\">\n<p>Now there are no brackets, so exponents come first. This formula says to square each score in the dataset, then add together all the results: 5<sup>2<\/sup>+8<sup>2<\/sup>+9<sup>2<\/sup>+6<sup>2<\/sup>+7<sup>2<\/sup> = 25+64+81+36+49 = <span style=\"background-color: #ccffcc\">255<\/span>. So what seems like a minor difference in the formula really changes the result!<\/p>\n<p>Let us try:<\/p>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 25px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/ql-cache\/quicklatex.com-f08c9467d2573984195010f41bf5dd36_l3.png\" height=\"25\" width=\"93\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#32;&#92;&#091;&#92;&#115;&#117;&#109;&#32;&#40;&#88;&#45;&#49;&#41;&#94;&#123;&#50;&#125;&#92;&#093;&#32;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<p>First brackets, so subtract off 1 from each score. Next exponents, so square each result. And finally summation, so add that all together. 42+72+82+52+62 = 16+49+64+25+36 = <span style=\"background-color: #ccffcc\">190<\/span>.<\/p>\n<p>And finally, we will try:<\/p>\n<\/div>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 25px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/ql-cache\/quicklatex.com-6f68780ab74e1dea857c98e79dde76f4_l3.png\" height=\"25\" width=\"83\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#32;&#92;&#091;&#92;&#115;&#117;&#109;&#32;&#88;&#45;&#49;&#94;&#123;&#50;&#125;&#92;&#093;&#32;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<div class=\"textbook__content\">\n<p>Now exponents is first, so square the number 1. Next is summation, so add all scores together. Finally, complete the subtraction. 1<sup>2<\/sup> = 1. 5+8+9+6+7 = 35. 35 \u2013 1 = <span style=\"background-color: #ccffcc\">34<\/span>.<\/p>\n<p>So, now you see that order of operations is vital to decode summation notation. Each of these variations have very different solutions. Whenever you see a new formula, try translating it into words after reviewing order of operations.<\/p>\n<p>Now we are ready to take a look at calculating a <strong>mean<\/strong>. The <strong>mean<\/strong> is the most common measure of central tendency, because it has some powerful applications in statistics. The <strong>mean<\/strong> is the same thing as an average, something you are very familiar with. You also probably know that to find an average, you add up all the numbers, then divide by how many numbers there were.<\/p>\n<\/div>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 38px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/ql-cache\/quicklatex.com-55341b44493248a86727a0eed0aad993_l3.png\" height=\"38\" width=\"82\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#32;&#92;&#091;&#77;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#115;&#117;&#109;&#32;&#88;&#125;&#123;&#78;&#125;&#92;&#093;&#32;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<div class=\"textbook__content\">\n<p>In statistical notation, the formula for the mean is shown above. A <strong>mean<\/strong> is symbolized as <strong><a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_65_96\">M<\/a><\/strong>. The number of scores in a dataset is symbolized as <strong><a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_65_97\">N<\/a><\/strong>.<\/p>\n<p>Conceptually we can think of the <strong>mean<\/strong> as the balancing point for the distribution. In a <strong>histogram<\/strong>, if we were to mentally convert the X-axis into a scale, the <strong>mean<\/strong> would be the fulcrum, or the point of the scale at which the two sides of the scale balance each other out. Each score is like a weight. Their position along the scale determines where the mean will be. Here are some examples.<\/p>\n<\/div>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-112\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.3-1024x676.jpg\" alt=\"\" width=\"486\" height=\"321\" srcset=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.3-1024x676.jpg 1024w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.3-300x198.jpg 300w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.3-768x507.jpg 768w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.3-65x43.jpg 65w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.3-225x149.jpg 225w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.3-350x231.jpg 350w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.3.jpg 1134w\" sizes=\"auto, (max-width: 486px) 100vw, 486px\" \/><\/p>\n<div class=\"textbook__content\">In the top <strong>histogram<\/strong>, the <strong>mean<\/strong> is (1+2+6+6+10)\/5 = 25\/5 = 5. In the bottom <strong>histogram<\/strong>, the <strong>mean<\/strong> is (3+6+6+9+11)\/5 = 35\/5 = 7.In the previous examples, the <strong>mean<\/strong> was pretty close to the middle of the scale \u2013 not that surprising, because the data were spread out fairly evenly. However, things can change if the data pile up toward one end of the distribution (i.e. skewed), or if any of the data points are quite extreme (outliers). In the example here, think about where the balance point would be.<\/div>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-114\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.2-1024x511.jpg\" alt=\"\" width=\"514\" height=\"257\" srcset=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.2-1024x511.jpg 1024w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.2-300x150.jpg 300w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.2-768x383.jpg 768w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.2-65x32.jpg 65w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.2-225x112.jpg 225w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.2-350x175.jpg 350w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.2.jpg 1161w\" sizes=\"auto, (max-width: 514px) 100vw, 514px\" \/><\/p>\n<div class=\"textbook__content\">\n<p>If we put it close to the middle of the scale, for example at 5, the scale would tip to the right. So we have to move the balance point rightward. Let\u2019s see if that intuition is correct: <strong>M<\/strong> = (1+2+3+4+5+6+6+7+7+7+8+8+8+8+9+9)\/16 = 6.1<\/p>\n<p>The <strong>median<\/strong>, unlike the <strong>mean<\/strong>, is a counting-based measure. The values of the the <strong>scores<\/strong> are not important, just how many of them there are. To get the <strong>median<\/strong>, you find the midpoint of the scores after placing them in order. The <strong>median<\/strong> is the point at which half of the scores fall above and half of the scores fall below.\u00a0 Finding the <strong>median<\/strong> for an odd number of scores is easy. Just find the single middle score, and that is the <strong>median<\/strong>.<\/p>\n<\/div>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-120\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.5-1024x175.png\" alt=\"\" width=\"519\" height=\"89\" srcset=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.5-1024x175.png 1024w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.5-300x51.png 300w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.5-768x132.png 768w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.5-65x11.png 65w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.5-225x39.png 225w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.5-350x60.png 350w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.5.png 1535w\" sizes=\"auto, (max-width: 519px) 100vw, 519px\" \/><\/p>\n<div class=\"textbook__content\">\n<p>For an even number of scores, it is a little more complicated. You have to find the middle two scores and average those together.<\/p>\n<p>Here are some graphical examples of <strong>medians<\/strong>.<\/p>\n<\/div>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-113\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.4-1024x516.png\" alt=\"\" width=\"550\" height=\"277\" srcset=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.4-1024x516.png 1024w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.4-300x151.png 300w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.4-768x387.png 768w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.4-65x33.png 65w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.4-225x113.png 225w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.4-350x176.png 350w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.4.png 1536w\" sizes=\"auto, (max-width: 550px) 100vw, 550px\" \/><\/p>\n<div class=\"textbook__content\">\n<p>In the top histogram, 2 scores are below the <strong>median<\/strong> of 8, and two scores are above. Notice the <strong>median<\/strong> is not a balancing point (which would be a little to the left to account for the spread of the lower scores). In the bottom histogram, there are 2 scores below and two scores above the middle set of scores (2 and 3, averaged to 2.5). The <strong>median<\/strong> is not a balancing point (which would need to be further to the right to account for the score weighting down the right end of the scale).<\/p>\n<p>The <strong>mode<\/strong> is our final option for statistical measures of central tendency. The <strong>mode<\/strong> is simply the <strong>score<\/strong> that occurs most often in the dataset, the one with the highest frequency. This is the measure that can be used with <strong>nominal<\/strong> data too. It is possible to have more than one <strong>mode<\/strong>, if the dataset is <strong>bimodal<\/strong>, for example. In fact the term <strong>unimodal<\/strong> means \u201cone <strong>mode<\/strong>\u201d and <strong>bimodal<\/strong> means \u201ctwo <strong>modes<\/strong>\u201d. Note that the <strong>mode<\/strong> must correspond to an actual <strong>score<\/strong> in the dataset, so a <strong>grouped frequency table<\/strong> or <strong>histogram<\/strong> will not help you identify it.<\/p>\n<p>Another thing to note is that if your dataset happens to have two <strong>modes<\/strong>, that does not necessarily mean it is appropriate to describe the distribution as <strong>bimodal<\/strong>. Remember, true <strong>bimodal<\/strong> distributions are ones that show two distinct peaks in the smoothed line with some space between, indicating there are two collections of scores that are clustered together. Basically, if the distribution does not look like a camel&#8217;s back, then it is not truly <strong>bimodal<\/strong>.<\/p>\n<\/div>\n<figure id=\"attachment_124\" aria-describedby=\"caption-attachment-124\" style=\"width: 300px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-124 size-medium\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.6-300x217.jpg\" alt=\"\" width=\"300\" height=\"217\" srcset=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.6-300x217.jpg 300w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.6-1024x740.jpg 1024w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.6-768x555.jpg 768w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.6-1536x1111.jpg 1536w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.6-2048x1481.jpg 2048w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.6-65x47.jpg 65w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.6-225x163.jpg 225w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.6-350x253.jpg 350w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><figcaption id=\"caption-attachment-124\" class=\"wp-caption-text\">Camel Farm in Mongolia 02 by Alexandr frolov is licensed under <a href=\"https:\/\/creativecommons.org\/licenses\/by-sa\/4.0\/deed.en\">CC BY-SA 4.0<\/a><\/figcaption><\/figure>\n<div><\/div>\n<div>\n<h4><a id=\"mean_return\"><\/a><a href=\"#mean\">Concept Practice: mean, median, mode<\/a><\/h4>\n<\/div>\n<div class=\"textbook__content\">Now we should take a look at the difference between a <strong>mean<\/strong> and a <strong>median<\/strong>.<\/div>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-129\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.7-1024x371.jpg\" alt=\"\" width=\"522\" height=\"189\" srcset=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.7-1024x371.jpg 1024w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.7-300x109.jpg 300w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.7-768x278.jpg 768w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.7-65x24.jpg 65w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.7-225x81.jpg 225w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.7-350x127.jpg 350w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.7.jpg 1169w\" sizes=\"auto, (max-width: 522px) 100vw, 522px\" \/><\/p>\n<div class=\"textbook__content\">On the <strong>histogram<\/strong> above, you can see that there is a cluster of <strong>scores<\/strong> at the low end of the scale. However, there is one outlier with an extreme <strong>score<\/strong> at the upper end. The <strong>median<\/strong> is concerned with how many scores fall above or below, rather than their values, so it is not affected much by the 12 way out there. It still reflects the center of the distribution accurately. However, the <strong>mean<\/strong> takes into account the value of each score. Without the outlier, the <strong>mean<\/strong> would have been (2+2+2+3+3)\/5 = 2.4. With the outlier, the <strong>mean<\/strong> is (2+2+2+3+3+12)\/6 = 4. With an outlier, or a heavily-skewed distribution, the <strong>mean<\/strong> can be pulled in the direction of the outlier or skew, and is thus not the most accurate measure of central tendency. Under these circumstances, the <strong>median<\/strong> will better describe the dataset.<\/div>\n<div><\/div>\n<div>\n<h4><a id=\"distCT_return\"><\/a><a href=\"#distCT\">Concept Practice: distribution shape and central tendency<\/a><\/h4>\n<h4><a id=\"meanVS_return\"><\/a><a href=\"#meanVS\">Concept Practice: mean vs median vs mode<\/a><\/h4>\n<\/div>\n<div><\/div>\n<div><\/div>\n<h1><a id=\"2b\"><\/a>2b. Variability<\/h1>\n<h6 style=\"text-align: right\"><a href=\"#video2b\">video lesson<\/a><\/h6>\n<div class=\"textbox__content\">\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-1156\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.1-butterflies-beetles-1024x444.png\" alt=\"butterflies-beetles\" width=\"836\" height=\"363\" srcset=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.1-butterflies-beetles-1024x444.png 1024w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.1-butterflies-beetles-300x130.png 300w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.1-butterflies-beetles-768x333.png 768w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.1-butterflies-beetles-65x28.png 65w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.1-butterflies-beetles-225x97.png 225w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.1-butterflies-beetles-350x152.png 350w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.1-butterflies-beetles.png 1057w\" sizes=\"auto, (max-width: 836px) 100vw, 836px\" \/><\/p>\n<p>In the image at left, it is easy to detect a difference in pattern among the rows &#8212; even though there are variations, there is also a clear theme. The butterflies on the top are dark with large light patches, and those on the bottom are light with dark trim. In the image at right, there is no clear theme in colouring pattern top to bottom, and there is a lot of variability in beetle appearance within each grouping. This is important, because if we want to be able to assert differences in what is typical or representative between one group and another, we need enough uniformity to discern those differences. If there is too much random variability, we will be unable to say much about the data or use them for decision making.<\/p>\n<p>Our objectives in this part of the chapter will be that to explain the concept of variability and why it is important, and to calculate the <strong>descriptive<\/strong> statistics <strong><a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_65_156\">variance<\/a><\/strong> and <strong><a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_65_157\">standard deviation<\/a><\/strong>.<\/p>\n<p>Soon we will be progressing to <strong>inferential<\/strong> statistics, in which we will often wish to figure out if the central tendency of one group of scores is different from another group of scores. If we want to be able to assert differences in what is typical between one group and another, we need enough uniformity within each of those groups to discern those differences. If there is too much random variability, we will be unable to say much about the data or use them for decision making. There will be too little order in the chaos. For that reason, we need to learn how to measure variability in a data set and take that into account in the process of making inferences about the data.<\/p>\n<p>There are many ways to measure variability. However, we will focus on the two main measures of variability that are commonly use in both descriptive and inferential statistics of the sort we will cover in this course: <strong>variance<\/strong> and <strong>standard deviation<\/strong>. It is worth noting that <strong>variance<\/strong> and <strong>standard deviation<\/strong> are directly related, but <strong>standard deviation<\/strong> is easier to interpret and is thus more often reported as a <strong>descriptive<\/strong> statistic.<\/p>\n<p>In general, measures of variability describe the degree to which <strong>scores<\/strong> in a data set are spread out or clustered together. They also give us a sense of the width of a distribution. Finally, they help us understand how well any individual statistic (for example the <strong>mean<\/strong>) can possibly represent the distribution as a whole.<\/p>\n<p>When it comes to <strong>inferential<\/strong> statistics, smaller variability is better. When comparing two distributions, as we will be doing in inferential statistics, there are two ways to be confident that there is a difference. One is to have dramatically different central tendencies (such as the <strong>means<\/strong>). The other way is to have small variability, such that an individual statistic represents that distribution well.<\/p>\n<p>The first measure of variability we will learn to calculate is <strong>variance<\/strong>. <strong>Variance<\/strong> summarizes the extent to which scores are spread out from the mean. To do that, we calculate the deviation of each score from the mean.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-1158\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.8-1024x483.jpg\" alt=\"Fig 2.8\" width=\"407\" height=\"192\" srcset=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.8-1024x483.jpg 1024w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.8-300x142.jpg 300w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.8-768x362.jpg 768w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.8-65x31.jpg 65w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.8-225x106.jpg 225w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.8-350x165.jpg 350w, https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Fig-2.8.jpg 1229w\" sizes=\"auto, (max-width: 407px) 100vw, 407px\" \/><\/p>\n<p>Here is a graphical illustration of deviations. As the first\u00a0step in calculating <strong>variance<\/strong>, for each score we find its distance from the mean. So here if the distribution&#8217;s mean is 6, a score of 1 is a deviation of 5 away from the mean, and a score of 7 is a deviation of 1 away from the mean.<\/p>\n<p>Here is the formula to calculate variance.<\/p>\n<\/div>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 39px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/ql-cache\/quicklatex.com-1e00fac0117abd1c58db2dad8d0e5fb8_l3.png\" height=\"39\" width=\"159\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#32;&#92;&#091;&#83;&#68;&#94;&#123;&#50;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#115;&#117;&#109;&#32;&#40;&#88;&#45;&#77;&#41;&#94;&#123;&#50;&#125;&#125;&#123;&#78;&#125;&#92;&#093;&#32;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<div class=\"textbook__content\">I have translated the steps\u00a0of the formula into words for you here. Keep in mind that we have to do each part if the formula in the order of operations.\u00a0 First brackets, then exponents, then summation, and finally division.<\/div>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\">Steps to Calculate Variance<\/p>\n<\/header>\n<div class=\"textbox__content\">\n<ol>\n<li>Take the distance, or \u201cdeviation\u201d, of each score from the mean<\/li>\n<li>Square each distance to get rid of the sign<\/li>\n<li>Add up all the squared deviations<\/li>\n<li>Divide by the number of scores<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div class=\"textbox__content\">\n<p>The first part of the formula we need to calculate, then, is to take the distance, or \u201cdeviation\u201d, of each score from the mean. This is written as X-M.<\/p>\n<p>Next, we square each distance to get rid of the sign (because some deviations will be negative numbers, which we do not want. This is the exponent outside the brackets.<\/p>\n<\/div>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 39px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/ql-cache\/quicklatex.com-1e00fac0117abd1c58db2dad8d0e5fb8_l3.png\" height=\"39\" width=\"159\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#32;&#92;&#091;&#83;&#68;&#94;&#123;&#50;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#115;&#117;&#109;&#32;&#40;&#88;&#45;&#77;&#41;&#94;&#123;&#50;&#125;&#125;&#123;&#78;&#125;&#92;&#093;&#32;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<div class=\"textbox__content\">\n<p>Next, we do summation, so we add up all the squared deviations. In fact, the result of this step has its own name: <strong><a class=\"glossary-term\" aria-haspopup=\"dialog\" aria-describedby=\"definition\" href=\"#term_65_162\">Sum of Squares<\/a>,<\/strong> which we will sometimes abbreviate as SS.<\/p>\n<p>And finally, we divide by the number of scores, to make sure this is an average measure of distances in the dataset. Now, we&#8217;re doing the division last, because of the notation, because there is a top and a bottom. So that makes it clear that the division is the last thing that we do in the order of operations.<\/p>\n<p>One thing to note is that the purpose of squaring each deviation before taking the sum is the following. All the deviations for scores smaller than the mean will come out negative. All the deviations for scores larger than the mean will come out positive. So if we added up the deviations without squaring them, the negatives would cancel out the positives and the variance would always end up zero. Squaring each deviation is mainly a way to convert them all to positive numbers.<\/p>\n<p><strong>Standard deviation<\/strong> is the other measure of variability we will use in this course. It expresses the variability in terms of a typical deviation in the data set. This will be a single number that gives us the distance of typical scores in the dataset from the mean.<\/p>\n<p><strong>Variance<\/strong> is essentially the average squared deviation; now we want to find the average deviation to get it back into the original units of the data. To find the <strong>standard deviation<\/strong>, we just need to take the square root of the <strong>variance<\/strong>.<\/p>\n<\/div>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 18px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/ql-cache\/quicklatex.com-691b36862848544bb4a135c64ec0d469_l3.png\" height=\"18\" width=\"100\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#32;&#92;&#091;&#83;&#68;&#61;&#92;&#115;&#113;&#114;&#116;&#123;&#83;&#68;&#94;&#123;&#50;&#125;&#125;&#92;&#093;&#32;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<div class=\"textbox__content\">\n<p>Most of the <strong>inferential<\/strong> statistics we will use in this course will be based on our calculations of the <strong>mean<\/strong> and either the <strong>variance<\/strong> or <strong>standard deviation<\/strong>.<\/p>\n<\/div>\n<h4><a id=\"variability_return\"><\/a><a href=\"#variability\">Concept Practice: variability<\/a><\/h4>\n<h1>Chapter Summary<\/h1>\n<p>In this chapter we examined the purpose and common methods for determining statistics representing the central tendency and the variability of a dataset. We saw the particular characteristics of each statistic that makes it most appropriate or useful for specific situations. The combination of central tendency and the variability statistics not only provides a very succinct summary of the dataset, but it also will become the basis for making inferences from data.<\/p>\n<p>Key Terms:<\/p>\n<table class=\"no-lines\" style=\"border-collapse: collapse;width: 100%;height: 60px\">\n<tbody>\n<tr style=\"height: 15px\">\n<td style=\"width: 33.3333%;height: 15px\"><strong>mean<\/strong><\/td>\n<td style=\"width: 33.3333%;height: 15px\"><strong>\u03a3<\/strong><\/td>\n<td style=\"width: 33.3333%;height: 15px\"><strong>variance<\/strong><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 33.3333%;height: 15px\"><strong>median<\/strong><\/td>\n<td style=\"width: 33.3333%;height: 15px\"><strong>M<\/strong><\/td>\n<td style=\"width: 33.3333%;height: 15px\"><strong>standard deviation<\/strong><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 33.3333%;height: 15px\"><strong>mode<\/strong><\/td>\n<td style=\"width: 33.3333%;height: 15px\"><strong>N<\/strong><\/td>\n<td style=\"width: 33.3333%;height: 15px\"><strong>Sum of Squares<\/strong><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h1>Concept Practice<\/h1>\n<p><a id=\"mean\"><\/a><\/p>\n<div id=\"h5p-96\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-96\" class=\"h5p-iframe\" data-content-id=\"96\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"Practice 2a.04. Mean, Median and Mode calculation\"><\/iframe><\/div>\n<\/div>\n<h6 style=\"text-align: right\">Return to <a href=\"#mean_return\">text<\/a><\/h6>\n<p><a id=\"distCT\"><\/a><\/p>\n<div id=\"h5p-85\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-85\" class=\"h5p-iframe\" data-content-id=\"85\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"Practice 2a.01. Distribution and central tendency\"><\/iframe><\/div>\n<\/div>\n<h6 style=\"text-align: right\">Return to <a href=\"#distCT_return\">text<\/a><\/h6>\n<p><a id=\"meanVS\"><\/a><\/p>\n<div id=\"h5p-86\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-86\" class=\"h5p-iframe\" data-content-id=\"86\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"Practice 2a.03. Central Tendency\"><\/iframe><\/div>\n<\/div>\n<h6 style=\"text-align: right\">Return to <a href=\"#meanVS_return\">text<\/a><\/h6>\n<p><a id=\"variability\"><\/a><\/p>\n<div id=\"h5p-87\">\n<div class=\"h5p-iframe-wrapper\"><iframe id=\"h5p-iframe-87\" class=\"h5p-iframe\" data-content-id=\"87\" style=\"height:1px\" src=\"about:blank\" frameBorder=\"0\" scrolling=\"no\" title=\"Practice 2b.01. Variability\/Standard deviation\"><\/iframe><\/div>\n<\/div>\n<h6 style=\"text-align: right\">Return to <a href=\"#variability_return\">text<\/a><\/h6>\n<p>Return to <a href=\"#2a\">2a. Central Tendency<\/a><\/p>\n<h6>Try interactive <a href=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/chapter\/worksheet-2a\/\" target=\"_blank\" rel=\"noopener\">Worksheet 2a<\/a> or download <a href=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2022\/01\/2a.-Class-worksheet.pdf\" target=\"_blank\" rel=\"noopener\">Worksheet 2a<\/a><\/h6>\n<p>Return to <a href=\"#2b\">2b. Variability<\/a><\/p>\n<h6>Try interactive <a href=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/chapter\/worksheet-2b\/\" target=\"_blank\" rel=\"noopener\">Worksheet 2b<\/a>\u00a0or download <a href=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2022\/01\/2b.-Class-worksheet.pdf\" target=\"_blank\" rel=\"noopener\">Worksheet 2b<\/a><\/h6>\n<p><a id=\"video2a\"><\/a><br \/>\nvideo 2a1<\/p>\n<div style=\"width: 1440px;\" class=\"wp-video\"><video class=\"wp-video-shortcode\" id=\"video-65-1\" width=\"1440\" height=\"1080\" preload=\"metadata\" controls=\"controls\"><source type=\"video\/mp4\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Chapter-2a1.mp4?_=1\" \/><a href=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Chapter-2a1.mp4\">https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Chapter-2a1.mp4<\/a><\/video><\/div>\n<p>video 2a2<\/p>\n<div style=\"width: 1440px;\" class=\"wp-video\"><video class=\"wp-video-shortcode\" id=\"video-65-2\" width=\"1440\" height=\"1080\" preload=\"metadata\" controls=\"controls\"><source type=\"video\/mp4\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Chapter-2a2.mp4?_=2\" \/><a href=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Chapter-2a2.mp4\">https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Chapter-2a2.mp4<\/a><\/video><\/div>\n<p><a id=\"video2b\"><\/a>video 2b<\/p>\n<div style=\"width: 1440px;\" class=\"wp-video\"><video class=\"wp-video-shortcode\" id=\"video-65-3\" width=\"1440\" height=\"1080\" preload=\"metadata\" controls=\"controls\"><source type=\"video\/mp4\" src=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Chapter-2b.mp4?_=3\" \/><a href=\"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Chapter-2b.mp4\">https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-content\/uploads\/sites\/1469\/2021\/07\/Chapter-2b.mp4<\/a><\/video><\/div>\n<div class=\"glossary\"><span class=\"screen-reader-text\" id=\"definition\">definition<\/span><template id=\"term_65_68\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_65_68\"><div tabindex=\"-1\"><p>the same thing as an average: you add up all the numbers, then divide by how many numbers there were. Conceptually we can think of the mean as the balancing point for the distribution.<\/p>\n<\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_65_70\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_65_70\"><div tabindex=\"-1\"><p>the midpoint of the scores after placing them in order. The median is a counting-based measure: the point at which half of the scores fall above and half of the scores fall below.  <\/p>\n<\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_65_71\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_65_71\"><div tabindex=\"-1\"><p>the score(s) that occur(s) most often in the dataset<\/p>\n<\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_65_75\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_65_75\"><div tabindex=\"-1\"><p>in summation notation, a symbol that denotes \u201ctaking the sum\u201d of a series of numbers<\/p>\n<\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_65_96\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_65_96\"><div tabindex=\"-1\"><p>the symbol for the mean (average) of scores in a sample<\/p>\n<\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_65_97\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_65_97\"><div tabindex=\"-1\"><p>the symbol for the number of scores in a sample<\/p>\n<\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_65_156\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_65_156\"><div tabindex=\"-1\"><p>a common measure of variability in numeric data. The average squared distance of scores from the mean.<\/p>\n<\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_65_157\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_65_157\"><div tabindex=\"-1\"><p>a common measure of variability in numeric data. The average distance of a scores from the mean.<\/p>\n<\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><template id=\"term_65_162\"><div class=\"glossary__definition\" role=\"dialog\" data-id=\"term_65_162\"><div tabindex=\"-1\"><p>the sum of squared deviations, or differences, between scores and the mean in a numeric dataset<\/p>\n<\/div><button><span aria-hidden=\"true\">&times;<\/span><span class=\"screen-reader-text\">Close definition<\/span><\/button><\/div><\/template><\/div>","protected":false},"author":1394,"menu_order":2,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[48],"contributor":[],"license":[],"class_list":["post-65","chapter","type-chapter","status-publish","hentry","chapter-type-numberless"],"part":3,"_links":{"self":[{"href":"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-json\/pressbooks\/v2\/chapters\/65","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-json\/wp\/v2\/users\/1394"}],"version-history":[{"count":26,"href":"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-json\/pressbooks\/v2\/chapters\/65\/revisions"}],"predecessor-version":[{"id":1190,"href":"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-json\/pressbooks\/v2\/chapters\/65\/revisions\/1190"}],"part":[{"href":"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-json\/pressbooks\/v2\/parts\/3"}],"metadata":[{"href":"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-json\/pressbooks\/v2\/chapters\/65\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-json\/wp\/v2\/media?parent=65"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-json\/pressbooks\/v2\/chapter-type?post=65"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-json\/wp\/v2\/contributor?post=65"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/statspsych\/wp-json\/wp\/v2\/license?post=65"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}