{"id":72,"date":"2018-10-31T17:15:13","date_gmt":"2018-10-31T21:15:13","guid":{"rendered":"https:\/\/pressbooks.bccampus.ca\/simplestats\/?post_type=chapter&#038;p=72"},"modified":"2019-08-13T17:06:16","modified_gmt":"2019-08-13T21:06:16","slug":"3-4-mean","status":"publish","type":"chapter","link":"https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/3-4-mean\/","title":{"raw":"3.4 Mean","rendered":"3.4 Mean"},"content":{"raw":"[latexpage]\r\n\r\nThe third, and final, measure of central tendency is one you have undoubtedly encountered before. It is one that most people have had to calculate at least a few times in their lives, and that everyone has heard reported about one thing or another. You most likely know it by its common name, <strong>the average<\/strong>.\r\n\r\n&nbsp;\r\n\r\nRecall that the measures of central tendency provide information about the typical cases, or where cases tend to centre in a variable's distribution. Thus a student's Grade Point Average (GPA) provides a measure for how well they do academically, not in one class, but <em>on average<\/em>, across all of them; a hockey player's points season average provides a measure of their performance on the ice not just in one game but for a whole season; a monthly average temperature gives indication of what the typical weather for a specific month is, etc. All of these averages show what is typical or expected.\r\n\r\n&nbsp;\r\n\r\n<strong>The mean of a variable is<\/strong> therefore, quite simply put, <strong>the mathematical average of the values<\/strong> <strong>of the variable's cases<\/strong>. Reported alongside the mode and the median, it provides a fuller picture of where the cases tend to cluster, or what the typical cases are. The mode does this in the simplest way, by counting their frequency and reporting the largest one. The median does that by providing the most centrally located case in terms of order.\r\n\r\n&nbsp;\r\n\r\n<strong>Unlike the mode and the median, however, the mean takes into account the actual <em>values<\/em> of the cases.<\/strong>\r\n\r\n&nbsp;\r\n\r\nKeeping the last sentence in mind, do you think the mean will apply to all and any variables? If you have been paying attention, you would know that the answer is \"no, of course not\".\r\n\r\n&nbsp;\r\n\r\nNominal and ordinal variables have categories.\u00a0 <strong>Only interval\/ratio variables have actual numerical values, therefore, the mean applies only to them.<\/strong> After all, mathematical calculations are only possible when we have <em>numbers<\/em> with which to do the calculations: we cannot calculate an average of gender,\u00a0 or of race\/ethnicity, or of religious affiliation, etc.[footnote]Note that in specific cases it's possible to calculate <em>something like an average<\/em> for certain ordinal variables, for example, Likert-scales, to the extent that their numerical labels reflect a somewhat monotonic, stable-unit, distances. This should be done with extreme care and ample justification, however, and beginner researchers (like you) are advised against using means for ordinal variables.[\/footnote] We could, however, calculate an average age, income, score, temperature, etc.\r\n\r\n&nbsp;\r\n\r\nIf you had ever calculated your GPA, you already know how to calculate the mean. I will still give you an example to strengthen your knowledge.\r\n\r\n&nbsp;\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\"><em>Example 3.4 (A) Mean of Number of Siblings, Raw data<\/em><\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n&nbsp;\r\n\r\nIf you recall our Example 3.3 (A) from the previous Section 3.2 (<span>\u00a0<\/span><span id=\"sample-permalink\"><a href=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/3-2-median\/\">https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/<span id=\"editable-post-name\">3-2-median<\/span>\/<\/a><\/span><span>\u00a0\u200e)<\/span>, you imagined yourself asking seven of your friends about the number of siblings they had. We imagined the responses as follows: 2, 1, 4, 2, 1, 0, 3. We had to put these values in order to be able to find the median, but the mean works either way, whether the values are in order or not.\r\n\r\n&nbsp;\r\n\r\nTo calculate the average number of siblings your imagined friends have, we simply add all responses together and divide them by the total number of friends, i.e., by 7:\r\n\r\n&nbsp;\r\n\r\n$$\\frac{(2+1+4+2+1+0+3)}{7}=\\frac{13}{7}=1.86$$\r\n\r\n&nbsp;\r\n\r\nThat is, your imagined friends have 1.86 siblings on average (or not quite but closer to two, rather than one siblings on average). We could also say that the mean of <em>number of siblings<\/em> is 1.86.\r\n\r\n<\/div>\r\n<\/div>\r\n&nbsp;\r\n\r\nLet's do it again, as practice makes perfect.\r\n\r\n&nbsp;\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\"><em>Example 3.5 Textbook Prices For a Semester, Raw Data<\/em><\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n&nbsp;\r\n\r\nDepending on the courses you take in a semester, what you pay for books will vary but let's say we're interested in how much you pay for books in a typical semester.<span style=\"font-size: 1rem\">\u00a0Perhaps\u00a0<\/span><span style=\"text-indent: 1em;font-size: 1rem\">you are very-well organized and want to finish your degree as quickly as possible so you have decided to take five courses per semester. For simplicity's sake, let's assume your were assigned one book per course. These are the books' prices: \\$120, \\$230, \\$300, \\$65, \\$30. How much did you pay for a book on average?<\/span>\r\n\r\n&nbsp;\r\n\r\n$$\\frac{(120+230+300+65+30)}{5}=\\frac{745}{5}=149$$\r\n\r\n&nbsp;\r\n\r\nThat is, despite the fact that some of your books were expensive (like the \\$300 one), and some relatively cheap (like the \\$30 one), the average price you paid for a book in that semester was \\$149.\r\n\r\n<\/div>\r\n<\/div>\r\n&nbsp;\r\n\r\nNow that we've seen how the mean works in practice, let's generalize what we did in the two examples above using proper notation. Fair warning: the formula below does <em>look<\/em> complicated but remember what we just did: our calculations were quite simple (adding all values, dividing their sum by their total number), and so is the formula. As usual, it simply restates what we've said in words in a mathematical shorthand. If you know what each symbol in the shorthand stands for, you know what the formula means. So, take a deep breath:\r\n\r\n&nbsp;\r\n\r\n\\begin{equation}\r\n\r\n\\frac{x_1+x_2+x_3+\\dots+x_N}{N}=\\frac{\\sum\\limits_{i=1}^{N}{x_i}}{N}=\\overline{x}\r\n\r\n\\end{equation}\r\n\r\n&nbsp;\r\n\r\nwhere\u00a0\u2211 stands for \"sum\"[footnote] <em>\u2211<\/em> is pronounced \"SIG-ma\" and is the Greek letter S. [\/footnote], $\\sum\\limits_{i=1}^{N}$ indicates to sum all cases from the first (1) to the last (<em>N<\/em>), <em>x<sub>i<\/sub> <\/em>stands for any case with a number between 1 and <em>N<\/em>, and $\\overline{x}$ indicates the mean<span style=\"text-indent: 18.6667px;font-size: 14pt\">[footnote]$\\overline{x}$ is pronounced \"EX-bar\".[\/footnote]<\/span><span style=\"text-indent: 1em;font-size: 14pt\">, i.e., the average of all the <\/span><em style=\"text-indent: 1em;font-size: 14pt\">x<sub>i<\/sub><\/em><span style=\"text-indent: 1em;font-size: 14pt\">'s. Thus, the formula basically tells you to add all values and divide by their total, just as we did in the examples.<\/span>\r\n\r\n&nbsp;\r\n\r\nSo far, we only calculated the means for raw data, i.e., data not presented in a frequency table. Will the calculation of the mean be different if we had a frequency table instead? While the principle is the same, the fact that the values are grouped by frequency in frequency tables requires that we do a slight modification to our calculations. Here's a small-scale illustration to demonstrate the principle before we do an example with a larger<em> N<\/em>.\r\n\r\n&nbsp;\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\"><em>Example 3.4 (B) Mean for\u00a0Number of Siblings, Aggregated Data<\/em><\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n&nbsp;\r\n\r\nArranging the raw data from Example 3.4 (A) above, we again get the following table.\r\n\r\n<\/div>\r\n<em>Table 3.3 Frequency Table for Number of Siblings<\/em>\r\n<div class=\"textbox__content\">\r\n<table style=\"border-collapse: collapse;width: 50%;height: 105px\" border=\"0\">\r\n<tbody>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 2.83286%;height: 15px;text-align: center\"><strong>Value<\/strong><\/td>\r\n<td style=\"width: 2.849%;height: 15px;text-align: center\"><strong>Frequency<\/strong><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 2.83286%;height: 15px\">0<\/td>\r\n<td style=\"width: 2.849%;height: 15px\">1<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 2.83286%;height: 15px\">1<\/td>\r\n<td style=\"width: 2.849%;height: 15px\">2<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 2.83286%;height: 15px\">2<\/td>\r\n<td style=\"width: 2.849%;height: 15px\">2<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 2.83286%;height: 15px\">3<\/td>\r\n<td style=\"width: 2.849%;height: 15px\">1<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 2.83286%;height: 15px\">4<\/td>\r\n<td style=\"width: 2.849%;height: 15px\">1<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 2.83286%;height: 15px\"><strong>Total<\/strong><\/td>\r\n<td style=\"width: 2.849%;height: 15px\"><strong>7<\/strong><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nAccording to the formula for the mean, we need to add all values together and then divide their sum by their total number. When the values are disaggregated (i.e., raw), we can proceed to adding them up right away. However, when they are grouped by frequency, we first need to multiply each value by its respective frequency, and then add the value-times-frequency products together, before dividing them by the total number, like this:\r\n\r\n&nbsp;\r\n\r\n$$\\frac{\\sum\\limits_{i=1}^{N}{x_i}}{N}=\\frac{(0+1+1+2+2+3+4)}{7}=\\frac{0(1)+1(2)+2(2)+3(1)+4(1)}{7}=\\frac{13}{7}=1.86=\\overline{x}$$\r\n\r\n&nbsp;\r\n\r\nAgain, the average number of siblings of these seven friends is 1.86, as previously calculated.\r\n\r\n<\/div>\r\n<\/div>\r\n&nbsp;\r\n\r\nNow let's apply the same principle to a new, larger-<em>N<\/em> example.\r\n\r\n&nbsp;\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\"><em>Example 3.6\u00a0Age of Classmates, Aggregated Data<\/em><\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n&nbsp;\r\n\r\nImagine you are doing a survey for one of your class assignments and one of the questions is about age. You aggregate the data by frequency and you get the following table.\r\n\r\n&nbsp;\r\n\r\n<em>Table 3.5 Mean for Age of Classmates<\/em>\r\n<table style=\"border-collapse: collapse;width: 50%;height: 135px\" border=\"0\">\r\n<tbody>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 24.3218%;height: 15px;text-align: center\"><strong>Value<\/strong><\/td>\r\n<td style=\"width: 23.4743%;height: 15px;text-align: center\"><strong>Frequency<\/strong><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 24.3218%;height: 15px\">19<\/td>\r\n<td style=\"width: 23.4743%;height: 15px\">1<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 24.3218%;height: 15px\">20<\/td>\r\n<td style=\"width: 23.4743%;height: 15px\">10<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 24.3218%;height: 15px\">21<\/td>\r\n<td style=\"width: 23.4743%;height: 15px\">12<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 24.3218%;height: 15px\">22<\/td>\r\n<td style=\"width: 23.4743%;height: 15px\">8<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 24.3218%;height: 15px\">25<\/td>\r\n<td style=\"width: 23.4743%;height: 15px\">2<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 24.3218%;height: 15px\">27<\/td>\r\n<td style=\"width: 23.4743%;height: 15px\">1<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 24.3218%;height: 15px\">35<\/td>\r\n<td style=\"width: 23.4743%;height: 15px\">1<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 24.3218%;height: 15px\"><strong>TOTAL<\/strong><\/td>\r\n<td style=\"width: 23.4743%;height: 15px\"><strong>35<\/strong><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nBy the formula, we have:\r\n\r\n$\\frac{\\sum\\limits_{i=1}^{N}{x_i}}{N}=\\frac{19(1)+20(10)+21(12)+22(8)+25(2)+27(1)+35(1)}{35}=\\frac{19+200+252+176+50+27+35}{35}=\\frac{759}{35}=21.69=\\overline{x}$\r\n\r\n&nbsp;\r\n\r\nOr, now you know that the average age of your classmates in that class is 21.69 years, or a bit less than 22 years.\r\n\r\n<\/div>\r\n<\/div>","rendered":"<p>The third, and final, measure of central tendency is one you have undoubtedly encountered before. It is one that most people have had to calculate at least a few times in their lives, and that everyone has heard reported about one thing or another. You most likely know it by its common name, <strong>the average<\/strong>.<\/p>\n<p>&nbsp;<\/p>\n<p>Recall that the measures of central tendency provide information about the typical cases, or where cases tend to centre in a variable&#8217;s distribution. Thus a student&#8217;s Grade Point Average (GPA) provides a measure for how well they do academically, not in one class, but <em>on average<\/em>, across all of them; a hockey player&#8217;s points season average provides a measure of their performance on the ice not just in one game but for a whole season; a monthly average temperature gives indication of what the typical weather for a specific month is, etc. All of these averages show what is typical or expected.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>The mean of a variable is<\/strong> therefore, quite simply put, <strong>the mathematical average of the values<\/strong> <strong>of the variable&#8217;s cases<\/strong>. Reported alongside the mode and the median, it provides a fuller picture of where the cases tend to cluster, or what the typical cases are. The mode does this in the simplest way, by counting their frequency and reporting the largest one. The median does that by providing the most centrally located case in terms of order.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>Unlike the mode and the median, however, the mean takes into account the actual <em>values<\/em> of the cases.<\/strong><\/p>\n<p>&nbsp;<\/p>\n<p>Keeping the last sentence in mind, do you think the mean will apply to all and any variables? If you have been paying attention, you would know that the answer is &#8220;no, of course not&#8221;.<\/p>\n<p>&nbsp;<\/p>\n<p>Nominal and ordinal variables have categories.\u00a0 <strong>Only interval\/ratio variables have actual numerical values, therefore, the mean applies only to them.<\/strong> After all, mathematical calculations are only possible when we have <em>numbers<\/em> with which to do the calculations: we cannot calculate an average of gender,\u00a0 or of race\/ethnicity, or of religious affiliation, etc.<a class=\"footnote\" title=\"Note that in specific cases it's possible to calculate something like an average for certain ordinal variables, for example, Likert-scales, to the extent that their numerical labels reflect a somewhat monotonic, stable-unit, distances. This should be done with extreme care and ample justification, however, and beginner researchers (like you) are advised against using means for ordinal variables.\" id=\"return-footnote-72-1\" href=\"#footnote-72-1\" aria-label=\"Footnote 1\"><sup class=\"footnote\">[1]<\/sup><\/a> We could, however, calculate an average age, income, score, temperature, etc.<\/p>\n<p>&nbsp;<\/p>\n<p>If you had ever calculated your GPA, you already know how to calculate the mean. I will still give you an example to strengthen your knowledge.<\/p>\n<p>&nbsp;<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\"><em>Example 3.4 (A) Mean of Number of Siblings, Raw data<\/em><\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>&nbsp;<\/p>\n<p>If you recall our Example 3.3 (A) from the previous Section 3.2 (<span>\u00a0<\/span><span id=\"sample-permalink\"><a href=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/3-2-median\/\">https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/<span id=\"editable-post-name\">3-2-median<\/span>\/<\/a><\/span><span>\u00a0\u200e)<\/span>, you imagined yourself asking seven of your friends about the number of siblings they had. We imagined the responses as follows: 2, 1, 4, 2, 1, 0, 3. We had to put these values in order to be able to find the median, but the mean works either way, whether the values are in order or not.<\/p>\n<p>&nbsp;<\/p>\n<p>To calculate the average number of siblings your imagined friends have, we simply add all responses together and divide them by the total number of friends, i.e., by 7:<\/p>\n<p>&nbsp;<\/p>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 38px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-bbe3225acb3cc43b818d7648fedc67e5_l3.png\" height=\"38\" width=\"308\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#92;&#102;&#114;&#97;&#99;&#123;&#40;&#50;&#43;&#49;&#43;&#52;&#43;&#50;&#43;&#49;&#43;&#48;&#43;&#51;&#41;&#125;&#123;&#55;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#49;&#51;&#125;&#123;&#55;&#125;&#61;&#49;&#46;&#56;&#54;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>That is, your imagined friends have 1.86 siblings on average (or not quite but closer to two, rather than one siblings on average). We could also say that the mean of <em>number of siblings<\/em> is 1.86.<\/p>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>Let&#8217;s do it again, as practice makes perfect.<\/p>\n<p>&nbsp;<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\"><em>Example 3.5 Textbook Prices For a Semester, Raw Data<\/em><\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>&nbsp;<\/p>\n<p>Depending on the courses you take in a semester, what you pay for books will vary but let&#8217;s say we&#8217;re interested in how much you pay for books in a typical semester.<span style=\"font-size: 1rem\">\u00a0Perhaps\u00a0<\/span><span style=\"text-indent: 1em;font-size: 1rem\">you are very-well organized and want to finish your degree as quickly as possible so you have decided to take five courses per semester. For simplicity&#8217;s sake, let&#8217;s assume your were assigned one book per course. These are the books&#8217; prices: &#36;120, &#36;230, &#36;300, &#36;65, &#36;30. How much did you pay for a book on average?<\/span><\/p>\n<p>&nbsp;<\/p>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 38px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-35c4726df14e5e5ad2e3a3fa77e47627_l3.png\" height=\"38\" width=\"322\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#92;&#102;&#114;&#97;&#99;&#123;&#40;&#49;&#50;&#48;&#43;&#50;&#51;&#48;&#43;&#51;&#48;&#48;&#43;&#54;&#53;&#43;&#51;&#48;&#41;&#125;&#123;&#53;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#55;&#52;&#53;&#125;&#123;&#53;&#125;&#61;&#49;&#52;&#57;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>That is, despite the fact that some of your books were expensive (like the &#36;300 one), and some relatively cheap (like the &#36;30 one), the average price you paid for a book in that semester was &#36;149.<\/p>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>Now that we&#8217;ve seen how the mean works in practice, let&#8217;s generalize what we did in the two examples above using proper notation. Fair warning: the formula below does <em>look<\/em> complicated but remember what we just did: our calculations were quite simple (adding all values, dividing their sum by their total number), and so is the formula. As usual, it simply restates what we&#8217;ve said in words in a mathematical shorthand. If you know what each symbol in the shorthand stands for, you know what the formula means. So, take a deep breath:<\/p>\n<p>&nbsp;<\/p>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 63px;\"><span class=\"ql-right-eqno\"> (1) <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-cb277d855ed22b36c2c633eb11d840fa_l3.png\" height=\"63\" width=\"289\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#98;&#101;&#103;&#105;&#110;&#123;&#101;&#113;&#117;&#97;&#116;&#105;&#111;&#110;&#42;&#125; &#92;&#102;&#114;&#97;&#99;&#123;&#120;&#95;&#49;&#43;&#120;&#95;&#50;&#43;&#120;&#95;&#51;&#43;&#92;&#100;&#111;&#116;&#115;&#43;&#120;&#95;&#78;&#125;&#123;&#78;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#115;&#117;&#109;&#92;&#108;&#105;&#109;&#105;&#116;&#115;&#95;&#123;&#105;&#61;&#49;&#125;&#94;&#123;&#78;&#125;&#123;&#120;&#95;&#105;&#125;&#125;&#123;&#78;&#125;&#61;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125; &#92;&#101;&#110;&#100;&#123;&#101;&#113;&#117;&#97;&#116;&#105;&#111;&#110;&#42;&#125;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>where\u00a0\u2211 stands for &#8220;sum&#8221;<a class=\"footnote\" title=\"\u2211 is pronounced &quot;SIG-ma&quot; and is the Greek letter S.\" id=\"return-footnote-72-2\" href=\"#footnote-72-2\" aria-label=\"Footnote 2\"><sup class=\"footnote\">[2]<\/sup><\/a>, <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-9103549f48d90732e336a07fb43d194a_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#115;&#117;&#109;&#92;&#108;&#105;&#109;&#105;&#116;&#115;&#95;&#123;&#105;&#61;&#49;&#125;&#94;&#123;&#78;&#125;\" title=\"Rendered by QuickLaTeX.com\" height=\"43\" width=\"22\" style=\"vertical-align: -17px;\" \/> indicates to sum all cases from the first (1) to the last (<em>N<\/em>), <em>x<sub>i<\/sub> <\/em>stands for any case with a number between 1 and <em>N<\/em>, and <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-0d00c2da2b2541a97ae0ac3c10e1504e_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;\" title=\"Rendered by QuickLaTeX.com\" height=\"11\" width=\"11\" style=\"vertical-align: 0px;\" \/> indicates the mean<span style=\"text-indent: 18.6667px;font-size: 14pt\"><a class=\"footnote\" title=\"is pronounced &quot;EX-bar&quot;.\" id=\"return-footnote-72-3\" href=\"#footnote-72-3\" aria-label=\"Footnote 3\"><sup class=\"footnote\">[3]<\/sup><\/a><\/span><span style=\"text-indent: 1em;font-size: 14pt\">, i.e., the average of all the <\/span><em style=\"text-indent: 1em;font-size: 14pt\">x<sub>i<\/sub><\/em><span style=\"text-indent: 1em;font-size: 14pt\">&#8216;s. Thus, the formula basically tells you to add all values and divide by their total, just as we did in the examples.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>So far, we only calculated the means for raw data, i.e., data not presented in a frequency table. Will the calculation of the mean be different if we had a frequency table instead? While the principle is the same, the fact that the values are grouped by frequency in frequency tables requires that we do a slight modification to our calculations. Here&#8217;s a small-scale illustration to demonstrate the principle before we do an example with a larger<em> N<\/em>.<\/p>\n<p>&nbsp;<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\"><em>Example 3.4 (B) Mean for\u00a0Number of Siblings, Aggregated Data<\/em><\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>&nbsp;<\/p>\n<p>Arranging the raw data from Example 3.4 (A) above, we again get the following table.<\/p>\n<\/div>\n<p><em>Table 3.3 Frequency Table for Number of Siblings<\/em><\/p>\n<div class=\"textbox__content\">\n<table style=\"border-collapse: collapse;width: 50%;height: 105px\">\n<tbody>\n<tr style=\"height: 15px\">\n<td style=\"width: 2.83286%;height: 15px;text-align: center\"><strong>Value<\/strong><\/td>\n<td style=\"width: 2.849%;height: 15px;text-align: center\"><strong>Frequency<\/strong><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 2.83286%;height: 15px\">0<\/td>\n<td style=\"width: 2.849%;height: 15px\">1<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 2.83286%;height: 15px\">1<\/td>\n<td style=\"width: 2.849%;height: 15px\">2<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 2.83286%;height: 15px\">2<\/td>\n<td style=\"width: 2.849%;height: 15px\">2<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 2.83286%;height: 15px\">3<\/td>\n<td style=\"width: 2.849%;height: 15px\">1<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 2.83286%;height: 15px\">4<\/td>\n<td style=\"width: 2.849%;height: 15px\">1<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 2.83286%;height: 15px\"><strong>Total<\/strong><\/td>\n<td style=\"width: 2.849%;height: 15px\"><strong>7<\/strong><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>According to the formula for the mean, we need to add all values together and then divide their sum by their total number. When the values are disaggregated (i.e., raw), we can proceed to adding them up right away. However, when they are grouped by frequency, we first need to multiply each value by its respective frequency, and then add the value-times-frequency products together, before dividing them by the total number, like this:<\/p>\n<p>&nbsp;<\/p>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 62px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-3ce2248910a09f3fe94001cbd84a739a_l3.png\" height=\"62\" width=\"683\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#115;&#117;&#109;&#92;&#108;&#105;&#109;&#105;&#116;&#115;&#95;&#123;&#105;&#61;&#49;&#125;&#94;&#123;&#78;&#125;&#123;&#120;&#95;&#105;&#125;&#125;&#123;&#78;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#40;&#48;&#43;&#49;&#43;&#49;&#43;&#50;&#43;&#50;&#43;&#51;&#43;&#52;&#41;&#125;&#123;&#55;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#48;&#40;&#49;&#41;&#43;&#49;&#40;&#50;&#41;&#43;&#50;&#40;&#50;&#41;&#43;&#51;&#40;&#49;&#41;&#43;&#52;&#40;&#49;&#41;&#125;&#123;&#55;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#49;&#51;&#125;&#123;&#55;&#125;&#61;&#49;&#46;&#56;&#54;&#61;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>Again, the average number of siblings of these seven friends is 1.86, as previously calculated.<\/p>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>Now let&#8217;s apply the same principle to a new, larger-<em>N<\/em> example.<\/p>\n<p>&nbsp;<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\"><em>Example 3.6\u00a0Age of Classmates, Aggregated Data<\/em><\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>&nbsp;<\/p>\n<p>Imagine you are doing a survey for one of your class assignments and one of the questions is about age. You aggregate the data by frequency and you get the following table.<\/p>\n<p>&nbsp;<\/p>\n<p><em>Table 3.5 Mean for Age of Classmates<\/em><\/p>\n<table style=\"border-collapse: collapse;width: 50%;height: 135px\">\n<tbody>\n<tr style=\"height: 15px\">\n<td style=\"width: 24.3218%;height: 15px;text-align: center\"><strong>Value<\/strong><\/td>\n<td style=\"width: 23.4743%;height: 15px;text-align: center\"><strong>Frequency<\/strong><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 24.3218%;height: 15px\">19<\/td>\n<td style=\"width: 23.4743%;height: 15px\">1<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 24.3218%;height: 15px\">20<\/td>\n<td style=\"width: 23.4743%;height: 15px\">10<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 24.3218%;height: 15px\">21<\/td>\n<td style=\"width: 23.4743%;height: 15px\">12<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 24.3218%;height: 15px\">22<\/td>\n<td style=\"width: 23.4743%;height: 15px\">8<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 24.3218%;height: 15px\">25<\/td>\n<td style=\"width: 23.4743%;height: 15px\">2<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 24.3218%;height: 15px\">27<\/td>\n<td style=\"width: 23.4743%;height: 15px\">1<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 24.3218%;height: 15px\">35<\/td>\n<td style=\"width: 23.4743%;height: 15px\">1<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 24.3218%;height: 15px\"><strong>TOTAL<\/strong><\/td>\n<td style=\"width: 23.4743%;height: 15px\"><strong>35<\/strong><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>By the formula, we have:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-44e372567244526aefd0b697e581e248_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#115;&#117;&#109;&#92;&#108;&#105;&#109;&#105;&#116;&#115;&#95;&#123;&#105;&#61;&#49;&#125;&#94;&#123;&#78;&#125;&#123;&#120;&#95;&#105;&#125;&#125;&#123;&#78;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#49;&#57;&#40;&#49;&#41;&#43;&#50;&#48;&#40;&#49;&#48;&#41;&#43;&#50;&#49;&#40;&#49;&#50;&#41;&#43;&#50;&#50;&#40;&#56;&#41;&#43;&#50;&#53;&#40;&#50;&#41;&#43;&#50;&#55;&#40;&#49;&#41;&#43;&#51;&#53;&#40;&#49;&#41;&#125;&#123;&#51;&#53;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#49;&#57;&#43;&#50;&#48;&#48;&#43;&#50;&#53;&#50;&#43;&#49;&#55;&#54;&#43;&#53;&#48;&#43;&#50;&#55;&#43;&#51;&#53;&#125;&#123;&#51;&#53;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#55;&#53;&#57;&#125;&#123;&#51;&#53;&#125;&#61;&#50;&#49;&#46;&#54;&#57;&#61;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;\" title=\"Rendered by QuickLaTeX.com\" height=\"65\" width=\"580\" style=\"vertical-align: -6px;\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>Or, now you know that the average age of your classmates in that class is 21.69 years, or a bit less than 22 years.<\/p>\n<\/div>\n<\/div>\n<hr class=\"before-footnotes clear\" \/><div class=\"footnotes\"><ol><li id=\"footnote-72-1\">Note that in specific cases it's possible to calculate <em>something like an average<\/em> for certain ordinal variables, for example, Likert-scales, to the extent that their numerical labels reflect a somewhat monotonic, stable-unit, distances. This should be done with extreme care and ample justification, however, and beginner researchers (like you) are advised against using means for ordinal variables. <a href=\"#return-footnote-72-1\" class=\"return-footnote\" aria-label=\"Return to footnote 1\">&crarr;<\/a><\/li><li id=\"footnote-72-2\"> <em>\u2211<\/em> is pronounced \"SIG-ma\" and is the Greek letter S.  <a href=\"#return-footnote-72-2\" class=\"return-footnote\" aria-label=\"Return to footnote 2\">&crarr;<\/a><\/li><li id=\"footnote-72-3\"><img src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-0d00c2da2b2541a97ae0ac3c10e1504e_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;\" title=\"Rendered by QuickLaTeX.com\" height=\"11\" width=\"11\" style=\"vertical-align: 0px;\" \/> is pronounced \"EX-bar\". <a href=\"#return-footnote-72-3\" class=\"return-footnote\" aria-label=\"Return to footnote 3\">&crarr;<\/a><\/li><\/ol><\/div>","protected":false},"author":533,"menu_order":4,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-72","chapter","type-chapter","status-publish","hentry"],"part":24,"_links":{"self":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/72","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/users\/533"}],"version-history":[{"count":25,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/72\/revisions"}],"predecessor-version":[{"id":1591,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/72\/revisions\/1591"}],"part":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/parts\/24"}],"metadata":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/72\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/media?parent=72"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapter-type?post=72"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/contributor?post=72"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/license?post=72"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}