{"id":1499,"date":"2019-08-08T17:54:57","date_gmt":"2019-08-08T21:54:57","guid":{"rendered":"https:\/\/pressbooks.bccampus.ca\/simplestats\/?post_type=chapter&#038;p=1499"},"modified":"2019-09-15T23:43:00","modified_gmt":"2019-09-16T03:43:00","slug":"2-3-2-missing-data-adding-valid-percentages","status":"publish","type":"chapter","link":"https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/2-3-2-missing-data-adding-valid-percentages\/","title":{"raw":"2.3.2 Missing Data: Adding Valid Percentages","rendered":"2.3.2 Missing Data: Adding Valid Percentages"},"content":{"raw":"[latexpage]\r\n\r\nIf you've paid attention so far, you must have noticed that three of our 21 respondents provided a \"Didn't answer\" response when asked about their educational attainment. Sometimes respondents may refuse to answer a question, or the question may not have been applicable to them and wasn't asked, or a response might not get recorded due to an error, etc. In short, sometimes we have a case of what is known as <em>missing data<\/em>.\r\n\r\n&nbsp;\r\n\r\nWhat do we know about the educational attainment of the three individuals who, for whatever reason, didn't answer this question? Nothing.\r\n\r\n&nbsp;\r\n\r\nCan we in some way infer their educational attainment? Not with the data provided in the example.\r\n\r\n&nbsp;\r\n\r\nSo then what do we do? How do we analyze our <em>educational attainment<\/em> variable?\r\n\r\n&nbsp;\r\n\r\nThe most frequent -- and strongly recommended (especially for people just starting on their journey to research) -- course of action is to simply<em> drop<\/em> the missing cases[footnote]Depending on the particular data and particular situation, and assuming strong justification, researchers experienced in data analysis may have different options, such as estimation, imputation of means, etc. These, however, are beyond the scope of this text. The safest action for students\/beginners to take remains dropping any missing cases from the analysis. See <a href=\"https:\/\/www.iriseekhout.com\/missing-data\/missing-data-methods\/imputation-methods\/\">https:\/\/www.iriseekhout.com\/missing-data\/missing-data-methods\/imputation-methods\/<\/a> for a discussion. [\/footnote]. Missing cases have no part in any analysis and using them as they are would inevitably compromise conclusions -- after all, we have no information on what we want to know about them, and we cannot make that information up.\r\n\r\n&nbsp;\r\n\r\nGenerally, how statistical software deal with missing data by default settings may vary. SPSS's default is to skip missing cases so that analysis is always based on valid cases only.\r\n\r\n&nbsp;\r\n\r\nAs well, SPSS provides a separate column in <em>Data View<\/em> indicating which values in the data stand for a missing data point. As discussed in <span style=\"color: #000000\">Section 2.1 <\/span>(<a href=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/2-1-data\/\">https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/2-1-data\/<\/a>), you can find the <em>coding<\/em> of the values in the <em>Values<\/em> column in <em>Data View<\/em>.\u00a0\u00a0Clicking the specific cell in that column opens up a window with the values' code. There you may find several types of missing data, typically values such as \"Valid skip\"\/\"Not applicable\" (the respondent had not been asked the question on which the variable is based due to a previous answer)[footnote]For example, if a respondent has indicated previously that they didn't smoke, a subsequent question about how often they smoked would make no sense; the respondent then would be \"validly skipped\" from answering this subsequent question.[\/footnote], \"Don't know\" (the respondent did not know the answer to the question), \"Refusal\" (the respondent refused to answer the question), \"Not stated\" (when the question should have been answered\/ an answer should have been recorded but, for whatever reason, it hasn't been), etc.\r\n\r\n&nbsp;\r\n\r\nApart from \"Not applicable\", the codes listed here are standard Statistics Canada codes used in all their datasets and can be found in any Statistics Canada dataset documentation[footnote]Currently, Statistics Canada uses 6, 96, 996, etc. for \"Valid skip\"; 7, 97, 997, etc. for \"Don't know\"; 8, 98, 998, etc. for \"Refused\"; and 9, 99, 999, etc. for \"Not stated\". [\/footnote].\r\n\r\n&nbsp;\r\n\r\nSo given that we had three cases of missing data within our group of 21 respondents, are the percentages reported in the previous sub-section's Table 2.2 in Example 2.2 (C) <em>valid<\/em> to use?\r\n\r\n&nbsp;\r\n<div class=\"textbox textbox--learning-objectives\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\"><em><span style=\"color: #ff0000\"><strong>Watch Out!! #4<\/strong><\/span>... for Findings Based on Missing Data<\/em><\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n&nbsp;\r\n\r\nThis will be a short warning but it deserves it's own scary-red<em> Watch Out!!<\/em> reiteration: do not trust analysis and findings that include missing cases as they would be distorted and unreliable. Missing data is exactly that - <em>missing<\/em>. It simply does not exist. As a beginner researcher, always make sure you have dropped (i.e., excluded) any missing cases before analyzing your data and reporting any results.\r\n\r\n<\/div>\r\n<\/div>\r\n&nbsp;\r\n\r\nConsidering that Table 2.2 did include missing data in the calculation of percentages, let us correct that by modifying it and including another column, <strong><em>valid<\/em> percentages<\/strong>.\r\n\r\n&nbsp;\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\"><em>Example 2.2 (D)\u00a0Hypothetical Data on Educational Attainment, Organized and with Relative Frequencies and Valid Percentages Added<\/em><\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n<em>Table 2.3 Educational Attainment by Frequency, Percent and Valid Percent<\/em>\r\n<table style=\"border-collapse: collapse;width: 79.1507%;height: 209px\" border=\"0\">\r\n<tbody>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 10.9909%;text-align: center;height: 59px\"><strong>\u00a0<\/strong><\/td>\r\n<td style=\"width: 37.8257%;height: 59px;text-align: center\"><strong>Degree<\/strong><\/td>\r\n<td style=\"width: 4.26491%;height: 59px\">\r\n<p style=\"text-align: center\"><strong>\u00a0 Frequency<\/strong><\/p>\r\n<\/td>\r\n<td style=\"width: 17.6642%;height: 59px\">\r\n<p style=\"text-align: center\"><strong>Percent<\/strong><\/p>\r\n<\/td>\r\n<td style=\"width: 18.7802%;height: 59px\">\r\n<p style=\"text-align: center\"><strong>Valid Percent<\/strong><\/p>\r\n<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 10.9909%;height: 15px\">Valid<\/td>\r\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0No degree<\/td>\r\n<td style=\"width: 4.26491%;height: 15px;text-align: center\">1<\/td>\r\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">4.7<\/td>\r\n<td style=\"width: 18.7802%;text-align: center;height: 15px\">5.6<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\r\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0Secondary\/High School<\/td>\r\n<td style=\"width: 4.26491%;height: 15px;text-align: center\">6<\/td>\r\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">28.6<\/td>\r\n<td style=\"width: 18.7802%;text-align: center;height: 15px\">33.3<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\r\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0Associate's<\/td>\r\n<td style=\"width: 4.26491%;height: 15px;text-align: center\">3<\/td>\r\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">14.3<\/td>\r\n<td style=\"width: 18.7802%;text-align: center;height: 15px\">16.7<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\r\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0Bachelor's<\/td>\r\n<td style=\"width: 4.26491%;height: 15px;text-align: center\">5<\/td>\r\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">23.8<\/td>\r\n<td style=\"width: 18.7802%;text-align: center;height: 15px\">27.8<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\r\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0Master's<\/td>\r\n<td style=\"width: 4.26491%;height: 15px;text-align: center\">2<\/td>\r\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">9.5<\/td>\r\n<td style=\"width: 18.7802%;text-align: center;height: 15px\">11.1<\/td>\r\n<\/tr>\r\n<tr style=\"height: 16px\">\r\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\r\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0PhD<\/td>\r\n<td style=\"width: 4.26491%;height: 15px;text-align: center\">1<\/td>\r\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">4.7<\/td>\r\n<td style=\"width: 18.7802%;text-align: center;height: 15px\">5.6<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\r\n<td style=\"width: 37.8257%;height: 15px\"><strong>\u00a0 \u00a0Total Valid<\/strong><\/td>\r\n<td style=\"width: 4.26491%;text-align: center;height: 15px\"><strong>18<\/strong><\/td>\r\n<td style=\"width: 17.6642%;text-align: center;height: 15px\"><strong>85.6<\/strong><\/td>\r\n<td style=\"width: 18.7802%;text-align: center;height: 15px\"><strong>100.0<\/strong><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 10.9909%;height: 15px\">Missing<\/td>\r\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0Didn't answer<\/td>\r\n<td style=\"width: 4.26491%;text-align: center;height: 15px\">3<\/td>\r\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">14.3<\/td>\r\n<td style=\"width: 18.7802%;text-align: center;height: 15px\"><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\r\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0Total Missing<\/td>\r\n<td style=\"width: 4.26491%;text-align: center;height: 15px\">3<\/td>\r\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">14.3<\/td>\r\n<td style=\"width: 18.7802%;text-align: center;height: 15px\"><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\r\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0 TOTAL<\/td>\r\n<td style=\"width: 4.26491%;text-align: center;height: 15px\">21<\/td>\r\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">100.0<\/td>\r\n<td style=\"width: 18.7802%;text-align: center;height: 15px\"><strong>\u00a0<\/strong><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n&nbsp;\r\n\r\nAs you see in the modified Table 2.3 above, I have separated the missing cases from the valid cases (the cases for which we have educational attainment data). <strong>Since we have only 18 valid cases, we should use only those 18 cases for any calculations and analysis -- and not the total of 21 cases<\/strong> (which includes the missing). Thus, instead of having just\r\n\r\n&nbsp;\r\n\r\n$$\\frac{f}{N}(100)=\\frac{1}{21}(100)=0.047(100)=4.7\\%$$\r\n\r\n&nbsp;\r\n\r\nalong with the rest of the categories' percentages calculated in this way, we should calculate the categories' <em>valid<\/em> percentages, discarding he three missing cases, like this:\r\n\r\n&nbsp;\r\n\r\n$$\\frac{f}{N}(100)=\\frac{1}{18}(100)=0.056(100)=5.6\\%$$\r\n\r\n&nbsp;\r\n\r\n(As usual, I only show you the calculation for the first category as the rest follow in the same way.)\r\n\r\n&nbsp;\r\n\r\nDespite the fact that we do have the percentages based on missing data in the table, note that these - <strong>the valid percentages -- are the only percentages you should use in your analysis and report in your findings<\/strong>.\r\n\r\n&nbsp;\r\n\r\n<em>Alright<\/em>, you might say now, <em>we added percentages and valid percentages to the simple frequencies, this surely means we have a complete frequency table by now.<\/em>\r\n\r\n&nbsp;\r\n\r\nSorry, no, not yet. One thing remains.","rendered":"<p>If you&#8217;ve paid attention so far, you must have noticed that three of our 21 respondents provided a &#8220;Didn&#8217;t answer&#8221; response when asked about their educational attainment. Sometimes respondents may refuse to answer a question, or the question may not have been applicable to them and wasn&#8217;t asked, or a response might not get recorded due to an error, etc. In short, sometimes we have a case of what is known as <em>missing data<\/em>.<\/p>\n<p>&nbsp;<\/p>\n<p>What do we know about the educational attainment of the three individuals who, for whatever reason, didn&#8217;t answer this question? Nothing.<\/p>\n<p>&nbsp;<\/p>\n<p>Can we in some way infer their educational attainment? Not with the data provided in the example.<\/p>\n<p>&nbsp;<\/p>\n<p>So then what do we do? How do we analyze our <em>educational attainment<\/em> variable?<\/p>\n<p>&nbsp;<\/p>\n<p>The most frequent &#8212; and strongly recommended (especially for people just starting on their journey to research) &#8212; course of action is to simply<em> drop<\/em> the missing cases<a class=\"footnote\" title=\"Depending on the particular data and particular situation, and assuming strong justification, researchers experienced in data analysis may have different options, such as estimation, imputation of means, etc. These, however, are beyond the scope of this text. The safest action for students\/beginners to take remains dropping any missing cases from the analysis. See https:\/\/www.iriseekhout.com\/missing-data\/missing-data-methods\/imputation-methods\/ for a discussion.\" id=\"return-footnote-1499-1\" href=\"#footnote-1499-1\" aria-label=\"Footnote 1\"><sup class=\"footnote\">[1]<\/sup><\/a>. Missing cases have no part in any analysis and using them as they are would inevitably compromise conclusions &#8212; after all, we have no information on what we want to know about them, and we cannot make that information up.<\/p>\n<p>&nbsp;<\/p>\n<p>Generally, how statistical software deal with missing data by default settings may vary. SPSS&#8217;s default is to skip missing cases so that analysis is always based on valid cases only.<\/p>\n<p>&nbsp;<\/p>\n<p>As well, SPSS provides a separate column in <em>Data View<\/em> indicating which values in the data stand for a missing data point. As discussed in <span style=\"color: #000000\">Section 2.1 <\/span>(<a href=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/2-1-data\/\">https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/2-1-data\/<\/a>), you can find the <em>coding<\/em> of the values in the <em>Values<\/em> column in <em>Data View<\/em>.\u00a0\u00a0Clicking the specific cell in that column opens up a window with the values&#8217; code. There you may find several types of missing data, typically values such as &#8220;Valid skip&#8221;\/&#8221;Not applicable&#8221; (the respondent had not been asked the question on which the variable is based due to a previous answer)<a class=\"footnote\" title=\"For example, if a respondent has indicated previously that they didn't smoke, a subsequent question about how often they smoked would make no sense; the respondent then would be &quot;validly skipped&quot; from answering this subsequent question.\" id=\"return-footnote-1499-2\" href=\"#footnote-1499-2\" aria-label=\"Footnote 2\"><sup class=\"footnote\">[2]<\/sup><\/a>, &#8220;Don&#8217;t know&#8221; (the respondent did not know the answer to the question), &#8220;Refusal&#8221; (the respondent refused to answer the question), &#8220;Not stated&#8221; (when the question should have been answered\/ an answer should have been recorded but, for whatever reason, it hasn&#8217;t been), etc.<\/p>\n<p>&nbsp;<\/p>\n<p>Apart from &#8220;Not applicable&#8221;, the codes listed here are standard Statistics Canada codes used in all their datasets and can be found in any Statistics Canada dataset documentation<a class=\"footnote\" title=\"Currently, Statistics Canada uses 6, 96, 996, etc. for &quot;Valid skip&quot;; 7, 97, 997, etc. for &quot;Don't know&quot;; 8, 98, 998, etc. for &quot;Refused&quot;; and 9, 99, 999, etc. for &quot;Not stated&quot;.\" id=\"return-footnote-1499-3\" href=\"#footnote-1499-3\" aria-label=\"Footnote 3\"><sup class=\"footnote\">[3]<\/sup><\/a>.<\/p>\n<p>&nbsp;<\/p>\n<p>So given that we had three cases of missing data within our group of 21 respondents, are the percentages reported in the previous sub-section&#8217;s Table 2.2 in Example 2.2 (C) <em>valid<\/em> to use?<\/p>\n<p>&nbsp;<\/p>\n<div class=\"textbox textbox--learning-objectives\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\"><em><span style=\"color: #ff0000\"><strong>Watch Out!! #4<\/strong><\/span>&#8230; for Findings Based on Missing Data<\/em><\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>&nbsp;<\/p>\n<p>This will be a short warning but it deserves it&#8217;s own scary-red<em> Watch Out!!<\/em> reiteration: do not trust analysis and findings that include missing cases as they would be distorted and unreliable. Missing data is exactly that &#8211; <em>missing<\/em>. It simply does not exist. As a beginner researcher, always make sure you have dropped (i.e., excluded) any missing cases before analyzing your data and reporting any results.<\/p>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>Considering that Table 2.2 did include missing data in the calculation of percentages, let us correct that by modifying it and including another column, <strong><em>valid<\/em> percentages<\/strong>.<\/p>\n<p>&nbsp;<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\"><em>Example 2.2 (D)\u00a0Hypothetical Data on Educational Attainment, Organized and with Relative Frequencies and Valid Percentages Added<\/em><\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p><em>Table 2.3 Educational Attainment by Frequency, Percent and Valid Percent<\/em><\/p>\n<table style=\"border-collapse: collapse;width: 79.1507%;height: 209px\">\n<tbody>\n<tr style=\"height: 15px\">\n<td style=\"width: 10.9909%;text-align: center;height: 59px\"><strong>\u00a0<\/strong><\/td>\n<td style=\"width: 37.8257%;height: 59px;text-align: center\"><strong>Degree<\/strong><\/td>\n<td style=\"width: 4.26491%;height: 59px\">\n<p style=\"text-align: center\"><strong>\u00a0 Frequency<\/strong><\/p>\n<\/td>\n<td style=\"width: 17.6642%;height: 59px\">\n<p style=\"text-align: center\"><strong>Percent<\/strong><\/p>\n<\/td>\n<td style=\"width: 18.7802%;height: 59px\">\n<p style=\"text-align: center\"><strong>Valid Percent<\/strong><\/p>\n<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 10.9909%;height: 15px\">Valid<\/td>\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0No degree<\/td>\n<td style=\"width: 4.26491%;height: 15px;text-align: center\">1<\/td>\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">4.7<\/td>\n<td style=\"width: 18.7802%;text-align: center;height: 15px\">5.6<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0Secondary\/High School<\/td>\n<td style=\"width: 4.26491%;height: 15px;text-align: center\">6<\/td>\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">28.6<\/td>\n<td style=\"width: 18.7802%;text-align: center;height: 15px\">33.3<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0Associate&#8217;s<\/td>\n<td style=\"width: 4.26491%;height: 15px;text-align: center\">3<\/td>\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">14.3<\/td>\n<td style=\"width: 18.7802%;text-align: center;height: 15px\">16.7<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0Bachelor&#8217;s<\/td>\n<td style=\"width: 4.26491%;height: 15px;text-align: center\">5<\/td>\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">23.8<\/td>\n<td style=\"width: 18.7802%;text-align: center;height: 15px\">27.8<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0Master&#8217;s<\/td>\n<td style=\"width: 4.26491%;height: 15px;text-align: center\">2<\/td>\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">9.5<\/td>\n<td style=\"width: 18.7802%;text-align: center;height: 15px\">11.1<\/td>\n<\/tr>\n<tr style=\"height: 16px\">\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0PhD<\/td>\n<td style=\"width: 4.26491%;height: 15px;text-align: center\">1<\/td>\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">4.7<\/td>\n<td style=\"width: 18.7802%;text-align: center;height: 15px\">5.6<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\n<td style=\"width: 37.8257%;height: 15px\"><strong>\u00a0 \u00a0Total Valid<\/strong><\/td>\n<td style=\"width: 4.26491%;text-align: center;height: 15px\"><strong>18<\/strong><\/td>\n<td style=\"width: 17.6642%;text-align: center;height: 15px\"><strong>85.6<\/strong><\/td>\n<td style=\"width: 18.7802%;text-align: center;height: 15px\"><strong>100.0<\/strong><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 10.9909%;height: 15px\">Missing<\/td>\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0Didn&#8217;t answer<\/td>\n<td style=\"width: 4.26491%;text-align: center;height: 15px\">3<\/td>\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">14.3<\/td>\n<td style=\"width: 18.7802%;text-align: center;height: 15px\"><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0Total Missing<\/td>\n<td style=\"width: 4.26491%;text-align: center;height: 15px\">3<\/td>\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">14.3<\/td>\n<td style=\"width: 18.7802%;text-align: center;height: 15px\"><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 10.9909%;height: 15px\"><\/td>\n<td style=\"width: 37.8257%;height: 15px\">\u00a0 \u00a0 TOTAL<\/td>\n<td style=\"width: 4.26491%;text-align: center;height: 15px\">21<\/td>\n<td style=\"width: 17.6642%;text-align: center;height: 15px\">100.0<\/td>\n<td style=\"width: 18.7802%;text-align: center;height: 15px\"><strong>\u00a0<\/strong><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>As you see in the modified Table 2.3 above, I have separated the missing cases from the valid cases (the cases for which we have educational attainment data). <strong>Since we have only 18 valid cases, we should use only those 18 cases for any calculations and analysis &#8212; and not the total of 21 cases<\/strong> (which includes the missing). Thus, instead of having just<\/p>\n<p>&nbsp;<\/p>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 37px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-b99b8301fea2a6b7420b9a205bf3b324_l3.png\" height=\"37\" width=\"309\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#92;&#102;&#114;&#97;&#99;&#123;&#102;&#125;&#123;&#78;&#125;&#40;&#49;&#48;&#48;&#41;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#49;&#125;&#123;&#50;&#49;&#125;&#40;&#49;&#48;&#48;&#41;&#61;&#48;&#46;&#48;&#52;&#55;&#40;&#49;&#48;&#48;&#41;&#61;&#52;&#46;&#55;&#92;&#37;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>along with the rest of the categories&#8217; percentages calculated in this way, we should calculate the categories&#8217; <em>valid<\/em> percentages, discarding he three missing cases, like this:<\/p>\n<p>&nbsp;<\/p>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 37px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-b3ecb24b7a5f5abbec5ab6364fa9ffb5_l3.png\" height=\"37\" width=\"309\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#92;&#102;&#114;&#97;&#99;&#123;&#102;&#125;&#123;&#78;&#125;&#40;&#49;&#48;&#48;&#41;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#49;&#125;&#123;&#49;&#56;&#125;&#40;&#49;&#48;&#48;&#41;&#61;&#48;&#46;&#48;&#53;&#54;&#40;&#49;&#48;&#48;&#41;&#61;&#53;&#46;&#54;&#92;&#37;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>(As usual, I only show you the calculation for the first category as the rest follow in the same way.)<\/p>\n<p>&nbsp;<\/p>\n<p>Despite the fact that we do have the percentages based on missing data in the table, note that these &#8211; <strong>the valid percentages &#8212; are the only percentages you should use in your analysis and report in your findings<\/strong>.<\/p>\n<p>&nbsp;<\/p>\n<p><em>Alright<\/em>, you might say now, <em>we added percentages and valid percentages to the simple frequencies, this surely means we have a complete frequency table by now.<\/em><\/p>\n<p>&nbsp;<\/p>\n<p>Sorry, no, not yet. One thing remains.<\/p>\n<hr class=\"before-footnotes clear\" \/><div class=\"footnotes\"><ol><li id=\"footnote-1499-1\">Depending on the particular data and particular situation, and assuming strong justification, researchers experienced in data analysis may have different options, such as estimation, imputation of means, etc. These, however, are beyond the scope of this text. The safest action for students\/beginners to take remains dropping any missing cases from the analysis. See <a href=\"https:\/\/www.iriseekhout.com\/missing-data\/missing-data-methods\/imputation-methods\/\">https:\/\/www.iriseekhout.com\/missing-data\/missing-data-methods\/imputation-methods\/<\/a> for a discussion.  <a href=\"#return-footnote-1499-1\" class=\"return-footnote\" aria-label=\"Return to footnote 1\">&crarr;<\/a><\/li><li id=\"footnote-1499-2\">For example, if a respondent has indicated previously that they didn't smoke, a subsequent question about how often they smoked would make no sense; the respondent then would be \"validly skipped\" from answering this subsequent question. <a href=\"#return-footnote-1499-2\" class=\"return-footnote\" aria-label=\"Return to footnote 2\">&crarr;<\/a><\/li><li id=\"footnote-1499-3\">Currently, Statistics Canada uses 6, 96, 996, etc. for \"Valid skip\"; 7, 97, 997, etc. for \"Don't know\"; 8, 98, 998, etc. for \"Refused\"; and 9, 99, 999, etc. for \"Not stated\".  <a href=\"#return-footnote-1499-3\" class=\"return-footnote\" aria-label=\"Return to footnote 3\">&crarr;<\/a><\/li><\/ol><\/div>","protected":false},"author":533,"menu_order":5,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-1499","chapter","type-chapter","status-publish","hentry"],"part":323,"_links":{"self":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/1499","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/users\/533"}],"version-history":[{"count":7,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/1499\/revisions"}],"predecessor-version":[{"id":2017,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/1499\/revisions\/2017"}],"part":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/parts\/323"}],"metadata":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/1499\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/media?parent=1499"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapter-type?post=1499"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/contributor?post=1499"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/license?post=1499"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}