{"id":91,"date":"2018-10-31T17:37:23","date_gmt":"2018-10-31T21:37:23","guid":{"rendered":"https:\/\/pressbooks.bccampus.ca\/simplestats\/?post_type=chapter&#038;p=91"},"modified":"2019-10-18T17:51:51","modified_gmt":"2019-10-18T21:51:51","slug":"6-1-populations-and-samples","status":"publish","type":"chapter","link":"https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/6-1-populations-and-samples\/","title":{"raw":"6.1 Populations and Samples","rendered":"6.1 Populations and Samples"},"content":{"raw":"&nbsp;\r\n\r\nBefore we start, yet another word of warning: what follows is only a brief overview of the topic of sampling and types of sampling.\u00a0<span style=\"font-size: 14pt\">What I offer is enough in terms of a necessary background to statistical inference -- but the main learning objective here\u00a0<\/span><em style=\"font-size: 14pt\">is<\/em><span style=\"font-size: 14pt\">\u00a0inference,\u00a0<\/span><em style=\"font-size: 14pt\">not<\/em><span style=\"font-size: 14pt\">\u00a0everything there is to know about sampling methods and their intricacies. Thus, i<\/span><span style=\"text-indent: 1em;font-size: 14pt\">f this is the first time you encounter the concept, you would be better served to read a thorough introduction on sampling and the benefits and downsides of the different sampling methods in virtually any one of the research methods textbooks you can find as that would give a more comprehensive treatment that I do here.\u00a0<\/span>\r\n\r\n&nbsp;\r\n\r\nWith that in mind, onward to the preliminaries: populations and samples.\r\n\r\n&nbsp;\r\n\r\nIn the introduction to this chapter, I asked a question: <em>Do Canadians approve of immigration?<\/em>\u00a0How, do you think, can we go about answering it?\r\n\r\n&nbsp;\r\n\r\nPresumably, the simplest way to investigate this would be <em>to simply ask -- <\/em>imagine we contacted everyone and, indeed,\u00a0 simply asked them whatever version of the question we have decided on (i.e., whichever way we have operationalized our variable, <em>attitudes to immigration<\/em>), noting everyone's responses. Many governments, both historically and to this day, have employed and still employ this method for gathering information.\r\n\r\n&nbsp;\r\n\r\n<strong>When we gather information from everyone in whom we are interested, we are doing a <em>census<\/em>.<\/strong> You probably know that the Government of Canada, through Statistics Canada, conducts a census of the Canadian population every five years. (You might have even filled the form yourself, if you are of age, or seen your parents do it, otherwise.) Then, can the government (or any researcher\/agency for that matter) collect information about everything it might need or want through censuses, every time the information is required?\r\n\r\n&nbsp;\r\n\r\nTheoretically, it's an option. In practice, no way: it would be prohibitively expensive. You might find the reason prosaic, but any research is limited by the availability of resources, money <em>and<\/em> time. Asking one additional question on a questionnaire to one additional person has costs, which add up quickly the more questions and the more people are included in the study. Thus, censuses of the population are enormous undertakings reserved for collecting only <em>really<\/em> important (typically demographic) information, and are usually quite limited in scope.[footnote]For more information on the Canadian census program see here: https:\/\/www12.statcan.gc.ca\/census-recensement\/index-eng.cfm[\/footnote][footnote]Censuses of the population are so expensive, some governments cannot afford to do them (or at least not regularly) and instead rely on survey data from samples. As well, in some places censuses can be fraught with controversies due to racial\/ethnic and\/or religious tensions, etc. and are therefore avoided. (REFERENCE Weeks 2015).[\/footnote]\r\n\r\n&nbsp;\r\n\r\n<span style=\"text-indent: 1em;font-size: 14pt\">Given that conducting censuses for everything anyone (researches, governments, etc.) might want information on is generally impractical\/unfeasible, what can be done when information about a population is needed? <\/span>\r\n\r\n&nbsp;\r\n\r\n<span style=\"text-indent: 1em;font-size: 14pt\">Here is where statistics saves the day: with probability theory and inferential statistics, we can use the next best thing to a census -- <em>random-sample surveys! <\/em><\/span><span style=\"text-indent: 1em;font-size: 14pt\">My job in this chapter will be to convince you that you don't need to do a census of the population you want to study as long as you have a well-selected sample.<\/span>\r\n\r\n&nbsp;\r\n\r\nYou, undoubtedly, have taken a survey at some point in your life in one form or another: a survey for which you were selected\/invited or you volunteered; which included other people but definitely not <em>everyone<\/em>. In other words, unless we are discussing a census, surveys typically are administered to <em>samples<\/em>\u00a0(i.e., sub-groups) of the population. However, not all surveys are created equal: those that can \"substitute\" for the population, as it were, rely on the just-mentioned technique of <em>random sampling<\/em>.\r\n\r\n&nbsp;\r\n\r\nBut first off, let's establish what samples and populations really are. While it's intuitive to think of <em>population<\/em> as the population of a country (say, 36.7 mln. Canadians), and of <em>sample<\/em> as a sub-group of that population (say, ten thousand Canadians), this is only a special case of the general terms <em>sample<\/em> and <em>population<\/em>. <strong>In research, a <em>population<\/em> is a group encompassing everyone on whom we want information, i.e. everyone (or everything) we want to study.<\/strong> Considering that we might not be studying people (recall that the units of analysis can be countries, organizations, etc.), we say that\u00a0<strong>a population encompasses all elements under study<\/strong>. This means that we could have study populations such as \"countries in South America\", or \"hospitals and medical clinics in Toronto\", or \"departments of sociology in Canadian universities\", etc.\r\n\r\n&nbsp;\r\n\r\nAs well, while the elements may be people, instead of the whole population of a country, we might be interested in studying \"university students in Canada,\" or \"early childhood educators in British Columbia,\" or \"dog walkers in downtown Vancouver,\" or \"Telus company employees,\" or \"dentists in Surrey, BC,\" etc. All of these examples are of populations that can be defined as such by researchers interested in them.\r\n\r\n&nbsp;\r\n\r\nThus,<strong> a <em>sample<\/em> is any sub-group of the population under study<\/strong>. For example, if I decide to study \"KPU students\", my study population would be defined as \"everyone registered as a student at KPU\". If I select a hundred students for my study, I would have a sample of <em>N<\/em>=100.\r\n\r\n&nbsp;\r\n\r\nUltimately, again, <strong>what the population for a particular study is depends on what the researcher wants to study<\/strong>.\r\n\r\n&nbsp;\r\n\r\nIf we go back to the <em>Do Canadians approve of immigration?<\/em> example, the population under study would be, of course, \"Canadians\" but we have to be very careful how we define \"Canadians\": Are we interested in <em>all<\/em> Canadians, regardless of where they live\/are at the moment? (I.e., do we include ex-pats, people with dual citizenship residing abroad, Canadian tourists travelling the world, etc.?) Or do we only want to study Canadians <em>in Canada<\/em>? And do we want to study permanent residents in Canada too or only people with Canadian passports?\u00a0 Regardless of how we want to define our study population, it has to be precise and to have objective criteria that we follow consistently.\r\n\r\n&nbsp;\r\n\r\nOnce a researcher has decided on and defined a study population, and collecting data on all elements of that population is considered unfeasible (and, as you will eventually see, collecting data on all elements of the population might be even undesirable as its unnecessary, even if it were feasible), the researcher needs to select a sample for their study.\r\n\r\n&nbsp;\r\n\r\n<strong>The procedure of selecting a sample is called <em>sampling.<\/em>There are two broad types of sampling, <em>non-random<\/em> and <em>random,<\/em><\/strong> and the next section is devoted to that.","rendered":"<p>&nbsp;<\/p>\n<p>Before we start, yet another word of warning: what follows is only a brief overview of the topic of sampling and types of sampling.\u00a0<span style=\"font-size: 14pt\">What I offer is enough in terms of a necessary background to statistical inference &#8212; but the main learning objective here\u00a0<\/span><em style=\"font-size: 14pt\">is<\/em><span style=\"font-size: 14pt\">\u00a0inference,\u00a0<\/span><em style=\"font-size: 14pt\">not<\/em><span style=\"font-size: 14pt\">\u00a0everything there is to know about sampling methods and their intricacies. Thus, i<\/span><span style=\"text-indent: 1em;font-size: 14pt\">f this is the first time you encounter the concept, you would be better served to read a thorough introduction on sampling and the benefits and downsides of the different sampling methods in virtually any one of the research methods textbooks you can find as that would give a more comprehensive treatment that I do here.\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>With that in mind, onward to the preliminaries: populations and samples.<\/p>\n<p>&nbsp;<\/p>\n<p>In the introduction to this chapter, I asked a question: <em>Do Canadians approve of immigration?<\/em>\u00a0How, do you think, can we go about answering it?<\/p>\n<p>&nbsp;<\/p>\n<p>Presumably, the simplest way to investigate this would be <em>to simply ask &#8212; <\/em>imagine we contacted everyone and, indeed,\u00a0 simply asked them whatever version of the question we have decided on (i.e., whichever way we have operationalized our variable, <em>attitudes to immigration<\/em>), noting everyone&#8217;s responses. Many governments, both historically and to this day, have employed and still employ this method for gathering information.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>When we gather information from everyone in whom we are interested, we are doing a <em>census<\/em>.<\/strong> You probably know that the Government of Canada, through Statistics Canada, conducts a census of the Canadian population every five years. (You might have even filled the form yourself, if you are of age, or seen your parents do it, otherwise.) Then, can the government (or any researcher\/agency for that matter) collect information about everything it might need or want through censuses, every time the information is required?<\/p>\n<p>&nbsp;<\/p>\n<p>Theoretically, it&#8217;s an option. In practice, no way: it would be prohibitively expensive. You might find the reason prosaic, but any research is limited by the availability of resources, money <em>and<\/em> time. Asking one additional question on a questionnaire to one additional person has costs, which add up quickly the more questions and the more people are included in the study. Thus, censuses of the population are enormous undertakings reserved for collecting only <em>really<\/em> important (typically demographic) information, and are usually quite limited in scope.<a class=\"footnote\" title=\"For more information on the Canadian census program see here: https:\/\/www12.statcan.gc.ca\/census-recensement\/index-eng.cfm\" id=\"return-footnote-91-1\" href=\"#footnote-91-1\" aria-label=\"Footnote 1\"><sup class=\"footnote\">[1]<\/sup><\/a><a class=\"footnote\" title=\"Censuses of the population are so expensive, some governments cannot afford to do them (or at least not regularly) and instead rely on survey data from samples. As well, in some places censuses can be fraught with controversies due to racial\/ethnic and\/or religious tensions, etc. and are therefore avoided. (REFERENCE Weeks 2015).\" id=\"return-footnote-91-2\" href=\"#footnote-91-2\" aria-label=\"Footnote 2\"><sup class=\"footnote\">[2]<\/sup><\/a><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"text-indent: 1em;font-size: 14pt\">Given that conducting censuses for everything anyone (researches, governments, etc.) might want information on is generally impractical\/unfeasible, what can be done when information about a population is needed? <\/span><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"text-indent: 1em;font-size: 14pt\">Here is where statistics saves the day: with probability theory and inferential statistics, we can use the next best thing to a census &#8212; <em>random-sample surveys! <\/em><\/span><span style=\"text-indent: 1em;font-size: 14pt\">My job in this chapter will be to convince you that you don&#8217;t need to do a census of the population you want to study as long as you have a well-selected sample.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>You, undoubtedly, have taken a survey at some point in your life in one form or another: a survey for which you were selected\/invited or you volunteered; which included other people but definitely not <em>everyone<\/em>. In other words, unless we are discussing a census, surveys typically are administered to <em>samples<\/em>\u00a0(i.e., sub-groups) of the population. However, not all surveys are created equal: those that can &#8220;substitute&#8221; for the population, as it were, rely on the just-mentioned technique of <em>random sampling<\/em>.<\/p>\n<p>&nbsp;<\/p>\n<p>But first off, let&#8217;s establish what samples and populations really are. While it&#8217;s intuitive to think of <em>population<\/em> as the population of a country (say, 36.7 mln. Canadians), and of <em>sample<\/em> as a sub-group of that population (say, ten thousand Canadians), this is only a special case of the general terms <em>sample<\/em> and <em>population<\/em>. <strong>In research, a <em>population<\/em> is a group encompassing everyone on whom we want information, i.e. everyone (or everything) we want to study.<\/strong> Considering that we might not be studying people (recall that the units of analysis can be countries, organizations, etc.), we say that\u00a0<strong>a population encompasses all elements under study<\/strong>. This means that we could have study populations such as &#8220;countries in South America&#8221;, or &#8220;hospitals and medical clinics in Toronto&#8221;, or &#8220;departments of sociology in Canadian universities&#8221;, etc.<\/p>\n<p>&nbsp;<\/p>\n<p>As well, while the elements may be people, instead of the whole population of a country, we might be interested in studying &#8220;university students in Canada,&#8221; or &#8220;early childhood educators in British Columbia,&#8221; or &#8220;dog walkers in downtown Vancouver,&#8221; or &#8220;Telus company employees,&#8221; or &#8220;dentists in Surrey, BC,&#8221; etc. All of these examples are of populations that can be defined as such by researchers interested in them.<\/p>\n<p>&nbsp;<\/p>\n<p>Thus,<strong> a <em>sample<\/em> is any sub-group of the population under study<\/strong>. For example, if I decide to study &#8220;KPU students&#8221;, my study population would be defined as &#8220;everyone registered as a student at KPU&#8221;. If I select a hundred students for my study, I would have a sample of <em>N<\/em>=100.<\/p>\n<p>&nbsp;<\/p>\n<p>Ultimately, again, <strong>what the population for a particular study is depends on what the researcher wants to study<\/strong>.<\/p>\n<p>&nbsp;<\/p>\n<p>If we go back to the <em>Do Canadians approve of immigration?<\/em> example, the population under study would be, of course, &#8220;Canadians&#8221; but we have to be very careful how we define &#8220;Canadians&#8221;: Are we interested in <em>all<\/em> Canadians, regardless of where they live\/are at the moment? (I.e., do we include ex-pats, people with dual citizenship residing abroad, Canadian tourists travelling the world, etc.?) Or do we only want to study Canadians <em>in Canada<\/em>? And do we want to study permanent residents in Canada too or only people with Canadian passports?\u00a0 Regardless of how we want to define our study population, it has to be precise and to have objective criteria that we follow consistently.<\/p>\n<p>&nbsp;<\/p>\n<p>Once a researcher has decided on and defined a study population, and collecting data on all elements of that population is considered unfeasible (and, as you will eventually see, collecting data on all elements of the population might be even undesirable as its unnecessary, even if it were feasible), the researcher needs to select a sample for their study.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>The procedure of selecting a sample is called <em>sampling.<\/em>There are two broad types of sampling, <em>non-random<\/em> and <em>random,<\/em><\/strong> and the next section is devoted to that.<\/p>\n<hr class=\"before-footnotes clear\" \/><div class=\"footnotes\"><ol><li id=\"footnote-91-1\">For more information on the Canadian census program see here: https:\/\/www12.statcan.gc.ca\/census-recensement\/index-eng.cfm <a href=\"#return-footnote-91-1\" class=\"return-footnote\" aria-label=\"Return to footnote 1\">&crarr;<\/a><\/li><li id=\"footnote-91-2\">Censuses of the population are so expensive, some governments cannot afford to do them (or at least not regularly) and instead rely on survey data from samples. As well, in some places censuses can be fraught with controversies due to racial\/ethnic and\/or religious tensions, etc. and are therefore avoided. (REFERENCE Weeks 2015). <a href=\"#return-footnote-91-2\" class=\"return-footnote\" aria-label=\"Return to footnote 2\">&crarr;<\/a><\/li><\/ol><\/div>","protected":false},"author":533,"menu_order":1,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-91","chapter","type-chapter","status-publish","hentry"],"part":32,"_links":{"self":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/91","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/users\/533"}],"version-history":[{"count":15,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/91\/revisions"}],"predecessor-version":[{"id":2046,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/91\/revisions\/2046"}],"part":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/parts\/32"}],"metadata":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/91\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/media?parent=91"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapter-type?post=91"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/contributor?post=91"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/license?post=91"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}