{"id":130,"date":"2018-10-31T18:08:14","date_gmt":"2018-10-31T22:08:14","guid":{"rendered":"https:\/\/pressbooks.bccampus.ca\/simplestats\/?post_type=chapter&#038;p=130"},"modified":"2019-11-15T18:37:45","modified_gmt":"2019-11-15T23:37:45","slug":"10-1-correlation","status":"publish","type":"chapter","link":"https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/10-1-correlation\/","title":{"raw":"10.1. Correlation","rendered":"10.1. Correlation"},"content":{"raw":"[latexpage]\r\n\r\nYou will recall from Section 7.2.3 that we use the coefficient of correlation (Pearson's) <em>r<\/em> to examine associations between two continuous variables. The correlation coefficient <em>r<\/em> varies between -1 and 1. The closer it is to either, the stronger the correlation, and the closer it is to 0, the weaker the correlation[footnote]The sign of <em>r<\/em> is there <em>only<\/em> to indicate the direction of the association: positive or negative, nothing else. Thus this is a reminder not to use <em>r<\/em>'s sign as a measure of magnitude or strength of the association. Thus, for example, -0.9 is a stronger association than 0.2 because -0.9 is closer to -1 than 0.2 is to 1. (In fact, 0.2 is much closer to 0, or no association.) That is, a strong negative correlation is <em>stronger<\/em> than a weak positive one, despite that -0.9&lt;0.2.[\/footnote].\r\n\r\n&nbsp;\r\n\r\nWhere does <em>r<\/em> come from though? What does it actually measure? I doubt you have lost sleep wondering about these questions which I left unanswered in Chapter 7, but here is your chance to learn this anyway (think of it as closure of sorts).\r\n\r\n&nbsp;\r\n\r\nThe correlation coefficient is<span style=\"text-indent: 18.6667px;font-size: 14pt\">, essentially,<\/span><span style=\"text-indent: 1em;font-size: 14pt\">\u00a0a ratio of the variabilities of the two variables[footnote]To be precise, the ratio is between the covariance of <em>x<\/em> and <em>y<\/em>\u00a0(i.e., their joint variability, <em>s<sub>xy<\/sub><\/em>) and the product of their separate variances<em> s<sub>x<\/sub><\/em> and <em>s<sub>y<\/sub><\/em>:<\/span>\r\n\r\n&nbsp;\r\n\r\n$$r=\\frac{s_{xy}}{s_x s_y}$$ or\r\n\r\n&nbsp;\r\n\r\n$$\\rho=\\frac{\\sigma_{xy}}{\\sigma_x \\sigma_y}$$ if we apply it to a population instead of a sample. (Here\u00a0<em>\u03c1<\/em> is the small-case Greek letter <em>r<\/em>, pronounced [ROH].)\r\n\r\n<span style=\"text-indent: 1em;font-size: 14pt\">[\/footnote]. <\/span>\r\n\r\n<span style=\"text-indent: 1em;font-size: 14pt\"><strong>The easiest way to calculate\u00a0<\/strong><\/span><strong><em style=\"text-indent: 1em;font-size: 14pt\">r<\/em><\/strong><span style=\"text-indent: 1em;font-size: 14pt\"><strong>\u00a0between a variable<em> x<\/em> and a variable<em> y<\/em> is through the distances of the observations from the means of the two variables, or more accurately, the sums of squares<\/strong>[footnote]Recall that the sum of squares was the numerator in the formulas for the variance and the standard deviation. We take the distances of the observations from the mean, square them, and them add them altogether. (We square them <\/span><em style=\"text-indent: 1em;font-size: 14pt\">before<\/em><span style=\"text-indent: 1em;font-size: 14pt\"> adding t<\/span><span style=\"text-indent: 18.6667px;font-size: 14pt\">o turn them all positive, otherwise they'd cancel each other upon summation. See Section 4.3 (<a href=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/4-3-variance\/\">https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/4-3-variance\/<\/a>) for details.)<\/span><span style=\"text-indent: 1em;font-size: 14pt\">\u00a0[\/footnote]:\u00a0<\/span>\r\n\r\n&nbsp;\r\n\r\n$$r=\\frac{\\Sigma{(x-\\overline{x})(y-\\overline{y})}}{\\sqrt{\\Sigma{(x-\\overline{x})^2}\\Sigma{(y-\\overline{y})^2}}}$$\r\n\r\n&nbsp;\r\n\r\nFrom Section 4.3, we know that $\\Sigma{(x-\\overline{x})}^2$ is the sum of squares of the variable <em>x\u00a0<\/em><span style=\"text-indent: 18.6667px;font-size: 14pt\">(so, <em>SS<sub>x<\/sub><\/em>)<\/span><span style=\"text-indent: 1em;font-size: 14pt\">; by analogy,\u00a0<\/span><span style=\"text-indent: 1em;font-size: 14pt\"> $\\Sigma{(y-\\overline{y})}^2$ will be the sum of squares of the variable <em>y<\/em> (so, <em>SS<sub>y<\/sub><\/em>). When the distances between an observation and the two means are \"cross-multiplied\" before summing (like in the numerator), they are called the sum of products (<em>SP<sub>xy<\/sub><\/em>).\u00a0<\/span>\r\n\r\n&nbsp;\r\n\r\nThus we can restate the formula above in the following simplified (and easier to remember) way<span style=\"text-indent: 1em;font-size: 14pt\">[footnote]Note that other \"versions\" of the formula for<\/span><span style=\"text-indent: 1em;font-size: 14pt\">\u00a0<\/span><em style=\"text-indent: 1em;font-size: 14pt\">r<\/em><span style=\"text-indent: 1em;font-size: 14pt\">\u00a0<\/span><span style=\"text-indent: 1em;font-size: 14pt\">exist. All of them calculate the same <em>r<\/em>, but are just restated in different term. The two \"versions\" presented in the text above are the simplest. For example, one of the most common ways to express <em>r<\/em> you may find elsewhere (but which is rather hard on the eyes and for purposes of calculation by hand) is this:<\/span>\r\n\r\n&nbsp;\r\n\r\n<span style=\"text-indent: 1em;font-size: 14pt\"><span style=\"text-indent: 1em;font-size: 14pt\">$$r=\\frac{N\\Sigma{xy} -\\Sigma{x}\\Sigma{y}}{\\sqrt{N\\Sigma{x^2}-(\\Sigma{x})^2)(N\\Sigma{y^2}-(\\Sigma{y})^2}}$$<\/span><\/span>\r\n\r\n<span style=\"text-indent: 1em;font-size: 14pt\"><span style=\"text-indent: 1em;font-size: 14pt\">[\/footnote]<\/span>:<\/span>\r\n\r\n&nbsp;\r\n\r\n$$r=\\frac{SP_{xy}}{\\sqrt{SS_x SS_y}}$$\r\n\r\n&nbsp;\r\n\r\nExample 10.1(A) provides an empirical application of <em>r<\/em>'s calculation.\r\n\r\n&nbsp;\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\"><em>Example 10.1(A) Education and Parental Education, GSS 2018<\/em><\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n&nbsp;\r\n\r\nTable 10.1 lists the years of schooling (our variable <em>y<\/em>) of seven respondents in the <em>GSS 2018<\/em> (NORC, 2019) and the years of schooling of their respective fathers (our variable <em>x<\/em>)[footnote]Here <em>parental education<\/em> is the independent variable and <em>respondent's education<\/em> is the dependent variable, so they are denoted as <em>x<\/em> and <em>y<\/em>, respectively, according to convention. [\/footnote]. While inference with <em>N<\/em>=7 is not a serious proposition, the small observation count allows for a quick calculation for demonstration purposes only. (After all, we already know the correlation coefficient of these exact same two variables from Section 7.2.3; there the SPSS-calculated <em>r <\/em>was equal to\u00a00.413.)\r\n\r\n&nbsp;\r\n\r\nThe rest of the columns in Table 10.1 list the necessary computations (obtaining distances from the mean, squaring distances, summing distances, etc.) to produce <em>SS<sub>x<\/sub>, SS<sub>y<\/sub><\/em>, and <em>SP<sub>xy<\/sub>.<\/em>\r\n\r\n&nbsp;\r\n\r\n<em>Table 10.1 Calculating Pearson's r<\/em>\r\n<table style=\"border-collapse: collapse;width: 100%;height: 289px\" border=\"0\">\r\n<tbody>\r\n<tr style=\"height: 30px\">\r\n<td style=\"width: 1.41643%;height: 30px;text-align: center\">$x$<\/td>\r\n<td style=\"width: 2.31468%;height: 30px;text-align: center\">$y$<\/td>\r\n<td style=\"width: 21.1225%;height: 30px;text-align: center\">\u00a0$(x-\\overline{x})$<\/td>\r\n<td style=\"width: 18.0203%;height: 30px;text-align: center\">$(x-\\overline{x})^2$<\/td>\r\n<td style=\"width: 21.4724%;height: 30px;text-align: center\">$(y-\\overline{y})$<\/td>\r\n<td style=\"width: 16.8352%;height: 30px;text-align: center\">$(y-\\overline{y})^2$<\/td>\r\n<td style=\"width: 18.8183%;height: 30px;text-align: center\">\u00a0$(x-\\overline{x})(y-\\overline{y})$<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">12<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">8<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\">(12-12.4) = -0.4<\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\">-0.4<sup>2\u00a0<\/sup>= 0.2<\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\">(8-13.6) = 5.6<\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\">5.6<sup>2\u00a0<\/sup>= 31.4<\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\">(-0.4)(5.6)=2.2<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">6<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">12<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\">(6-12.4) = -6.4<\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\">-6.4<sup>2\u00a0<\/sup>= 41<\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\">(12-13.6) = -1.6<\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\">-1.6<sup>2\u00a0<\/sup>= 2.6<\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\">(-6.4)(1.6)=10.2<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">12<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">19<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\">(12-12.4) = -0.4<\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\">-0.4<sup>2\u00a0<\/sup>= 0.2<\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\">(19-13.6) = 5.4<\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\">5.4<sup>2\u00a0<\/sup>= 29.2<\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\">(-0.4)(5.4)=-2.2<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">16<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">16<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\">(16-12.4) = 3.6<\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\">3.6<sup>2\u00a0<\/sup>= 13<\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\">(16-13.6) = 2.4<\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\">2.4<sup>2\u00a0<\/sup>= 5.8<\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\">(3.6)(2.4)=8.6<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">15<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">12<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\">(15-12.4) = 2.6<\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\">2.6<sup>2\u00a0<\/sup>= 6.8<\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\">(12-13.6) = -1.6<\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\">-1.6<sup>2\u00a0<\/sup>= 2.6<\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\">(2.6)(-1.6)=-4.2<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">12<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">12<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\">(12-12.4) = -0.4<\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\">-0.4<sup>2\u00a0<\/sup>= 0.2<\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\">(12-13.6) = -1.6<\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\">-1.6<sup>2\u00a0<\/sup>= 2.6<\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\">(-0.4)(-1.6)=0.6<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">14<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">16<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\">(14-12.4) = 1.6<\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\">1.6<sup>2\u00a0<\/sup>= 2.6<\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\">(16-13.6) = 2.4<\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\">2.4<sup>2\u00a0<\/sup>= 5.8<\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\">(1.6)(2.4)=3.8<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\"><img src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-0d00c2da2b2541a97ae0ac3c10e1504e_l3.svg\" alt=\"\\overline{x}\" \/>12.4<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\"><img src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-01881adf9c51d256ce0a5af82c2e7024_l3.svg\" alt=\"\\overline{y}\" \/>13.6<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><strong><em>SS<sub>x<\/sub><\/em>=63.7<\/strong><\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><strong><em>SS<sub>y<\/sub><\/em>=79.7<\/strong><\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><strong><em>SP<sub>xy<\/sub><\/em>=19.3<\/strong><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\nThen, according to the formula for <em>r<\/em> we have:\r\n\r\n&nbsp;\r\n\r\n$$r=\\frac{SP_{xy}}{\\sqrt{SS_x SS_y}}=\\frac{19.3}{\\sqrt{63.7\\times79.7}}=\\frac{19.3}{71.3}=0.271$$\r\n\r\n&nbsp;\r\n\r\nObviously, this <em>r<\/em>=0.271 is not the same as the SPSS-produced <em>r<\/em>=0.413 we had from Section 7.2.3; in fact, it would be very surprising if they were the same, considering the former is based on <em>N<\/em>=7 while the latter is based on <em>N<\/em>=1,687. The exact value of <em>r<\/em> in the above calculation (<em>r<\/em>=0.271) doesn't matter, and doesn't serve any purpose and shouldn't be interpreted as it exists only as the end result of our demonstration.\r\n\r\n<\/div>\r\n<\/div>\r\n&nbsp;\r\n\r\nFancy trying it out on your own?\r\n\r\n&nbsp;\r\n<div class=\"textbox textbox--exercises\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\"><em>Do It! 10.1 Calculating Pearson's r<\/em><\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n&nbsp;\r\n\r\nHere are 7 more cases from the same <em>GSS 2018<\/em> dataset. Fill out the table fully and produce <em>r<\/em>.\r\n<table style=\"border-collapse: collapse;width: 100%;height: 289px\" border=\"0\">\r\n<tbody>\r\n<tr style=\"height: 30px\">\r\n<td style=\"width: 1.41643%;height: 30px;text-align: center\">$x$<\/td>\r\n<td style=\"width: 2.31468%;height: 30px;text-align: center\">$y$<\/td>\r\n<td style=\"width: 21.1225%;height: 30px;text-align: center\">\u00a0$(x-\\overline{x})$<\/td>\r\n<td style=\"width: 18.0203%;height: 30px;text-align: center\">$(x-\\overline{x})^2$<\/td>\r\n<td style=\"width: 21.4724%;height: 30px;text-align: center\">$(y-\\overline{y})$<\/td>\r\n<td style=\"width: 16.8352%;height: 30px;text-align: center\">$(y-\\overline{y})^2$<\/td>\r\n<td style=\"width: 18.8183%;height: 30px;text-align: center\">\u00a0$(x-\\overline{x})(y-\\overline{y})$<\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">12<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">12<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">12<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">14<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">13<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">13<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">13<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">16<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">14<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">20<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">20<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">16<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">21<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">18<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><\/td>\r\n<\/tr>\r\n<tr style=\"height: 15px\">\r\n<td style=\"width: 1.41643%;height: 15px;text-align: center\"><img src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-0d00c2da2b2541a97ae0ac3c10e1504e_l3.svg\" alt=\"\\overline{x}\" \/>=<\/td>\r\n<td style=\"width: 2.31468%;height: 15px;text-align: center\"><img src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-01881adf9c51d256ce0a5af82c2e7024_l3.svg\" alt=\"\\overline{y}\" \/>=<\/td>\r\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><strong><em>SS<sub>x<\/sub><\/em>=<\/strong><\/td>\r\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\r\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><strong><em>SS<sub>y<\/sub><\/em>=<\/strong><\/td>\r\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><strong><em>SP<sub>xy<\/sub><\/em>=<\/strong><\/td>\r\n<\/tr>\r\n<\/tbody>\r\n<\/table>\r\n<\/div>\r\n<\/div>\r\n&nbsp;\r\n\r\nEven if we dismiss the value of the<em> N<\/em>=7 coefficients and go back to <em>r<\/em>=0.413 based on <em>N<\/em>=1,687, we still want to know if this correlation <em>as observed in the sample<\/em> is statistically significant (i.e., generalizable to the population). Thus, we need to test <em>r<\/em>, and we do that through a <em>t<\/em>-test.\r\n\r\n&nbsp;\r\n\r\n<strong>The <em>t<\/em>-test for Pearson's <em>r<\/em> is<\/strong> given by the following formula:\r\n\r\n&nbsp;\r\n\r\n$$t=\\frac{r\\sqrt{N-2}}{\\sqrt{1-r^2}}$$\r\n\r\n&nbsp;\r\n\r\nwith <em>df<\/em>=<em>N<\/em>-2.\r\n\r\n&nbsp;\r\n<div class=\"textbox textbox--examples\"><header class=\"textbox__header\">\r\n<p class=\"textbox__title\"><em>Example 10.1(B)\u00a0Testing the<\/em>\u00a0<em>Education and Parental Education Correlation, GSS 2018<\/em><\/p>\r\n\r\n<\/header>\r\n<div class=\"textbox__content\">\r\n\r\n&nbsp;\r\n\r\nAs usual, it helps to know what we are testing exactly:\r\n<ul>\r\n \t<li>H<sub>0<\/sub>: There is no correlation between parental and offspring education;\u00a0<em>\u03c1<\/em>=0.<\/li>\r\n \t<li>H<sub>a<\/sub>: There is a correlation between parental and offspring education;\u00a0<em>\u03c1<\/em>\u22600.<\/li>\r\n<\/ul>\r\nThen, for <em>N<\/em>=1,687 and <em>r<\/em>=0.413, we have:\r\n\r\n&nbsp;\r\n\r\n$$t=\\frac{r\\sqrt{N-2}}{\\sqrt{1-r^2}}=\\frac{0.413\\sqrt{1687-2}}{\\sqrt{1-0.413^2}}=\\frac{0.413(41.1)}{0.911}=18.633$$\r\n\r\n&nbsp;\r\n\r\n<strong>With <em>t<\/em>=18.633, <em>df<\/em>=1,685, and <em>p<\/em>=0.00001 (i.e., <em>p<\/em>=0.00001&lt;0.5), we can reject the null hypothesis that parental and offspring education are not correlated. At this time, we have enough evidence to conclude that there is a moderately weak (<em>r<\/em>=0.413), statistically significant correlation between parental education and offspring education in the US population<\/strong>[footnote]Purely for demonstration purposes, we could also calculate the<em> t<\/em> for the 7 respondents whose responses we used to calculate <em>r<\/em>=0.271:\r\n\r\n&nbsp;\r\n\r\n$$t=\\frac{r\\sqrt{N-2}}{\\sqrt{1-r^2}}=\\frac{0.271\\sqrt{7-2}}{\\sqrt{1-0.271^2}}=\\frac{0.271(5)}{0.963}=1.407$$\r\n\r\n&nbsp;\r\n\r\nIn this case, we could interpret the results like this: \"With <em>t<\/em>=1.407, <em>df<\/em>=5, and <em>p<\/em>=0.218 (i.e., <em>p<\/em>=0.218&gt;0.5), we cannot reject the null hypothesis that parental and offspring education are not correlated. At this time, we do not have enough evidence to conclude that there is a statistically significant correlation between parental education and offspring education in the US population.\" However, we cannot trust this \"inference\" as it is only based on <em>N<\/em>=7.[\/footnote]<strong>.<\/strong>\r\n\r\n&nbsp;\r\n\r\n<\/div>\r\n<\/div>\r\n&nbsp;\r\n\r\nWith this, we can have established (with 99% certainty) that parental education and offspring education are correlated. Considering that parents tend to have their schooling done before their children have theirs, on average, it is also reasonable to assume that parental education affects offspring education (and not vice versa)[footnote]In terms of establishing causality, we are limited by the bivariate case we have: it is entirely possible (and expected) that other things affect offspring education too, not just their parents' education. As well, it is possible than something else (for example, wealth, income, socioecoomic class, etc.) might be affecting both parental and offspring education, rendering the effect of parental educaion on offspring education spurious. These type of considerations are exactly the purpose of mutlivariate analysis, but since we are dealing with bivariate analysis here, we have to leave these considerations aside. I bring them up here to remind you not to forget them in the discussion that follows, which will focus on the two variables at hand. [\/footnote].\r\n\r\n&nbsp;\r\n\r\nWouldn't then be good to know <em>exactly how much<\/em> effect parental education has on offspring education? That is, wouldn't you like to know that if a father had one more year of schooling compared to another father, how much more schooling the child of the former would be expected to have compared to the child of the latter? One type of regression -- called <em>linear regression<\/em> -- can tell us just that.","rendered":"<p>You will recall from Section 7.2.3 that we use the coefficient of correlation (Pearson&#8217;s) <em>r<\/em> to examine associations between two continuous variables. The correlation coefficient <em>r<\/em> varies between -1 and 1. The closer it is to either, the stronger the correlation, and the closer it is to 0, the weaker the correlation<a class=\"footnote\" title=\"The sign of r is there only to indicate the direction of the association: positive or negative, nothing else. Thus this is a reminder not to use r's sign as a measure of magnitude or strength of the association. Thus, for example, -0.9 is a stronger association than 0.2 because -0.9 is closer to -1 than 0.2 is to 1. (In fact, 0.2 is much closer to 0, or no association.) That is, a strong negative correlation is stronger than a weak positive one, despite that -0.9&lt;0.2.\" id=\"return-footnote-130-1\" href=\"#footnote-130-1\" aria-label=\"Footnote 1\"><sup class=\"footnote\">[1]<\/sup><\/a>.<\/p>\n<p>&nbsp;<\/p>\n<p>Where does <em>r<\/em> come from though? What does it actually measure? I doubt you have lost sleep wondering about these questions which I left unanswered in Chapter 7, but here is your chance to learn this anyway (think of it as closure of sorts).<\/p>\n<p>&nbsp;<\/p>\n<p>The correlation coefficient is<span style=\"text-indent: 18.6667px;font-size: 14pt\">, essentially,<\/span><span style=\"text-indent: 1em;font-size: 14pt\">\u00a0a ratio of the variabilities of the two variables<a class=\"footnote\" title=\"To be precise, the ratio is between the covariance of x and y\u00a0(i.e., their joint variability, sxy) and the product of their separate variances sx and sy:\n\n\u00a0\n\n \u00a0  \u00a0  or\n\n\u00a0\n\n \u00a0  \u00a0  if we apply it to a population instead of a sample. (Here\u00a0\u03c1 is the small-case Greek letter r, pronounced [ROH].)\" id=\"return-footnote-130-2\" href=\"#footnote-130-2\" aria-label=\"Footnote 2\"><sup class=\"footnote\">[2]<\/sup><\/a>. <\/span><\/p>\n<p><span style=\"text-indent: 1em;font-size: 14pt\"><strong>The easiest way to calculate\u00a0<\/strong><\/span><strong><em style=\"text-indent: 1em;font-size: 14pt\">r<\/em><\/strong><span style=\"text-indent: 1em;font-size: 14pt\"><strong>\u00a0between a variable<em> x<\/em> and a variable<em> y<\/em> is through the distances of the observations from the means of the two variables, or more accurately, the sums of squares<\/strong><a class=\"footnote\" title=\"Recall that the sum of squares was the numerator in the formulas for the variance and the standard deviation. We take the distances of the observations from the mean, square them, and them add them altogether. (We square them before adding to turn them all positive, otherwise they'd cancel each other upon summation. See Section 4.3 (https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/4-3-variance\/) for details.)\u00a0\" id=\"return-footnote-130-3\" href=\"#footnote-130-3\" aria-label=\"Footnote 3\"><sup class=\"footnote\">[3]<\/sup><\/a>:\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 46px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-3d062e8d716435979d8b0f86b15ee662_l3.png\" height=\"46\" width=\"203\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#114;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#83;&#105;&#103;&#109;&#97;&#123;&#40;&#120;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;&#40;&#121;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#121;&#125;&#41;&#125;&#125;&#123;&#92;&#115;&#113;&#114;&#116;&#123;&#92;&#83;&#105;&#103;&#109;&#97;&#123;&#40;&#120;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;&#94;&#50;&#125;&#92;&#83;&#105;&#103;&#109;&#97;&#123;&#40;&#121;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#121;&#125;&#41;&#94;&#50;&#125;&#125;&#125;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>From Section 4.3, we know that <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-6e211dac765eb1be909c7bacb83f987f_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#83;&#105;&#103;&#109;&#97;&#123;&#40;&#120;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;&#125;&#94;&#50;\" title=\"Rendered by QuickLaTeX.com\" height=\"21\" width=\"76\" style=\"vertical-align: -4px;\" \/> is the sum of squares of the variable <em>x\u00a0<\/em><span style=\"text-indent: 18.6667px;font-size: 14pt\">(so, <em>SS<sub>x<\/sub><\/em>)<\/span><span style=\"text-indent: 1em;font-size: 14pt\">; by analogy,\u00a0<\/span><span style=\"text-indent: 1em;font-size: 14pt\"> <img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-ef04066d1a99d3c6ffef66b2cb205d62_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#92;&#83;&#105;&#103;&#109;&#97;&#123;&#40;&#121;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#121;&#125;&#41;&#125;&#94;&#50;\" title=\"Rendered by QuickLaTeX.com\" height=\"21\" width=\"74\" style=\"vertical-align: -4px;\" \/> will be the sum of squares of the variable <em>y<\/em> (so, <em>SS<sub>y<\/sub><\/em>). When the distances between an observation and the two means are &#8220;cross-multiplied&#8221; before summing (like in the numerator), they are called the sum of products (<em>SP<sub>xy<\/sub><\/em>).\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>Thus we can restate the formula above in the following simplified (and easier to remember) way<span style=\"text-indent: 1em;font-size: 14pt\"><a class=\"footnote\" title=\"Note that other &quot;versions&quot; of the formula for\u00a0r\u00a0exist. All of them calculate the same r, but are just restated in different term. The two &quot;versions&quot; presented in the text above are the simplest. For example, one of the most common ways to express r you may find elsewhere (but which is rather hard on the eyes and for purposes of calculation by hand) is this:\n\n\u00a0\n\n \u00a0  \u00a0\" id=\"return-footnote-130-4\" href=\"#footnote-130-4\" aria-label=\"Footnote 4\"><sup class=\"footnote\">[4]<\/sup><\/a><\/span>:<\/span><\/p>\n<p>&nbsp;<\/p>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 45px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-69aedafe8a00c2fb3bd1e584f474f73d_l3.png\" height=\"45\" width=\"115\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#114;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#83;&#80;&#95;&#123;&#120;&#121;&#125;&#125;&#123;&#92;&#115;&#113;&#114;&#116;&#123;&#83;&#83;&#95;&#120;&#32;&#83;&#83;&#95;&#121;&#125;&#125;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>Example 10.1(A) provides an empirical application of <em>r<\/em>&#8216;s calculation.<\/p>\n<p>&nbsp;<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\"><em>Example 10.1(A) Education and Parental Education, GSS 2018<\/em><\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>&nbsp;<\/p>\n<p>Table 10.1 lists the years of schooling (our variable <em>y<\/em>) of seven respondents in the <em>GSS 2018<\/em> (NORC, 2019) and the years of schooling of their respective fathers (our variable <em>x<\/em>)<a class=\"footnote\" title=\"Here parental education is the independent variable and respondent's education is the dependent variable, so they are denoted as x and y, respectively, according to convention.\" id=\"return-footnote-130-5\" href=\"#footnote-130-5\" aria-label=\"Footnote 5\"><sup class=\"footnote\">[5]<\/sup><\/a>. While inference with <em>N<\/em>=7 is not a serious proposition, the small observation count allows for a quick calculation for demonstration purposes only. (After all, we already know the correlation coefficient of these exact same two variables from Section 7.2.3; there the SPSS-calculated <em>r <\/em>was equal to\u00a00.413.)<\/p>\n<p>&nbsp;<\/p>\n<p>The rest of the columns in Table 10.1 list the necessary computations (obtaining distances from the mean, squaring distances, summing distances, etc.) to produce <em>SS<sub>x<\/sub>, SS<sub>y<\/sub><\/em>, and <em>SP<sub>xy<\/sub>.<\/em><\/p>\n<p>&nbsp;<\/p>\n<p><em>Table 10.1 Calculating Pearson&#8217;s r<\/em><\/p>\n<table style=\"border-collapse: collapse;width: 100%;height: 289px\">\n<tbody>\n<tr style=\"height: 30px\">\n<td style=\"width: 1.41643%;height: 30px;text-align: center\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-ede05c264bba0eda080918aaa09c4658_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#120;\" title=\"Rendered by QuickLaTeX.com\" height=\"8\" width=\"10\" style=\"vertical-align: 0px;\" \/><\/td>\n<td style=\"width: 2.31468%;height: 30px;text-align: center\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-0af556714940c351c933bba8cf840796_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#121;\" title=\"Rendered by QuickLaTeX.com\" height=\"12\" width=\"9\" style=\"vertical-align: -4px;\" \/><\/td>\n<td style=\"width: 21.1225%;height: 30px;text-align: center\">\u00a0<img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-2be8c227df65e8134dad5fb1b20474f0_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#40;&#120;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;\" title=\"Rendered by QuickLaTeX.com\" height=\"18\" width=\"54\" style=\"vertical-align: -4px;\" \/><\/td>\n<td style=\"width: 18.0203%;height: 30px;text-align: center\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-044ade8b43a3571c6d6e964117980bf0_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#40;&#120;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;&#94;&#50;\" title=\"Rendered by QuickLaTeX.com\" height=\"19\" width=\"62\" style=\"vertical-align: -4px;\" \/><\/td>\n<td style=\"width: 21.4724%;height: 30px;text-align: center\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-293fcf26d7743d8ceeac73d9c706910b_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#40;&#121;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#121;&#125;&#41;\" title=\"Rendered by QuickLaTeX.com\" height=\"18\" width=\"52\" style=\"vertical-align: -4px;\" \/><\/td>\n<td style=\"width: 16.8352%;height: 30px;text-align: center\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-532d7ff9cbe6d68b91e39c1bcc2d6620_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#40;&#121;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#121;&#125;&#41;&#94;&#50;\" title=\"Rendered by QuickLaTeX.com\" height=\"19\" width=\"60\" style=\"vertical-align: -4px;\" \/><\/td>\n<td style=\"width: 18.8183%;height: 30px;text-align: center\">\u00a0<img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-5845ded70fd4fcb506551f41ac83a76a_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#40;&#120;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;&#40;&#121;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#121;&#125;&#41;\" title=\"Rendered by QuickLaTeX.com\" height=\"18\" width=\"108\" style=\"vertical-align: -4px;\" \/><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">12<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">8<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\">(12-12.4) = -0.4<\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\">-0.4<sup>2\u00a0<\/sup>= 0.2<\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\">(8-13.6) = 5.6<\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\">5.6<sup>2\u00a0<\/sup>= 31.4<\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\">(-0.4)(5.6)=2.2<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">6<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">12<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\">(6-12.4) = -6.4<\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\">-6.4<sup>2\u00a0<\/sup>= 41<\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\">(12-13.6) = -1.6<\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\">-1.6<sup>2\u00a0<\/sup>= 2.6<\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\">(-6.4)(1.6)=10.2<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">12<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">19<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\">(12-12.4) = -0.4<\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\">-0.4<sup>2\u00a0<\/sup>= 0.2<\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\">(19-13.6) = 5.4<\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\">5.4<sup>2\u00a0<\/sup>= 29.2<\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\">(-0.4)(5.4)=-2.2<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">16<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">16<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\">(16-12.4) = 3.6<\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\">3.6<sup>2\u00a0<\/sup>= 13<\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\">(16-13.6) = 2.4<\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\">2.4<sup>2\u00a0<\/sup>= 5.8<\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\">(3.6)(2.4)=8.6<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">15<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">12<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\">(15-12.4) = 2.6<\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\">2.6<sup>2\u00a0<\/sup>= 6.8<\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\">(12-13.6) = -1.6<\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\">-1.6<sup>2\u00a0<\/sup>= 2.6<\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\">(2.6)(-1.6)=-4.2<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">12<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">12<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\">(12-12.4) = -0.4<\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\">-0.4<sup>2\u00a0<\/sup>= 0.2<\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\">(12-13.6) = -1.6<\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\">-1.6<sup>2\u00a0<\/sup>= 2.6<\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\">(-0.4)(-1.6)=0.6<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">14<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">16<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\">(14-12.4) = 1.6<\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\">1.6<sup>2\u00a0<\/sup>= 2.6<\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\">(16-13.6) = 2.4<\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\">2.4<sup>2\u00a0<\/sup>= 5.8<\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\">(1.6)(2.4)=3.8<\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\"><img decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-0d00c2da2b2541a97ae0ac3c10e1504e_l3.svg\" alt=\"\\overline{x}\" \/>12.4<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\"><img decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-01881adf9c51d256ce0a5af82c2e7024_l3.svg\" alt=\"\\overline{y}\" \/>13.6<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><strong><em>SS<sub>x<\/sub><\/em>=63.7<\/strong><\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><strong><em>SS<sub>y<\/sub><\/em>=79.7<\/strong><\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><strong><em>SP<sub>xy<\/sub><\/em>=19.3<\/strong><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Then, according to the formula for <em>r<\/em> we have:<\/p>\n<p>&nbsp;<\/p>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 45px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-b021094a805e2f96b68ae071777ec0ac_l3.png\" height=\"45\" width=\"365\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#114;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#83;&#80;&#95;&#123;&#120;&#121;&#125;&#125;&#123;&#92;&#115;&#113;&#114;&#116;&#123;&#83;&#83;&#95;&#120;&#32;&#83;&#83;&#95;&#121;&#125;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#49;&#57;&#46;&#51;&#125;&#123;&#92;&#115;&#113;&#114;&#116;&#123;&#54;&#51;&#46;&#55;&#92;&#116;&#105;&#109;&#101;&#115;&#55;&#57;&#46;&#55;&#125;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#49;&#57;&#46;&#51;&#125;&#123;&#55;&#49;&#46;&#51;&#125;&#61;&#48;&#46;&#50;&#55;&#49;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>Obviously, this <em>r<\/em>=0.271 is not the same as the SPSS-produced <em>r<\/em>=0.413 we had from Section 7.2.3; in fact, it would be very surprising if they were the same, considering the former is based on <em>N<\/em>=7 while the latter is based on <em>N<\/em>=1,687. The exact value of <em>r<\/em> in the above calculation (<em>r<\/em>=0.271) doesn&#8217;t matter, and doesn&#8217;t serve any purpose and shouldn&#8217;t be interpreted as it exists only as the end result of our demonstration.<\/p>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>Fancy trying it out on your own?<\/p>\n<p>&nbsp;<\/p>\n<div class=\"textbox textbox--exercises\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\"><em>Do It! 10.1 Calculating Pearson&#8217;s r<\/em><\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>&nbsp;<\/p>\n<p>Here are 7 more cases from the same <em>GSS 2018<\/em> dataset. Fill out the table fully and produce <em>r<\/em>.<\/p>\n<table style=\"border-collapse: collapse;width: 100%;height: 289px\">\n<tbody>\n<tr style=\"height: 30px\">\n<td style=\"width: 1.41643%;height: 30px;text-align: center\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-ede05c264bba0eda080918aaa09c4658_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#120;\" title=\"Rendered by QuickLaTeX.com\" height=\"8\" width=\"10\" style=\"vertical-align: 0px;\" \/><\/td>\n<td style=\"width: 2.31468%;height: 30px;text-align: center\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-0af556714940c351c933bba8cf840796_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#121;\" title=\"Rendered by QuickLaTeX.com\" height=\"12\" width=\"9\" style=\"vertical-align: -4px;\" \/><\/td>\n<td style=\"width: 21.1225%;height: 30px;text-align: center\">\u00a0<img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-2be8c227df65e8134dad5fb1b20474f0_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#40;&#120;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;\" title=\"Rendered by QuickLaTeX.com\" height=\"18\" width=\"54\" style=\"vertical-align: -4px;\" \/><\/td>\n<td style=\"width: 18.0203%;height: 30px;text-align: center\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-044ade8b43a3571c6d6e964117980bf0_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#40;&#120;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;&#94;&#50;\" title=\"Rendered by QuickLaTeX.com\" height=\"19\" width=\"62\" style=\"vertical-align: -4px;\" \/><\/td>\n<td style=\"width: 21.4724%;height: 30px;text-align: center\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-293fcf26d7743d8ceeac73d9c706910b_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#40;&#121;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#121;&#125;&#41;\" title=\"Rendered by QuickLaTeX.com\" height=\"18\" width=\"52\" style=\"vertical-align: -4px;\" \/><\/td>\n<td style=\"width: 16.8352%;height: 30px;text-align: center\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-532d7ff9cbe6d68b91e39c1bcc2d6620_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#40;&#121;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#121;&#125;&#41;&#94;&#50;\" title=\"Rendered by QuickLaTeX.com\" height=\"19\" width=\"60\" style=\"vertical-align: -4px;\" \/><\/td>\n<td style=\"width: 18.8183%;height: 30px;text-align: center\">\u00a0<img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-5845ded70fd4fcb506551f41ac83a76a_l3.png\" class=\"ql-img-inline-formula quicklatex-auto-format\" alt=\"&#40;&#120;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#120;&#125;&#41;&#40;&#121;&#45;&#92;&#111;&#118;&#101;&#114;&#108;&#105;&#110;&#101;&#123;&#121;&#125;&#41;\" title=\"Rendered by QuickLaTeX.com\" height=\"18\" width=\"108\" style=\"vertical-align: -4px;\" \/><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">12<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">12<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">12<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">14<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">13<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">13<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">13<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">16<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">14<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">20<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">20<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">16<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\">21<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\">18<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><\/td>\n<\/tr>\n<tr style=\"height: 15px\">\n<td style=\"width: 1.41643%;height: 15px;text-align: center\"><img decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-0d00c2da2b2541a97ae0ac3c10e1504e_l3.svg\" alt=\"\\overline{x}\" \/>=<\/td>\n<td style=\"width: 2.31468%;height: 15px;text-align: center\"><img decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-01881adf9c51d256ce0a5af82c2e7024_l3.svg\" alt=\"\\overline{y}\" \/>=<\/td>\n<td style=\"width: 21.1225%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 18.0203%;height: 15px;text-align: center\"><strong><em>SS<sub>x<\/sub><\/em>=<\/strong><\/td>\n<td style=\"width: 21.4724%;height: 15px;text-align: center\"><\/td>\n<td style=\"width: 16.8352%;height: 15px;text-align: center\"><strong><em>SS<sub>y<\/sub><\/em>=<\/strong><\/td>\n<td style=\"width: 18.8183%;height: 15px;text-align: center\"><strong><em>SP<sub>xy<\/sub><\/em>=<\/strong><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>Even if we dismiss the value of the<em> N<\/em>=7 coefficients and go back to <em>r<\/em>=0.413 based on <em>N<\/em>=1,687, we still want to know if this correlation <em>as observed in the sample<\/em> is statistically significant (i.e., generalizable to the population). Thus, we need to test <em>r<\/em>, and we do that through a <em>t<\/em>-test.<\/p>\n<p>&nbsp;<\/p>\n<p><strong>The <em>t<\/em>-test for Pearson&#8217;s <em>r<\/em> is<\/strong> given by the following formula:<\/p>\n<p>&nbsp;<\/p>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 43px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-bb1f5ecb6b9db44148265521aeb52334_l3.png\" height=\"43\" width=\"102\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#116;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#114;&#92;&#115;&#113;&#114;&#116;&#123;&#78;&#45;&#50;&#125;&#125;&#123;&#92;&#115;&#113;&#114;&#116;&#123;&#49;&#45;&#114;&#94;&#50;&#125;&#125;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>with <em>df<\/em>=<em>N<\/em>-2.<\/p>\n<p>&nbsp;<\/p>\n<div class=\"textbox textbox--examples\">\n<header class=\"textbox__header\">\n<p class=\"textbox__title\"><em>Example 10.1(B)\u00a0Testing the<\/em>\u00a0<em>Education and Parental Education Correlation, GSS 2018<\/em><\/p>\n<\/header>\n<div class=\"textbox__content\">\n<p>&nbsp;<\/p>\n<p>As usual, it helps to know what we are testing exactly:<\/p>\n<ul>\n<li>H<sub>0<\/sub>: There is no correlation between parental and offspring education;\u00a0<em>\u03c1<\/em>=0.<\/li>\n<li>H<sub>a<\/sub>: There is a correlation between parental and offspring education;\u00a0<em>\u03c1<\/em>\u22600.<\/li>\n<\/ul>\n<p>Then, for <em>N<\/em>=1,687 and <em>r<\/em>=0.413, we have:<\/p>\n<p>&nbsp;<\/p>\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 43px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-15072c04811ea4217b91458e6ba5f557_l3.png\" height=\"43\" width=\"439\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#116;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#114;&#92;&#115;&#113;&#114;&#116;&#123;&#78;&#45;&#50;&#125;&#125;&#123;&#92;&#115;&#113;&#114;&#116;&#123;&#49;&#45;&#114;&#94;&#50;&#125;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#48;&#46;&#52;&#49;&#51;&#92;&#115;&#113;&#114;&#116;&#123;&#49;&#54;&#56;&#55;&#45;&#50;&#125;&#125;&#123;&#92;&#115;&#113;&#114;&#116;&#123;&#49;&#45;&#48;&#46;&#52;&#49;&#51;&#94;&#50;&#125;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#48;&#46;&#52;&#49;&#51;&#40;&#52;&#49;&#46;&#49;&#41;&#125;&#123;&#48;&#46;&#57;&#49;&#49;&#125;&#61;&#49;&#56;&#46;&#54;&#51;&#51;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\n<p>&nbsp;<\/p>\n<p><strong>With <em>t<\/em>=18.633, <em>df<\/em>=1,685, and <em>p<\/em>=0.00001 (i.e., <em>p<\/em>=0.00001&lt;0.5), we can reject the null hypothesis that parental and offspring education are not correlated. At this time, we have enough evidence to conclude that there is a moderately weak (<em>r<\/em>=0.413), statistically significant correlation between parental education and offspring education in the US population<\/strong><a class=\"footnote\" title=\"Purely for demonstration purposes, we could also calculate the t for the 7 respondents whose responses we used to calculate r=0.271:\n\n\u00a0\n\n \u00a0  \u00a0 \n\n\u00a0\n\nIn this case, we could interpret the results like this: &quot;With t=1.407, df=5, and p=0.218 (i.e., p=0.218&gt;0.5), we cannot reject the null hypothesis that parental and offspring education are not correlated. At this time, we do not have enough evidence to conclude that there is a statistically significant correlation between parental education and offspring education in the US population.&quot; However, we cannot trust this &quot;inference&quot; as it is only based on N=7.\" id=\"return-footnote-130-6\" href=\"#footnote-130-6\" aria-label=\"Footnote 6\"><sup class=\"footnote\">[6]<\/sup><\/a><strong>.<\/strong><\/p>\n<p>&nbsp;<\/p>\n<\/div>\n<\/div>\n<p>&nbsp;<\/p>\n<p>With this, we can have established (with 99% certainty) that parental education and offspring education are correlated. Considering that parents tend to have their schooling done before their children have theirs, on average, it is also reasonable to assume that parental education affects offspring education (and not vice versa)<a class=\"footnote\" title=\"In terms of establishing causality, we are limited by the bivariate case we have: it is entirely possible (and expected) that other things affect offspring education too, not just their parents' education. As well, it is possible than something else (for example, wealth, income, socioecoomic class, etc.) might be affecting both parental and offspring education, rendering the effect of parental educaion on offspring education spurious. These type of considerations are exactly the purpose of mutlivariate analysis, but since we are dealing with bivariate analysis here, we have to leave these considerations aside. I bring them up here to remind you not to forget them in the discussion that follows, which will focus on the two variables at hand.\" id=\"return-footnote-130-7\" href=\"#footnote-130-7\" aria-label=\"Footnote 7\"><sup class=\"footnote\">[7]<\/sup><\/a>.<\/p>\n<p>&nbsp;<\/p>\n<p>Wouldn&#8217;t then be good to know <em>exactly how much<\/em> effect parental education has on offspring education? That is, wouldn&#8217;t you like to know that if a father had one more year of schooling compared to another father, how much more schooling the child of the former would be expected to have compared to the child of the latter? One type of regression &#8212; called <em>linear regression<\/em> &#8212; can tell us just that.<\/p>\n<hr class=\"before-footnotes clear\" \/><div class=\"footnotes\"><ol><li id=\"footnote-130-1\">The sign of <em>r<\/em> is there <em>only<\/em> to indicate the direction of the association: positive or negative, nothing else. Thus this is a reminder not to use <em>r<\/em>'s sign as a measure of magnitude or strength of the association. Thus, for example, -0.9 is a stronger association than 0.2 because -0.9 is closer to -1 than 0.2 is to 1. (In fact, 0.2 is much closer to 0, or no association.) That is, a strong negative correlation is <em>stronger<\/em> than a weak positive one, despite that -0.9&lt;0.2. <a href=\"#return-footnote-130-1\" class=\"return-footnote\" aria-label=\"Return to footnote 1\">&crarr;<\/a><\/li><li id=\"footnote-130-2\">To be precise, the ratio is between the covariance of <em>x<\/em> and <em>y<\/em>\u00a0(i.e., their joint variability, <em>s<sub>xy<\/sub><\/em>) and the product of their separate variances<em> s<sub>x<\/sub><\/em> and <em>s<sub>y<\/sub><\/em>:<\/span>\r\n\r\n&nbsp;\r\n\r\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 38px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-dc38207340b7529d1766c1e33d6d24c8_l3.png\" height=\"38\" width=\"68\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#114;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#115;&#95;&#123;&#120;&#121;&#125;&#125;&#123;&#115;&#95;&#120;&#32;&#115;&#95;&#121;&#125;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p> or\r\n\r\n&nbsp;\r\n\r\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 38px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-63f436ca3c9f17308c8bb42c03d23344_l3.png\" height=\"38\" width=\"72\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#92;&#114;&#104;&#111;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#92;&#115;&#105;&#103;&#109;&#97;&#95;&#123;&#120;&#121;&#125;&#125;&#123;&#92;&#115;&#105;&#103;&#109;&#97;&#95;&#120;&#32;&#92;&#115;&#105;&#103;&#109;&#97;&#95;&#121;&#125;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p> if we apply it to a population instead of a sample. (Here\u00a0<em>\u03c1<\/em> is the small-case Greek letter <em>r<\/em>, pronounced [ROH].)\r\n\r\n<span style=\"text-indent: 1em;font-size: 14pt\"> <a href=\"#return-footnote-130-2\" class=\"return-footnote\" aria-label=\"Return to footnote 2\">&crarr;<\/a><\/li><li id=\"footnote-130-3\">Recall that the sum of squares was the numerator in the formulas for the variance and the standard deviation. We take the distances of the observations from the mean, square them, and them add them altogether. (We square them <\/span><em style=\"text-indent: 1em;font-size: 14pt\">before<\/em><span style=\"text-indent: 1em;font-size: 14pt\"> adding t<\/span><span style=\"text-indent: 18.6667px;font-size: 14pt\">o turn them all positive, otherwise they'd cancel each other upon summation. See Section 4.3 (<a href=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/4-3-variance\/\">https:\/\/pressbooks.bccampus.ca\/simplestats\/chapter\/4-3-variance\/<\/a>) for details.)<\/span><span style=\"text-indent: 1em;font-size: 14pt\">\u00a0 <a href=\"#return-footnote-130-3\" class=\"return-footnote\" aria-label=\"Return to footnote 3\">&crarr;<\/a><\/li><li id=\"footnote-130-4\">Note that other \"versions\" of the formula for<\/span><span style=\"text-indent: 1em;font-size: 14pt\">\u00a0<\/span><em style=\"text-indent: 1em;font-size: 14pt\">r<\/em><span style=\"text-indent: 1em;font-size: 14pt\">\u00a0<\/span><span style=\"text-indent: 1em;font-size: 14pt\">exist. All of them calculate the same <em>r<\/em>, but are just restated in different term. The two \"versions\" presented in the text above are the simplest. For example, one of the most common ways to express <em>r<\/em> you may find elsewhere (but which is rather hard on the eyes and for purposes of calculation by hand) is this:<\/span>\r\n\r\n&nbsp;\r\n\r\n<span style=\"text-indent: 1em;font-size: 14pt\"><span style=\"text-indent: 1em;font-size: 14pt\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-23cb9dfe344c43d3f9e2a0d329312e81_l3.png\" height=\"45\" width=\"290\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#114;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#78;&#92;&#83;&#105;&#103;&#109;&#97;&#123;&#120;&#121;&#125;&#32;&#45;&#92;&#83;&#105;&#103;&#109;&#97;&#123;&#120;&#125;&#92;&#83;&#105;&#103;&#109;&#97;&#123;&#121;&#125;&#125;&#123;&#92;&#115;&#113;&#114;&#116;&#123;&#78;&#92;&#83;&#105;&#103;&#109;&#97;&#123;&#120;&#94;&#50;&#125;&#45;&#40;&#92;&#83;&#105;&#103;&#109;&#97;&#123;&#120;&#125;&#41;&#94;&#50;&#41;&#40;&#78;&#92;&#83;&#105;&#103;&#109;&#97;&#123;&#121;&#94;&#50;&#125;&#45;&#40;&#92;&#83;&#105;&#103;&#109;&#97;&#123;&#121;&#125;&#41;&#94;&#50;&#125;&#125;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/span><\/span>\r\n\r\n<span style=\"text-indent: 1em;font-size: 14pt\"><span style=\"text-indent: 1em;font-size: 14pt\"> <a href=\"#return-footnote-130-4\" class=\"return-footnote\" aria-label=\"Return to footnote 4\">&crarr;<\/a><\/li><li id=\"footnote-130-5\">Here <em>parental education<\/em> is the independent variable and <em>respondent's education<\/em> is the dependent variable, so they are denoted as <em>x<\/em> and <em>y<\/em>, respectively, according to convention.  <a href=\"#return-footnote-130-5\" class=\"return-footnote\" aria-label=\"Return to footnote 5\">&crarr;<\/a><\/li><li id=\"footnote-130-6\">Purely for demonstration purposes, we could also calculate the<em> t<\/em> for the 7 respondents whose responses we used to calculate <em>r<\/em>=0.271:\r\n\r\n&nbsp;\r\n\r\n<p class=\"ql-center-displayed-equation\" style=\"line-height: 43px;\"><span class=\"ql-right-eqno\"> &nbsp; <\/span><span class=\"ql-left-eqno\"> &nbsp; <\/span><img src=\"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-content\/ql-cache\/quicklatex.com-78658a095169355d27a37ded7581bfb9_l3.png\" height=\"43\" width=\"381\" class=\"ql-img-displayed-equation quicklatex-auto-format\" alt=\"&#92;&#091;&#116;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#114;&#92;&#115;&#113;&#114;&#116;&#123;&#78;&#45;&#50;&#125;&#125;&#123;&#92;&#115;&#113;&#114;&#116;&#123;&#49;&#45;&#114;&#94;&#50;&#125;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#48;&#46;&#50;&#55;&#49;&#92;&#115;&#113;&#114;&#116;&#123;&#55;&#45;&#50;&#125;&#125;&#123;&#92;&#115;&#113;&#114;&#116;&#123;&#49;&#45;&#48;&#46;&#50;&#55;&#49;&#94;&#50;&#125;&#125;&#61;&#92;&#102;&#114;&#97;&#99;&#123;&#48;&#46;&#50;&#55;&#49;&#40;&#53;&#41;&#125;&#123;&#48;&#46;&#57;&#54;&#51;&#125;&#61;&#49;&#46;&#52;&#48;&#55;&#92;&#093;\" title=\"Rendered by QuickLaTeX.com\" \/><\/p>\r\n\r\n&nbsp;\r\n\r\nIn this case, we could interpret the results like this: \"With <em>t<\/em>=1.407, <em>df<\/em>=5, and <em>p<\/em>=0.218 (i.e., <em>p<\/em>=0.218&gt;0.5), we cannot reject the null hypothesis that parental and offspring education are not correlated. At this time, we do not have enough evidence to conclude that there is a statistically significant correlation between parental education and offspring education in the US population.\" However, we cannot trust this \"inference\" as it is only based on <em>N<\/em>=7. <a href=\"#return-footnote-130-6\" class=\"return-footnote\" aria-label=\"Return to footnote 6\">&crarr;<\/a><\/li><li id=\"footnote-130-7\">In terms of establishing causality, we are limited by the bivariate case we have: it is entirely possible (and expected) that other things affect offspring education too, not just their parents' education. As well, it is possible than something else (for example, wealth, income, socioecoomic class, etc.) might be affecting both parental and offspring education, rendering the effect of parental educaion on offspring education spurious. These type of considerations are exactly the purpose of mutlivariate analysis, but since we are dealing with bivariate analysis here, we have to leave these considerations aside. I bring them up here to remind you not to forget them in the discussion that follows, which will focus on the two variables at hand.  <a href=\"#return-footnote-130-7\" class=\"return-footnote\" aria-label=\"Return to footnote 7\">&crarr;<\/a><\/li><\/ol><\/div>","protected":false},"author":533,"menu_order":1,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-130","chapter","type-chapter","status-publish","hentry"],"part":128,"_links":{"self":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/130","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/users\/533"}],"version-history":[{"count":25,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/130\/revisions"}],"predecessor-version":[{"id":2145,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/130\/revisions\/2145"}],"part":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/parts\/128"}],"metadata":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapters\/130\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/media?parent=130"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/pressbooks\/v2\/chapter-type?post=130"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/contributor?post=130"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/pressbooks.bccampus.ca\/simplestats\/wp-json\/wp\/v2\/license?post=130"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}