<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="../Math/XSL/pmathml.xsl"?>
<!--        This document was created  with

            S O F T 4 S C I E N C E      S c i W r i t e r   

            http://www.soft4science.com     e-mail: info@soft4science.com
-->
<html pref:renderer="mathplayer-dl" xmlns:pref="http://www.w3.org/2002/Math/preference" xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
    <meta name="Generator" content="soft4science SciWriter" />
    <meta name="template" content="" />
    <meta name="Style-Numbering" content="&lt;Numbering sections=&quot;False&quot; tables=&quot;none&quot; /&gt;" />
    <meta name="Style-ParagraphMargins" content="&lt;ParagraphMargins h1_bottom=&quot;1&quot; h1_top=&quot;1&quot; h2_bottom=&quot;1&quot; h2_top=&quot;1&quot; h3_bottom=&quot;1&quot; h3_top=&quot;1&quot; h4_bottom=&quot;1&quot; h4_top=&quot;1&quot; h5_bottom=&quot;1&quot; h5_top=&quot;1&quot; h6_bottom=&quot;1&quot; h6_top=&quot;1&quot; /&gt;" />
    <meta name="Style-Headings" content="&lt;Headings color=&quot;Black&quot; /&gt;" />
    <meta name="Style-GeneralText" content="&lt;GeneralText textIndent=&quot;0&quot; /&gt;" />
    <meta name="PreferedMathMLRenderer" content="mathplayer_dl" />
    <meta name="guid" content="ad6f0718-294f-4466-8691-947f1c041457" />
    <meta name="date_lastUpdated" content="2005-09-30T09:56:58" />
    <meta name="date_created" content="2004-06-29T20:43:05" />
    <meta name="keywords" content="Calculus" />
    <meta name="author" content="John Weber" />
    <title>John Weber - GPC - Math 1431 - Chapter 5 Notes</title>
    <style type="text/css"><![CDATA[
    /* - CSS code generated by soft4science SciWriter  -*/

    body{font-size:14.25pt}
    table{font-size:14.25pt}
    a:link,a:active,a:visited{color:blue}
    img{border-width:0px}
    p,li,td,caption, blockquote{font-family:"Times New Roman",times}
    p{text-indent:0em}
    h1,h2,h3,h4,h5,h6{font-family:"Times New Roman",times;color:#000000}
    p{margin-top:0em;margin-bottom:0em}
    li{margin-top:0em;margin-bottom:0em}
    h1{font-size:160%;margin-top:1em;margin-bottom:1em}
    h2{font-size:150%;margin-top:1em;margin-bottom:1em}
    h3{font-size:140%;margin-top:1em;margin-bottom:1em}
    h4{font-size:130%;margin-top:1em;margin-bottom:1em}
    h5{font-size:120%;margin-top:1em;margin-bottom:1em}
    h6{font-size:110%;margin-top:1em;margin-bottom:1em}
    table{margin-top:1em;margin-bottom:1em}
    pre{margin-top:1em;margin-bottom:1em}
    hr{margin-top:1em;margin-bottom:1em}
    ul,ol{margin-top:0em;margin-bottom:0em}

    math          {font-family:"Times New Roman",times,CMSY10, CMEX10, Symbol}
    mml\:math    {font-family:"Times New Roman",times,CMSY10, CMEX10, Symbol}
    math[display="block"]{display:block;text-align:center;font-style: normal;}
    math *.[mathvariant="normal"] {font-weight: normal;font-style: normal;}
    math *.[mathvariant="bold"]   {font-weight: bold;font-style: normal;}
    math *.[mathvariant="italic"] {font-weight: normal;font-style: italic;}
    math *.[mathvariant="bold-italic"] {font-weight: bold;font-style: italic;}
    math *.[mathvariant="double-struck"]{font-family:msbm;font-weight: normal;font-style: normal;}
    math *.[mathvariant="script"]{font-family: eusb;font-weight: normal;font-style: normal;}
    math *.[mathvariant="bold-script"]{font-family: eusb;font-weight: bold;font-style: normal;}
    math *.[mathvariant="fraktur"] {font-family: eufm;font-weight: normal;font-style: normal;}
    math *.[mathvariant="bold-fraktur"] {font-family: eufm;font-weight: bold;font-style: italic;}
    math *.[mathvariant="sans-serif}"] {font-family: sans-serif,Arial,Lucida Sans Unicode, Verdana;font-weight: normal;font-style: normal;}
    math *.[mathvariant="bold-sans-serif"] {font-family: sans-serif,Arial, Lucida Sans Unicode, Verdana;font-weight: bold;font-style: normal;}
    math *.[mathvariant="sans-serif-italic"]{font-family: sans-serif,Arial, Lucida Sans Unicode, Verdana;font-weight: normal;font-style: italic;}
    math *.[mathvariant="sans-serif-bold-italic"] { font-family: sans-serif,Arial, Lucida Sans Unicode, Verdana ;font-weight: bold;font-style: italic;}
    math *.[mathvariant="monospace"] {font-family: monospace};
    math *.[mathsize="small"] {font-size: 80%};
    math *.[mathsize="big"] {font-size: 125%};
    msub>*:first-child[mathsize="big"],msup>*:first-child[mathsize="big"],msubsup>*:first-child[mathsize="big"],munder>*:first-child[mathsize="big"],mover>*:first-child[mathsize="big"],munderover>*:first-child[mathsize="big"],mmultiscripts>*:first-child[mathsize="big"],mroot>*:first-child[mathsize="big"] {font-size: 125%}
    msub>*:first-child[mathsize="small"],msup>*:first-child[mathsize="small"],msubsup>*:first-child[mathsize="small"],munder>*:first-child[mathsize="small"],mover>*:first-child[mathsize="small"],munderover>*:first-child[mathsize="small"],mmultiscripts>*:first-child[mathsize="small"],mroot>*:first-child[mathsize="small"] {  font-size: 80%}
    msub>*:first-child,msup>*:first-child,msubsup>*:first-child,munder>*:first-child,mover>*:first-child,munderover>*:first-child,mmultiscripts>*:first-child,mroot>*:first-child {  font-size: 100%}
    msub>*[mathsize="big"],msup>*[mathsize="big"],msubsup>*[mathsize="big"],munder>*[mathsize="big"],mover>*[mathsize="big"],munderover>*[mathsize="big"],mmultiscripts>*[mathsize="big"],math[display="inline"] mfrac>*[mathsize="big"],math *[scriptlevel="+1"][mathsize="big"] {  font-size: 89%  /* (.71 times 1.25) */}
    msub>* [mathsize="small"],msup>*[mathsize="small"],msubsup>*[mathsize="small"],munder>*[mathsize="small"],mover>*[mathsize="small"],munderover>*[mathsize="small"],mmultiscripts>*[mathsize="small"],math[display="inline"] mfrac>*[mathsize="small"],math *[scriptlevel="+1"][mathsize="small"] {  font-size: 57% /* (.71 times .80) */}
    msub>*,msup>*,msubsup>*,munder>*,mover>*,munderover>*,mmultiscripts>*,math[display="inline"] mfrac>*,math *[scriptlevel="+1"] {  font-size: 71%}
    mroot>*[mathsize="big"] {  font-size: 62%  /* (.50 times 1.25) */}
    mroot>*[mathsize="small"] {  font-size: 40% /* (.50 times .80) */}
    mroot>* {  font-size: 50%}
    .s4s-table-right{text-align:right}
    .s4s-table-right table{margin-left:auto;margin-right:0;text-align:left;}
    .s4s-table-right caption{margin-left:auto;margin-right:auto;text-align:center;}
    .s4s-table-center{text-align:center;}
    .s4s-table-center table{margin-left:auto;margin-right:auto;text-align:left;}
    .s4s-table-center caption{margin-left:auto;margin-right:auto;text-align:center;}
    .s4s-empty-paragraph{height:1em;width:textwidth}
    .s4s-noindent{text-indent:0em}
    .s4s-citation          {text-decoration:none}
    .s4s-section-reference {text-decoration:none}
    .s4s-theorem-reference {text-decoration:none}
    .s4s-table-reference   {text-decoration:none}
    .s4s-figure-reference  {text-decoration:none}
    .s4s-equation-reference{text-decoration:none}
    /* - CSS code generated by soft4science SciWriter  -*/
]]></style>
  </head>
  <body style="margin-left:50px;margin-right:50px;margin-top:50px;margin-bottom:50px">
    <p class="s4s-noindent">
      <span style="font-size:80%">www.john-weber.com</span>
    </p>
    <h1 class="s4s-section-numbered" id="SECTION.9c9a1249-5026-49ff-9987-db797e41b166">Chapter 5: Regression</h1>
    <hr />
    <h3 class="s4s-section-numbered" id="SECTION.e4331bc8-4fe7-496c-bde9-57b84b5faf16">
      <span style="color:Blue">Knowledge Prerequisites</span>
    </h3>
    <h4 class="s4s-section-numbered" id="SECTION.eb1947ef-373e-4cd2-a4b4-f519173f08ab">Scatterplots</h4>
    <ol>
      <li>
        <strong>declarative knowledge (definitions)</strong>
        <ol>
          <li>independent variable</li>
          <li>dependent variable</li>
          <li>response variable</li>
          <li>explanatory variable</li>
          <li>correlation</li>
        </ol>
      </li>
      <li>
        <strong>procedural knowledge</strong>
        <ol>
          <li>how to construct a scatteplot</li>
        </ol>
      </li>
      <li>
        <strong>conditional knowledge</strong>
        <ol>
          <li>how to describe a scatterplor</li>
        </ol>
      </li>
      <li>
        <strong>resources</strong>
        <ol>
          <li>
            <a href="chapt04notes.xml">Chapter 4</a> of text.</li>
        </ol>
      </li>
    </ol>
    <h4 class="s4s-section-numbered" id="SECTION.24336930-2c24-4725-ae6d-7a362fb3cf8c">Slope-intercept equation of line</h4>
    <ol>
      <li>
        <strong>declarative knowledge (definitions)</strong>
        <ol>
          <li>know what <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>m</mi></math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>b</mi></math> represent in <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>y</mi><mo>&#x0003D;</mo><mi>m</mi><mo>&#x02062;</mo><mi>x</mi><mo>&#x0002B;</mo><mi>b</mi></math></li>
        </ol>
      </li>
      <li>
        <strong>procedural knowledge</strong>
        <ol>
          <li>how to graph a line</li>
        </ol>
      </li>
      <li>
        <strong>conditional knowledge</strong>
        <ol>
          <li>know how <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>m</mi></math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>b</mi></math> affect the graph of <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>y</mi><mo>&#x0003D;</mo><mi>m</mi><mo>&#x02062;</mo><mi>x</mi><mo>&#x0002B;</mo><mi>b</mi></math></li>
        </ol>
      </li>
      <li>
        <strong>resources</strong>
        <ol>
          <li />
        </ol>
      </li>
    </ol>
    <hr />
    <h3 class="s4s-section-numbered" id="SECTION.541469b1-6c65-4551-9c65-ecf7706ab813">
      <span style="color:Green">Learning Goals</span>
    </h3>
    <ol>
      <li>
        <strong>declarative knowledge (definitions)</strong>
        <ol>
          <li>regression line</li>
          <li>least-squares regression line, <math xmlns="http://www.w3.org/1998/Math/MathML"><mover><mrow><mi>y</mi></mrow><mo>&#x0005E;</mo></mover><mo>&#x0003D;</mo><mi>a</mi><mo>&#x0002B;</mo><mi>b</mi><mo>&#x02062;</mo><mi>x</mi></math></li>
          <li>influential observations</li>
          <li>lurking variables</li>
          <li>extrapolation</li>
        </ol>
      </li>
      <li>
        <strong>procedural knowledge</strong>
        <ol>
          <li>how to calculate <a href="http://gpc.edu/%7Ejweber/math1431/ti83lsq.htm">the least-squares regression line</a></li>
          <li>how to make a prediction using the least-squares regression line</li>
        </ol>
      </li>
      <li>
        <strong>conditional knowledge</strong>
        <ol>
          <li>know the difference between an outlier and an influential observation.</li>
          <li>be aware of the potential problems of predictions using the least-squares regression line.</li>
          <li>know the facts about least-squares regression (see pp. 110-111 of text).</li>
          <li>know that "association does not imply causation".</li>
          <li>know what <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mrow><mi>r</mi></mrow><mrow><mn>2</mn></mrow></msup></math> tells about the relationship.</li>
        </ol>
      </li>
    </ol>
    <hr />
    <h3 class="s4s-section-numbered" id="SECTION.5107892a-0860-47da-8095-15431a1e8f9f">
      <span style="color:Red">Questions to Ponder</span>
    </h3>
    <ol>
      <li>What is the best way to study/prepare for the concepts in this section? Are there are other methods to study these concepts?</li>
      <li>What questions do you need to ask the instructor?</li>
    </ol>
    <hr />
    <h3 class="s4s-section-numbered" id="SECTION.09b05e4a-237c-4460-bc9b-eebfd8658bff">Purpose of this Section</h3>
    <ol>
      <li>To quantify the relationship between two variables (i.e., to construct a mathematical model for a linear relationship between two variables).</li>
    </ol>
    <hr />
    <h3 class="s4s-section-numbered" id="SECTION.71ab9e60-203d-44d4-9fe0-21324d832c91">General Notes</h3>
    <h6 class="s4s-section-numbered" id="SECTION.8dbf90e3-b3a1-4410-a52f-aa4b9d63b1ee">The least<math xmlns="http://www.w3.org/1998/Math/MathML"><mo>&#x02212;</mo></math>squares regression line</h6>
    <p class="s4s-noindent">We need a method for constructing a regression line that will be consistent for all looking at the data. Here is a good website (<a href="http://www.ruf.rice.edu/~lane/stat_sim/reg_by_eye/index.html">http://www.ruf.rice.edu/~lane/stat_sim/reg_by_eye/index.html</a>) that shows how close your guess at the regression line is to the line constructed by the method of least squares. This is an excellent way to develop a feel for linear regression!</p>
    <p class="s4s-empty-paragraph"> </p>
    <p>The leastsquares regression line of <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>y</mi></math> on <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi></math> is the line that makes the sum of the squares of the deviations of the data points from the line in the vertical direction as small as possible. The least-squares regression line is the line <math xmlns="http://www.w3.org/1998/Math/MathML"><mover><mrow><mi>y</mi></mrow><mo>&#x0005E;</mo></mover><mo>&#x0003D;</mo><mi>a</mi><mo>&#x0002B;</mo><mi>b</mi><mo>&#x02062;</mo><mi>x</mi></math> where <math xmlns="http://www.w3.org/1998/Math/MathML"><mover><mrow><mi>y</mi></mrow><mo>&#x0005E;</mo></mover></math> (called <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>y</mi></math>-hat) is the predicted value for the response variable. Because of the scatter of points about the line, the predicted response will usually not be exactly the same as the actually observed response <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>y</mi></math>.</p>
    <p class="s4s-empty-paragraph"> </p>
    <p>Here are steps to <a href="ti83lsq.htm">find the least<math xmlns="http://www.w3.org/1998/Math/MathML"><mo>&#x02212;</mo></math>squares regression line using the TI-83 calculator</a>.</p>
    <h6 id="SECTION.4b56d473-9ded-4691-b322-8945acf8c573">Residuals</h6>
    <p class="s4s-noindent">A residual is the difference between an observed value of the response variable and the value predicted by the regression line: </p>
    <table class="s4s-eq" width="95%">
      <tbody>
        <tr>
          <td align="center">
            <math display="block" xmlns="http://www.w3.org/1998/Math/MathML">
              <mi>residual</mi>
              <mo>&#x0003D;</mo>
              <mi>y</mi>
              <mo>&#x02212;</mo>
              <mover>
                <mrow>
                  <mi>y</mi>
                </mrow>
                <mo>&#x0005E;</mo>
              </mover>
            </math>
          </td>
        </tr>
      </tbody>
    </table>
    <p class="s4s-noindent">There are residuals for each data point. Residuals are positive when the data point is above the regression line and residuals are negative when the data point is below the regression line.</p>
    <p class="s4s-empty-paragraph"> </p>
    <p>Because the residuals show how far the data fall from our regression line, examining the residuals helps assess how well the line describes the data. The mean of the residuals from the leasts-quares regression line is always zero. A residual plot is a scatter plot of the regression residuals against the explanatory variable. Residual plots also help us ass the fit of a regression line.</p>
    <p class="s4s-empty-paragraph"> </p>
    <p>Here are steps to <a href="ti83resid.htm">finding and graphing the residuals using the TI-83 calculator</a>.</p>
    <p class="s4s-empty-paragraph"> </p>
    <p>Here are some key things to consider when examining the residual plot (see pp. 118121 of your text for example graphs of each of the key ideas below):</p>
    <ul>
      <li>Ideal case: Random pattern around 0, no unusual individual observations.</li>
      <li>Curved pattern: Relationship is NOT linear.</li>
      <li>Increasing (or decreasing) spread as <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi></math> increases: Prediction is less accurate when <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi></math> is large (small).</li>
      <li>Individual points with large residuals: These points are possible outliers.</li>
      <li>Individual points far from others in the <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi></math>-direction: These are called influential observations.</li>
    </ul>
    <h6 class="s4s-section-numbered" id="SECTION.d0f1601f-5979-4aeb-a598-a6c305abacfc">Resources</h6>
    <ul>
      <li>Visual Statistics - <a href="http://www.seeingstatistics.com/seeingTour/intro/why3.html">http://www.seeingstatistics.com/seeingTour/intro/why3.html</a> - This applet graphs some common geometric statistical concepts: data points (black dots), residuals (red lines), error squares (yellow squares), and regression or model line (blue line). Use your mouse to drag the blue line and see how the geometric objects change.</li>
      <li>Regression by Eye - <a href="http://www.ruf.rice.edu/~lane/stat_sim/reg_by_eye/index.html">http://www.ruf.rice.edu/~lane/stat_sim/reg_by_eye/index.html</a> - This applet gives you a scatterplot and you can use the mouse to indicate what seems to be the best straight line approximating the data. This is an excellent way to develop a feel for linear regression!</li>
      <li>Correlation and Regression Calculator - <a href="http://calculators.stat.ucla.edu/correlation.php">http://calculators.stat.ucla.edu/correlation.php</a> - This calculator returns the means, variances, regression coefficients, covariance, correlation coefficient, plus a scatterplot with the two regression lines.</li>
    </ul>
    <hr />
    <p class="s4s-noindent">
      <a href="math1431.htm">Back to John Weber's MATH 1431 Page</a>
    </p>
    <p>
      <a href="../../john.html">Back to john-weber.com</a>
    </p>
    <hr />
    <p class="s4s-empty-paragraph" />
    <p class="s4s-empty-paragraph" />
  </body>
</html>
