<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="../Math/XSL/pmathml.xsl"?>
<!--        This document was created  with

            S O F T 4 S C I E N C E      S c i W r i t e r   

            http://www.soft4science.com     e-mail: info@soft4science.com
-->
<html pref:renderer="mathplayer-dl" xmlns:pref="http://www.w3.org/2002/Math/preference" xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
    <meta name="Generator" content="soft4science SciWriter" />
    <meta name="template" content="" />
    <meta name="Style-Numbering" content="&lt;Numbering sections=&quot;False&quot; tables=&quot;none&quot; /&gt;" />
    <meta name="Style-ParagraphMargins" content="&lt;ParagraphMargins h1_bottom=&quot;1&quot; h1_top=&quot;1&quot; h2_bottom=&quot;1&quot; h2_top=&quot;1&quot; h3_bottom=&quot;1&quot; h3_top=&quot;1&quot; h4_bottom=&quot;1&quot; h4_top=&quot;1&quot; h5_bottom=&quot;1&quot; h5_top=&quot;1&quot; h6_bottom=&quot;1&quot; h6_top=&quot;1&quot; /&gt;" />
    <meta name="Style-Headings" content="&lt;Headings color=&quot;Black&quot; /&gt;" />
    <meta name="Style-GeneralText" content="&lt;GeneralText textIndent=&quot;0&quot; /&gt;" />
    <meta name="PreferedMathMLRenderer" content="mathplayer_dl" />
    <meta name="guid" content="ad6f0718-294f-4466-8691-947f1c041457" />
    <meta name="date_lastUpdated" content="2005-09-23T08:56:07" />
    <meta name="date_created" content="2004-06-29T20:43:05" />
    <meta name="keywords" content="Calculus" />
    <meta name="author" content="John Weber" />
    <title>John Weber - GPC - Math 1431 - Chapter 1 Notes</title>
    <style type="text/css"><![CDATA[
    /* - CSS code generated by soft4science SciWriter  -*/

    body{font-size:14.25pt}
    table{font-size:14.25pt}
    a:link,a:active,a:visited{color:blue}
    img{border-width:0px}
    p,li,td,caption, blockquote{font-family:"Times New Roman",times}
    p{text-indent:0em}
    h1,h2,h3,h4,h5,h6{font-family:"Times New Roman",times;color:#000000}
    p{margin-top:0em;margin-bottom:0em}
    li{margin-top:0em;margin-bottom:0em}
    h1{font-size:160%;margin-top:1em;margin-bottom:1em}
    h2{font-size:150%;margin-top:1em;margin-bottom:1em}
    h3{font-size:140%;margin-top:1em;margin-bottom:1em}
    h4{font-size:130%;margin-top:1em;margin-bottom:1em}
    h5{font-size:120%;margin-top:1em;margin-bottom:1em}
    h6{font-size:110%;margin-top:1em;margin-bottom:1em}
    table{margin-top:1em;margin-bottom:1em}
    pre{margin-top:1em;margin-bottom:1em}
    hr{margin-top:1em;margin-bottom:1em}
    ul,ol{margin-top:0em;margin-bottom:0em}

    math          {font-family:"Times New Roman",times,CMSY10, CMEX10, Symbol}
    mml\:math    {font-family:"Times New Roman",times,CMSY10, CMEX10, Symbol}
    math[display="block"]{display:block;text-align:center;font-style: normal;}
    math *.[mathvariant="normal"] {font-weight: normal;font-style: normal;}
    math *.[mathvariant="bold"]   {font-weight: bold;font-style: normal;}
    math *.[mathvariant="italic"] {font-weight: normal;font-style: italic;}
    math *.[mathvariant="bold-italic"] {font-weight: bold;font-style: italic;}
    math *.[mathvariant="double-struck"]{font-family:msbm;font-weight: normal;font-style: normal;}
    math *.[mathvariant="script"]{font-family: eusb;font-weight: normal;font-style: normal;}
    math *.[mathvariant="bold-script"]{font-family: eusb;font-weight: bold;font-style: normal;}
    math *.[mathvariant="fraktur"] {font-family: eufm;font-weight: normal;font-style: normal;}
    math *.[mathvariant="bold-fraktur"] {font-family: eufm;font-weight: bold;font-style: italic;}
    math *.[mathvariant="sans-serif}"] {font-family: sans-serif,Arial,Lucida Sans Unicode, Verdana;font-weight: normal;font-style: normal;}
    math *.[mathvariant="bold-sans-serif"] {font-family: sans-serif,Arial, Lucida Sans Unicode, Verdana;font-weight: bold;font-style: normal;}
    math *.[mathvariant="sans-serif-italic"]{font-family: sans-serif,Arial, Lucida Sans Unicode, Verdana;font-weight: normal;font-style: italic;}
    math *.[mathvariant="sans-serif-bold-italic"] { font-family: sans-serif,Arial, Lucida Sans Unicode, Verdana ;font-weight: bold;font-style: italic;}
    math *.[mathvariant="monospace"] {font-family: monospace};
    math *.[mathsize="small"] {font-size: 80%};
    math *.[mathsize="big"] {font-size: 125%};
    msub>*:first-child[mathsize="big"],msup>*:first-child[mathsize="big"],msubsup>*:first-child[mathsize="big"],munder>*:first-child[mathsize="big"],mover>*:first-child[mathsize="big"],munderover>*:first-child[mathsize="big"],mmultiscripts>*:first-child[mathsize="big"],mroot>*:first-child[mathsize="big"] {font-size: 125%}
    msub>*:first-child[mathsize="small"],msup>*:first-child[mathsize="small"],msubsup>*:first-child[mathsize="small"],munder>*:first-child[mathsize="small"],mover>*:first-child[mathsize="small"],munderover>*:first-child[mathsize="small"],mmultiscripts>*:first-child[mathsize="small"],mroot>*:first-child[mathsize="small"] {  font-size: 80%}
    msub>*:first-child,msup>*:first-child,msubsup>*:first-child,munder>*:first-child,mover>*:first-child,munderover>*:first-child,mmultiscripts>*:first-child,mroot>*:first-child {  font-size: 100%}
    msub>*[mathsize="big"],msup>*[mathsize="big"],msubsup>*[mathsize="big"],munder>*[mathsize="big"],mover>*[mathsize="big"],munderover>*[mathsize="big"],mmultiscripts>*[mathsize="big"],math[display="inline"] mfrac>*[mathsize="big"],math *[scriptlevel="+1"][mathsize="big"] {  font-size: 89%  /* (.71 times 1.25) */}
    msub>* [mathsize="small"],msup>*[mathsize="small"],msubsup>*[mathsize="small"],munder>*[mathsize="small"],mover>*[mathsize="small"],munderover>*[mathsize="small"],mmultiscripts>*[mathsize="small"],math[display="inline"] mfrac>*[mathsize="small"],math *[scriptlevel="+1"][mathsize="small"] {  font-size: 57% /* (.71 times .80) */}
    msub>*,msup>*,msubsup>*,munder>*,mover>*,munderover>*,mmultiscripts>*,math[display="inline"] mfrac>*,math *[scriptlevel="+1"] {  font-size: 71%}
    mroot>*[mathsize="big"] {  font-size: 62%  /* (.50 times 1.25) */}
    mroot>*[mathsize="small"] {  font-size: 40% /* (.50 times .80) */}
    mroot>* {  font-size: 50%}
    .s4s-table-right{text-align:right}
    .s4s-table-right table{margin-left:auto;margin-right:0;text-align:left;}
    .s4s-table-right caption{margin-left:auto;margin-right:auto;text-align:center;}
    .s4s-table-center{text-align:center;}
    .s4s-table-center table{margin-left:auto;margin-right:auto;text-align:left;}
    .s4s-table-center caption{margin-left:auto;margin-right:auto;text-align:center;}
    .s4s-empty-paragraph{height:1em;width:textwidth}
    .s4s-noindent{text-indent:0em}
    .s4s-citation          {text-decoration:none}
    .s4s-section-reference {text-decoration:none}
    .s4s-theorem-reference {text-decoration:none}
    .s4s-table-reference   {text-decoration:none}
    .s4s-figure-reference  {text-decoration:none}
    .s4s-equation-reference{text-decoration:none}
    /* - CSS code generated by soft4science SciWriter  -*/
]]></style>
  </head>
  <body style="margin-left:50px;margin-right:50px;margin-top:50px;margin-bottom:50px">
    <p class="s4s-noindent">
      <span style="font-size:80%">www.john-weber.com</span>
    </p>
    <h1 class="s4s-section-numbered" id="SECTION.9c9a1249-5026-49ff-9987-db797e41b166">Chapter 1: Picturing Distributions with Graphs</h1>
    <hr />
    <h3 class="s4s-section-numbered" id="SECTION.e4331bc8-4fe7-496c-bde9-57b84b5faf16">
      <span style="color:Blue">Knowledge Prerequisites</span>
    </h3>
    <h4 class="s4s-section-numbered" id="SECTION.3d5ea94c-093c-40fa-afd5-117332b7deca">none</h4>
    <hr />
    <h3 class="s4s-section-numbered" id="SECTION.541469b1-6c65-4551-9c65-ecf7706ab813">
      <span style="color:Green">Learning Goals</span>
    </h3>
    <ol>
      <li>
        <strong>declarative knowledge (definitions)</strong>
        <ol>
          <li>individuals</li>
          <li>categorical variable</li>
          <li>quantitative variable</li>
          <li>distribution</li>
          <li>exploratory data analysis</li>
          <li>pie chart</li>
          <li>bar graph</li>
          <li>histogram</li>
          <li>class width</li>
          <li>shape, center and spread of a distribution</li>
          <li>outlier of a distribution</li>
          <li>skewness of a distribution</li>
          <li>stem and leaf of a stemplot</li>
          <li>split stems</li>
          <li>time plot</li>
        </ol>
      </li>
      <li>
        <strong>procedural knowledge</strong>
        <ol>
          <li>how to construct a pie chart from given data</li>
          <li>how to construct a bar graph from given data</li>
          <li>how to construct a <a href="http://gpc.edu/~jweber/math1431/ti83hist.htm">histogram</a> from given data</li>
          <li>how to construct a <a href="http://gpc.edu/~jweber/math1431/ti83stem.htm">stemplot</a> from given data</li>
          <li>how to construct a <a href="http://gpc.edu/~jweber/math1431/ti83time.htm">timeplot</a> from given data</li>
        </ol>
      </li>
      <li>
        <strong>conditional knowledge</strong>
        <ol>
          <li>be able to distinguish between types of data.</li>
          <li>know which graph is most appropriate for given data. </li>
          <li>know how to analyze a histogram or stemplot:<ol><li>describe overall pattern and any deviations (outliers)</li><li>describe shape, center and spread </li></ol></li>
        </ol>
      </li>
    </ol>
    <hr />
    <h3 class="s4s-section-numbered" id="SECTION.5107892a-0860-47da-8095-15431a1e8f9f">
      <span style="color:Red">Questions to Ponder</span>
    </h3>
    <ol>
      <li>What is the best way to study/prepare for the concepts in this section? Are there are other methods to study these concepts?</li>
      <li>What questions do you need to ask the instructor?</li>
    </ol>
    <hr />
    <h3 class="s4s-section-numbered" id="SECTION.09b05e4a-237c-4460-bc9b-eebfd8658bff">Purpose of this Section</h3>
    <p class="s4s-noindent">Chapter 1 considers single variable statistics (i.e., one measured aspect of an individual).</p>
    <p class="s4s-empty-paragraph"> </p>
    <p>This section includes:</p>
    <ul>
      <li>
        <strong>distribution of a variable</strong> - describes what values the variable takes and how often it takes these values.</li>
      <li>
        <strong>exploratory data analysis -</strong> the use of graphs and numerical summaries to describe the variables in a data set and the relations among them.</li>
    </ul>
    <p class="s4s-noindent">This information will be very important for later Chapters </p>
    <hr />
    <h3 class="s4s-section-numbered" id="SECTION.71ab9e60-203d-44d4-9fe0-21324d832c91">General Notes</h3>
    <p class="s4s-noindent">For several good description of statistics, see the resources page: <a href="statMiscTopics.htm">miscellaneous topics</a>.</p>
    <p class="s4s-empty-paragraph"> </p>
    <p>Strategies for exploratory data analysis:</p>
    <ol>
      <li>Examine each variable individually, then examine relationships among them.<ol><li>Graph the data.<ol><li>Types:<ol><li>Pie charts (for categorial/qualitative variables)<ol><li>Here is a previous class separated into groups by zip code (Note that the area of each slice is proportional to the percentage of people in each zip code):<ol><li><a href="GIFs/pieChart.jpg">jpeg</a> or <a href="pieChart.xls">MS excel</a></li></ol></li></ol></li><li>Bar Graphs (for categorial/qualitative variables)<ol><li>Here is a previous class separated into groups by zip code (<ol><li>Note that the height (or area) of each bar represents the number of people in each zip code:<ol><li><a href="GIFs/barGraph1.jpg">jpeg</a> or <a href="barGraph1.xls">MS excel</a></li></ol></li><li>Or the height (or area) of each bar can represent the percentage of people in each zip code:<ol><li><a href="GIFs/barGraph2.jpg">jpeg</a> or <a href="barGraph2.xls">MS excel</a></li></ol></li></ol></li></ol></li><li>Histograms (for quantitative variables)<ol><li>Formal steps to construct a histogram by hand:<ol><li>Divide the range of values of the variable into classes of equal width: <math xmlns="http://www.w3.org/1998/Math/MathML"><mfrac><mrow><mrow><mo>&#x00028;</mo><mi>max</mi><mo>&#x02212;</mo><mi>min</mi><mo>&#x00029;</mo></mrow></mrow><mrow><mo>&#x00023;</mo><mspace width="mediummathspace" height="0.2em" /><mi>of</mi><mspace width="mediummathspace" height="0.2em" /><mi>classes</mi></mrow></mfrac></math></li><li>Count the number of observations in each class</li><li>Make a histogram</li></ol></li><li>Luckily, the TI83 constructs histograms in a more efficient manner (see <a href="ti83hist.htm">ti83hist.htm</a>).</li><li>NOTE: choose classes wisely. The class width affects the graph of the data. To see this effect, here is a java demonstration of <a href="http://www.amstat.org/publications/jse/v6n3/applets/Histogram.html">how class width affects a histogram</a>.</li><li>Percentages are preferred for large data sets. </li></ol></li><li>Stemplots (for quantitative variables)<ol><li>Stemplots work best for small numbers of observations.</li><li>Steps in constructing a stemplot:<ol><li>Separate each observation into stem and leafs.</li><li>Write the stems in a vertical column with smallest value at top. Do not skip numbers in the stem.</li><li>Write each leaf in a row to the right of the stem in increasing order.</li></ol></li><li>Luckily, the STEMPLOT program on the TI83 constructs stemplots in a more efficient manner (see <a href="ti83stem.htm">ti83stem.htm</a>).</li><li>You can split the stems (i.e., each stem appears twice). The upper stem will have leafs 0-4 and the lower stem will have leafs 5-9. For an example, see the top of page 17 of your text.</li><li>Back-to-back stemplots are useful when we wish to compare two related distributions. </li></ol></li><li>Time plots (when data is measured over time - they can reveal trends)<ol><li>Luckily, the TI83 constructs timeplots (see <a href="ti83time.htm">ti83time.htm</a>).</li></ol></li></ol></li></ol></li><li>Preliminary examination of data sets:<ol><li>What kind of individuals?</li><li>How many variables are present?</li><li>What is the exact definition of the variable? What type? In what units? How is thevariable measured? </li></ol></li><li>Numerical summaries of specific aspects of the data.<ol><li>description of the shape of the distribution (from one of the graphs above)</li><li>identification of possible outliers (from <a href="ti83hist.htm">histogram</a>) or actual outliers (from <a href="http://gpc.edu/%7Ejweber/math1431/ti83box.htm">modified boxplot</a> to be discussed in <a href="chapt02notes.xml">Chapter 2</a>).</li><li>identify appropriate measure of center </li><li>identify appropriate measure of spread</li><li>Is there a trend? (time plots only)</li></ol></li></ol></li>
    </ol>
    <p class="s4s-noindent">Skewed distributions</p>
    <ol>
      <li>
        <strong>Skewed to right:</strong> if the right side of the distribution extends further out than the left side.</li>
      <li>
        <strong>Skewed to left:</strong> if the left side of the distribution extends further out than the right side.</li>
    </ol>
    <p class="s4s-noindent">
      <a href="skewness.htm">Steps to determine the skewness of a distribution</a>.</p>
    <p class="s4s-empty-paragraph"> </p>
    <p>NOTE: when constructing graphs, make sure to appropriately label the graphs.</p>
    <h6 class="s4s-section-numbered" id="SECTION.5184016e-264f-4428-93d2-11f4e9e82de3">Resources</h6>
    <ul>
      <li>Histogram Applet - <a href="http://www.amstat.org/publications/jse/v6n3/applets/Histogram.html">http://www.amstat.org/publications/jse/v6n3/applets/Histogram.html</a> - This applet is designed to teach students how bin widths (i.e., class widths) affect a histogram.</li>
      <li>Histogram - <a href="http://www.shodor.org/interactivate/activities/histogram/index.html">http://www.shodor.org/interactivate/activities/histogram/index.html</a> - This activity allows the user to view and make their own histograms.</li>
      <li>Histogram Explorer - <a href="http://www.ratrat.com/histogram_explorer/he.html">http://www.ratrat.com/histogram_explorer/he.html</a> - Histogram Explorer is a tool for understanding histograms and their statistics.</li>
    </ul>
    <hr />
    <p class="s4s-noindent">
      <a href="math1431.htm">Back to John Weber's MATH 1431 Page</a>
    </p>
    <p>
      <a href="../../john.html">Back to john-weber.com</a>
    </p>
    <hr />
    <p class="s4s-empty-paragraph" />
    <p class="s4s-empty-paragraph" />
  </body>
</html>
