http://www.john-weber.com

Data for Class Activities

The following data represent the number of homeruns per season for each respective player:
Barry Bonds:1625241933
 2534463733
 4240373449
 734645  

Sammy Sosa:1715108
 3325364036
 6663506449
 40    


 

The following data represent the fuel efficiency (in mpg) of my 1998 Honda Civic for each of the last 20 full tanks:
29.5630.2528.3333.0430.9030.7027.6336.0935.9131.32
34.4933.8636.9732.8134.6129.3031.4135.5432.1334.21

 

The following data represent the fuel efficiency (in mpg) of my wife's 2003 Honda Civic for each of the first eight (8) full tanks:
21.9420.8620.2826.9625.2622.1719.6523.91


 

1. The following data represent the scores of the first two rounds of the 2004 NCAA Men's Division I Basketball Tourney (St. Louis and East Rutherford Brackets):

9676100102586678535851
65606672764975766378
54579172825963767360
53757978767064534459
5543755670846580  

  1. Determine if the data is normally distributed.



2. Twenty one-day-old male chicks were fed corn. Here are the weight gains (in grams) after 21 days:

380321366356283349
402423356410329399
350384316272345360
431455    

  1. Determine if the data is normally distributed.



3. Here are the number of homeruns hit each year by Joe DiMaggio:

29463230313012
212520391432 

  1. Determine if the data is normally distributed.

 

The yearly increase of CEO compensation and the yearly corporate profits for the largest 350 U.S. publicly traded firms is given below:

 1995199619971998199920002001200220032004
CEO compensation10.4%5.2%11.7%5.2%11.0%10.0%-2.8%10.0%7.2%14.5%
Corporate Profits14.2%11.0%8.9%5.0%15.1%8.9%-17.8%14.8%19.2%23.0%


 

The following data represents the ages (in years) of seven men and their systolic blood pressures:
Age, x16253945496470
Systolic BP, y109122143132199185199

  1. Use the TI-83 to help you construct the scatterplot. Label the graph.
  2. Describe the overall pattern. Is there a direction to the relationship? If so, what is it? Is there an outlier?
  3. Suppose the men aged 25, 49 and 64 were smokers and the others were non-smokers. Show this on your scatterplot.
  4. Can you conclude anything about smokers and their systolic blood pressure? Why or why not.

Adapted from www.mathlab.isot.com/math/Statistics/Statlab4.pdf


Resources:

i. Guessing Correlations - http://www.stat.uiuc.edu/~stat100/java/GCApplet/GCAppletFrame.html This applet gives you four scatterplots and four correlation coefficients. You match them up.

ii. Correlation/Outliers Applet - http://www.math.tamu.edu/FiniteMath/FinalBuild/Classes/Correlation/ Once applet can provide you with real insight into how to read scatterplots and into how correlation coefficients can be misleading.

iii. Regression by Eye - http://www.ruf.rice.edu/~lane/stat_sim/reg_by_eye/index.html This applet gives you a scatterplot and you can use the mouse to indicate what seems to be the best straight line approximating the data. This is an excellent way to develop a feel for linear regression!

iv. Visual Statistics - http://www.seeingstatistics.com/seeingTour/intro/why3.html This applet graphssome common geometric statistical concepts: data points (black dots), residuals (red lines), error squares (yellow squares), and regression or model line (blue line). Use your mouse to drag the blue line and see how the geometric objects change.


A student wonders if people of similar heights tend to date each other. She measures herself, her dormitory roommate, and the women in the adjoining rooms, then she measures the next man each woman dates. Here are the data (heights in inches):
Women, x666466657065
Men, y726870687165

  1. Use the TI-83 to help you construct the scatterplot.
  2. Describe the overall pattern. Is there an outlier?
  3. Based on the scatterplot, do you expect the correlation to be positive or negative? near ±1 or not?
  4. Compute the correlation r.
  5. How would r change if all the men were 6 inches shorter than the heights given in the table?
  6. How r would change if the heights are measured in centimeters?

 
CEO compensation: http://www.industryweek.com/PrintArticle.aspx?ArticleID=10387
 

The following data represent my weight (in #) since starting my low-card diet:

1/12/041/19/041/26/042/2/042/9/042/16/042/23/043/1/043/8/043/15/04
187185182179176174172170171170
 
3/22/043/29/044/5/044/12/044/19/044/26/044/30/045/10/045/17/04 
169166167163164164163164163 

 

  1. Use the TI-83 to help you construct a scatterplot of my weight. Label the graph.

  2. Describe the overall pattern. Is there a direction to the relationship? If so, what is it? Is there an outlier?

  3. Based on the scatterplot, do you expect the correlation to be positive or negative? near ±1 or not?

  4. Compute the correlation r.


 

The following data represent the grade for Quiz 3 and the grade for Test I for students in one of my classes:

Student:1234567891011
Quiz:859931095658
Test:7488999063919184839890
 
Student:1213141516171819202122
Quiz:9105851088898
Test:8360809656978779829285

 

  1. Use the TI-83 to help you construct a scatterplot of quiz grades vs. test grades. Label the graph.

  2. Describe the overall pattern. Is there a direction to the relationship? If so, what is it? Is there an outlier?

  3. Based on the scatterplot, do you expect the correlation to be positive or negative? near ±1 or not?

  4. Compute the correlation r.


 

Below are the students in one of my statistics classes:

    Beth   Brandon
    Lisa   Lacey
    Julie   Suzanne
    Joanna   Kelly
    Danielle   Hollie
    Marissa   Shavonne
    Benjamin   Krissie
    Christina   Anthony
    Kevin   Brian
    Dobrinka   Tanya
    Ashleigh   Takang
    Kim   Joanne


 

The famous "The Monte Hall Problem" comes from the game show Let's Make A Deal. Suppose you're on a game show, and you're given the choice of three doors: Behind one door is a $1,000,000.00 and behind the others, goats. The host knows behind which door is the good prize (for me it's the $1,000,000.00). You pick a door, say door number 1, and the host shows you what's behind one of the other doors that hides the goat. Then he gives you a chance to stay with your initial choice or to change your choice. Do you switch your choice of doors? Explain.

Would you switch if there were four(4) doors?


 

The following data are from 20 small packets of M&Ms:

  sample      red      orange    yellow    green      blue      brown      total
110868121357
210124817859
371161312756
4111051171458
571321171656
61311999758
791021391457
814851181157
991069121056
108157137555
1181171210957
129144149959
13298792257
14910310121357
156871571659
16910213131057
1781061491158
1891128151661
1971847121058
20812211111155

Data is from http://mathforum.org/epigone/ap-stat/clubangcroi.

 

Online Resources: