Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

linear regression study guide, Assignments of Mathematics

this covers linear regression. this document is a study guide

Typology: Assignments

Pre 2010

Uploaded on 03/23/2023

jackie-16
jackie-16 🇺🇸

1 document

1 / 5

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
II-5
AP Statistics Test B – Modeling Data – Part II Name ____________________
___ 1. All but one of the statements below contain a mistake. Which one could be true?
A) The correlation between height and weight is 0.568 inches per pound.
B) The correlation between weight and length of foot is 0.488.
C) The correlation between the breed of a dog and its weight is 0.435.
D) The correlation between gender and age is -0.171.
E) If the correlation between blood alcohol level and reaction time is 0.73, then the
correlation between reaction time and blood alcohol level is -0.73.
___ 2. A correlation of zero between two quantitative variables means that
A) we have done something wrong in our calculation of r.
B) there is no association between the two variables.
C) there is no linear association between the two variables.
D) re-expressing the data will guarantee a linear association between the two variables.
E) None of the above.
___ 3. A residuals plot is useful because
I. it will help us to see whether our model is appropriate.
II. it might show a pattern in the data that was hard to see in the original scatterplot.
III. it will clearly identify influential points.
A) I only B) II only C) I and II only D) I and III only E) I, II, and III
___ 4. Which of the following is not a goal of re-expressing data?
A) Make the distribution of a variable more symmetric.
B) Make the spread of several groups more alike.
C) Make the form of a scatterplot more nearly linear.
D) Make the scatter in a scatterplot spread out evenly rather than following a fan shape.
E) All of the above are goals of re-expressing data.
___ 5. The correlation coefficient between the hours that a person is awake during a 24-hour
period and the hours that same person is asleep during a 24-hour period is most likely to be
A) exactly +1.0 B) near +0.8 C) near 0 D) near -0.8 E) exactly -1.0
___ 6. The correlation coefficient between high school grade point average (GPA) and college
GPA is 0.560. For a student with a high school GPA that is 2.5 standard deviations above
the mean, we would expect that student to have a college GPA that is the mean.
A) equal to B) 0.56 SD above C) 1.4 SD above D) 2.5 SD above
___ 7. A regression analysis of students’ college grade point averages (GPAs) and their high
school GPAs found R2 = 0.311. Which of these is true?
I. High school GPA accounts for 31.1% of college GPA.
II. 31.1% of college GPAs can be correctly predicted with this model.
III. 31.1% of the variance in college GPA can be accounted for by the model
A) I only B) II only C) III only D) I and II E) None
Copyright 2010 Pearson Education, Inc.
pf3
pf4
pf5

Partial preview of the text

Download linear regression study guide and more Assignments Mathematics in PDF only on Docsity!

II-

_AP Statistics Test B – Modeling Data – Part II Name _____________________

___ 1. All but one of the statements below contain a mistake. Which one could be true? A) The correlation between height and weight is 0.568 inches per pound. B) The correlation between weight and length of foot is 0.488. C) The correlation between the breed of a dog and its weight is 0.435. D) The correlation between gender and age is -0.171. E) If the correlation between blood alcohol level and reaction time is 0.73, then the correlation between reaction time and blood alcohol level is -0.73.

___ 2. A correlation of zero between two quantitative variables means that A) we have done something wrong in our calculation of r. B) there is no association between the two variables. C) there is no linear association between the two variables. D) re-expressing the data will guarantee a linear association between the two variables. E) None of the above.

___ 3. A residuals plot is useful because I. it will help us to see whether our model is appropriate. II. it might show a pattern in the data that was hard to see in the original scatterplot. III. it will clearly identify influential points. A) I only B) II only C) I and II only D) I and III only E) I, II, and III

___ 4. Which of the following is not a goal of re-expressing data? A) Make the distribution of a variable more symmetric. B) Make the spread of several groups more alike. C) Make the form of a scatterplot more nearly linear. D) Make the scatter in a scatterplot spread out evenly rather than following a fan shape. E) All of the above are goals of re-expressing data.

___ 5. The correlation coefficient between the hours that a person is awake during a 24-hour period and the hours that same person is asleep during a 24-hour period is most likely to be A) exactly +1.0 B) near +0.8 C) near 0 D) near -0.8 E) exactly -1.

___ 6. The correlation coefficient between high school grade point average (GPA) and college GPA is 0.560. For a student with a high school GPA that is 2.5 standard deviations above the mean, we would expect that student to have a college GPA that is the mean. A) equal to B) 0.56 SD above C) 1.4 SD above D) 2.5 SD above

___ 7. A regression analysis of students’ college grade point averages (GPAs) and their high school GPAs found R^2 = 0.311. Which of these is true? I. High school GPA accounts for 31.1% of college GPA. II. 31.1% of college GPAs can be correctly predicted with this model. III. 31.1% of the variance in college GPA can be accounted for by the model

A) I only B) II only C) III only D) I and II E) None

II-

___ 8. Although there are annual ups and downs, over the long run, growth in the stock market averages about 9% per year. A model that best describes the value of a stock portfolio is probably: A) linear B) logarithmic C) exponential D) power E) quadratic

___ 9. When using midterm exam scores to predict a student’s final grade in a class, the student would prefer to have a A) positive residual, because that means the student’s final grade is higher than we would predict with the model. B) positive residual, because that means the student’s final grade is lower than we would predict with the model. C) residual equal to zero, because that means the student’s final grade is exactly what we would predict with the model. D) negative residual, because that means the student’s final grade is lower than we would predict with the model. E) negative residual, because that means the student’s final grade is higher than we would predict with the model.

___ 10.The model distance ˆ^ 3.30  0.235( speed )can be used to predict the stopping distance

(in feet) for a car traveling at a specific speed (in mph). According to this model, about how much distance will a car going 65 mph need to stop? A) 4.3 feet B) 18.6 feet C) 27.0 feet D) 345.0 feet E) 729.0 feet

  1. Storks Data show that there is a positive association between the population of 17 European countries and the number of stork pairs in those countries. a. Briefly explain what “positive association” means in this context.

b. Wildlife advocates want the stork population to grow, and jokingly suggest that citizens should be encouraged to have children. As a statistician, what do you think of this suggestion? Explain briefly.

II-

  1. Studying for exams A philosophy professor has found a correlation of 0.80 between the number of hours students study for his exams and their exam performance. During the time he collected the data, students studied an average of 10 hours with a standard deviation of 2. hours, and scored an average of 80 points with a standard deviation of 7.5 points. a. Create a linear model to estimate the number of points a student will score on the next exam from the number of hours the student studies.

b. If a student studies for 15 hours, what score should the student expect on the next exam? Show your work.

  1. Height and weight Suppose that both height and weight of adult men can be described with Normal models, and that the correlation between these variables is 0.65. If a man’s height places him at the 60th^ percentile, at what percentile would you expect his weight to be?

II-

  1. Carbon dating QuarkNet, a project funded by the National Science Foundation and the U.S. Department of Energy, poses the following problem on its website: “Last year, deep within the Soudan mine, QuarkNet teachers began a long-term experiment to measure the amount of carbon-14 remaining in an initial 100-gram sample at 2000-year intervals. The experiment will be complete in the year 32001. Fortunately, a method for sending information backwards in time will be discovered in the year 29998, so, although the experiment is far from over, the results are in.” Here is a portion of the data: Time (yr) 0 2000 4000 6000 8000 10,000 12,000 14,000 16,000 18, Mass (g) 100 76 61 47 36 29 22 17 13 10

A scatterplot of these data looks like:

Time (yr)

Mass (g)

0 5000 10000 15000 20000

100

80

60

40

20

0

Scatterplot of Mass (g) vs Time (yr)

a. Straighten the scatterplot by re-expressing these data and create an appropriate model for predicting the mass from the year.

b. Use your model to estimate what the mass will be after 7500 years.

c. Can you use your model to predict when 50 g of the sample will be left? Explain.