Notes on Multiple Regression - Statistical Applications | MATH 241 | Exams Mathematics

Multiple Regression

Multiple regression is the extension of simple regression to the prediction of one dependent variable from more than

one independent variable. The resulting models produce better predictors, but the calculations and interpretation are

more complex.

In the case of several independent variables, “linear” means that the regression equation is linear in all independent

variables and thus has the form: y=b0+b1x1+b2x2+b3x3+. . . +bpxp(prepresents the number of predictors)

We are using a model matching the model used for simple regression: We assume that in the population yis given by

y=β0+β1x1+β2x2+. . . +βpxp+satisfying the conditions:

1. Errors are independent

2. Equal variance of the errors (and so of the y’s) across the whole range of all the xi’s.

3. Normal distribution for errors(that is, of ), with mean 0.

In simple linear regression the geometric interpretation of the regression equation was a straight line. In multiple

linear regression with two independent variables the regression equation is given by the equation, y=b0+b1x1+b2x2,

which is represented by a plane in three space. We cannot give a geometric representation for the cases of three or more

independent variables.

An example:

The price of a house is to be predicted from the number of bedrooms and bathrooms.

Price

#bedrooms #bathrooms ($10,000)

x1x2y

3 2 13.6

2 1 11.6

4 3 15.6

2 1 9.8

3 2 14.0

2 2 12.2

5 3 17.2

4 2 15.4

4 2.5 17.0

5 3.5 18.2

The regression equation is y= 6.87 + 1.60x1+ 0.979x2.

Standard deviation of the errors (residuals)

With two independent variables is estimated by: s=rP(yi−ˆyi)2

n−3

In general, with ppredictors, s=sP(yi−ˆyi)2

n−1−p=sSSE

n−1−pbecause there are p+ 1 parameters estimated (to

calculate ˆyi) – the numbers β0, β1, . . . , βp.

The normal equations for two independent variables:

Py=nb0+b1Px1+b2Px2

Px1y=b0Px1+b1x2

1+b2x1x2

Px2y=b0Px2+b1Px1x2+b2Px2

We will not write down the normal equations for regression with three or more independent variables —they are really

matrix equations. We will rely on Minitab for calculation of regression coefficients.

Meaning of the coefficients:

The regression coefficients have a slightly more subtle interpretation with multiple predictors. As before, the coefficient

biestimates βi, but βigives the amount of change in ydue to a change in xiif all the other variables are held constant

(no change in other variables) — so it gives the effect that xihas on ythat is distinct from change due to other variables

Sometimes this is different (often less) than what you would see as the coefficient with simple regression.

Coefficient of determination:

As with simple regression, R2[always capitalized - there is no multiple-variable correlation coefficient r] indicates the

Notes on Multiple Regression - Statistical Applications | MATH 241, Exams of Mathematics

Related documents

Partial preview of the text

Download Notes on Multiple Regression - Statistical Applications | MATH 241 and more Exams Mathematics in PDF only on Docsity!

SSE

MSR

MSE