The standard error of the regression (SER) estimates the standard deviation of the error u

(1)

LECTURE NOTE, DAY 19

ESTIMATION IN MULTIPLE REGRESSION

& GOODNESS OF FITS

ECO 3007, 2016 SPRING INSTRUCTOR : JUNGMO YOON

HANYANG UNIVERSITY

6.3. The OLS Estimator in Multiple Regression

Read pages 189 - 192. To present the materials in this section well, we will need matrix algebra. I will spend some time to review matrix notations.

6.4. Measures of Fit

Read pages 193 - 197. The standard error of the regression (SER) estimates the standard deviation of the error u.

SER = q

s²_u_ˆ where

s²_u_ˆ = 1 n − k − 1

n

X

i=1

ˆ

u²_i = SSR n − k − 1.

Here k is the number of independent variables. The regression R² is defined by

R² = ESS

T SS = 1 − SSR T SS

where ESS =P( ˆY_i− ¯Y )², T SS =P(Yi− ¯Y )², and SSR =P(Yi− ˆY_i)². A problem with R²

Whenever you add additional regressors, you always increase R². It is easy to manipulate results. By adding (possibly meaningless) regressors, you can always

1

(2)

2 ECO 3007, 2016 SPRING INSTRUCTOR : JUNGMO YOON HANYANG UNIVERSITY

make your regression results look better. Many people do this, so be careful and don’t be fooled.

Example) Consider a minimisation problem,

minβ0,β1

n

X

i=1

(Y_i− β₀− β₁X_1i)²

Once we find solutions, ( ˆβ₀, ˆβ₁), the SSR of this problem is defined by SSR₁ = Pn

i=1(Yi− ˆβ0− ˆβ1X1i)².

Now suppose you add another regressor X₂ and solve the following minimization problem,

β0min,β1,β2

n

X

i=1

(Y_i− β₀− β₁X_1i− β₂X_2i)² then the SSR of the second problem is SSR2 =Pn

i=1(Yi− ˆβ0− ˆβ1X1i− ˆβ2X2i)². My claim is that

SSR₁ ≥ SSR₂, all the time

that is, the sum of squared errors of the second problem is always smaller than the sum of squared errors of the first problem. Why?

Think about this. When you approach the second problem, the least thing you can do is to set β₂ = 0, then the minimum value of the second problem is at least as small as that of the first problem. But we can do better than this by trying to find optimum values of β2.

Now think about the definition, R² = 1 − ^SSR_{T SS}. Note that T SS, the variation in Y , is fixed. But by adding Xs, you can always decrease SSR, therefore, R² will be increased.

The adjusted R²

When we add one more X in a regression model, it will (i) decrease the prediction error (SSR) but (ii) increase the complexity of the model. We need to find the right balance between two competing objectives. The adjusted R² try to achieve this. Its definition is

(3)

LECTURE NOTE, DAY 19 ESTIMATION IN MULTIPLE REGRESSION & GOODNESS OF FITS3

R¯² = 1 − n − 1

n − k − 1 · SSR T SS

Note that as we add one more regressor, it decrease SSR, but increase _n−k−1ⁿ⁻¹ . There- fore, addition of extra regressors does not necessarily increase ¯R².

Some observations.

(i) ¯R² = 1 − s²_u_ˆ/s²_Y. It is because R¯² = 1 − n − 1

n − k − 1 ·SSR

T SS = SSR/(n − k − 1)

T SS/(n − 1) = 1 − s²_u_ˆ s²_Y (ii) R² > ¯R² because 1 < _n−k−1ⁿ⁻¹ .

(iii) ¯R² can be negative because

R¯² = 1 − n − 1 n − k − 1

| {z }

A

·SSR T SS

| {z }

B

and 0 ≤ B ≤ 1, A > 1, so you never know if it will be positive or not.