A Note on Test for Model Adequacy in Nonlinear Regression

(1)

2004, Vol. 15, No. 3, pp. 689∼694

A Note on Test for Model Adequacy in Nonlinear Regression ¹⁾

Myung-Wook Kahng ²⁾

Abstract

We investigate the test for model adequacy in nonlinear regression. We can expect the usual likelihood ratio statistic to be unaffected by any parametric- effect curvature; only the effect of intrinsic curvature needs to be considered. Multiplicative correction factor is derived for the limiting distribution of test statistic, which is a function of the intrinsic curvature arrays.

Keywords : Correction factor, Intrinsic curvature, Intrinsic curvature array, Model adequacy, Parametric-effect curvature

1. Introduction

Once a nonlinear regression model has been chosen and the unknown parameters estimated, the question of the adequacy of the model to the data arises. Usually one would look at the residuals and carry out various plots. If the intrinsic curvature is negligible, as is often the case, then the usual residual plots and likelihood ratio test carried out for linear models are also appropriate in nonlinear regression analysis. However, if there is substantial intrinsic curvature, then the projected residuals suggested by Cook and Tsai(1985) can be used and its F-approximation can be seriously affected by severe curvature (Bates and Watts 1980, 1981; Hamilton, 1986).

When the model is replicated, the adequacy of the model can be tested directly.

We consider the likelihood ratio statistic for the test of lack of fit. This test is unaffect by any parametric-effect curvature, but the effect of intrinsic curvature 1) This research was supported by the Sookmyung Women's University Research Grants 2003.

2) Professor, Department of Statistics, Sookmyung Women's University, Seoul, 140-742, Korea

E-mail : [email protected]

(2)

distribution of test statistic, which is a function of the intrinsic curvature arrays.

If here are no replicates, then there are a number of test procedures available, generally based on the idea of near-replicates. A number of these, which are also suitable for nonlinear models, are given by Miller, Neill and Sherfey(1999).

2. Nonlinear Regression Model

Consider the standard nonlinear regression model

y

_i

= f ( x

_i

: θ ) + ε

_i

, i = 1, 2, …, n , (2.1)

in which the _i -th response y

i

is related to the q -dimensional vector of known explanatory variable x

_i

through the known nonlinear model function f , which depends on p -dimensional unknown parameter vector θ , and ε

i

is error. We assume that f is twice continuously differentiable in θ , and errors ε

_i

are independent identically distributed normal random variables with mean 0 and variance σ

²

. The least squares estimate of θ , denoted by ˆ , minimized the θ error sum of squares

S ( θ ) = ∑

n

i = 1

[ ^y

ⁱ

^{- f ( x}

ⁱ

^{: θ )} ]

²

^.

Let the p -dimensional parameter vector θ be partitioned in the form θ = ( θ

^T₁

, θ

^T₂

)

^T

where θ

₁

is a p

₁

-dimensional nuisance vector and θ

₂

is a p

₂

-dimensional vector of primary interest with p

1

+ p

₂

= p . Now we consider the testing H

₀

: θ

₂

= θ

₂₀

. To apply the likelihood ratio procedure we need to minimize S ( θ ), subject to θ

₂

= θ

₂₀

. Let S ( θ ˜ ) = S ( θ ˜

1

( θ

₂₀

) , θ

₂₀

) be the minimum value of S ( θ ) under H

₀

. Then the likelihood ratio statistics is

F

_LR

= S( θ ˜ ) - S( θ ˆ )

S( θ ˆ ) ⋅ n - p

p

₂

(2.2)

which is approximately distributed as F

p2, n - p

when H

₀

is true.

(3)

3. Lack of Fit Test

Suppose a design is replicated, say n

_i

times for point x

_i

, so that our nonlinear model (2.1) now becomes

y

_ij

= η

_i

+ e

_ij

= f ( x

_i

: θ ) + ε

_ij

, i = 1, 2, … , k, j = 1, 2, …, n

_i

(3.1)

where the errors ε

ij

are assumed to be i.i.d. N ( 0, σ

²

) and k is the number of the distinct setting of the explanatory variables. This model includes the genuine variation between different experiments with the same x observation as well as error of measurements.

Ignoring the structure on η

_i

, we have the usual decomposition

∑

i

∑

j

( y

_ij

- η

_i

)

²

= ∑

i

∑

j

( y

_ij

- y

_i⋅

+ y

_i⋅

- η

_i

)

²

= ∑

i

∑

j

( y

_ij

- y

_i⋅

)

²

+ ∑

i

n

_i

( y

_i⋅

- η

_i

)

²

,

(3.2)

where ∑

i

∑

j

( y

_ij

- y

_i⋅

)

²

is usually referred to as the pure error sum of squares.

Defining n = ∑ n

_i

, an unbiased estimate of σ

²

is

s

²_e

= 1 n - k ∑

i

∑

j

( y

_ij

- y

_i⋅

)

²

, and under the normality assumptions of ε

_ij

,

(n- k) s

²_e

σ

²

∼ χ

²_{n - k}

.

To find the least squares estimate ˆ of θ we minimize ∑∑ (y θ

_ij

- η

_i

)

²

with

respect to θ , which is equivalent to minimizing ∑ n

_i

( y

_i⋅

- η

_i

)

²

, i.e. a

weighted least squares analysis with weight n

i

. The normal equations for θ ˆ are

therefore

(4)

- 2 ∑

i

n

_i

( y

_i

) - f ( x

_i

: θ ) ∂ f ( x

_i

: θ )

∂ θ |

θ = θ^ˆ

= 0

If η

i

is replaced by η ˆ

i

= f ( x

_i

: θ ˆ ) , the identity (3.2) still applies, so that

∑

i

∑

j

( y

_ij

- η ˆ

i

)

²

= ∑

i

∑

j

( y

_ij

- y

_i⋅

)

²

+ ∑

i

n

_i

( y

_i⋅

- η ˆ

i

)

²

SSE = SS

_PE

+ ( SSE - SS

_PE

)

= SS

_PE

+ SS

_LOF

The left-hand side, called the residual sum of squares, is therefore split into a pure error sum of squares SS

_PE

and a lack of fit sum of squares SS

_LOF

. For the case when η

_i

is a linear function of θ , a test for the validity of this model is given by Seber(1977),

F = SS

_LOF

/ ( k - p) SS

_PE

/ ( n - k)

= ∑

i

n

_i

( y

_i⋅

- η ˆ

i

)

²

∑

i

∑

j

( y

_ij

- y

_i⋅

)

²

⋅ n - k k - p

(3.3)

where F ∼ F

k - p , n - k

when the model is valid. Because of asymptotic linearity, we find the above F-test is approximately valid for large n when the model is nonlinear.

4. Correction Factors

Since f ( x : θ ) and S ( θ ) are invariant under one-to-one transformations of θ , the likelihood ratio statistic, F

_LR

is unaffected by the parameter-effect curvature, however, the effect of intrinsic curvature need to be considered.

Hamilton and Wien(1987) studied second-order approximations for the distribution

of F

_LR

and developed correction terms which are functions of the intrinsic

curvature arrays. Assuming normality of the ε

_i

, they showed as σ

²

→ 0 (rather

than n → ∞ ), the limiting null distribution of F

LR

is, to order σ

³

,

(5)

( 1 - γ σ

²

) F

_p₂_{, n - p}

(4.1)

where γ = α

2

/ ( n - p) - α

₁

/ p

₂

and

α

₁

= α

₀

- α

₂

α

₂

= - 1

2 ∑

n - p

i = 1

tr { ^{( A}

^N^{i ..}

⁾

²

} ⁺ ¹ ₄ ∑

n - p

i = 1

{ ^{tr ( A}

^N^{i ..}

⁾ }

²

α

₀

= - 1

2 ∑

n - p1

i = 1

tr { ^{( A}

^N^{0 i ..}

⁾

²

} ⁺ ¹ ₄ ∑

n - p1

i = 1

{ ^{tr ( A}

^N^{0 i ..}

⁾ }

²

^.

Here the p×p matrix A

^Ni ..

, i = 1,2,…, n - p is the i -th face of the intrinsic curvature array A

^N_..

for the full model (2.1) (Seber and Wild, 1989, p. 142), and the p

₁

×p

₁

matrix A

^N_{0 i ..}

, i = 1, 2, …, n - p

1

is the i -th face of the intrinsic curvature array for the restricted model obtained by fixing θ

₂

= θ

₂₀

and working with θ

₁

. To evaluate γ we use θ

2

= θ

₂₀

and θ

₁

= θ ˜

1

( θ

₂₀

) , the restricted least squares estimator of θ . The variance σ

²

is estimated by s

²

= S( θ ˜ )/(n- p) . The computations can be simplified by an appropriate choice of basis for the sample space.

The usual test statistic for evaluating lack of fit in regression is the likelihood ratio statistic for the test

H

₀

: y

_ij

= f ( x

_i

: θ ) + ε

_ij

H

₁

: y

_ij

= g

_i

( x

_i

: θ ) + ε

_ij

Let δ

_i

= E ( y | x

_i

) = g

_i

( x

_i

: θ ) , for i = 1, 2, …, k , then the model under H

₀

is a restriction of that under H

₁

, and reparametrization φ = φ ( θ ) can be found

for which the null hypothesis is H

₀

: φ

₂

= 0 , where φ = ( φ

^T₁

, φ

^T₂

)

^T

and φ

₁

is p×1 , and φ

₂

is ( k - p) ×1 . Thus the result in (4.1) can be applied, with the

number of the parameters k , and the number of nuisance parameters p .

Fortunately it is not necessary to find φ or the model derivatives with respect to

φ because of the invariance of the intrinsic arrays and α

₁₁

and α

₁₂

under

(6)

approximately distributed as

[ ^{1 - σ}

²

( ^{n - k} ^α

²

^- ^{k - p} ^α

¹

)] ^{⋅ F}

k - p, n - k

, where α

1

and α

2

are calculated for the restricted nonlinear model.

References

1. Bates, D. M. and Watts, D. G. (1980). Relative curvature measures of nonlinearity (with discussion), Journal of the Royal Statistical Society Series, B, Vol. 42, 1-25.

2. Bates, D. M. and Watts, D. G. (1981). Parameter transformations for improved approximate confidence regions in nonlinear least squares, The Annals of Statistics, Vol. 9, 1152-1167.1. Cook, R. D. and Tsai, C.-L. (1985). Residuals in nonlinear regression, Biometrika, Vol. 72, 23-29.

3. Cook, R. D. and Tsai, C.-L. (1985). Residuals in nonlinear regression, Biometrika, Vol. 72, 23-29.

4. Hamilton, D. C. (1986). Confidence regions for parameter subsets in nonlinear regression, Biometrika, Vol. 73, 57-64.

5. Hamilton, D. C. and Wiens, D. (1987). Correction factors for F ratios in nonlinear regression, Biometrika, Vol. 74, 423-425.

6. Miller, F. R., Neill, J. W. and Sherfey, B. W. (1999), Implementation of a Maximin Power Clustering Criterion to Select Near Replicates for

Regression Lack of Fit Tests, Journal of the American Statistical Association, Vol. 94, 610-620.

7. Seber, G. A. F. (1977). Linear Regression Analysis, John Wiley & Sons:

New York.

8. Seber, G. A. F. and Wild, C. J. (1989). Nonlinear Regression, John Wiley & Sons: New York.

[ received date : Jun. 2004, accepted date : Aug. 2004 ]

A Note on Test for Model Adequacy in Nonlinear Regression

2004, Vol. 15, No. 3, pp. 689∼694