DOI 10.4134/JKMS.2010.47.6.1137
ON THE GOODNESS OF FIT TEST FOR DISCRETELY OBSERVED SAMPLE FROM DIFFUSION PROCESSES:
DIVERGENCE MEASURE APPROACH
Sangyeol Lee
Abstract. In this paper, we study the divergence based goodness of fit test for partially observed sample from diffusion processes. In order to derive the limiting distribution of the test, we study the asymptotic behavior of the residual empirical process based on the observed sample.
It is shown that the residual empirical process converges weakly to a Brownian bridge and the associated phi-divergence test has a chi-square limiting null distribution.
1. Introduction
The diffusion process has long been popular in analyzing random phenom- ena occurring in various fields such as finance, engineering, physical and med- ical sciences. As a basic reference, we refer to Karatzas and Shreve [5] and Shiryayev [20]. As the application of diffusion processes to real situations, es- pecially to financial market, is versatile, much attention has been paid to sta- tistical inference for diffusion processes and many sophisticated methods have been developed by researchers. For a general review, we refer to Prakasa Rao [18] and Kutoyants [7]. Past experience suggests that time series models such as general autoregressive conditional heteroscedastic (GARCH) models, which are the most popular in the financial time series context, are not well fitted to financial time series due to structural changes governed by the change of finan- cial policies and social panic events. This phenomenon is frequently observed in most financial time series data with high volatility: see, for instance, Lee, Tokutsu, and Maekawa [13] who empirically verified, through the CUSUM test, that most stock prices of Nikei 225 suffer from parameter changes when the underlying model of the data is assumed to follow a GARCH(1,1) model. With regard to the parameter change test for time series models, we refer to Lee, Ha, Na, and Na [8], Lee and Na [10], Lee, Nishiyama, and Yoshida [11], and
Received January 8, 2009; Revised September 24, 2009.
2000 Mathematics Subject Classification. 60J60, 62F05.
Key words and phrases. diffusion process, discretely observed sample, residual empirical process, weak convergence to a Brownian bridge, goodness of fit test, phi-divergence test.
c
°2010 The Korean Mathematical Society
1137
the articles cited in these papers. Since in general the stability of time series models is not guaranteed as mentioned above, instead of diffusion processes, researchers often consider using alternatives such as jump diffusion processes and L´evy processes: see Barndorff-Nielsen, Mikosch and Resnick [1], Shoutens [19] and Cont and Tankov [4]. Hence, in actual practice, it is very important to check whether time series follow diffusion or other processes.
In this paper, motivated by this viewpoint, we consider the goodness of fit test for diffusion processes. In particular, we concentrate on the phi-divergence test. The divergence measures are used as indices to similarity or dissimilarity between populations. They are also used either to measure mutual information concerning two r.v.’s and to construct goodness of fit tests. The most classic discrepancy is the Kullback-Leibler discrepancy while another important fam- ily of measures is the phi-divergence known as Csiszar’s family of divergence measures of information on divergence methods: see Read and Cressie [17] and Pardo [16]. Recently, Lee and Karagrigoriou [9] rediscovered that there is a strong link between the phi-divergence test and empirical process, and verified that the asymptotic behavior of the phi-divergence test can be analyzed by studying that of the empirical process. In particular, they showed that the as- ymptotic results on the residual empirical process can be used for deriving the limiting distribution of the phi-divergence tests for autoregressive time series models.
Here, we employ the phi-divergence test constructed with the residuals ob- tained based on discretely observed sample from diffusion processes. We do this because if the data is generated from a pure diffusion process, the resid- uals obtained from the sampled observations should behave like normal r.v.’s.
Then the whole testing procedure is reduced to the normality test for nearly i.i.d. observations. Since the phi-divergence test is strongly related to the resid- ual empirical process, we study its asymptotic behavior in diffusion processes, which itself may be of an independent interest. In fact, the residual empirical process in time series models has been extensively studied by many authors.
For instance, we refer to Lee and Wei [15], who deal with infinite order au- toregressive models and unstable models, Boldin [2] and Lee and Taniguchi [12], who handled GARCH models, Lee and Wee [14], who consider the diffu- sion process with a constant dispersion component, and papers cited in those articles.
In Section 2, it is shown that under regularity conditions, the residual em-
pirical process converges weakly to a Brownian bridge, and subsequently, by
using the link between the residual empirical process and phi-divergence test,
the latter has a chi-square limiting null distribution. Based on this result, one
can easily perform a goodness of fit test in actual practice. Although we do
not carry out an empirical study here, it is well known that the phi-divergence
test performs adequately in other situations, and the same result is anticipated
in our set-up. Finally, in Section 3, we provide the proof for the main theorem
addressed in Section 2.
2. Main results Let us consider the stochastic differential equation
dX t = a(X t ; θ)dt + b(X t ; σ)dW t , X 0 = x 0 , t ≥ 0, (2.1)
where (θ, σ) is a p + q-dimensional unknown parameter, a, b are known real valued functions, and W = {W t ; t ≥ 0} is a stochastic process that allows Equation (2.1) to have a unique strong solution. Further, we assume that
(A1) There exist constants C, m > 0 such that for any (θ, σ) and x, y,
|a(x; θ) − a(y; θ)| + |b(x; σ) − b(y; σ)| ≤ C|x − y|, sup
θ
0∈N
θ|| ˙a(x; θ
0)|| + sup
θ
0∈N
σ||˙b(x; σ
0)|| ≤ C(1 + |x| m ), sup
σ
0∈N
θ||¨a(x; θ
0)|| + sup
σ
0∈N
σ||¨b(x; σ
0)|| ≤ C(1 + |x| m ),
where ˙a = ∂a/∂θ, ¨a = ∂a 2 /∂θ 2 , ˙b = ∂b/∂σ, ¨a = ∂a 2 /∂σ 2 , and N θ and N σ are open neighborhoods of θ and σ.
(A2) inf x,σ b(x, σ) > 0.
Here we consider the problem of testing the following hypotheses:
H 0 : W is a standard Winer process vs. not H 0
via using the phi-divergence test introduced in Read and Cressie [17] and Pardo [16]. To task this, we study the asymptotic behavior of the residual empirical process constructed based on discreetly observed sample from the diffusion process in (2.1), say, X ti, t i = ih n , i = 1, . . . , n, where {h n } is a sequence of positive real numbers such that h n → 0 and nh n → ∞.
In what follows, we assume
(A3) Under H 0 , sup t E|X t | γ < ∞ for all γ > 0.
Further, we assume
(A4) Under H 0 , there exists an estimator (ˆ θ n , ˆ σ n ) of (θ, σ) such that (nh n ) 1/2 (ˆ θ n − θ) = O P (1) and n 1/2 (ˆ σ n − σ) = O P (1), and
(A5) nh 2 n → 0 and (nh n ) 1/2 / log n → ∞ as n → ∞.
Sufficient conditions for (A4) can be found in Kessler [6], which will be implicitly assumed without specification.
By using the Euler approximation, we can express
X ti− X ti−1 ' h n a(X ti−1; θ) + b(X ti; σ)(W ti− W ti−1).
' h n a(X ti−1; θ) + b(X ti; σ)(W ti− W ti−1).
; σ)(W ti− W ti−1).
).
In view of this, we define the residuals as ˆ
η ni = {X ti− X ti−1− h n a(X ti−1; ˆ θ n )}/b(X ti−1; ˆ σ n )h 1/2 n .
(2.2)
− h n a(X ti−1; ˆ θ n )}/b(X ti−1; ˆ σ n )h 1/2 n .
(2.2)
; ˆ σ n )h 1/2 n .
(2.2)
Since the phi-divergence test is strongly related to the residual empirical pro- cess, we consider the residual empirical process defined by
Y ˆ n (x) = 1
√ n h nh
X
i=1
{I(ˆ η ni ≤ x) − Φ(x)}, x ∈ R, (2.3)
where n h is the largest integer that does not exceed nh n . Then we have the following result, the proof of which is provided in Section 3.
Theorem 2.1. Let
η ni = W ti√ − W ti−1
h n
. Assume that (A1)-(A5) hold. Then, as n → ∞,
Y ˆ n (x) = 1
√ n h
n
hX
i=1
{I(η ni ≤ x) − Φ(x)} + ξ n (x),
where sup x |ξ n (x)| = o P (1). Thus, Y n (Φ −1 (u)) converges weakly to a Brownian bridge W o (u).
Remark. Usually, the estimator of scale parameter affects the limiting distribu- tion of the residual empirical process. In fact, this phenomenon can be found in GARCH processes (cf. Lee and Taniguchi [12]). In our case, however, the estimation effect disappears as seen in Theorem 2.1.
Lee and Wee [14] established the weak convergence of the residual empir- ical process to a Brownian bridge when the diffusion process has a constant dispersion: many well known stochastic differential equation (SDE) models in finance like the Black-Scholes model can be transformed to SDE models with a constant dispersion. In this case, one can even verify that their result holds for n h = nh β with 0 < β < 1 by virtue of (3.2) as long as nh 1+β → ∞, although we would not provide a detailed proof for this. An advantage of using this n h
is that the time horizon n h · h diverges to ∞. This may not be possible for a general dispersion case.
Now, we introduce the phi-divergence test based on the residuals ˆ η ni . Let Λ be a class of continuously twice differentiable real valued functions ϕ defined on R + such that ϕ(1) = 0, ϕ
00(1) 6= 0, 0ϕ(0/0) = 0 and 0ϕ(p/0) = lim u→∞ ϕ(u)/u.
For M ≥ 2, ϕ ∈ Λ, and the partition A i = (x i−1 , x i ], where −∞ = x 0 < x 1 <
· · · < x M = ∞ with Φ(x i ) = i/M , the ϕ-divergence test based on the residuals is given by
T ˆ n ϕ = 2n
ϕ
00(1) D ϕ (p, ˆ p n ),
where p = (p 1 , . . . , p M )
0and ˆ p n = (ˆ p n1 , . . . , ˆ p nM )
0with ˆ p ni = n −1 P n
j=1 I((i − 1)/M < Φ(ˆ η nj ) ≤ i/M ), and D ϕ is a real valued function on R M × R M , such that for all M × 1 vectors p = (p 1 , . . . , p M )0 and q = (q 1 , . . . , q M )
0, D ϕ (p, q) = P M
i=1 p i ϕ(q i /p i ). In particular, when ϕ(u) = u λ(λ+1)
λ+1−u with λ = −1/2, 0, 2/3, 1,
the phi-divergence tests are called the Freeman-Tukey, likelihood ratio, Cressie- Read, and chi-square tests, respectively.
If we put for s ∈ [0, 1], Y ˆ n (s) = 1
√ n h nh
X
j=1
{I(Φ(ˆ η nj ) ≤ s) − s},
Y n (s) = 1
√ n h nh
X
j=1
{I(Φ(η nj ) ≤ s) − s}, we can see that
ˆ
p ni = n −1/2 h { ˆ Y n (i/M ) − ˆ Y n ((i − 1)/M )} + 1/M.
Since ˆ Y n (s) = ˆ Y n (Φ −1 (s)), by Theorem 2.1,
(ˆ p ni − p i ) 2 = n −1 h {Y n (i/M ) − Y n ((i − 1)/M ) + ζ ni } 2 = O P (n −1 h ), where sup 1≤i≤M |ζ ni | = o P (1). Hence, in view of the arguments of Pardo [16, pp. 261–262], we can have
T ˆ n φ = M X M i=1
{Y n (i/M ) − Y n ((i − 1)/M )} 2 + o P (1).
Subsequently, by Theorem 3.1 of Pardo [16], we have the following result.
Theorem 2.2. Under H 0 and the conditions in Theorem 2.1, as n → ∞, T ˆ n φ → χ d 2 M −1 ,
(2.4)
where χ 2 M −1 denotes a r.v. with a chi-square distribution with M − 1 degrees of freedom.
Given a significance level α ∈ (0, 1), we reject H 0 if T n φ ≥ χ 2 M −1 (α) where χ 2 M −1 (α) is the number such that P (χ 2 M −1 ≥ χ 2 M −1 (α)) = α.
3. The proof of Theorem 2.1 In this section, we prove Theorem 2.1. Put
∆ ni = Z ti
t
i−1{a(X s ; θ) − a(X ti−1; θ)}ds + Z ti
t
i−1{b(X s ; θ) − b(X ti−1; θ)}dW s , d ni = a(X ti−1; ˆ θ n ) − a(X ti−1; θ).
; ˆ θ n ) − a(X ti−1; θ).
Note that ∆ ni can be viewed as a model bias in the regression model in (2.1).
By using (A1), (A2), the arguments in the proof of Lemma 3.4.2 of Prakara Rao [18, p. 156], and the martingale moment inequality in Equation (3.27) of Karatzas and Shreve [5, p. 163], we can easily see that for each k ≥ 1, there exists some C k > 0 depending only upon k with
E|X t − X s | 2k ≤ C k |t − s| k .
(3.1)
Subsequently, by using H¨older’s inequality, we can have
E
¯ ¯
¯ ¯
¯ Z ti
t
i−1{a(X s ; θ) − a(X ti−1; θ)}ds
¯ ¯
¯ ¯
¯
2k
= O(h 3k n ).
(3.2)
Further, we can verify that
E
¯ ¯
¯ ¯
¯ Z ti
t
i−1{b(X s ; σ) − b(X ti−1; σ)}dW s
¯ ¯
¯ ¯
¯
2k
= O(h 2k n ).
(3.3)
Combining (3.2) and (3.3), we have
E|∆ ni | 2k = O(h 2k n ).
(3.4)
We express that Y n (x) = I n (x) + II n (x) + III n (x), where I n (x) = 1
√ n h nh
X
i=1
{I(η ni ≤ x) − Φ(x)},
II n (x) = 1
√ n h nh
X
i=1
( Φ
à b(X ti−1; ˆ σ n )
b(X ti−1; σ) x− ∆ ni
b(X ti−1; σ)h 1/2 n
− h 1/2 n d ni
b(X ti−1; σ)
!
−Φ(x) )
,
III n (x) = 1
√ n h nh
X
i=1
( I
µ
η ni ≤ b(X ti−1; ˆ σ n )
b(X ti−1; σ) x − ∆ ni
b(X ti−1; σ)h 1/2 n
− h 1/2 n d ni
b(X ti−1; σ)
!
− Φ
à b(X ti−1; ˆ σ n )
b(X ti−1; σ) x − ∆ ni
b(X ti−1; σ)h 1/2 n
− h 1/2 n d ni
b(X ti−1; σ)
!
+ Φ(x) − I(η ni ≤ x)
¾ .
By Taylor’s theorem, we can express II n (x) = P 4
i=1 II ni (x), where II n1 (x) = 1
√ n h nh
X
i=1
µ b(X ti−1; ˆ σ n ) b(X ti−1; σ) − 1
; σ) − 1
¶ xφ(x),
II n2 (x) = −1
√ n h nh
X
i=1
∆ ni
b(X ti−1; σ) √ h n
φ(x),
II n3 (x) = −1
√ n h nh
X
i=1
√ h n
b(X ti−1; σ) d ni φ(x), II n4 (x) = 1
2 √ n h
n
hX
i=1
µ ∆ ni
b(X ti−1; σ) √ h n
+
√ h n
b(X ti−1; σ) d ni
¶ 2
φ
0(ξ ∗ ni (x)),
where ξ ni ∗ (x) is a number lying between x and b(X b(Xti−1;ˆ σ
n)
ti−1
;σ) x − ∆
nib(X
ti−1;σ)h
1/2n−
h
1/2nd
nib(X
ti−1;σ) .
By (A1)-(A3) and the mean value theorem, we can easily see that sup
x |II n1 (x)| = o P (1).
(3.5)
On the other hand, by (3.4) we have E sup
x |II n2 (x)| = o(1).
(3.6)
Now, by using Taylor’s theorem, we express d ni = d (1) ni + d (2) ni with d (1) ni = ˙a(X ti−1; θ)( ˆ θ n − θ) and d (2) ni = 1
2 ( ˆ θ n − θ)
0¨a(X ti−1; θ ni ∗ )( ˆ θ n − θ), where θ ∗ ni is an intermediate point between ˆ θ n and θ. By (A1)-(A4), we can have
√ 1 n h
n
hX
i=1
√ h n
b(X ti−1; σ) d (1) ni = o P (1) and
√ 1 n h
n
hX
i=1
√ h n
b(X ti−1; σ) d (2) ni = o P (1).
Hence,
sup
x |II n3 (x)| = o P (1).
(3.7)
In a similar fashion, it can be seen that sup
x |II n4 (x)| = o P (1).
(3.8)
Hence, in view of (3.5)-(3.8), we have sup x |II n (x)| = o P (1).
Now it remains to verify that sup
x |III n (x)| = o P (1).
(3.9) Put
w n = max
1≤i≤n
h½ √
h n
b(X ti−1; σ) |d (2) ni | + |∆ ni | b(X ti−1; σ) √
; σ) √
h n
¾ . Then, by (A1)-(A4), we can get
n 1/2 h w n = o P (h 1/2 n ).
(3.10) Set
e ni = b(X ti−1; ˆ σ n ) b(X ti−1; σ) − 1.
; σ) − 1.
Observe that by the monotonicity of the indicator function, I
µ
η ni ≤ x +
√ h n
b(X ti−1; σ) d (1) ni − [w n + max
1≤i≤n
h(e ni x)]
¶
≤ I µ
η ni ≤ b(X ti−1; ˆ σ n )
b(X ti−1; σ) x − ∆ ni
b(X ti−1; σ) √ h n
+
√ h n
b(X ti−1; σ) d ni
¶
≤ I µ
η ni ≤ x +
√ h n
b(X ti−1; σ) d (1) ni + [w n + max
1≤i≤n
h(e ni x)]
¶ .
Further, by using (A1)-(A4), Taylor’s theorem, (3.10), and the fact that n 1/2 h max
1≤i≤n
h|e ni | = o P (1), it can be readily checked that
√ 1 n h
n
hX
i=1
¯ ¯
¯ ¯Φ
µ b(X ti−1; ˆ σ n )
b(X ti−1; σ) x − ∆ ni
b(X ti−1; σ) √ h n
+
√ h n
b(X ti−1; σ) d ni
¶
− Φ µ
x +
√ h n
b(X ti−1; σ) d (1) ni + [w n + max
1≤i≤n
h(e ni x)]
¶¯ ¯
¯ ¯
+ 1
√ n h
n
hX
i=1
¯ ¯
¯ ¯Φ
µ b(X ti−1; ˆ σ n )
b(X ti−1; σ) x − ∆ ni
b(X ti−1; σ) √ h n
+
√ h n
b(X ti−1; σ) d ni
¶
− Φ µ
x +
√ h n
b(X ti−1; σ) d (1) ni − [w n + max
1≤i≤n
h(e ni x)]
¶¯ ¯
¯ ¯ = o P (1).
Subsequently, we have sup x |III n (x)|
≤ sup
x
¯ ¯
¯ ¯ 1
√ n h
n
hX
i=1
½ I
µ
η ni ≤ x +
√ h n
b(X ti−1; σ) d (1) ni + [w n + max
1≤i≤n
h(e ni x)]
¶
− Φ µ
x +
√ h n
b(X ti−1; σ) d (1) ni + [w n + max
1≤i≤n
h(e ni x)]
¶
+ Φ(x) − I(η ni ≤ x)
¾¯ ¯
¯ ¯
≤ sup
x
¯ ¯
¯ ¯ 1
√ n h
n
hX
i=1
½ I
µ
η ni ≤ x +
√ h n
b(X ti−1; σ) d (1) ni − [w n + max
1≤i≤n
h(e ni x)]
¶
− Φ µ
x +
√ h n
b(X ti−1; σ) d (1) ni − [w n + max
1≤i≤n
h(e ni x)]
¶
+ Φ(x) − I(η ni ≤ x)
¾¯ ¯
¯ ¯ + o P (1),
and therefore, sup
x |III n (x)| ≤ sup
x |IV n (x)| + sup
x |V n (x)| + o P (1), where
IV n (x) = 1
√ n h
n
hX
i=1
½ I
µ
η ni ≤ x +
√ h n
b(X ti−1; σ) d (1) ni
¶
− Φ µ
x +
√ h n
b(X ti−1; σ) d (1) ni
¶
+ Φ(x) − I(η ni ≤ x)
¾
and
V n (x) = 1
√ n h
¯ ¯
¯ ¯
n
hX
i=1
{I(η ni ≤ x + [w n + max
1≤i≤n
h(e ni x)])
− Φ(x + [w n + max
1≤i≤n
h(e ni x)]) + Φ(x) − I(η ni ≤ x)}
¯ ¯
¯ ¯
+ 1
√ n h
¯ ¯
¯ ¯
n
hX
i=1
{I(η ni ≤ x − [w n + max
1≤i≤n
h(e ni x)])
− Φ(x − [w n + max
1≤i≤n
h(e ni x)]) + Φ(x) − I(η ni ≤ x)}
¯ ¯
¯ ¯.
Since sup
x |V n (x)|
≤ 2 sup
|Φ(x)−Φ(y)|≤b
n¯ ¯
¯ ¯ 1
√ n h
n
hX
i=1
{I(η ni ≤ x) − Φ(x) + Φ(y) − I(η ni ≤ y)}
¯ ¯
¯ ¯
= o P (1)
(cf. Billingsley ([3])), where b n = sup x |Φ(x+[w n +max 1≤i≤nh(e ni x)])−Φ(x)| = o P (1), it suffices to show that sup x |V I n (x)| = o P (1).
Let x j be such that −∞ = x 0 < · · · < x Nn = ∞ and Φ(x j ) = N j
n