A Berry-Esseen type bound of regression estimator based on linear process errors

(1)

J. Korean Math. Soc. 45 (2008), No. 6, pp. 1753–1767

A BERRY-ESSEEN TYPE BOUND OF REGRESSION ESTIMATOR BASED ON LINEAR PROCESS ERRORS

Han-Ying Liang and Yu-Yu Li

Reprinted from the

Journal of the Korean Mathematical Society Vol. 45, No. 6, November 2008

⃝2008 The Korean Mathematical Societyc

(2)

A BERRY-ESSEEN TYPE BOUND OF REGRESSION ESTIMATOR BASED ON LINEAR PROCESS ERRORS

Han-Ying Liang and Yu-Yu Li

Abstract. Consider the nonparametric regression model Yni= g(xni) + ϵni (1≤ i ≤ n), where g(·) is an unknown regression function, xni are known ﬁxed design points, and the correlated errors{ϵni, 1≤ i ≤ n} have the same distribution as{Vi, 1≤ i ≤ n}, here Vt=P_∞

j=−∞ψjet−jwith P_∞

j=−∞|ψj| < ∞ and {et} are negatively associated random variables.

Under appropriate conditions, we derive a Berry-Esseen type bound for the estimator of g(·). As corollary, by choice of the weights, the Berry- Esseen type bound can attain O(n^−1/4(log n)^3/4).

1. Introduction Consider the nonparametric regression model

(1.1) Y

ni

= g(x

ni

) + ϵ

ni

, i = 1, . . . , n,

where g is an unknown regression function deﬁned on A, x

ni

are known ﬁxed design points, and ϵ

ni

are random errors. As an estimate of g, we consider the following weighted regression estimator:

(1.2) g

_n

(x) =

∑

n i=1

w

_ni

Y

_ni

, x ∈ A, where w

ni

= w

ni

(x) are weight functions.

The above estimator was ﬁrst proposed by Georgiev [8] and subsequently have been studied by many authors. For instance, when ϵ

ni

are assumed to be independent, consistency and asymptotic normality have been studied by Georgiev and Greblicki [10], Georgiev [9] and M¨ uller [17] among others. Results for the case when ϵ

ni

are dependent have also been studied by various authors.

Fan [7] extended the work of Georgiev [9] and M¨ uller [17] in the estimation of the regression model to the case where {ϵ

ni

} form an L

^q

-mixingale sequence for some 1 ≤ q ≤ 2. Roussas [19] discussed strong consistency and quadratic mean consistency for g

_n

(x) under mixing conditions. Roussas et al. [22] established

Received May 15, 2007; Revised August 27, 2007.

2000 Mathematics Subject Classiﬁcation. 62G08.

Key words and phrases. nonparametric regression model, negatively associated random variable, Berry-Esseen type bound.

⃝2008 The Korean Mathematical Societyc 1753

(3)

asymptotic normality of g

_n

(x) assuming that the errors are from a strictly stationary stochastic process and satisfy the strong mixing condition. Tran et al [26] discussed again asymptotic normality of g

n

(x) assuming that the errors form a linear time series, more precisely, a weakly stationary linear process based on a martingale diﬀerence sequence.

In this paper, we consider the model (1.1) and assume the following form for {ϵ

ni

}:

(A1) For each n, {ϵ

ni

, 1 ≤ i ≤ n} have the same distribution as V

1

, . . . , V

n

, where V

t

= ∑

_∞

j=−∞

ψ

j

e

t−j

and {e

t

} are identically distributed, negatively associated random variables with Ee

i

= 0. Here {ψ

j

} is a sequence of real numbers with ∑

_∞

j=−∞

|ψ

j

| < ∞.

Here, a ﬁnite family of random variables {X

i

, 1 ≤ i ≤ n} is said to be negatively associated (NA) if, for every pair of disjoint subsets A and B of {1, 2, . . . , n}, we have

Cov(f

₁

(X

_i

, i ∈ A), f

2

(X

_j

, j ∈ B)) ≤ 0,

whenever f

1

and f

2

are coordinatewise increasing provided the covariance ex- ists. An inﬁnite family of random variables is said to be NA, if every ﬁnite subfamily is NA.

The notion of negative association was ﬁrst introduced by Alam and Saxena

[1]. Joag-Dev and Proschan [11] showed that many well known multivari-

ate distributions possess the NA property. Examples include (a) multinomial,

(b) convolution of diﬀerent multinomials, (c) multivariate hypergeometric, (d)

Dirichlet, (e) Dirichlet compound multinomial, (f) negatively correlated nor-

mal distribution, (g) permutation distribution, (h) random sampling without

replacement, and (i) joint distribution of ranks. The signiﬁcance of NA, how-

ever, seems to come from the perception that it is an appropriate model when

several species compete for the same limited resources. Because of its wide

applications in multivariate statistical analysis and systems reliability, the no-

tion of NA has recently received considerable attention. We refer to Joag-Dev

and Proschan [11] for fundamental properties, Matula [16] for the three se-

ries theorem, Shao [23] for the Rosenthal-type inequality and the Kolmogorov

exponential inequality, and Su et al. [25] for a moment inequality and weak

convergence, Shao and Su [24] for the law of the iterated logarithm, Liang and

Su [15] and Liang [12] as well as Baek et al. [2] for complete convergence, Liang

and Baek [13] for some strong law, Roussas [20] studied the central limit theo-

rems for weak stationary NA random ﬁelds. Asymptotic properties of estimates

related to NA samples have also been studied by some authors. Cai and Rous-

sas [3] gave uniform strong consistency, convergence rates and the asymptotic

distribution of the Kaplan-Meier estimator for observations under randomly

censored failure times. In Cai and Roussas [4], they established Berry-Esseen

bounds for a smooth estimate of the distribution function; Roussas [21] derived

the asymptotic normality of the kernel estimate of the probability density func-

tion; Chen et al. [6] studied strong consistency of estimator in heteroscedastic

(4)

regression model under NA samples; Yang [27] studied uniformly asymptotic normality of the regression weighted estimator for NA samples; Liang and Jing [14] discussed the asymptotic properties of g

n

(x) under NA setting. In par- ticular, Liang and Jing [14] studied the asymptotic normality of g

n

(x) under assumption (A1) for V

t

= ∑

_∞

j=0

ψ

j

e

t−j

setting.

In this paper, we further investigate model (1.1) and derive a Berry-Esseen type bound for the estimator g

n

(x) of g( ·) under the errors {ϵ

ni

} satisfy assumption (A1). By choice of the weights, the Berry-Esseen type bound can attain O(n

^−1/4

(log n)

^3/4

).

The layout of the paper is as follows. The main result is presented in Sec- tion 2. In Section 3, some preliminary lemmas are given. The proofs of the main result and preliminary lemmas are provided in Sections 4 and 5, respectively.

In the sequel, let C, c, c

₁

, . . . denote generic ﬁnite positive constants, whose values are unimportant and may change from line to line. For random variables X and Y , X = Y means that the distribution of X is the same as that of Y .

^D

All limits are taken as the sample size n tends to ∞, unless speciﬁed otherwise.

2. The main result

In order to formulate the main result, we now give the following assumptions.

(A2) The weights satisfy ∑

n

i=1

|w

ni

(x) | ≤ C, w

n

:= max

₁_≤i≤n

|w

ni

(x) | = O(σ

²_n

(x)), where σ

²_n

(x) := Var(g

n

(x)) > 0.

(A3) There exist positive integers p := p(n) and q := q(n) such that for suﬃciently large n, p + q ≤ 3n, qp

⁻¹

≤ c < ∞.

Let γ

in

→ 0 (i = 1, 2, 3) and u(q) → 0, where γ

1n

= nqp

⁻¹

w

n

, γ

2n

= pw

n

, γ

3n

= n( ∑

|j|>n

|ψ

j

|)

²

and u(q) = sup

_j_≥1

∑

|i−j|≥q

|Cov(e

i

, e

j

) |.

Our main result is as follows.

Theorem 2.1. Suppose that (A1)-(A3) are satisﬁed. If E |e

0

|

^2+δ

< ∞ for some δ > 0, then, for each x ∈ A, we have

sup

u

|P (σ

n⁻¹

(x) {g

n

(x) − Eg

n

(x) } ≤ u) − Φ(u)| = O(a

n

),

where a

n

= (γ

_1n^1/2

+γ

_2n^1/2

)(log n)

^1/2

+γ

_2n^δ/2

+γ

_3n^1/3

+u

^1/3

(q)+n

⁻¹

+((nqp

⁻¹

)

^−δ/2

+ p

^−δ/2

)(log n)

^(δ^−2)/2

→ 0, Φ(u) represents the standard normal distribution function.

Corollary 2.1. Suppose that (A1)-(A2) are satisﬁed. Let sup

n≥1

n

^7/8

(log n)

^−9/8

∑

|j|>n

|ψ

j

| < ∞

and u(n) = O(n

^9/4

e

^−3n/4

). If E |e

0

|

³

< ∞ and w

n

= O(n

⁻¹

), then for each x ∈ A, we have

sup

u

|P (σ

n⁻¹

(x) {g

n

(x) − Eg

n

(x) } ≤ u) − Φ(u)| = O(n

^−1/4

(log n)

^3/4

).

(5)

Corollary 2.2. Suppose that (A1) is satisﬁed, and that spectral density function τ (w) of V

i

is bounded away from zero and inﬁnity, i.e., 0 < c

1

≤ τ(w) <

∞, for w ∈ (π, π]. Let sup

n≥1

n

^7/8

(log n)

^−9/8

∑

|j|>n

|ψ

j

| < ∞ and u(n) = O(n

^9/4

e

^−3n/4

). If E |e

0

|

³

< ∞, w

n

= O(n

⁻¹

) and n ∑

n

i=1

w

²_ni

(x) ≥ c

0

> 0, then, for each x ∈ A, we have

sup

u

|P (σ

n⁻¹

(x) {g

n

(x) − Eg

n

(x) } ≤ u) − Φ(u)| = O(n

^−1/4

(log n)

^3/4

).

Remark 2.1. (a) In Theorem 2.1, we use u(q) → 0, which is easily satisﬁed.

For example:

(i) If u(1) < ∞ (cf. Roussas [21]), then u(q) → 0 as q → ∞.

(ii) For stationary NA sequence, Cai and Roussas [4] use the covariance coeﬃcient: u

^′

(n) := ∑

_∞

j=n

|cov(e

1

, e

j+1

) |

^1/3

and u

^′

(1) < ∞.

In this case, we have |cov(e

1

, e

j+1

) | = o(j

⁻³

). Hence

u(q) :=

∑

∞ j=q

|cov(e

1

, e

_j+1

) | = O( ∑

^∞

j=q

|cov(e

1

, e

_j+1

) |

^1/3

j

⁻²

) = O(q

⁻²

).

(b) In Roussas et al. [22], the weights are required to satisfy the conditions: ∑

n

i=1

|w

ni

(x) | ≤ C, max

1≤i≤n

|w

ni

(x) | = O( ∑

n

i=1

w

_ni²

(x)),

∑

n

i=1

w

²_ni

(x) = O(σ

_n²

(x)). Clearly, these conditions imply Assump- tion (A2).

In addition, if we choose

w

ni

(x) = x

n,i

− x

n,i−1

h

n

K( x − x

n,i

h

n

),

where {h

n

} is a sequence of positive constants tending to 0 and nh

n

→

∞, and the design points satisfy 0 = x

n,0

≤ x

n,1

≤ · · · ≤ x

n,n

= 1, this weight also was used by Tran et al. [26]. Assume that

(iii) there exist positive constants c

₁

and c

₂

such that c

₁

n

⁻¹

≤ x

n,i

− x

_n,i₋₁

≤ c

2

n

⁻¹

for i = 1, 2, . . . , n,

(iv) K(x) is nonnegative, bounded and continuous almost everywhere on R and has a majorant; that is, K(x) ≤ H(x) all x ∈ R, where H is symmetric, bounded, nonincreasing on [0, ∞) with ∫

H(y)dy <

∞, where the integral is over R.

(v) {ϵ

ni

} are stationary with Eϵ

ni

= 0 and its spectral density function f (w) is bounded away from zero and inﬁnity, i.e., 0 < c

3

≤ f (w) < ∞ for w ∈ (π, π].

Note that (v) implies that

σ

_n²

(x) = E[

∑

n i=1

w

_ni

(x)ϵ

_ni

]

²

=

∫

π

−π

f (w) |

∑

n k=1

w

_nk

(x)e

^−ikw

|

²

dw ≥ c

4

∑

n i=1

w

²_ni

(x).

(6)

While, by using Lemma 3.1 in Tran et al. [26], (iii) and (iv) imply max

1≤i≤n

|w

ni

(x) | ≤ C/nh

n

, ∑

n

i=1

w

ni

(x) ≤ C and 1

h

n

∑

n i=1

(x

n,i

− x

n,i−1

)K

²

( x − x

n,i

h

n

) →

∫

_∞

−∞

K

²

(u)du > 0.

Thus σ

_n²

(x) ≥ c

4

∑

n

i=1

w

²_ni

(x) ≥

_nh^c⁵_n

. Hence max

1≤i≤n

|w

ni

(x) | = o(σ

_n²

(x)), which shows that (A2) is mild.

(c) In Roussas et al. [22] (cf. (2.21) there), they assume that nqp

⁻¹

∑

n i=1

w

_ni²

(x) → 0, p

²

∑

n i=1

w

²_ni

(x) → 0.

Obviously, the limits above are stronger than γ

1n

→ 0 and γ

2n

→ 0 here, which were also used by Yang [27]. Under some regularity conditions, γ

_3n

→ 0 holds with the usual AR, MA and ARMA processes which are extensively used to model serially correlated data.

3. Some preliminary lemmas We write σ

²_n

= σ

_n²

(x) and

S

_n

:= σ

_n⁻¹

{g

n

(x) − Eg

n

(x) } = σ

^D _n⁻¹

∑

n i=1

w

_ni

V

_i

= σ

_n⁻¹

∑

n i=1

w

_ni

∑

n j=−n

ψ

_j

e

_i_−j

+ σ

⁻¹_n

∑

n i=1

w

_ni

∑

|j|>n

ψ

_j

e

_i_−j

:= S

1n

+ S

2n

.

(3.3) Note that

S

_1n

= σ

_n⁻¹

∑

n i=1

w

_ni

∑

n j=−n

ψ

_j

e

_i_−j

=

∑

2n l=1−n

σ

_n⁻¹

(

^min{n,l+n}

∑

i=max{1,l−n}

w

_ni

ψ

_i_−l

)

e

_l

:=

∑

2n l=1−n

σ

⁻¹_n

t

nl

e

l

.

Set Z

nl

= σ

_n⁻¹

t

nl

e

l

, l = 1 − n, 2 − n, . . . , 2n. Then S

1n

= ∑

2n

l=1−n

Z

nl

. Let k = [

_p+q³ⁿ

] and

(3.4) S

1n

= S

_1n^′

+ S

^′′_1n

+ S

_1n^′′′

, where

 



 

S

_1n^′

= ∑

k

m=1

y

nm

, y

nm

= ∑

k_m+p−1 i=k_m

Z

ni

, S

_1n^′′

= ∑

k

m=1

y

^′_nm

, y

^′_nm

= ∑

l_m+q−1 i=l_m

Z

ni

, S

_1n^′′′

= y

_nk+1^′

, y

^′_nk+1

= ∑

2n

i=k(p+q)−n+1

Z

ni

,

(7)

k

_m

= (m − 1)(p + q) + 1 − n, l

m

= (m − 1)(p + q) + p + 1 − n, m = 1, . . . , k.

From (3.3) and (3.4) we have

(3.5) S

n

= S

_1n^′

+ S

_1n^′′

+ S

_1n^′′′

+ S

2n

.

Lemma 3.1. Suppose that (A1)-(A3) are satisﬁed. If Ee

²₀

< ∞, then E(S

_1n^′′

)

²

≤ Cγ

1n

, E(S

_1n^′′′

)

²

≤ Cγ

2n

, E(S

_2n

)

²

≤ Cγ

3n

. Lemma 3.2. Suppose that (A1)-(A3) are satisﬁed. Set s

²_n

= ∑

k

m=1

Var(y

nm

).

If Ee

²₀

< ∞, then

|s

²n

− 1| ≤ C(γ

^1/21n

+ γ

_2n^1/2

+ γ

_3n^1/2

+ u(q)).

Let η

nm

, m = 1, 2, . . . , k be independent random variables and η

nm

= y

D nm

for m = 1, 2, . . . , k. Put T

n

= ∑

k

m=1

η

nm

.

Lemma 3.3. Under the assumptions of Theorem 2.1, we have sup

u

|P (T

n

/s

n

≤ u) − Φ(u)| ≤ Cγ

2n^δ/2

. Lemma 3.4. Under the assumptions of Theorem 2.1, we have

sup

u

|P (S

^′1n

≤ u) − P (T

n

≤ u)| ≤ C{γ

2n^δ/2

+ u

^1/3

(q) }.

Lemma 3.5. Under the assumptions of Theorem 2.1, we have P ( |S

1n^′′

| > cµ

n

) ≤ C{n

⁻¹

+ (nqp

⁻¹

)

^−δ/2

(log n)

^(δ^−2)/2

}, (3.6)

P ( |S

1n^′′′

| > cν

n

) ≤ C{n

⁻¹

+ p

^−δ/2

(log n)

^(δ^−2)/2

}, (3.7)

P ( |S

2n

| > τ

n

) ≤ Cγ

3n^1/3

, (3.8)

where µ

_n

= γ

_1n^1/2

(log n)

^1/2

, ν

_n

= γ

_2n^1/2

(log n)

^1/2

, τ

_n

= γ

^1/3_3n

.

Lemma 3.6. Let X and Y be random variables. Then for any a > 0 sup

u

|P (X + Y ≤ u) − Φ(u)| ≤ sup

u

|P (X ≤ u) − Φ(u)| + a

√ 2π + P ( |Y | > a).

The proof of Lemma 3.6 can be found in Chang and Rao [5].

4. Proof of Theorem 2.1 We observe that

sup

u

|P (S

1n^′

≤ u) − Φ(u)|

≤ sup

u

|P (S

1n^′

≤ u) − P (T

n

≤ u)| + sup

u

|P (T

n

≤ u) − Φ( u s

_n

) | + sup

u

|Φ( u

s

_n

) − Φ(u))|

:= J

1n

+ J

2n

+ J

3n

.

(8)

From Lemma 3.4 we have J

1n

≤ C(γ

2n^δ/2

+u

^1/3

(q)), Lemma 3.3 yields that J

2n

≤ Cγ

_2n^δ/2

, according to Lemma 3.2 we obtain that

J

3n

= |Φ(u/s

n

) − Φ(u))|

≤ C|s

²n

− 1|/s

²n

≤ C|s

²n

− 1| ≤ C(γ

1n^1/2

+ γ

_2n^1/2

+ γ

^1/2_3n

+ u(q)).

Therefore sup

u

|P (S

1n^′

≤ u) − Φ(u)| ≤ C{γ

1n^1/2

+ γ

_2n^1/2

+ γ

_2n^δ/2

+ γ

_3n^1/2

+ u

^1/3

(q) } and from Lemmas 3.5 and 3.6 we have

sup

u

|P (S

n

≤ u) − Φ(u)|

≤ sup

u

|P (S

1n^′

+ S

_1n^′′

+ S

_1n^′′′

+ S

_2n

≤ u) − Φ(u)|

≤ sup

u

|P (S

1n^′

≤ u) − Φ(u)| + (µ

n

+ ν

n

+ τ

n

)/ √ 2π + P ( |S

^′′1n

+ S

_1n^′′′

+ S

_2n

| > µ

n

+ ν

_n

+ τ

_n

)

≤ C{γ

1n^1/2

+ γ

_2n^1/2

+ γ

^δ/2_2n

+ γ

_3n^1/2

+ u

^1/3

(q) } + µ

n

+ ν

_n

+ τ

_n

+ P ( |S

^′′1n

| > µ

n

) + P ( |S

1n^′′′

| > ν

n

) + P ( |S

2n

| > τ

n

)

= O (

(γ

_1n^1/2

+ γ

_2n^1/2

)(log n)

^1/2

+ γ

_2n^δ/2

+ γ

_3n^1/3

+ u

^1/3

(q) + n

⁻¹

+ ((nqp

⁻¹

)

^−δ/2

+ p

^−δ/2

)(log n)

^(δ^−2)/2

)

= O(a

_n

).

Proof of Corollary 2.1. In Theorem 2.1, taking δ = 1, p = [n

^1/2

(log n)

^1/2

], q = [log n], then

u(q) = O(n

^−3/4

(log n)

^9/4

) by u(n) = O(n

^9/4

e

^−3n/4

);

γ

^1/3_3n

= n

^−1/4

(log n)

^3/4

(

n

^7/8

(log n)

^−9/8

∑

|j|>n

|ψ

j

| )

2/3

= O(n

^−1/4

(log n)

^3/4

) by sup

_n_≥1

n

^7/8

(log n)

^−9/8

∑

|j|>n

|ψ

j

| < ∞; γ

1n^1/2

= γ

_2n^1/2

= n

^−1/4

(log n)

^1/4

by w

_n

= O(n

⁻¹

).

Therefore, the conclusion follows from Theorem 2.1. ¤ Proof of Corollary 2.2. Note that

σ

²_n

(x) = E(

∑

n k=1

w

_nk

V

_k

)

²

=

∫

π

−π

τ (w) ¯¯ ¯

∑

n k=1

w

_nk

e

^−ikw

¯¯ ¯

²

dw ≥ C

∑

n k=1

w

²_nk

.

(9)

Hence, from n ∑

n

i=1

w

²_ni

(x) ≥ c

0

> 0 and w

_n

= O(n

⁻¹

), it follows that w

n

≤ C

∑

n i=1

w

²_ni

(x) = O(σ

_n²

(x)) and

∑

n i=1

|w

ni

(x) | ≤ C,

i.e., (A2) is satisﬁed. The rest proofs are the same as the proof of Corollary

2.1. ¤

5. Proofs of lemmas

Lemma 5.1. Let {X

j

, 1 ≤ j ≤ n} be NA random variables with EX

j

= 0 and E |X

j

|

^p

< ∞ for some p > 1, and let {a

j

, j ≥ 1} be a sequence of real numbers.

Then, there exist A

p

> 0 and B

p

> 0 such that E max

1≤k≤n

| ∑

k

j=1

a

j

X

j

|

^p

≤ A

p

∑

n

j=1

E |a

j

X

j

|

^p

for 1 < p ≤ 2 and E max

1≤k≤n

¯¯ ¯¯ ∑

^k

j=1

a

_j

X

_j

¯¯

^p

≤ B

p

{ ∑

n

j=1

E |a

j

X

_j

|

^p

+ ( ∑

n

j=1

E(a

_j

X

_j

)

²

)

p/2

}

for p > 2.

From Theorem 2 of Shao [23], it is easy to prove Lemma 5.1.

Lemma 5.2 (Yang [27]). Suppose that {X

j

, j ≥ 1} is a sequence of NA random variables.

(a) Let {a

j

, j ≥ 1} be a sequence of real numbers, 1 = m

0

< m

₁

< · · · <

m

k

= n. Denote by Y

l

:= ∑

ml

j=m_l−1+1

a

j

X

j

for 1 ≤ l ≤ k. Then

¯¯ ¯¯E exp { it

∑

k l=1

Y

_l

}

−

∏

k l=1

exp {itY

l

} ¯¯

¯¯ ≤ 4t

²

∑

1≤s<j≤n

|a

s

a

_j

||Cov(X

s

, X

_j

) |.

(b) Let EX

_j

= 0 and |X

j

| ≤ b a.s. Set ∆

n

= ∑

n

j=1

EX

_j²

. Then, for any ϵ > 0,

P ( |

∑

n j=1

X

_j

| ≥ ϵ) ≤ 2 exp {

− ϵ

²

2(2∆

_n

+ 2bϵ) }

.

Proof of Lemma 3.1. According to the deﬁnition of S

^′′_1n

, from Lemma 5.1 and (A1)-(A2) we have

E(S

_1n^′′

)

²

= E(

∑

k m=1

l_m

∑

+q−1 i=lm

Z

ni

)

²

≤ C

∑

k m=1

lm

∑

+q−1 i=lm

E(Z

_ni²

)

= C

∑

k m=1

lm

∑

+q−1 i=l_m

σ

⁻²_n

t

²_ni

Ee

²_i

(10)

≤ C

∑

k m=1

l_m

∑

+q−1 i=lm

σ

⁻²_n

(

^min{n, i+n}

∑

j=max{1, i−n}

w

nj

ψ

j−i

)

2

≤ C

∑

k m=1

l_m

∑

+q−1 i=lm

w

n

(

^min{n, i+n}

∑

j=max{1, i−n}

|ψ

j−i

| )

2

≤ Ckqw

n

( ∑

_∞

j=−∞

|ψ

j

| )

2

≤ Cnqp

⁻¹

w

n

= Cγ

1n

. Similarly,

E(S

_1n^′′′

)

²

= E

( ∑

2n

i=k(p+q)−n+1

Z

ni

)

2

≤ C

∑

2n i=k(p+q)−n+1

σ

⁻²_n

t

²_ni

Ee

²_i

≤ C

∑

2n i=k(p+q)−n+1

w

_n

(

^min{n, i+n}

∑

j=max{1, i−n}

|ψ

j−i

| )

2

≤ C[3n − k(p + q)]w

n

( ∑

∞

j=−∞

|ψ

j

| )

2

≤ Cpw

n

= Cγ

2n

. As to S

2n

, (A2) and Ee

²₀

< ∞ yield that

ES

_2n²

= E ¯¯

¯¯σ

⁻¹ⁿ

∑

ⁿ

i₁=1

w

ni1

∑

|j1|>n

ψ

j1

e

i1−j1

¯¯ ¯¯ ¯¯

¯¯σ

ⁿ⁻¹

∑

ⁿ

i₂=1

w

ni2

∑

|j2|>n

ψ

j2

e

i2−j2

¯¯ ¯¯

≤ CE { ∑

n

i1=1

|w

ni₁

|

∑

n i2=1

¯¯ ¯¯ ∑

|j1|>n

ψ

j₁

e

i₁−j1

¯¯ ¯¯ ¯¯

¯¯ ∑

|j2|>n

ψ

j₂

e

i₂−j2

¯¯ ¯¯ }

≤ Cn ( ∑

|j|>n

|ψ

j

| )

2

= Cγ

_3n

.

¤ Proof of Lemma 3.2. Let Γ

n

= ∑

1≤i<j≤k

Cov(y

ni

, y

nj

), then s

²_n

= E(S

_1n^′

)

²

− 2Γ

_n

. Note that ES

_n²

= 1, so

E(S

_1n^′

)

²

= E[S

_n

− (S

1n^′′

+ S

^′′′_1n

+ S

_2n

)]

²

= 1 + E(S

_1n^′′

+ S

^′′′_1n

+ S

2n

)

²

− 2ES

n

(S

^′′_1n

+ S

_1n^′′′

+ S

2n

).

(11)

Hence, from Lemma 3.1 we have

(5.9)

|E(S

1n^′

)

²

− 1|

≤ E(S

1n^′′

+ S

_1n^′′′

+ S

2n

)

²

+ 2 |ES

n

(S

_1n^′′

+ S

_1n^′′′

+ S

2n

) |

≤ E(S

1n^′′

+ S

_1n^′′′

+ S

2n

)

²

+ 2(ES

_n²

)

^1/2

[E(S

^′′_1n

+ S

_1n^′′′

+ S

2n

)

²

]

^1/2

≤ {[E(S

1n^′′

)

²

]

^1/2

+ [E(S

_1n^′′′

)

²

]

^1/2

+ (ES

_2n²

)

^1/2

}

²

+ 2 {[E(S

1n^′′

)

²

]

^1/2

+ [E(S

_1n^′′′

)

²

]

^1/2

+ (ES

_2n²

)

^1/2

}

≤ (γ

1n^1/2

+ γ

_2n^1/2

+ γ

^1/2_3n

)

²

+ 2(γ

_1n^1/2

+ γ

^1/2_2n

+ γ

_3n^1/2

)

≤ C(γ

1n^1/2

+ γ

_2n^1/2

+ γ

_3n^1/2

).

On the other hand, (5.10)

|Γ

n

|

= _| ^∑

1≤i<j≤k

Cov(y

ni

, y

nj

) |

≤ ^∑

1≤i<j≤k ki

∑

+p−1

s=ki

kj

∑

+p−1

t=kj

|Cov(Z

ns

, Z

nt

)|

≤

^k

^∑

⁻¹

i=1 ki+p

∑

−1

s=ki

∑

k

j=i+1 kj

∑

+p−1

t=kj

min{n,s+n}

∑

u=max{1,s−n}

min{n, t+n}

∑

v=max{1,t−n}

σ

⁻²n

|w

nu

w

nv

||ψ

u−s

ψ

v−t

||Cov(e

s

, e

t

)|

≤ ^C

k

∑

−1

i=1 ki

∑

+p−1

s=ki

min{n,s+n}

∑

u=max{1,s−n}

|ψ

u−s

||w

nu

|

∑

k

j=i+1 kj

∑

+p−1

t=kj

| Cov(e

s

, e

t

) |

min{n,t+n}

∑

v=max{1,t−n}

|ψ

v−t

|

≤ ^C

k

∑

−1

i=1 k_i

∑

+p−1

s=ki

min{n,s+n}

∑

u=max{1,s−n}

|ψ

u−s

||w

nu

| ∑

t:|t−s|≥q

|Cov(e

s

, e

t

) |

≤ ^Cu(q)

^k−1

^∑

i=1 ki

∑

+p−1

s=ki

∑

n

u=1

|w

nu

||ψ

u−s

|

≤ Cu(q).

Therefore, (5.9) and (5.10) follow that

|s

²n

− 1| ≤ C(γ

^1/21n

+ γ

_2n^1/2

+ γ

_3n^1/2

+ u(q)).

¤ Proof of Lemma 3.3. By using Berry-Esseen inequality (cf. Petrov [18], p. 154, Theorem 5.7) we have

sup

u

|P (T

n

/s

n

≤ u) − Φ(u)| ≤ C

∑

k m=1

E |η

nm

|

^2+δ

/s

^2+δ_n

.

(12)

While, according to Lemma 5.1, from (A1) and (A2) it follows that

∑

k m=1

E |η

nm

|

^2+δ

≤ C

∑

k m=1

{

km

∑

+p−1

j=k_m

¯¯ ¯¯

^min^{n,j+n}

∑

i=max{1,j−n}

σ

_n⁻¹

w

ni

ψ

i−j

¯¯ ¯¯

^2+δ

E |e

j

|

^2+δ

+

[

km

∑

+p−1

j=k_m

(

^min{n,j+n}

∑

i=max{1,j−n}

σ

⁻¹_n

w

ni

ψ

i−j

)

2

Ee

²_j

]

1+δ/2

}

≤ Cσ

^−(2+δ)n

∑

k m=1

km

∑

+p−1 j=k_m

min{n, j+n}

∑

i=max{1, j−n}

|w

ni

||ψ

i−j

| · ¯¯

¯¯

^min^{{n, j+n}}

∑

i=max{1, j−n}

w

ni

ψ

i−j

¯¯ ¯¯

^1+δ

+ C

∑k m=1

[^km∑+p−1 j=km

( min{n,j+n}∑

i=max{1,j−n}

σn⁻¹wniψi−j

)2

·(^k^m∑^+p⁻¹

j=km

( min{n,j+n}∑

i=max{1,j−n}

σn⁻¹wniψi−j

)2)δ/2]

≤ C {

w

_n^δ/2

∑

n i=1

|w

ni

| ( ∑

k

m=1 km

∑

+p−1

j=k_m

|ψ

i−j

| )

+ (pw

n

)

^δ/2

∑

k

m=1

[

^km

∑

+p−1 j=km

(

^min{n,j+n}

∑

i=max{1,j−n}

|w

ni

ψ

i−j

| )

· (

^min{n,j+n}

∑

i=max{1,j−n}

σ

⁻²_n

|w

ni

ψ

i−j

| )]}

≤ C{w

n^δ/2

+ (pw

_n

)

^δ/2

}

≤ C(pw

n

)

^δ/2

= Cγ

_2n^δ/2

.

Note that Lemma 3.2 implies s

²_n

→ 1 and therefore sup

u

|P (T

n

/s

_n

≤ u) − Φ(u)| ≤ Cγ

2n^δ/2

.

¤

Proof of Lemma 3.4. Assume that φ(t) and ψ(t) are the characteristic function of S

^′_n

and T

n

, respectively. By Esseen inequality (cf. Petrov [18], p. 146, Theorem 5.3), for any T > 0

(5.11)

sup

u

|P (S

n^′

≤ u) − P (T

n

≤ u)|

≤

∫

T

−T

| φ(t) − ψ(t)

t |dt

+ T sup

u

∫

|u|≤^C_T

|P (T

n

≤ u + y) − P (T

n

≤ u)|dy

:= I

1n

+ I

2n

.

(13)

According to Lemma 5.2(a), following the line as in (5.10) we have

¯¯ ¯¯ φ(t) − ψ(t) t

¯¯ ¯¯

= |E exp{itS

1n^′

} − Π

^km=1

E exp {ity

nm

}|

≤ 4t

²^k−1^∑

i=1 ki+p−1∑

s=ki

∑k j=i+1

kj∑+p−1 t=kj

min{n, s+n}∑

u=max{1, s−n}

min{n, t+n}∑

v=max{1, t−n}

σ_n⁻²| wnuwnv|| ψu−sψ_v−t|| Cov(es, et)|

≤ Ct

²

u(q).

Therefore I

1n

≤ Cu(q)T

²

. On the other hand, from Lemmas 3.2 and 3.3 it follows that

sup

u

|P (T

n

≤ u + y) − P (T

n

≤ u)|

≤ sup

y

|P (T

n

/s

_n

≤ (u + y)/s

n

) − Φ((u + y)/s

n

) | + sup

u

|P (T

n

/s

n

≤ u/s

n

) − Φ(u/s

n

) | + sup

u

|Φ((u + y)/s

n

) − Φ(u/s

n

) |

≤ C{γ

2n^δ/2

+ |y|/s

n

} ≤ C{γ

2n^δ/2

+ |y|}.

Therefore I

2n

≤ C(γ

^δ/22n

+ 1/T ) and choosing T = u

^−1/3

(q) from (5.11) we obtain that

sup

u

|P (S

n^′

≤ u) − P (T

n

≤ u)| ≤ C(γ

2n^δ/2

+ u

^1/3

(q)).

¤ Proof of Lemma 3.5. Note that

S

_1n^′′

=

∑

k m=1

lm

∑

+q−1 j=l_m

σ

⁻¹_n

t

nj

e

j

=

∑

k m=1

lm

∑

+q−1 j=l_m

σ

⁻¹_n

(t

⁺_nj

− t

⁻nj

)e

j

,

where a

⁺

= max {0, a}, a

⁻

= min {0, −a}, so, without loss of generality, we assume that t

nj

≥ 0 for j ≥ 1 and n ≥ 1. Let

α

nj

= −b

n

I(Z

nj

< −b

n

) + Z

nj

I( |Z

nj

| ≤ b

n

) + b

n

I(Z

nj

> b

n

), α

^′_nj

= (Z

_nj

+ b

_n

)I(Z

_nj

< −b

n

) + (Z

_nj

− b

n

)I(Z

_nj

> b

_n

), β

nj

= α

nj

− Eα

nj

, β

_nj^′

= α

_nj^′

− Eα

^′nj

,

where b

_n

> 0, which will be specialized later. It is clear that {β

nj

, j ≥ 1} and {β

nj^′

, j ≥ 1} are mean zero NA sequences from the deﬁnition of α

nj

and α

^′_nj

.

Firstly, we prove (3.6). Taking b

_n

= γ

^1/2_1n

(log n)

^−1/2

. Set

H

n

=

∑

k m=1

lm

∑

+q−1 j=lm

β

nj

, H

_n^′

=

∑

k m=1

lm

∑

+q−1 j=lm

β

_nj^′

. Then S

_1n^′′

= H

n

+ H

_n^′

and

(5.12) P ( |S

1n^′′

| > cµ

n

) ≤ P (|H

n

| > cµ

n

/2) + P ( |H

n^′

| > cµ

n

/2).

(14)

To evaluate P ( |H

n

| > cµ

n

/2), we shall use Lemma 5.2(b). Note that |β

nj

| ≤ 2b

n

and the proof of Lemma 3.1 shows that

∆

1n

:=

∑

k m=1

lm

∑

+q−1 j=lm

E(β

_nj²

) ≤

∑

k m=1

lm

∑

+q−1 j=lm

E(Z

_nj²

) ≤ Cγ

1n

. Then, by using Lemma 5.2(b), for large c > 0 we have

(5.13) P

( |H

n

| > cµ

_n

2 ) ≤ 2 exp {

− c

²

µ

²_n

8(2∆

1n

+ b

n

µ

n

)

}

≤ 2 exp{−cc

1

log n } ≤ Cn

⁻¹

. We observe that

E(β

_nj^′

)

²

≤ E(α

^′nj

)

²

≤ 2E(Z

nj²

+ b

²_n

)I( |Z

nj

| > b

n

)

≤ 4b

^−δn

E |Z

nj

|

^2+δ

≤ C(σ

⁻¹n

|t

nj

|)

^2+δ

b

^−δ_n

. Hence, by using Lemma 5.1 we have

P

( |H

n^′

| > cµ

n

2 ) ≤ C E(H

_n^′

)

²

µ

²_n

≤ Cµ

⁻²n

∑

k m=1

lm

∑

+q−1 j=lm

(σ

_n⁻¹

|t

nj

|)

^2+δ

b

^−δ_n

≤ C γ

1n

w

n^δ/2

µ

²_n

b

^δ_n

≤ C(nqp

⁻¹

)

^−δ/2

(log n)

^(δ^−2)/2

. (5.14)

Therefore, (5.13) and (5.14) follow (3.6).

Next we prove (3.7). Following the line as for (3.6). Choosing b

n

= γ

^1/2_2n

(log n)

^−1/2

.

Set Q

n

= ∑

2n

j=k(p+q)−n+1

β

nj

, Q

^′_n

= ∑

2n

j=k(p+q)−n+1

β

^′_nj

. Hence S

_1n^′′′

= Q

n

+ Q

^′_n

. Note that

∆

2n

:=

∑

2n j=k(p+q)−n+1

Eβ

²_nj

≤

∑

2n j=k(p+q)−n+1

EZ

_nj²

≤ Cγ

2n

. So, by using Lemma 5.2(b), for large c > 0, we have

(5.15) P

( |Q

n

| > cν

n

2 ) ≤ 2 exp {

− c

²

ν

²_n

8(2∆

2n

+ b

n

ν

n

)

}

≤ C exp{−cc

2

log n } ≤ Cn

⁻¹

. From E(β

_nj^′

)

²

≤ C(σ

n⁻¹

|t

nj

|)

^2+δ

b

^−δ_n

, and using Lemma 5.1 we have

P

( |Q

^′n

| > cν

n

2 ) ≤ C E(Q

^′_n

)

²

ν

_n²

≤ Cν

n⁻²

∑

2n j=k(p+q)−n+1

(σ

_n⁻¹

|t

nj

|)

^2+δ

b

^−δ_n

≤ C γ

2n

w

^δ/2n

ν

_n²

b

^δ_n

≤ Cp

^−δ/2

(log n)

^(δ^−2)/2

. (5.16)

Therefore, (5.15) and (5.16) yield (3.7).

(15)

As to (3.8), choosing τ

n

= γ

_3n^1/3

= n

^1/3

( ∑

|j|>n

|ψ

j

|)

^2/3

. In view of Lemma 3.1 we have

P ( |S

2n

| > τ

n

) ≤ E(S

2n

)

²

/τ

_n²

≤ Cγ

3n^1/3

.

¤ Acknowledgments. This research was supported by the National Natural Science Foundation of China (10571136, 10871146). The authors are grate- ful to anonymous referee for providing detailed list of comment which greatly improved the presentation of the paper.

References

[1] K. Alam and K. M. L. Saxena, Positive dependence in multivariate distributions, Comm.

Statist. A—Theory Methods 10 (1981), no. 12, 1183–1196.

[2] J. I. Baek, T. S. Kim, and H. Y. Liang, On the convergence of moving average processes under dependent conditions, Aust. N. Z. J. Stat. 45 (2003), no. 3, 331–342.

[3] Z. W. Cai and G. G. Roussas, Kaplan-Meier estimator under association, J. Multivariate Anal. 67 (1998), no. 2, 318–348.

[4] , Berry-Esseen bounds for smooth estimator of a distribution function under association, J. Nonparametr. Statist. 11 (1999), no. 1-3, 79–106.

[5] M. N. Chang and P. V. Rao, Berry-Esseen bound for the Kaplan-Meier estimator, Comm. Statist. Theory Methods 18 (1989), no. 12, 4647–4664.

[6] Z. J. Chen, H. Y. Liang, and Y. F. Ren, Strong consistency of estimators in a het- eroscedastic model under NA samples, Tongji Daxue Xuebao Ziran Kexue Ban 31 (2003), no. 8, 1001–1005.

[7] Y. Fan, Consistent nonparametric multiple regression for dependent heterogeneous pro- cesses: the ﬁxed design case, J. Multivariate Anal. 33 (1990), no. 1, 72–88.

[8] A. A. Georgiev, Local properties of function ﬁtting estimates with application to system identiﬁcation, Mathematical statistics and applications, Vol. B (Bad Tatzmannsdorf, 1983), 141–151, Reidel, Dordrecht, 1985.

[9] , Consistent nonparametric multiple regression: the ﬁxed design case, J. Multi- variate Anal. 25 (1988), no. 1, 100–110.

[10] A. A. Georgiev and W. Greblicki, Nonparametric function recovering from noisy obser- vations, J. Statist. Plann. Inference 13 (1986), no. 1, 1–14.

[11] K. Joag-Dev and F. Proschan, Negative association of random variables, with applica- tions, Ann. Statist. 11 (1983), no. 1, 286–295.

[12] H. Y. Liang, Complete convergence for weighted sums of negatively associated random variables, Statist. Probab. Lett. 48 (2000), no. 4, 317–325.

[13] H. Y. Liang and J. I. Baek, Weighted sums of negatively associated random variables, Aust. N. Z. J. Stat. 48 (2006), no. 1, 21–31.

[14] H. Y. Liang and B. Y. Jing, Asymptotic properties for estimates of nonparametric regression models based on negatively associated sequences, J. Multivariate Anal. 95 (2005), no. 2, 227–245.

[15] H. Y. Liang and C. Su, Complete convergence for weighted sums of NA sequences, Statist. Probab. Lett. 45 (1999), no. 1, 85–95.

[16] P. Matula, A note on the almost sure convergence of sums of negatively dependent random variables, Statist. Probab. Lett. 15 (1992), no. 3, 209–213.

[17] H. G. M¨uller, Weak and universal consistency of moving weighted averages, Period.

Math. Hungar. 18 (1987), no. 3, 241–250.

[18] V. V. Petrov, Limit Theorems of Probability Theory, Oxford University Press, New York, 1995.

(16)

[19] G. G. Roussas, Consistent regression estimation with ﬁxed design points under depen- dence conditions, Statist. Probab. Lett. 8 (1989), no. 1, 41–50.

[20] , Asymptotic normality of random ﬁelds of positively or negatively associated processes, J. Multivariate Anal. 50 (1994), no. 1, 152–173.

[21] , Asymptotic normality of the kernel estimate of a probability density function under association, Statist. Probab. Lett. 50 (2000), no. 1, 1–12.

[22] G. G. Roussas, L. T. Tran, and D. A. Ioannides, Fixed design regression for time series:

asymptotic normality, J. Multivariate Anal. 40 (1992), no. 2, 262–291.

[23] Q. M. Shao, A comparison theorem on moment inequalities between negatively associated and independent random variables, J. Theoret. Probab. 13 (2000), no. 2, 343–356.

[24] Q. M. Shao and C. Su, The law of the iterated logarithm for negatively associated random variables, Stochastic Process. Appl. 83 (1999), no. 1, 139–148.

[25] C. Su, L. C. Zhao, and Y. B. Wang, Moment inequalities and weak convergence for negatively associated sequences, Sci. China Ser. A 40 (1997), no. 2, 172–182.

[26] L. Tran, G. Roussas, S. Yakowitz, and B. T. Van, Fixed-design regression for linear time series, Ann. Statist. 24 (1996), no. 3, 975–991.

[27] S. C. Yang, Uniformly asymptotic normality of the regression weighted estimator for negatively associated samples, Statist. Probab. Lett. 62 (2003), no. 2, 101–110.

Han-Ying Liang

Department of Mathematics Tongji University

Shanghai 200092, P. R. China E-mail address: [email protected]

Yu-Yu Li

Department of Mathematics Tongji University

Shanghai 200092, P. R. China