Goodness-of-fit tests for the inverse Weibull or extreme value distribution based on
multiply type-II censored samples
Suk-Bok Kang 1 · Jun-Tae Han 2 · Yeon-Ju Seo 3 · Jina Jeong 4
13 Department of Statistics, Yeungnam University
24 Research Management Team, Korea Student Aid Foundation
Received 16 May 2014, revised 4 June 2014, accepted 10 June 2014
Abstract
The inverse Weibull distribution has been proposed as a model in the analysis of life testing data. Also, inverse Weibull distribution has been recently derived as a suitable model to describe degradation phenomena of mechanical components such as the dynamic components (pistons, crankshaft, etc.) of diesel engines. In this paper, we derive the approximate maximum likelihood estimators of the scale parameter and the shape parameter in the inverse Weibull distribution under multiply type-II censoring.
We also develop four modified empirical distribution function (EDF) type tests for the inverse Weibull or extreme value distribution based on multiply type-II censored samples. We also propose modified normalized sample Lorenz curve plot and new test statistic.
Keywords: Approximate maximum likelihood estimator, goodness-of-fit test, inverse Weibull distribution, modified normalized sample Lorenz curve, multiply type-II cen- sored sample.
1. Introduction
The probability density function (PDF) and the cumulative distribution function (CDF) of the two-parameter inverse Weibull distribution are given by
g (x; σ, λ) = λσ −λ x −(λ+1) exp −(x σ) −λ , x > 0, σ > 0, λ > 0 (1.1) and
G (x; σ, λ) = exp −(x σ) −λ , x > 0, σ > 0, λ > 0, (1.2) where σ and λ are scale and shape parameters respectively.
This distribution has been recently proposed as a model in the analysis of life testing data.
Calabria and Pulcini (1990) derived the maximum likelihood estimators and the least-square
1
Professor, Department of Statistics, Yeungnam University, Gyeongsan 712-749, Korea.
2
Corresponding author: Manager, Research & Statistics Team, Korea Student Aid Foundation, Seoul 100-753, Korea. E-mail: [email protected]
3
Graduate student, Department of Statistics, Yeungnam University, Gyeongsan 712-749, Korea.
4
Staff member, Research & Statistics Team, Korea Student Aid Foundation, Seoul 100-753, Korea.
estimators of the parameters in the inverse Weibull distribution. Calabria and Pulcini (1994) have studied Bayes 2-sample prediction for inverse Weibull distribution. Maswadah (2003) derived the conditional confidence intervals for the parameters based on the generalized order statistics in inverse Weibull distribution. Mahmoud et al. (2003) derived exact expression for the single moments of order statistics and calculated the double moments of order statistics from the inverse Weibull distribution. Also, they used these moments to obtain the best linear unbiased estimates (BLUEs) for the location and scale parameters.
The most common censoring schemes are type-I and type-II censoring, but the conven- tional type-I and type-II censoring schemes do not have flexibility. Multiply type-II censoring is a generalization of type-II censoring. Multiply type-II censored sampling arises in a life- testing experiment whenever the experimenter does not observe the failure times of some units placed on a like-test. Another situation where multiply censored samples arise natu- rally is when some units failed between two points of observation with the exact times of failure of these units unobserved.
The approximated maximum likelihood estimating method was first developed by Bal- akrishnan (1989) for the purpose of providing explicit estimators of the scale parameter in the Rayleigh distribution. Fei et al. (1995) studied the estimation for the two-parameter Weibull distribution and extreme-value distribution under multiply type-II censoring. They compared the mean squared errors of the maximum likelihood estimators, approximate max- imum likelihood estimators (AMLEs), and BLUEs of the parameters in the extreme value distribution. Balakrishnan et al. (2004) discussed point and interval estimation for the ex- treme value distribution under progressively type-II censoring. Kang (2007) proposed some explicit estimators of scale parameters in the half-triangle distribution under multiply type-II censoring by the approximate maximum likelihood methods. Han and Kang (2008) derived the AMLEs of the scale parameter and the location parameter in a double Rayleigh dis- tribution based on multiply type-II censored samples. Shin and Lee (2012) suggested an estimation method of the parameter in an exponential distribution based on a progressive type-I interval censored sample with semi-missing observation. Kang et al. (2013) obtained Bayes estimators and corresponding credible intervals with the highest posterior density and Bayes predictive intervals for unknown parameters based on progressively type-II censored data from an exponentiated half logistic distribution.
Porter III et al. (1992) developed three modified Kolmogorov-Smirnov, Anderson-Darling, and Cramer-von Mises tests for the Pareto distribution with unknown parameters of the location and scale and known shape parameter based on the complete samples. Puig and Stephens (2000) studied some tests of fit for the Laplace distribution based on the empirical distribution function (EDF) statistics and the application of the Laplace distribution in the least absolute deviations regression. Choulakian and Stephens (2001) studied estimation of parameters and goodness-of-fit tests for the generalized Pareto distribution.
In this paper, we derive the AMLEs of the scale parameter σ and the shape parameter λ under multiply type-II censored sample. We also propose three modified EDF type tests, including the modified Kolmogorov-Smirnov test, and the modified Anderson-Darling test, and the modified Cramer-von Mises test for the inverse Weibull distribution with unknown parameters based on multiply type-II censored samples using the AMLEs. For each test, Monte Carlo techniques are used to generate the critical values. The powers of these tests are also investigated under normal, lognormal, gamma, Weibull distributions.
We consider the modified EDF type tests using AMLEs and the modified normalized
sample Lorenz curve (NSLC) plot to test for the half-logistic distribution based on multiply type-II censored samples. We also propose new test statistics based on the NSLC for the inverse Weibull distribution under multiply type-II censored samples.
2. Approximate maximum likelihood estimators
We assume that n items are put on a life test, but only a 1 th, a 2 th, ..., a s th failures are observed, the rest are unobserved or missing, where a 1 , a 2 , ..., a s are considered to be fixed.
If this censoring arises, the scheme is known as multiply type-II censoring scheme.
If X is a inverse Weibull random variable, then Y = logX has extreme-value distribu- tion with location µ = log(1/σ) and scale parameter θ = 1/λ with PDF and CDF given respectively as;
f (y; µ, θ) = 1 θ exp
− y − µ θ
exp
−exp
− y − µ θ
(2.1)
and F (y; µ, θ) = exp
−exp
− y − µ θ
. (2.2)
Let us assume that the following multiply type-II censored sample from a sample of size n is
Y a
1:n ≤ Y a
2:n ≤ · · · ≤ Y a
s:n , (2.3) where 1 ≤ a 1 < a 2 < · · · < a s ≤ n, a 0 = 0, a s+1 = n + 1, F (y a
0:n ) = 0, and F (y a
s+1:n ) = 1.
The likelihood function based on the multiply type-II censored sample (2.3) is given by
L = n!
Q s+1
j=1 (a j − a j−1 − 1)!
s+1
Y
j=1
[F (z a
j:n ) − F (z a
j−1:n )] a
j−a
j−1−1 1 σ s
s
Y
j=1
f (z a
j:n )
= 1
σ s
n!
Q s+1
j=1 (a j − a j−1 − 1)! [F (z a
1:n )] a
1−1 [1 − F (z a
s:n )] n−a
s(2.4)
×
s
Y
j=1
f (z a
j:n )
s
Y
j=2
[F (z a
j:n ) − F (z a
j−1:n )] a
j−a
j−1−1
,
where z i:n = (y i:n − µ)/θ, and f (z) and F (z) are the pdf and the cdf of the standard Extreme-value distribution, respectively.
Since f 0 (z)/f (z) = e −z − 1, we can obtain the likelihood equations as follows;
∂lnL
∂θ = − 1 θ
"
s + (a 1 − 1)e −z
a1:nz a
1:n − (n − a s ) f (z a
s:n )
1 − F (z a
s:n ) z a
s:n +
s
X
j=1
e −z
aj :nz a
j:n
−
s
X
j=1
z a
j:n +
s
X
j=2
(a j − a j−1 − 1) f (z a
j:n )z a
j:n − f (z a
j−1:n )z a
j−1:n
F (z a
j:n ) − F (z a
j−1:n )
#
(2.5)
= 0,
and
∂lnL
∂µ = − 1
θ
"
(a 1 − 1)e −z
a1:n− (n − a s ) f (z a
s:n ) 1 − F (z a
s:n ) +
s
X
j=1
e −z
aj :n− s
+
s
X
j=2
(a j − a j−1 − 1) f (z a
j:n ) − f (z a
j−1:n ) F (z a
j:n ) − F (z a
j−1:n )
#
(2.6)
= 0.
Since the likelihood equations are very complicated, the equations (2.5) and (2.6) do not admit explicit solutions for θ and µ, respectively.
Let ξ i = F −1 (p i ) = −ln [−lnp i ] where p i = i/(n + 1), q i = 1−p i . First, we can approximate the following function by
f (z a
s:n )
1 − F (z a
s:n ) z a
s:n ' κ 1 + δ 1 z a
s:n , (2.7) e −z
aj :nz a
j:n ' e −ξ
ajξ a 2
j
+ e −ξ
aj(1 − ξ a
j)z a
j:n , (2.8) and
f (z a
j:n )z a
j:n − f (z a
j−1:n )z a
j−1:n
F (z a
j:n ) − F (z a
j−1:n ) ' α 1j + β 1j z a
j:n + γ 1j z a
j−1:n , (2.9) where
κ 1 = − ξ a 2
sq a
sf 0 (ξ a
s) + f 2 (ξ a
s) q a
s, δ 1 = 1 q a
sf (ξ a
s) + ξ a
sf 0 (ξ a
s) + f 2 (ξ a
s) q a
sξ a
s,
α 1j = K 2 − ξ a 2
j
f 0 (ξ a
j) − ξ a 2
j−1
f 0 (ξ a
j−1) p a
j− p a
j−1, β 1j = (1 − K)f (ξ a
j) + ξ a
jf 0 (ξ a
j) p a
j− p a
j−1,
γ 1j = − (1 − K)f (ξ a
j−1) + ξ a
j−1f 0 (ξ a
j−1) p a
j− p a
j−1, K = f (ξ a
j)ξ a
j− f (ξ a
j−1)ξ a
j−1p a
j− p a
j−1.
By substituting the equation (2.7), (2.8), and (2.9) into the equation (2.5), we can derive an estimator of θ as follows;
θ ˆ 1 = −B 1 + C 1 µ ˆ A 1
, (2.10)
where
A 1 = s + (a 1 − 1)e −ξ
a1ξ 2 a
1
− (n − a s )κ 1 +
s
X
j=1
e −ξ
ajξ a 2
j
+
s
X
j=2
(a j − a j−1 − 1)α 1j ,
B 1 = (a 1 − 1)e −ξ
a1(1 − ξ a
1)Y a
1:n − (n − a s )δ 1 Y a
s:n +
s
X
j=1
e −ξ
aj(1 − ξ a
j)Y a
j:n
+
s
X
j=1
Y a
j:n +
s
X
j=2
(a j − a j−1 − 1)(β 1j Y a
j:n + γ 1j Y a
j−1:n ),
C 1 = (a 1 − 1)e −ξ
aj(1 − ξ a
j) − (n − a s )δ 1 +
s
X
j=1
e −ξ
aj(1 − ξ a
j) − s
+
s
X
j=2
(a j − a j−1 − 1)(β 1j + γ 1j ).
Second, we can approximate the following functions by f (z a
s:n )
1 − F (z a
s:n ) ' κ 2 + δ 2 z a
s:n , (2.11) e −z
aj :n' e −ξ
aj(1 + ξ 2 a
j) − e −ξ
ajz a
j:n , (2.12) f (z a
j:n )
F (z a
j:n ) − F (z a
j−1:n ) ' α 2j + β 2j z a
j:n + γ 2j z a
j−1:n , (2.13)
and f (z a
j−1:n )
F (z a
j:n ) − F (z a
j−1:n ) ' α 3j + β 3j z a
j:n + γ 3j z a
j−1:n , (2.14) where
κ 2 = 1 q a
sf (ξ a
s) − ξ a
sf 0 (ξ a
s) − f 2 (ξ a
s) q a
sξ a
s, δ 2 = 1 q a
sf 0 (ξ a
s) + f 2 (ξ a
s) q a
s,
α 2j = (1 + K)f (ξ a
j) − ξ a
jf 0 (ξ a
j) p a
j− p a
j−1, β 2j = f 0 (ξ a
j) p a
j− p a
j−1−
f (ξ a
j) p a
j− p a
j−12 ,
γ 2j = f (ξ a
j)f (ξ a
j−1)
p a
j− p a
j−12 , α 3j = (1 + K)f (ξ a
j−1) − ξ a
j−1f 0 (ξ a
j−1) p a
j− p a
j−1, β 3j = −γ 2j , γ 3j = f 0 (ξ a
j−1)
p a
j− p a
j−1+
f (ξ a
j−1) p a
j− p a
j−12 .
By substituting the equations (2.11)∼(2.14) into the equation (2.5), we can derive an estimator of θ as follows;
θ ˆ 2 = −B 2 + p B 2 2
− 4sC 2
2s , (2.15)
where
B 2 = (a 1 −1)e −ξ
a1(1+ξ 2 a
1
)Y a
1:n −(n − a s )κ 2 Y a
s:n +
s
X
j=1
e −ξ
aj(1 + ξ a 2
j