Confidence intervals for the stress-strength reliability of the generalized exponential distribution
Hong Kyung Pak 1 · Sang Gil Kang 2 · Woo Dong Lee 3
1 Department of Medical Information Technology and Data Science, Daegu Haany University
2 Department of Computer and Data Information, Sangji University
3 Department of Cosmeceutical Engineering, Daegu Haany University
Received 20 August 2018, revised 13 September 2018, accepted 17 September 2018
Abstract
In this article, two likelihood-based methods are developed to obtain the confidence intervals for the stress-strength reliability of the generalized exponential distribution.
When the parameter of interest is the stress-strength reliability of the generalized exponential distribution, we construct the confidence intervals of the stress-strength reliability based on the signed log-likelihood ratio statistic and the modified signed log-likelihood ratio statistic for the parameter of interest. Simulations are performed to show the accuracy of the proposed method in terms of estimated coverage probability and bilateral symmetry of the confidence intervals when the sample size is small. And an example is provided.
Keywords: Confidence interval, generalized exponential distribution, likelihood-based inference, modified signed log-likelihood ratio statistic, signed log-likelihood ratio statis- tic, stress-strength reliability.
1. Introduction
Kundu and Gupta (1999) introduced the one parameter generalized exponential (GE) distribution as an alternative to the gamma or Weibull distribution for fitting lifetime data.
This GE distribution is very flexible to modeling a lifetime data in place of the gamma or Weibull distributions (see also Gupta and Kundu, 2001; Kundu and Gupta, 2007; Raqab, 2002; Raqab and Ahsanullah, 2001). Specially, the GE distribution has many properties that are quite similar to those of the gamma distribution. But this GE distribution has the cumulative distribution function and the failure rate in an explicit form. And the distribution of the transformed random sample distributes as the χ 2 distribution.
1
Professor, Department of Medical Information Technology and Data Science, Daegu Haany University, Kyungsan 38610, Korea.
2
Professor, Department of Computer and Data Information, Sangji University, Wonju 26339, Korea.
3
Corresponding author: Professor, Department of Cosmeceutical Engineering, Daegu Haany University,
Kyungsan 38610, Korea. E-mail : [email protected]
Since the failure rate of the GE is increasing, constant or decreasing according to the value of the shape parameter, it is a versatile distribution to analyze a lifetime data. Gupta and Kundu (2002) and Kundu and Gupta (2007) studied the statistical inference about the GE distribution. They developed the maximum likelihood estimator (MLE) and proved the sampling distributions of the estimators. They also developed the confidence intervals of the parameters based on the large sample properties of the MLE.
The statistical inference related with this stress-strength reliability is very common. In statistical reliability analysis, a system fails when the stress exceeds the strength. In other words, the system fails if and only if at any time the stress is greater than the strength. This stress-strength reliability is a measure of the system performance and has a meaning in the context of the reliability engineering. There are many studies related with statistical inference of the stress-strength reliability under various distributions such as bivariate exponential distribution (Awad et al., 1981), bivariate normal distribution (Gupta and Gupta, 1990) and exponential distribution (Tong, 1974, 1977). Kundu and Gupta (2005) studied the estimation of the stress-strength reliability for the GE distribution. They developed the MLE, uniformly minimum variance unbiased estimator and Bayes estimator of the stress-strength reliability.
Wong and Wu (2009) studied the interval estimation of the stress-strength reliability of GE using lower record data. They developed a likelihood-based analysis using Lagrange multiplier technique. When the parameter of interest is given, we use the transformation of a nuisance parameter which is orthogonal to the parameter of interest. This transformation simplifies the development of confidence intervals based on the likelihood-based statistics. In this sense, the approach is different to this paper. Lee et al. (2018) developed the likelihood based inference when the parameter of interest is the ratio of the shape parameters of the GE distributions.
In this paper, we focus on constructing the confidence intervals of the stress-strength reli- ability of the GE distribution. Generally, to construct a confidence interval of a parameter, the exact distribution of the estimator is necessary. When the exact distribution of the estimator is hard to obtain, we must resort to an approximation method like the Central Limit Theorem. However, even though the approximation method is developed, the cover- age probability of the confidence interval may be inaccurate and the confidence interval is quite asymmetric when the sample size is small. Cox and Hinkley (1974) introduced the signed log-likelihood ratio (SLR) statistic. This SLR statistic converges in distribution to the standard normal distribution when the sample size is large. But when the sample size is small, this statistic is quite inaccurate. The modified signed log-likelihood ratio (MSLR) statistic of Barndorff-Nielsen (1986, 1991) is highly accurate in the sense that it converges to the standard normal distribution very fast. So, the MSLR statistic is accurate when the sample size is small. However, the computation of the MSLR statistic is difficult.
The main purpose of this paper is to develop the confidence intervals of the stress-strength
reliability based on SLR and MSLR. Specially, we use an orthogonal transformation to
simplify computation. We want to show that the proposed confidence intervals are working
very well even with a small sample size. This paper is arranged as follows. In Section 2, the
confidence intervals based on the exact statistics, the SLR statistic and the MSLR statistic
will be developed. In Section 3, some simulations are performed to show the performances
of the proposed methods. An example using simulated data will be given. Section 4 presents
the conclusions of this paper.
2. Development of confidence intervals based on likelihood-based methods
The probability density function of the GE distribution with shape parameter λ is given by
f (x|λ) = λe −x (1 − e −x ) λ−1 , x > 0, (2.1) where λ > 0. We denote this distribution as GE(λ). The distribution function of GE(λ) is
F (x|λ) = (1 − e −x ) λ , x > 0, λ > 0.
Let X be a stress of a system whose distribution is GE(α) and Y be a strength with GE(β). Assume that X and Y are independent. Then the stress-strength reliability ψ is given by
ψ = P (X < Y ) = Z ∞
0
Z y 0
αe −x (1 − e −x ) α−1 βe −y (1 − e −y ) β−1 dxdy
= β
α + β . (2.2)
ψ is the probability of functioning of an item or a system which is subject to stress X.
Let X i , i = 1, · · · , m denote random samples of stress from GE(α) and Y j , j = 1, 2, · · · , n denote random sample of strength from GE(β). Then the likelihood function of α and β given observation x = (x i , y j ), i = 1, 2, · · · , m, j = 1, 2, · · · , n is given by
L(α, β|x) = α m e − P
mi=1x
im
Y
i=1
(1 − e −x
i) α−1 β n e − P
nj=1y
jn
Y
j=1
(1 − e −y
j) β−1
= α m β n e −t
1−t
2m
Y
i=1
(1 − e −x
i) α−1
n
Y
j=1
(1 − e −y
j) β−1 , (2.3)
where t 1 = P m
i=1 x i and t 2 = P n j=1 y j .
From the likelihood function (2.3), the MLE of α and β are calculated by α = − b m t
3
and β = − b t n
4
, where t 3 = P m
i=1 log (1 − e −x
i) and t 4 = P n
j=1 log (1 − e −y
j), respectively.
By the invariance property of the MLE, the MLE of ψ is given by
ψ = b β b α + b b β =
1 + α b
β b
−1
.
Following Kundu and Gupta (2005) or Wong and Wu (2009), since −2αt 3 ∼ χ 2 (2m) and
−2βt 4 ∼ χ 2 (2n), where χ 2 (ν) denotes a χ 2 distribution with degree of freedom ν, and these two statistics are stochastically independent,
α b β
β α b = 1 − ψ ψ
ψ b
1 − b ψ ∼ F (2m, 2n), (2.4)
where F (ν 1 , ν 2 ) denotes a F distribution with degrees of freedom ν 1 and ν 2 . Then, the exact 100(1 − γ)% confidence interval for ψ is given by
1 1 + 1− b ψ
ψ b F (1 − γ/2; 2m, 2n)
, 1
1 + 1− b ψ
ψ b F (γ/2; 2m, 2n)
, (2.5)
where F (γ; ν 1 , ν 2 ) is the 100γ% th percentile of F distribution with degrees of freedom ν 1 and ν 2 .
To develop the likelihood-based inference for ψ, let ω = α m β n be a nuisance parameter.
This transformation plays an important role in developing highly accurate methods. By this transformation, the parameter of interest ψ and the nuisance parameter ω is orthogonal, and thus the constraint MLE of ω is obtained in an explicit form and the computation of MSLR is simplified. As mentioned in the introduction, Wong and Wu (2009) also developed MSLR in this model, but they did not consider transformation of nuisance parameter. Though their results were also compatible to our results, their approach is different from ours. We can simplify the computation of MSLR by the orthogonal transformation of parameters.
Let θ = (ψ, ω). Under this reparametrization, the log-likelihood of θ = (ψ, ω) is given by
l ≡ l(θ) = l(ψ, ω) = log(ω) − t 1 − t 2 − t 3 − t 4 + ω
N1t 3 ( 1 − ψ ψ )
n N
+ t 4 ( 1 − ψ ψ )
−
mN! , (2.6)
where N = m + n. Also, the MLE of ω is given by
ω = b α b m β b n .
The first partial derivatives of the log-likelihood (2.6) with respect to ψ and ω are given by
l ψ; ≡ ∂l
∂ψ = ω
N1ψ −2 N
1 − ψ ψ
−
mNm ψ
1 − ψ t 4 − nt 3
, (2.7)
l ω; ≡ ∂l
∂ω = 1
ω + ω
N1−1 N t 3
1 − ψ ψ
Nn+ t 4
ψ
1 − ψ
mN!
. (2.8)
When the parameter of interest ψ is fixed, the constrained MLE of ω is
ω b ψ = − N
( 1−ψ ψ )
Nnt 3 + ( 1−ψ ψ ) −
mNt 4
! N
, (2.9)
and let b θ ψ = (ψ, ω b ψ ).
2.1. Confidence interval based on the signed log-likelihood ratio statistic The SLR statistic r is given by
r ≡ r(ψ) = sgn( b ψ − ψ) n
2[l(b θ) − l(b θ ψ )] o
12, (2.10)
where sgn(a) is 1 if a > 0, 0 if a = 0 and -1 if a < 0. This statistic r converges in distribution to the standard normal distribution when the sample size n → ∞
Using r, the approximate 100(1 − γ)% confidence interval for ψ can be obtained from n
ψ; |r(ψ)| ≤ z
γ2
o
, (2.11)
where z
γ2
is the 100(1 − γ 2 )% th percentile of the standard normal distribution.
It is well known that this statistic r distributes as the standard normal distribution with rate O p (n −
12) (Cox and Hinkley, 1974). When the sample size n is small, the approxima- tion to the standard normal distribution is not accurate. This fact may cause problems in constructing confidence interval about ψ with small sample size.
2.2. Confidence interval based on the modified signed log-likelihood ratio statis- tic
There are various methods to improve the accuracy of the approximation to normality of the SLR statistic. The MSLR statistic of Barndorff-Nielsen (1986, 1991) is an excellent method to improve the accuracy of the approximation. Based on r, the MSLR statistic r ∗ has the form
r ∗ ≡ r ∗ (ψ) = r(ψ) + r(ψ) −1 log u(ψ) r(ψ)
.
This statistic approximately distributes as the standard normal distribution to the order of O p (n −
32). Therefore, r ∗ is more accurate than r when the sample size is small.
The 100(1 − γ)% confidence interval for ψ can be developed from n
ψ; |r ∗ (ψ)| ≤ z
γ2
o
. (2.12)
If the log-likelihood function l(θ) can be written as l(θ; b θ, a), where a is an ancillary statistic, then u(ψ) is given by
u(ψ) =
l ;b θ (b θ) − l ;b θ (b θ ψ ) l ω;b θ (b θ ψ ) n
j θθ (b θ)
j ωω (b θ ψ )
o
12,
where the sample-space derivatives are defined as
l ;b θ (θ) = ∂l(θ; b θ, a)
∂ b θ , the mixed derivatives as
l ω;b θ (θ) = ∂ 2 l(θ; b θ, a)
∂ω∂ b θ = ∂
∂ω l ;b θ (θ),
j θθ (b θ) is the observed information matrix and j ωω (b θ ψ ) is the observed nuisance information
matrix. Except full-rank exponential family distributions and certain transformation models,
it is often difficult or impossible to compute the statistic u.
Since our model is a full-rank exponential model, the log-likelihood function (2.6) based on the data x is only related to the minimal sufficient statistic t = (t 3 , t 4 ). There is a one- to-one transformation between b θ and t, and the transformation Jacobian matrix is ∂ b θ/∂t.
The sample-space derivatives with respect to b θ in the formula for u can be computed based on the sample-space derivatives with respect to t. Since j θθ (b θ) = l θ;b θ (b θ) (Barndorff-Nielsen and Cox, 1994) and by canceling the determinant of the transformation Jacobian matrix, the statistic u can be reduced to
u(ψ) =
l ;t (b θ) − l ;t (b θ ψ ) l ω;t (b θ ψ )
l θ;t (b θ)
j θθ (b θ)
j ωω (b θ ψ )
1 2
,
where the sample-space derivatives l ;t (θ) = ∂l(θ; t)/∂t and the mixed derivatives l ω;t (θ) =
∂ 2 l(θ; t)/∂ω∂t.
In our case, we have found that
l ψψ; = ∂ 2 l
∂ψ 2 = ω
N1N
ψ
1 − ψ
mNψ −3
1
1 − ψ + 2
nt 3 − mt 4
ψ 1 − ψ
,
l ψω; = ∂ 2 l
∂ψ∂ω = ω
N1−1 ψ −2 N
ψ
1 − ψ
mNm N
ψt 4
1 − ψ − nt 3
N
,
l ωω; = ∂ 2 l
∂ω 2 = − 1
ω 2 + ω
N1−2 N
1 N − 1
t 3 1 − ψ ψ
Nn+ t 4
ψ
1 − ψ
mN! .
Then the observed information matrix j θθ (θ) is j θθ (θ) =
−l ψψ; −l ψω;
−l ψω; −l ωω;
, and j ωω (θ) = −l ωω; . The sample-space derivative is
l ;t = (l ;t
3l ;t
4) T =
1−ψ ψ
Nnω
N1t 3
ψ 1−ψ
mNω
N1t 4
T
,
and the mixed derivative is
l θ;t =
l ψ;t
3l ψ;t
4l ω;t
3l ω;t
4=
− N ψ nω
2N1t
3