Bayesian Test for the Intraclass Correlation Coefficient in the One-Way Random Effect Model

(1)

2004, Vol. 15, No. 3, pp. 645∼654

Sang Gil Kang¹⁾ ․ Hee Choon Lee²⁾

Abstract

In this paper, we develop the Bayesian test procedure for the intraclass correlation coefficient in the unbalanced one-way random effect model based on the reference priors. That is, the objective is to compare two nested model such as the independent and intraclass models using the factional Bayes factor. Thus the model comparison problem in this case amounts to testing the hypotheses H1:ρ = 0 versus H₂:ρ≠0. Some real data examples are provided.

Keywords : Fractional Bayes Factor, Independent Model, Intraclass Model, Reference Prior

1. INTRODUCTION

Consider an unbalanced one-way random effect model

Y_ij= μ +α_i+ ε_ij, for i = 1, …, m and j = 1, …, n_i, (1)

where μ is an unknown constant, and the α_i and ε_ij are independent normal variables with 0 means and variances σ²_α and σ², respectively. The testing for intraclass correlation coefficient ρ = σ²_α/(σ²_α+ σ²) is of interest.

The intraclass correlation coefficient and the ratio of variance components in the random effect model has been of interest for a long time, especially in animal

1) First Author : Assistant Professor, Department of Applied Statistics, Sangji University, Wonju, 220-702, Korea

E-mail : [email protected]

2) Professor, Department of Applied Statistics, Sangji University, Wonju, 220-702, Korea E-mail : [email protected]

(2)

of a certain trait of livestock breeders (Graybill et al., 1956). One difficult part of the analysis of the model (1) from the sampling theory of view is the possible negative estimates for σ²_α as well as for σ²_α/σ². Thus a Bayesian analysis for this model is desirable, not only because of its intrinsic merit, but also because its ability to resolving this problem. Also the intraclass correlation ρ = σ²_α/(σ²_α+ σ²) is one-to-one transformation of φ = σ²_α/σ².

The problem of estimating intraclass correlation and the ratio of variance components in the one-way random effect model has been investigated by many authors from the Bayesian point of view. We may refer to Hill (1965), Box and Tiao (1973), Donner (1986), Palmer and Broemeling (1990), among others. For balanced one-way random effect model, Ye (1994) developed the reference priors for φ, examined frequentist coverage probabilities for various φ and compared risk functions of the Bayes estimators for the reference priors. Kim, Kang and Lee (2001) provided a class of second order probability matching priors for φ and ρ.

It is shown that among all of the reference priors, the one-at-a-time reference prior satisfies a second order matching criterion. For the unbalanced random effect model, Datta, Ghosh and Kim (2002) developed the first order probability matching prior for φ. In this case, the second order probability matching prior does not exist.

In Bayesian testing problem, the Bayes factor under proper priors or informative priors have been very successful. However, limited information and time constraints often require the use of noninformative priors. Since noninformative priors such as Jeffreys' priors or reference priors (Berger and Bernardo, 1989, 1992) are typically improper so that such priors are only defined up to arbitrary constants which affects the values of Bayes factors. Spiegalhalter and Smith (1982), O'Hagan (1995) and Berger and Pericchi (1996) have made efforts to compensate for that arbitrariness.

Berger and Pericchi (1996) introduced the intrinsic Bayes factor using a data-splitting idea, which would eliminate the arbitrariness of improper priors.

O'Hagan (1995) proposed the fractional Bayes factor. For removing the arbitrariness he used to a portion of the likelihood with a so-called the fraction b.

These approaches have shown to be quite useful in many statistical areas.

The present paper focuses on Bayesian testing procedure for the intraclass correlation coefficient in the unbalanced one-way random effect model. For dealing Bayesian testing for the intraclass correlation coefficient in the unbalanced one-way random effect model, we use the fractional Bayes factor (O'Hagan, 1995).

The outline of the remaining sections is as follows. In Section 2, using the reference priors, we provide the Bayesian testing procedure based on the fractional Bayes factor. In Section 3, some real examples are given.

(3)

2. BAYESIAN TEST PROCEDURES

2.1 Preliminaries

Models (or Hypotheses) H₁, H₂, …, H_q are under consideration, with the data x = ( x₁,x₂, … ,x_n) having probability density function f_i( x ∣ θ_i) under model H_i,i= 1,2,…,q. The parameter vectors θ_i are unknown. Let π_i( θ_i) be the prior distribution of model Hi, and let pi be the prior probabilities of model Hi,

i = 1,2,…,q. Then the posterior probability that the model H_i is true is

P( H_i∣ x ) =(^∑^{j = 1}^q ^p^p^jⁱ ^⋅B ^ji)^{- 1}, (1) where B_ji is the Bayes factor of model H_j to model H_i defined by

B_ji= m_j( x ) m_i( x ) =

⌠⌡f^j( x ∣ θ_j)π_j( θ_j)d θ_j

⌠⌡fⁱ( x∣ θ_i)π_i( θ_i)d θ_i . (2) The Bji interpreted as the comparative support of the data for the model j to i. The computation of B_ji needs specification of the prior distribution π_i( θ_i) and π_j( θ). Usually, one can use the noninformative prior, often improper, such as uniform prior, Jeffreys prior, reference prior or probability matching prior. Denote it as π^N_i. The use of improper priors π^N_i(⋅) in (2) causes the Bji to contain unspecified constants. To solve this problem, O'Hagan (1995) proposed the fractional Bayes factor for Bayesian testing and model selection problem as follow.

When the π^N_i( θ_i) is noninformative prior under H_i, equation (2) becomes

B^N_ji=

⌠⌡f^j( x ∣ θ_j)π^N_j( θ_j)d θ_j

⌠⌡fⁱ( x∣ θ_i)π^N_i( θ_i)d θ_i .

Then the fractional Bayes factor of model Hj versus model Hi is

B^F_ji= B^N_ji⋅

⌠⌡f^bⁱ( x ∣ θ_i)π^N_i( θ_i)d θ_i

⌠⌡f^b^j( x∣ θ_j)π^N_j( θ_j)d θ_j = B^N_ji⋅ m^b₁( x) m^b₂( x) ,

(4)

f_i( x ∣ θ_i) is the likelihood function and b specifies a fraction of the likelihood which is to be used as a prior density. He proposed three ways for the choice of the fraction b. One frequently suggested choice is b = m/n, where m is the size of the minimal training sample, assuming this is well defined. (see O'Hagan, 1995, 1997 and the discussion by Berger and Mortera of O'Hagan, 1995).

2.2 Bayesian Model Selection using Fractional Bayes Factor

In the unbalanced one-way random effect model, the intraclass correlation coefficient is given by ρ = σ²_α/(σ²_α+ σ²). We want to test the hypotheses H₁:ρ = 0 versus H₂:ρ≠0. That is, we compare two nested models such as independent and intraclss model. In this hypothesis testing problem, our objective is to develop a Bayesian test based on the fractional Bayes factors under the reference priors.

Under the hypothesis H₁, the model is the independent model. Thus the likelihood function of parameters ( μ, σ²) is given by

L( μ,σ²) = ( 1

2π )^N/2σ^{- N}exp { - 1

2σ² [S²+ ∑^m

i = 1n_i( y_i- μ)²] }, where S²= ∑^m

i = 1∑

ni

j= 1(y_ij- y_i)², y_i=∑

ni

j= 1y _ij/n_i and N = ∑^m

i = 1∑

ni

j= 1n_i. And the reference prior for μ and σ² is given by

π₁(μ,σ²) ∝ σ ^{- 2}.

Then the element of fractional Bayes factor under M₁ is given by

m^b₁( y ) = ⌠⌡

∞ 0

⌠⌡

∞

- ∞L^b(μ,σ²∣ y )π₁(μ,σ²)dμdσ²

= π^-

Nb - 1

2 N ^-

1 2b^-

Nb

2 Γ(Nb - 1

2 )[∑

m i = 1∑

ni

j = 1y²_ij- N y²] ^-

Nb - 1

2 ,

where y = ∑^m

i = 1∑

ni

j = 1y _ij/N .

For the hypothesis H2, the model becomes intraclass model. In this case, the reference prior developed by Datta, Ghosh and Kim (2002). This prior is given by

π₂(θ₁, θ₂, θ₃)

∝ ( 1 - θ₁)^{- 2}θ^{- 1}₂ {∑^m

i = 1n²_i(1+ n_iθ₁

1 - θ₁ )^{- 2}- 1 N [∑^m

i = 1n_i(1 + n_iθ₁

1- θ₁ )^{- 1}]²}

1 2,

(5)

where θ₁= σ²_α/(σ²+ σ²_α), θ2= σ²[∏

m

i = 1(σ²+ n_iσ²_α)/σ²] ^1/N and θ₃= μ. The expression of original parametrization of this prior is convenient for the computation of the marginal distribution. So in original parametrization ( μ,σ²_α, σ²), the reference prior for μ, σ²α and σ² is given by

π₂(μ,σ²_α, σ²) ∝ σ^{- 4}[∑

m i = 1

n²_iσ⁴

(σ²+ n_iσ²_α)² - 1 N (∑

m i = 1

n_iσ²

σ²+ n_iσ²_α )²] ^1/2. And the likelihood function is

L( μ,σ²_α,σ²) = ( 1

2π )^N/2σ^{- N} ∏^m

i = 1( σ²+ n_iσ²_α σ² )^-

1 2

× exp {- 1

2σ² [S²+∑^m

i = 1

n_iσ²

σ²+ n_iσ²_α ( y_i- μ)²] }.

Thus the element of fractional Bayes factor under H2 gives as follows.

m^b₂( y ) = ⌠⌡

∞ 0

⌠⌡

∞ 0

⌠⌡

∞

- ∞L^b(μ , σ²_α, σ²∣ y )π₂(μ, σ²_α,σ²)dμdσ²_αdσ²

= π ^-

Nb - 1 2 b^-

Nb

2 Γ( Nb - 1

2 )T( y ;b), where

T( y ;b)

= ⌠⌡

∞ 0 (∑^m

i = 1

n_i 1+ n_iθ ) ^-

1 2 ∏^m

i = 1(1+ n_iθ)^-

b

2[S²+ ∑^m

i = 1

n_i

1+ n_iθ ( y_i²- μ²_θ)] ^-

Nb - 1 2

× [∑

m i = 1

n²_i

(1+ n_iθ)² - 1 N (∑

m i = 1

n_i 1+ n_iθ )²]

1 2dθ,

where μθ= ( ∑

m i = 1

n_iy_i

1+ n_iθ )/(∑

m i = 1

n_i

1+ n_iθ ). Therefore the B^N₂₁ is given by

B^N₂₁= N

1

2T( y ;1) [∑

m i = 1∑

ni

j= 1y²_ij- N y²] ^-

N - 1 2

.

Moreover

(6)

m^b₁( y ) m^b₂( y ) =

[∑

i = 1∑

j= 1y²_ij- N y²] ² N

1

2T( y ;b)

.

Thus the fractional Bayes factor of H2 versus H1 is given by

B^F₂₁ = T( y ;1) T( y ;b) [ ∑^m

i = 1∑

ni

j= 1y²_ij- N y²]^-

Nb - N 2 .

Note that the calculation of the fractional Bayes factor of H2 versus H1 is requires an one dimensional integration.

Remark 1. In the balanced one-way random effect model, the element of fractional Bayes factor under H1 is given by

m^b₁( y ) = π ^-

Nb - 1

2 N ^-

1 2b^-

Nb

2 Γ( Nb - 1

2 )[∑

m i = 1 ∑

n

j = 1y²_ij- N y²]^-

Nb - 1

2 ,

where y = ∑^m

i = 1 ∑ⁿ

j = 1y_ij/N and N = mn . For H2, the reference prior is given by π₂(μ,σ²_α, σ²) ∝ σ^{- 2}(σ²+ n σ²_α)^{- 1}.

This reference prior satisfies a second order matching criterion (Kim, Kang and Lee, 2001). Therefore the element of fractional Bayes factor under H₂ gives as follows.

m^b₂( y ) = π^-

Nb - 1

2 N ^-

1 2b^-

Nb

2 Γ( Nb - 1

2 )T( y ;b), where

T( y ;b) = ⌠⌡

∞

0 (1+ n θ) ^-

mb - 1 2 - 1

[S²₁+ n

1 + nθ S²₂]^-

Nb - 1

2 dθ.

Here S²₁= ∑^m

i = 1 ∑ⁿ

j = 1(y_ij- y_i)², y _i= ∑ⁿ

j = 1y_ij/n and S²₂= ∑^m

i = 1( y_i- y )², y = ∑^m

i = 1∑ⁿ

j = 1y_ij/N . Therefore the B^N₂₁ is given by

B^N₂₁= T( y ;1) [∑^m

i = 1 ∑ⁿ

j = 1y²_ij- N y²] ^-

N - 1 2

.

Moreover

(7)

m^b₁( y ) m^b₂( y ) =

[ ∑

m i = 1∑

n

j = 1y²_ij- N y²] ^-

Nb - 1 2

T( y ;b) .

Thus the fractional Bayes factor of H₂ versus H₁ is given by

B^F₂₁ = T( y ;1) T( y ;b) [∑^m

i = 1 ∑ⁿ

j = 1y²_ij- N y²]^-

Nb - N 2 .

3. NUMERICAL STUDIES

To investigate the Bayesian test procedures, we consider three real examples.

Example 1. This example taken from Swallow and Searle (1978). This example is also discussed by Datta, Ghosh and Kim (2002). The data consist of measurements of vegetable oil in bottles. These bottles are filled by a multiple-head machine, and the variability between these heads represents the between-group variability. There are N = 16 observations in m = 5 groups. The sizes of these groups are 4, 2, 5, 3 and 2. The measurements are given Table 1.

Table 1: New Weights of Vegetable Oil Fills by Group

1 2 3 4 5

15.70 15.68 15.64 15.60

15.69 15.71

15.75 15.82 15.75 15.71 15.84

15.68 15.66 15.59

15.65 15.60

The value of fractional Bayes factor of H2 versus H1 is B^F₂₁= 2.5057. We assume that the prior probabilities are equal. Then the posterior probability for H1

is 0.2853. Thus there is strong evidence for H₂ in terms of the posterior probability.

Example 2. The data in Table 2 is taken from Montgomery (2001). A textile company weaves a fabric on a large number of looms. It would like the looms to be homogeneous so that it obtains a fabric of uniform strength. The process engineer suspects that, in addition to the usual variation in strength within samples of fabric from the same loom, there may be significant variations in strength between looms. To investigate this, she selects four looms at random and

(8)

The value of fractional Bayes factor of H₂ versus H₁ is B^F₂₁= 19.4595. We assume that the prior probabilities are equal. Then the posterior probability for H1

is 0.0489. Thus there is strong evidence for H₂ in terms of the posterior probability.

Table 2: Strength Data

Looms

Observations 1 2 3 4 1

2 3 4

98 97 99 96 91 90 93 92 96 95 97 95 95 96 99 98

Example 3. This example is from Ott(1993). Two graduate students working for a professor in electrical engineering have been funded to record lightning discharge intensities at 3 tracking stations. Because of the high frequency of thunderstorms in the summer months, the graduate students were to choose a point a random on a map of the 20-mile-radius region and assemble their tracking equipment. Each day during the hours from 8 AM to 5 PM, they were to monitor their instrument until the maximum intensity had been recorded for 5 separate storms. The process was then repeated separately at the two other locations chosen at random. The sample data appear in Table 3.

Table 3: Lightning Discharge Intensities

Tracking station

Intensities

1 2 3

20 1050 3200 5600 50 4300 70 2560 3650 80 100 7700 8500 2960 3340

The value of fractional Bayes factor and arithmetic intrinsic Bayes factor of H2

versus H1 is B^F₂₁= 0.0863. We assume that the prior probabilities are equal.

Then the posterior probability for H1 is 0.9205. Thus there is strong evidence for H₁ in terms of the posterior probability.

(9)

REFERENCES

1. Berger, J. O. and Bernardo, J. M. (1989). Estimating a Product of Means : Bayesian Analysis with Reference Priors. Journal of the American Statistical Association, 84, 200-207.

2. Berger, J. O. and Bernardo, J. M. (1992). On the Development of Reference Priors (with discussion). Bayesian Statistics IV, J. M.

Bernardo, et. al., Oxford University Press, Oxford, 35-60.

3. Berger, J. O. and Pericchi, L. R. (1996). The Intrinsic Bayes Factor for Model Selection and Prediction, Journal of the American Statistical Association, 91, 109-122.

4. Box, G. and Tiao, G. (1973). Bayesian Inference in Statistical Analysis.

Addison-Wesly, Readings, Massachusetts.

5. Datta, G. S., Ghosh, M. and Kim, Y. H.(2002). Probability Matching Priors For One-Way Unbalanced Random Effect Models. Statistics &

Decisions, 20, 29-51.

6. Donner, A. (1986). A Review of Inference Procedures for the Intraclass Correlation Coefficient in the One-Way Random Effects Model.

International Statistical Review, 54, 67-82.

7. Graybill, F. A., Martin, F. and Godfrey, G. (1956). Confidence Intervals for Variance Ratios Specifying Genetic Heritability. Biometrics, 12, 99-109.

8. O' Hagan, A. (1995). Fractional Bayes Factors for Model Comparison (with discussion), Journal of Royal Statistical Society, B, 57, 99-118.

9. O' Hagan, A. (1997). Properties of Intrinsic and Fractional Bayes Factors, Test, 6, 101-118.

10. Hill, B. (1965). Inference about Variance Components in the One-Way Model. Journal of the American Statistical Association, 60, 806-825.

11. Kim, D. H., Kang, S. G. and Lee, W. D. (2001). On Second Order Probability Matching Criterion in the One-Way Random Effect Model.

The Korean Communications in Statistics, 8, 29-37.

12. Montgomery, C. M. (2001). Design and Analysis of Experiments. John Wiley & Sons, Inc.

13. Ott, R. L. (1993). An Introduction to Statistical Methods and Data Analysis. Duxbury Press, Belmont, CA.

14. Palmer, J. L. and Broemeling, L. D. (1990). A Comparison of Bayes and Maximum Likelihood Estimation of the Intraclass Correlation Coefficient.

Communications in Statistics - Theory and Methods, 19, 953-975.

15. Swallow, W. H. and Searle, S. R. (1978). Minimum Variance Quadratic Unbiased Estimation (MIVQUE) of Variance Components, Technometrics, 20, 265-272.

(10)

Linear and Log-Linear Models with Vague Prior Information, Journal of Royal Statistical Society, B, 44, 377-387.

17. Ye, K. (1994). Bayesian Reference Prior Analysis on the Ratio of Variances for the Balanced One-Way Random Effect Model. Journal of Statistical Planning and Inference, 41, 267-280.

[ received date : May. 2004, accepted date : Aug. 2004 ]