Bayesian test of homogenity in small areas: A discretization approach
MinSup Kim 1 · Balgobin Nandram 2 · Dal Ho Kim 3
13 Department of Statistics, Kyungpook National University
2 Department of Mathematical Sciences, Worcester Polytechnic Institute
Received 29 September 2017, revised 30 October 2017, accepted 1 November 2017
Abstract
This paper studies Bayesian test of homogeneity in contingency tables made by dis- cretizing a continuous variable. Sometimes when we are considering events of interest in small area setup, we can think of discretization approaches about the continuous vari- able. If we properly discretize the continuous variable, we can find invisible relationships between areas (groups) and a continuous variable of interest. The proper discretization of the continuous variable can support the alternative hypothesis of the homogeneity test in contingency tables even if the null hypothesis was not rejected through k-sample tests involving one-way ANOVA. In other words, the proportions of variables with a particular level can vary from group to group by the discretization. If we discretize the the continuous variable, it can be treated as an analysis of the contingency table.
In this case, the chi-squared test is the most commonly employed method. However, further discretization gives rise to more cells in the table. As a result, the count in the cells becomes smaller and the accuracy of the test becomes lower. To prevent this, we can consider the Bayesian approach and apply it to the setup of the homogeneity test.
Keywords: Contingency table, Dirichlet prior, discretization, hierarchical Bayesian model, test of homogeneity.
1. Introduction
In many studies, one of our main concerns is to test the relationship between more than two groups, especially homogeneity. When we analyze the homogeneity of k-samples with more than two groups, parametric method, one-way ANOVA test and nonparametric methods, Kruskal-Wallis test and Anderson-Darling test, which are ranked tests, can be used (Scholz and Stephens, 1987).
Sometimes when we are considering events of interest, we can think of discretization ap- proaches such as the maximally selected chi-square statistics which Miller and Siegmund
1
Ph.D. candidate, Department of Statistics, Kyungpook National University, Daegu 41566, Korea.
2
Professor, Department of Mathematical Sciences, Worcester Polytechnic Institute, 100 Institute Road, Worcester, MA 01609, USA.
3