Testing Homogeneity in a heteroscedastic contaminated normal mixture
By: Xiaoqing Niu, Pengfei Li, Yuejiao Fu
Potential Business Impact:
Finds real patterns in noisy science data.
Large-scale simultaneous hypothesis testing appears in many areas such as microarray studies, genome-wide association studies, brain imaging, disease mapping and astronomical surveys. A well-known inference method is to control the false discovery rate. One popular approach is to model the $z$-scores derived from the individual $t$-tests and then use this model to control the false discovery rate. We propose a new class of contaminated normal mixtures for modelling $z$-scores. We further design an EM-test for testing homogeneity in this class of mixture models. We show that the EM-test statistic has a shifted mixture of chi-squared limiting distribution. Simulation results show that the proposed testing procedure has accurate type I error and significantly larger power than its competitors under a variety of model specifications. A real-data example is analyzed to exemplify the application of the proposed method.
Similar Papers
Analysis of hypothesis tests for multiple uncertain finite populations with applications to normal uncertainty distributions
Methodology
Tests if groups of data are different.
$t$-Testing the Waters: Empirically Validating Assumptions for Reliable A/B-Testing
Methodology
Checks if online tests give true results.
Empirical partially Bayes two sample testing
Methodology
Finds tiny differences in biology data.