Score: 0

Evaluating Informative Cluster Size in Cluster Randomized Trials

Published: October 1, 2025 | arXiv ID: 2510.01127v1

By: Bryan S. Blette , Zhe Chen , Brennan C. Kahan and more

Potential Business Impact:

Helps scientists know if group size matters in studies.

Business Areas:
A/B Testing Data and Analytics

In cluster randomized trials, the average treatment effect among individuals (i-ATE) can be different from the cluster average treatment effect (c-ATE) when informative cluster size is present, i.e., when treatment effects or participant outcomes depend on cluster size. In such scenarios, mixed-effects models and generalized estimating equations (GEEs) with exchangeable correlation structure are biased for both the i-ATE and c-ATE estimands, whereas GEEs with an independence correlation structure or analyses of cluster-level summaries are recommended in practice. However, when cluster size is non-informative, mixed-effects models and GEEs with exchangeable correlation structure can provide unbiased estimation and notable efficiency gains over other methods. Thus, hypothesis tests for informative cluster size would be useful to assess this key phenomenon under cluster randomization. In this work, we develop model-based, model-assisted, and randomization-based tests for informative cluster size in cluster randomized trials. We construct simulation studies to examine the operating characteristics of these tests, show they have appropriate Type I error control and meaningful power, and contrast them to existing model-based tests used in the observational study setting. The proposed tests are then applied to data from a recent cluster randomized trial, and practical recommendations for using these tests are discussed.

Country of Origin
🇺🇸 United States

Page Count
25 pages

Category
Statistics:
Methodology