Learning from Neighbors with PHIBP: Predicting Infectious Disease Dynamics in Data-Sparse Environments
By: Edwin Fong, Lancelot F. James, Juho Lee
Modeling sparse count data, which arise across numerous scientific fields, presents significant statistical challenges. This chapter addresses these challenges in the context of infectious disease prediction, with a focus on predicting outbreaks in geographic regions that have historically reported zero cases. To this end, we present the detailed computational framework and experimental application of the Poisson Hierarchical Indian Buffet Process (PHIBP), with demonstrated success in handling sparse count data in microbiome and ecological studies. The PHIBP's architecture, grounded in the concept of absolute abundance, systematically borrows statistical strength from related regions and circumvents the known sensitivities of relative-rate methods to zero counts. Through a series of experiments on infectious disease data, we show that this principled approach provides a robust foundation for generating coherent predictive distributions and for the effective use of comparative measures such as alpha and beta diversity. The chapter's emphasis on algorithmic implementation and experimental results confirms that this unified framework delivers both accurate outbreak predictions and meaningful epidemiological insights in data-sparse settings.
Similar Papers
Exchangeable Gaussian Processes with application to epidemics
Methodology
Predicts disease outbreaks more accurately.
Sparse Bayesian Partially Identified Models for Sequence Count Data
Methodology
Finds real changes in tiny cell parts.
Bayesian Hierarchical Invariant Prediction
Machine Learning (CS)
Finds what truly causes things, even with lots of data.