A New Targeted-Federated Learning Framework for Estimating Heterogeneity of Treatment Effects: A Robust Framework with Applications in Aging Cohorts
By: Rong Zhao , Jason Falvey , Xu Shi and more
Potential Business Impact:
Finds how treatments work differently for people.
Analyzing data from multiple sources offers valuable opportunities to improve the estimation efficiency of causal estimands. However, this analysis also poses many challenges due to population heterogeneity and data privacy constraints. While several advanced methods for causal inference in federated settings have been developed in recent years, many focus on difference-based averaged causal effects and are not designed to study effect modification. In this study, we introduce a novel targeted-federated learning framework to study the heterogeneity of treatment effects (HTEs) for a targeted population by proposing a projection-based estimand. This HTE framework integrates information from multiple data sources without sharing raw data, while accounting for covariate distribution shifts among sources. Our proposed approach is shown to be doubly robust, conveniently supporting both difference-based estimands for continuous outcomes and odds ratio-based estimands for binary outcomes. Furthermore, we develop a communication-efficient bootstrap-based selection procedure to detect non-transportable data sources, thereby enhancing robust information aggregation without introducing bias. The superior performance of the proposed estimator over existing methods is demonstrated through extensive simulation studies, and the utility of our approach has been shown in a real-world data application using nationwide Medicare-linked data.
Similar Papers
Heterogeneity-Aware Federated Causal Inference Leveraging Effect-Measure Transportability
Methodology
Lets many computers learn together safely.
Federated Causal Inference in Healthcare: Methods, Challenges, and Applications
Machine Learning (CS)
Lets hospitals learn from each other's patient data safely.
Reliable Selection of Heterogeneous Treatment Effect Estimators
Machine Learning (Stat)
Finds the best way to treat each person.