Domain Adaptation Under MNAR Missingness
By: Tyrel Stokes , Hyungrok Do , Saul Blecker and more
Potential Business Impact:
Helps computers learn from messy health records.
Current domain adaptation methods under missingness shift are restricted to Missing At Random (MAR) missingness mechanisms. However, in many real-world examples, the MAR assumption may be too restrictive. When covariates are Missing Not At Random (MNAR) in both source and target data, the common covariate shift solutions, including importance weighting, are not directly applicable. We show that under reasonable assumptions, the problem of MNAR missingness shift can be reduced to an imputation problem. This allows us to leverage recent methodological developments in both the traditional statistics and machine/deep-learning literature for MNAR imputation to develop a novel domain adaptation procedure for MNAR missingness shift. We further show that our proposed procedure can be extended to handle simultaneous MNAR missingness and covariate shifts. We apply our procedure to Electronic Health Record (EHR) data from two hospitals in south and northeast regions of the US. In this setting we expect different hospital networks and regions to serve different populations and to have different procedures, practices, and software for inputting and recording data, causing simultaneous missingness and covariate shifts.
Similar Papers
Robustness intervals for competing risks analysis with causes of failure missing not at random
Methodology
Makes medical studies more trustworthy with missing data.
Sensitivity analysis for nonignorable missing values in blended analysis framework: a study on the effect of bariatric surgery via electronic health records
Methodology
Fixes doctor records with missing info.
Causal View of Time Series Imputation: Some Identification Results on Missing Mechanism
Machine Learning (CS)
Fixes missing data in time records.