Post-selection Inference in Regression Models for Group Testing Data
By: Qinyan Shen, Karl Gregory, Xianzheng Huang
Potential Business Impact:
Fixes math for doctors with messy notes.
We develop methodology for valid inference after variable selection in logistic regression when the responses are partially observed, that is, when one observes a set of error-prone testing outcomes instead of the true values of the responses. Aiming at selecting important covariates while accounting for missing information in the response data, we apply the expectation-maximization algorithm to compute maximum likelihood estimators subject to LASSO penalization. Subsequent to variable selection, we make inferences on the selected covariate effects by extending post-selection inference methodology based on the polyhedral lemma. Empirical evidence from our extensive simulation study suggests that our post-selection inference results are more reliable than those from naive inference methods that use the same data to perform variable selection and inference without adjusting for variable selection.
Similar Papers
Time-to-event prediction for grouped variables using Exclusive Lasso
Methodology
Improves cancer survival predictions using patient data.
Inference post region selection
Statistics Theory
Finds important areas in images reliably.
Robust Inference Methods for Latent Group Panel Models under Possible Group Non-Separation
Econometrics
Finds hidden patterns in data to make better predictions.