Improve Power of Knockoffs with Annotation Information of Covariates
By: Xiangyu Zhang , Lijun Wang , Changjun Li and more
Potential Business Impact:
Finds important genes more accurately.
Genome-wide association studies (GWAS) often find association signals between many genetic variants and traits of interest in a genomic region. Functional annotations of these variants provide valuable prior information that helps prioritize biologically relevant variants and enhances the power to detect causal variants. However, due to substantial correlations among these variants, a critical question is how to rigorously control the false discovery rate while effectively leveraging prior knowledge. We introduce annotation-informed knockoffs (AnnoKn), a knockoff-based method that performs annotation-informed variable selection with strict control of the false discovery rate. AnnoKn integrates the knockoff procedure with adaptive Lasso regression to evaluate the importance of multiple covariates while incorporating functional annotation information within a unified Bayesian framework. To facilitate real-world applications where individual-level data are not accessible, we further extend AnnoKn to operate on summary statistics. Through simulations and real-world applications to GTEx and GWAS datasets, we show that AnnoKn achieves superior power in detecting causal genetic variants compared with existing annotation-informed variable selection methods, while maintaining valid control over false discoveries.
Similar Papers
Knockoffs for low dimensions: changing the nominal level post-hoc to gain power while controlling the FDR
Methodology
Finds hidden patterns more reliably in data.
Confirmatory Biomarker Identification via Derandomized Knockoffs for Cox Regression with k-FWER Control
Methodology
Finds important health clues for better survival.
Novel Knockoff Generation and Importance Measures with Heterogeneous Data via Conditional Residuals and Local Gradients
Methodology
Finds important data in messy, mixed-up information.