ProtoEHR: Hierarchical Prototype Learning for EHR-based Healthcare Predictions
By: Zi Cai , Yu Liu , Zhiyao Luo and more
Potential Business Impact:
Helps doctors predict patient health better.
Digital healthcare systems have enabled the collection of mass healthcare data in electronic healthcare records (EHRs), allowing artificial intelligence solutions for various healthcare prediction tasks. However, existing studies often focus on isolated components of EHR data, limiting their predictive performance and interpretability. To address this gap, we propose ProtoEHR, an interpretable hierarchical prototype learning framework that fully exploits the rich, multi-level structure of EHR data to enhance healthcare predictions. More specifically, ProtoEHR models relationships within and across three hierarchical levels of EHRs: medical codes, hospital visits, and patients. We first leverage large language models to extract semantic relationships among medical codes and construct a medical knowledge graph as the knowledge source. Building on this, we design a hierarchical representation learning framework that captures contextualized representations across three levels, while incorporating prototype information within each level to capture intrinsic similarities and improve generalization. To perform a comprehensive assessment, we evaluate ProtoEHR in two public datasets on five clinically significant tasks, including prediction of mortality, prediction of readmission, prediction of length of stay, drug recommendation, and prediction of phenotype. The results demonstrate the ability of ProtoEHR to make accurate, robust, and interpretable predictions compared to baselines in the literature. Furthermore, ProtoEHR offers interpretable insights on code, visit, and patient levels to aid in healthcare prediction.
Similar Papers
Automated Hierarchical Graph Construction for Multi-source Electronic Health Records
Machine Learning (Stat)
Connects patient records from different hospitals.
Prototype Learning to Create Refined Interpretable Digital Phenotypes from ECGs
Machine Learning (CS)
Helps doctors spot sickness from heartbeats.
CEHR-XGPT: A Scalable Multi-Task Foundation Model for Electronic Health Records
Machine Learning (CS)
Helps doctors predict patient health using past records.