Score: 0

KEEP: Integrating Medical Ontologies with Clinical Data for Robust Code Embeddings

Published: October 6, 2025 | arXiv ID: 2510.05049v1

By: Ahmed Elhussein , Paul Meddeb , Abigail Newbury and more

Potential Business Impact:

Helps doctors understand patient health better.

Business Areas:

Semantic Search Internet Services

Machine learning in healthcare requires effective representation of structured medical codes, but current methods face a trade off: knowledge graph based approaches capture formal relationships but miss real world patterns, while data driven methods learn empirical associations but often overlook structured knowledge in medical terminologies. We present KEEP (Knowledge preserving and Empirically refined Embedding Process), an efficient framework that bridges this gap by combining knowledge graph embeddings with adaptive learning from clinical data. KEEP first generates embeddings from knowledge graphs, then employs regularized training on patient records to adaptively integrate empirical patterns while preserving ontological relationships. Importantly, KEEP produces final embeddings without task specific auxiliary or end to end training enabling KEEP to support multiple downstream applications and model architectures. Evaluations on structured EHR from UK Biobank and MIMIC IV demonstrate that KEEP outperforms both traditional and Language Model based approaches in capturing semantic relationships and predicting clinical outcomes. Moreover, KEEP's minimal computational requirements make it particularly suitable for resource constrained environments.

Enhancing Omics Cohort Discovery for Research on Neurodegeneration through Ontology-Augmented Embedding Models

Computation and Language

Organizes brain disease data for faster research.

16 Jun 2025 1

87%

A Systematic Evaluation of Knowledge Graph Embeddings for Gene-Disease Association Prediction

Machine Learning (CS)

Finds new cures by connecting genes and sickness.

11 Apr 2025 1

87%

Ontology-Aligned Embeddings for Data-Driven Labour Market Analytics

Machine Learning (CS)

Helps computers understand job titles from anywhere.

5 Sep 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

20 pages

KEEP: Integrating Medical Ontologies with Clinical Data for Robust Code Embeddings

Helps doctors understand patient health better.

Technical Abstract

Enhancing Omics Cohort Discovery for Research on Neurodegeneration through Ontology-Augmented Embedding Models

A Systematic Evaluation of Knowledge Graph Embeddings for Gene-Disease Association Prediction

Ontology-Aligned Embeddings for Data-Driven Labour Market Analytics