Score: 0

TempoQL: A Readable, Precise, and Portable Query System for Electronic Health Record Data

Published: November 12, 2025 | arXiv ID: 2511.09337v1

By: Ziyong Ma , Richard D. Boyce , Adam Perer and more

Potential Business Impact:

Makes it easy to find health data for AI.

Business Areas:

Electronic Health Record (EHR) Health Care

Electronic health record (EHR) data is an essential data source for machine learning for health, but researchers and clinicians face steep barriers in extracting and validating EHR data for modeling. Existing tools incur trade-offs between expressivity and usability and are typically specialized to a single data standard, making it difficult to write temporal queries that are ready for modern model-building pipelines and adaptable to new datasets. This paper introduces TempoQL, a Python-based toolkit designed to lower these barriers. TempoQL provides a simple, human-readable language for temporal queries; support for multiple EHR data standards, including OMOP, MEDS, and others; and an interactive notebook-based query interface with optional large language model (LLM) authoring assistance. Through a performance evaluation and two use cases on different datasets, we demonstrate that TempoQL simplifies the creation of cohorts for machine learning while maintaining precision, speed, and reproducibility.

Generating Querying Code from Text for Multi-Modal Electronic Health Record

Information Retrieval

Lets doctors find patient info easily.

25 Nov 2025 0

86%

Extracting OPQRST in Electronic Health Records using Large Language Models with Reasoning

Computation and Language

Helps doctors find important patient info faster.

2 Sep 2025 0

86%

Reliable Curation of EHR Dataset via Large Language Models under Environmental Constraints

Databases

Lets doctors ask questions about patient data.

2 Nov 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

24 pages

TempoQL: A Readable, Precise, and Portable Query System for Electronic Health Record Data

Makes it easy to find health data for AI.

Technical Abstract

Generating Querying Code from Text for Multi-Modal Electronic Health Record

Extracting OPQRST in Electronic Health Records using Large Language Models with Reasoning

Reliable Curation of EHR Dataset via Large Language Models under Environmental Constraints