Generating Querying Code from Text for Multi-Modal Electronic Health Record
By: Mengliang ZHang
Potential Business Impact:
Lets doctors find patient info easily.
Electronic health records (EHR) contain extensive structured and unstructured data, including tabular information and free-text clinical notes. Querying relevant patient information often requires complex database operations, increasing the workload for clinicians. However, complex table relationships and professional terminology in EHRs limit the query accuracy. In this work, we construct a publicly available dataset, TQGen, that integrates both \textbf{T}ables and clinical \textbf{T}ext for natural language-to-query \textbf{Gen}eration. To address the challenges posed by complex medical terminology and diverse types of questions in EHRs, we propose TQGen-EHRQuery, a framework comprising a medical knowledge module and a questions template matching module. For processing medical text, we introduced the concept of a toolset, which encapsulates the text processing module as a callable tool, thereby improving processing efficiency and flexibility. We conducted extensive experiments to assess the effectiveness of our dataset and workflow, demonstrating their potential to enhance information querying in EHR systems.
Similar Papers
Patient-level Information Extraction by Consistent Integration of Textual and Tabular Evidence with Bayesian Networks
Artificial Intelligence
Helps doctors understand patient health better.
Towards Visualizing Electronic Medical Records via Natural Language Queries
Databases
Lets doctors see patient health from words.
TempoQL: A Readable, Precise, and Portable Query System for Electronic Health Record Data
Human-Computer Interaction
Makes it easy to find health data for AI.