Automated HIV Screening on Dutch EHR with Large Language Models
By: Lang Zhou , Amrish Jhingoer , Yinghao Luo and more
Potential Business Impact:
Finds people who need HIV tests from notes.
Efficient screening and early diagnosis of HIV are critical for reducing onward transmission. Although large scale laboratory testing is not feasible, the widespread adoption of Electronic Health Records (EHRs) offers new opportunities to address this challenge. Existing research primarily focuses on applying machine learning methods to structured data, such as patient demographics, for improving HIV diagnosis. However, these approaches often overlook unstructured text data such as clinical notes, which potentially contain valuable information relevant to HIV risk. In this study, we propose a novel pipeline that leverages a Large Language Model (LLM) to analyze unstructured EHR text and determine a patient's eligibility for further HIV testing. Experimental results on clinical data from Erasmus University Medical Center Rotterdam demonstrate that our pipeline achieved high accuracy while maintaining a low false negative rate.
Similar Papers
Integrating Large Language Models with Human Expertise for Disease Detection in Electronic Health Records
Computation and Language
Helps doctors find patient sicknesses faster.
Structured Semantics from Unstructured Notes: Language Model Approaches to EHR-Based Decision Support
Information Retrieval
Helps doctors understand patient notes better.
Balancing Natural Language Processing Accuracy and Normalisation in Extracting Medical Insights
Artificial Intelligence
Helps doctors find patient info faster.