UW-BioNLP at ChemoTimelines 2025: Thinking, Fine-Tuning, and Dictionary-Enhanced LLM Systems for Chemotherapy Timeline Extraction
By: Tianmai M. Zhang , Zhaoyi Sun , Sihang Zeng and more
Potential Business Impact:
Helps doctors track cancer medicine history.
The ChemoTimelines shared task benchmarks methods for constructing timelines of systemic anticancer treatment from electronic health records of cancer patients. This paper describes our methods, results, and findings for subtask 2 -- generating patient chemotherapy timelines from raw clinical notes. We evaluated strategies involving chain-of-thought thinking, supervised fine-tuning, direct preference optimization, and dictionary-based lookup to improve timeline extraction. All of our approaches followed a two-step workflow, wherein an LLM first extracted chemotherapy events from individual clinical notes, and then an algorithm normalized and aggregated events into patient-level timelines. Each specific method differed in how the associated LLM was utilized and trained. Multiple approaches yielded competitive performances on the test set leaderboard, with fine-tuned Qwen3-14B achieving the best official score of 0.678. Our results and analyses could provide useful insights for future attempts on this task as well as the design of similar tasks.
Similar Papers
A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports
Computation and Language
Helps doctors understand patient health timelines automatically.
Automated Extraction of Fluoropyrimidine Treatment and Treatment-Related Toxicities from Clinical Notes Using Natural Language Processing
Computation and Language
Finds cancer drug side effects in doctor's notes.
Integrating Text and Time-Series into (Large) Language Models to Predict Medical Outcomes
Computation and Language
Helps doctors understand patient health records better.