Score: 0

CNSight: Evaluation of Clinical Note Segmentation Tools

Published: December 28, 2025 | arXiv ID: 2512.22795v1

By: Risha Surana , Adrian Law , Sunwoo Kim and more

Potential Business Impact:

Organizes doctor's notes for better health insights.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Clinical notes are often stored in unstructured or semi-structured formats after extraction from electronic medical record (EMR) systems, which complicates their use for secondary analysis and downstream clinical applications. Reliable identification of section boundaries is a key step toward structuring these notes, as sections such as history of present illness, medications, and discharge instructions each provide distinct clinical contexts. In this work, we evaluate rule-based baselines, domain-specific transformer models, and large language models for clinical note segmentation using a curated dataset of 1,000 notes from MIMIC-IV. Our experiments show that large API-based models achieve the best overall performance, with GPT-5-mini reaching a best average F1 of 72.4 across sentence-level and freetext segmentation. Lightweight baselines remain competitive on structured sentence-level tasks but falter on unstructured freetext. Our results provide guidance for method selection and lay the groundwork for downstream tasks such as information extraction, cohort identification, and automated summarization.