Score: 2

Syn-STARTS: Synthesized START Triage Scenario Generation Framework for Scalable LLM Evaluation

Published: November 18, 2025 | arXiv ID: 2511.14023v1

By: Chiharu Hagiwara , Naoki Nonaka , Yuhta Hashimoto and more

Potential Business Impact:

Creates fake emergency cases for training AI doctors.

Business Areas:

Machine Learning Artificial Intelligence, Data and Analytics, Software

Triage is a critically important decision-making process in mass casualty incidents (MCIs) to maximize victim survival rates. While the role of AI in such situations is gaining attention for making optimal decisions within limited resources and time, its development and performance evaluation require benchmark datasets of sufficient quantity and quality. However, MCIs occur infrequently, and sufficient records are difficult to accumulate at the scene, making it challenging to collect large-scale realworld data for research use. Therefore, we developed Syn-STARTS, a framework that uses LLMs to generate triage cases, and verified its effectiveness. The results showed that the triage cases generated by Syn-STARTS were qualitatively indistinguishable from the TRIAGE open dataset generated by manual curation from training materials. Furthermore, when evaluating the LLM accuracy using hundreds of cases each from the green, yellow, red, and black categories defined by the standard triage method START, the results were found to be highly stable. This strongly indicates the possibility of synthetic data in developing high-performance AI models for severe and critical medical situations.