Score: 1

OncoReason: Structuring Clinical Reasoning in LLMs for Robust and Interpretable Survival Prediction

Published: October 20, 2025 | arXiv ID: 2510.17532v1

By: Raghu Vamshi Hemadri , Geetha Krishna Guruju , Kristi Topollai and more

Potential Business Impact:

Helps doctors predict cancer treatment success.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Predicting cancer treatment outcomes requires models that are both accurate and interpretable, particularly in the presence of heterogeneous clinical data. While large language models (LLMs) have shown strong performance in biomedical NLP, they often lack structured reasoning capabilities critical for high-stakes decision support. We present a unified, multi-task learning framework that aligns autoregressive LLMs with clinical reasoning for outcome prediction on the MSK-CHORD dataset. Our models are trained to jointly perform binary survival classification, continuous survival time regression, and natural language rationale generation. We evaluate three alignment strategies: (1) standard supervised fine-tuning (SFT), (2) SFT with Chain-of-Thought (CoT) prompting to elicit step-by-step reasoning, and (3) Group Relative Policy Optimization (GRPO), a reinforcement learning method that aligns model outputs to expert-derived reasoning trajectories. Experiments with LLaMa3-8B and Med42-8B backbones demonstrate that CoT prompting improves F1 by +6.0 and reduces MAE by 12%, while GRPO achieves state-of-the-art interpretability and predictive performance across BLEU, ROUGE, and BERTScore. We further show that existing biomedical LLMs often fail to produce valid reasoning traces due to architectural constraints. Our findings underscore the importance of reasoning-aware alignment in multi-task clinical modeling and set a new benchmark for interpretable, trustworthy LLMs in precision oncology.

Training and Evaluation of Guideline-Based Medical Reasoning in LLMs

Computation and Language

Teaches computers to explain medical decisions like doctors.

3 Dec 2025 1

91%

Cognitive-Mental-LLM: Evaluating Reasoning in Large Language Models for Mental Health Prediction via Online Text

Computation and Language

Helps computers understand mental health from online words.

13 Mar 2025 1

91%

Large Language Models with Temporal Reasoning for Longitudinal Clinical Summarization and Prediction

Computation and Language

Helps doctors quickly understand patient history.

30 Jan 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

16 pages

OncoReason: Structuring Clinical Reasoning in LLMs for Robust and Interpretable Survival Prediction

Helps doctors predict cancer treatment success.

Technical Abstract

Training and Evaluation of Guideline-Based Medical Reasoning in LLMs

Cognitive-Mental-LLM: Evaluating Reasoning in Large Language Models for Mental Health Prediction via Online Text

Large Language Models with Temporal Reasoning for Longitudinal Clinical Summarization and Prediction