Score: 0

SafeMed-R1: Adversarial Reinforcement Learning for Generalizable and Robust Medical Reasoning in Vision-Language Models

Published: December 22, 2025 | arXiv ID: 2512.19317v1

By: A. A. Gde Yogi Pramana , Jason Ray , Anthony Jaya and more

Vision--Language Models (VLMs) show significant promise for Medical Visual Question Answering (VQA), yet their deployment in clinical settings is hindered by severe vulnerability to adversarial attacks. Standard adversarial training, while effective for simpler tasks, often degrades both generalization performance and the quality of generated clinical reasoning. We introduce SafeMed-R1, a hybrid defense framework that ensures robust performance while preserving high-quality, interpretable medical reasoning. SafeMed-R1 employs a two-stage approach: at training time, we integrate Adversarial Training with Group Relative Policy Optimization (AT-GRPO) to explicitly robustify the reasoning process against worst-case perturbations; at inference time, we augment the model with Randomized Smoothing to provide certified $L_2$-norm robustness guarantees. We evaluate SafeMed-R1 on the OmniMedVQA benchmark across eight medical imaging modalities comprising over 88,000 samples. Our experiments reveal that standard fine-tuned VLMs, despite achieving 95\% accuracy on clean inputs, collapse to approximately 25\% under PGD attacks. In contrast, SafeMed-R1 maintains 84.45\% accuracy under the same adversarial conditions, representing a 59 percentage point improvement in robustness. Furthermore, we demonstrate that models trained with explicit chain-of-thought reasoning exhibit superior adversarial robustness compared to instruction-only variants, suggesting a synergy between interpretability and security in medical AI systems.

Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models

CV and Pattern Recognition

Helps doctors understand X-rays better and faster.

18 Mar 2025 0

90%

SaFeR-VLM: Toward Safety-aware Fine-grained Reasoning in Multimodal Models

Machine Learning (CS)

Makes AI safer by teaching it to think carefully.

8 Oct 2025 1

90%

RARL: Improving Medical VLM Reasoning and Generalization with Reinforcement Learning and LoRA under Data and Hardware Constraints

CV and Pattern Recognition

Helps doctors understand medical pictures better.

7 Jun 2025 0

View PDF Login to Bookmark

SafeMed-R1: Adversarial Reinforcement Learning for Generalizable and Robust Medical Reasoning in Vision-Language Models

Technical Abstract

Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models

SaFeR-VLM: Toward Safety-aware Fine-grained Reasoning in Multimodal Models

RARL: Improving Medical VLM Reasoning and Generalization with Reinforcement Learning and LoRA under Data and Hardware Constraints