A Multimodal Multi-Agent Framework for Radiology Report Generation
By: Ziruo Yi, Ting Xiao, Mark V. Albert
Potential Business Impact:
Helps doctors write faster, more accurate patient reports.
Radiology report generation (RRG) aims to automatically produce diagnostic reports from medical images, with the potential to enhance clinical workflows and reduce radiologists' workload. While recent approaches leveraging multimodal large language models (MLLMs) and retrieval-augmented generation (RAG) have achieved strong results, they continue to face challenges such as factual inconsistency, hallucination, and cross-modal misalignment. We propose a multimodal multi-agent framework for RRG that aligns with the stepwise clinical reasoning workflow, where task-specific agents handle retrieval, draft generation, visual analysis, refinement, and synthesis. Experimental results demonstrate that our approach outperforms a strong baseline in both automatic metrics and LLM-based evaluations, producing more accurate, structured, and interpretable reports. This work highlights the potential of clinically aligned multi-agent frameworks to support explainable and trustworthy clinical AI applications.
Similar Papers
Medical AI Consensus: A Multi-Agent Framework for Radiology Report Generation and Evaluation
Artificial Intelligence
Helps doctors write patient reports faster.
Leveraging LLMs for Multimodal Retrieval-Augmented Radiology Report Generation via Key Phrase Extraction
CV and Pattern Recognition
Helps doctors write X-ray reports faster.
MRGAgents: A Multi-Agent Framework for Improved Medical Report Generation with Med-LVLMs
Multiagent Systems
Helps doctors find hidden problems in X-rays.