Score: 0

AMRG: Extend Vision Language Models for Automatic Mammography Report Generation

Published: August 12, 2025 | arXiv ID: 2508.09225v1

By: Nak-Jun Sung , Donghyun Lee , Bo Hwa Choi and more

Potential Business Impact:

Helps doctors write breast cancer reports faster.

Mammography report generation is a critical yet underexplored task in medical AI, characterized by challenges such as multiview image reasoning, high-resolution visual cues, and unstructured radiologic language. In this work, we introduce AMRG (Automatic Mammography Report Generation), the first end-to-end framework for generating narrative mammography reports using large vision-language models (VLMs). Building upon MedGemma-4B-it-a domain-specialized, instruction-tuned VLM-we employ a parameter-efficient fine-tuning (PEFT) strategy via Low-Rank Adaptation (LoRA), enabling lightweight adaptation with minimal computational overhead. We train and evaluate AMRG on DMID, a publicly available dataset of paired high-resolution mammograms and diagnostic reports. This work establishes the first reproducible benchmark for mammography report generation, addressing a longstanding gap in multimodal clinical AI. We systematically explore LoRA hyperparameter configurations and conduct comparative experiments across multiple VLM backbones, including both domain-specific and general-purpose models under a unified tuning protocol. Our framework demonstrates strong performance across both language generation and clinical metrics, achieving a ROUGE-L score of 0.5691, METEOR of 0.6152, CIDEr of 0.5818, and BI-RADS accuracy of 0.5582. Qualitative analysis further highlights improved diagnostic consistency and reduced hallucinations. AMRG offers a scalable and adaptable foundation for radiology report generation and paves the way for future research in multimodal medical AI.

EMRRG: Efficient Fine-Tuning Pre-trained X-ray Mamba Networks for Radiology Report Generation

CV and Pattern Recognition

Helps doctors write X-ray reports faster.

19 Oct 2025 1

90%

A Multimodal Multi-Agent Framework for Radiology Report Generation

Artificial Intelligence

Helps doctors write faster, more accurate patient reports.

14 May 2025 0

90%

MV-MLM: Bridging Multi-View Mammography and Language for Breast Cancer Diagnosis and Risk Prediction

CV and Pattern Recognition

Helps doctors find breast cancer faster.

30 Oct 2025 1

View PDF Login to Bookmark

Page Count

10 pages

AMRG: Extend Vision Language Models for Automatic Mammography Report Generation

Helps doctors write breast cancer reports faster.

Technical Abstract

EMRRG: Efficient Fine-Tuning Pre-trained X-ray Mamba Networks for Radiology Report Generation

A Multimodal Multi-Agent Framework for Radiology Report Generation

MV-MLM: Bridging Multi-View Mammography and Language for Breast Cancer Diagnosis and Risk Prediction