Score: 0

Visual question answering-based image-finding generation for pulmonary nodules on chest CT from structured annotations

Published: January 16, 2026 | arXiv ID: 2601.11075v1

By: Maiko Nagao , Kaito Urata , Atsushi Teramoto and more

Potential Business Impact:

Helps doctors describe lung spots from scans.

Business Areas:

Image Recognition Data and Analytics, Software

Interpretation of imaging findings based on morphological characteristics is important for diagnosing pulmonary nodules on chest computed tomography (CT) images. In this study, we constructed a visual question answering (VQA) dataset from structured data in an open dataset and investigated an image-finding generation method for chest CT images, with the aim of enabling interactive diagnostic support that presents findings based on questions that reflect physicians' interests rather than fixed descriptions. In this study, chest CT images included in the Lung Image Database Consortium and Image Database Resource Initiative (LIDC-IDRI) datasets were used. Regions of interest surrounding the pulmonary nodules were extracted from these images, and image findings and questions were defined based on morphological characteristics recorded in the database. A dataset comprising pairs of cropped images, corresponding questions, and image findings was constructed, and the VQA model was fine-tuned on it. Language evaluation metrics such as BLEU were used to evaluate the generated image findings. The VQA dataset constructed using the proposed method contained image findings with natural expressions as radiological descriptions. In addition, the generated image findings showed a high CIDEr score of 3.896, and a high agreement with the reference findings was obtained through evaluation based on morphological characteristics. We constructed a VQA dataset for chest CT images using structured information on the morphological characteristics from the LIDC-IDRI dataset. Methods for generating image findings in response to these questions have also been investigated. Based on the generated results and evaluation metric scores, the proposed method was effective as an interactive diagnostic support system that can present image findings according to physicians' interests.

Generation of Chest CT pulmonary Nodule Images by Latent Diffusion Models using the LIDC-IDRI Dataset

Image and Video Processing

Creates fake CT scans to train medical AI.

16 Jan 2026 0

89%

RadImageNet-VQA: A Large-Scale CT and MRI Dataset for Radiologic Visual Question Answering

CV and Pattern Recognition

Teaches computers to answer questions about medical scans.

19 Dec 2025 1

89%

VICCA: Visual Interpretation and Comprehension of Chest X-ray Anomalies in Generated Report Without Human Feedback

CV and Pattern Recognition

Helps AI explain X-ray pictures better.

29 Jan 2025 1

View PDF Login to Bookmark

Page Count

9 pages

Visual question answering-based image-finding generation for pulmonary nodules on chest CT from structured annotations

Helps doctors describe lung spots from scans.

Technical Abstract

Generation of Chest CT pulmonary Nodule Images by Latent Diffusion Models using the LIDC-IDRI Dataset

RadImageNet-VQA: A Large-Scale CT and MRI Dataset for Radiologic Visual Question Answering

VICCA: Visual Interpretation and Comprehension of Chest X-ray Anomalies in Generated Report Without Human Feedback