Medico 2025: Visual Question Answering for Gastrointestinal Imaging
By: Sushant Gautam , Vajira Thambawita , Michael Riegler and more
Potential Business Impact:
Helps doctors understand stomach pictures better.
The Medico 2025 challenge addresses Visual Question Answering (VQA) for Gastrointestinal (GI) imaging, organized as part of the MediaEval task series. The challenge focuses on developing Explainable Artificial Intelligence (XAI) models that answer clinically relevant questions based on GI endoscopy images while providing interpretable justifications aligned with medical reasoning. It introduces two subtasks: (1) answering diverse types of visual questions using the Kvasir-VQA-x1 dataset, and (2) generating multimodal explanations to support clinical decision-making. The Kvasir-VQA-x1 dataset, created from 6,500 images and 159,549 complex question-answer (QA) pairs, serves as the benchmark for the challenge. By combining quantitative performance metrics and expert-reviewed explainability assessments, this task aims to advance trustworthy Artificial Intelligence (AI) in medical image analysis. Instructions, data access, and an updated guide for participation are available in the official competition repository: https://github.com/simula/MediaEval-Medico-2025
Similar Papers
Multi-Task Learning for Visually Grounded Reasoning in Gastrointestinal VQA
CV and Pattern Recognition
Helps doctors understand medical images better.
Kvasir-VQA-x1: A Multimodal Dataset for Medical Reasoning and Robust MedVQA in Gastrointestinal Endoscopy
CV and Pattern Recognition
Helps doctors understand stomach camera images better.
MedXplain-VQA: Multi-Component Explainable Medical Visual Question Answering
CV and Pattern Recognition
Shows doctors why AI suggests a diagnosis.