Score: 1

MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities

Published: November 25, 2025 | arXiv ID: 2511.20650v1

By: Tooba Tehreem Sheikh , Jean Lahoud , Rao Muhammad Anwer and more

Potential Business Impact:

Finds new diseases in medical pictures.

Business Areas:

Image Recognition Data and Analytics, Software

Traditional object detection models in medical imaging operate within a closed-set paradigm, limiting their ability to detect objects of novel labels. Open-vocabulary object detection (OVOD) addresses this limitation but remains underexplored in medical imaging due to dataset scarcity and weak text-image alignment. To bridge this gap, we introduce MedROV, the first Real-time Open Vocabulary detection model for medical imaging. To enable open-vocabulary learning, we curate a large-scale dataset, Omnis, with 600K detection samples across nine imaging modalities and introduce a pseudo-labeling strategy to handle missing annotations from multi-source datasets. Additionally, we enhance generalization by incorporating knowledge from a large pre-trained foundation model. By leveraging contrastive learning and cross-modal representations, MedROV effectively detects both known and novel structures. Experimental results demonstrate that MedROV outperforms the previous state-of-the-art foundation model for medical image detection with an average absolute improvement of 40 mAP50, and surpasses closed-set detectors by more than 3 mAP50, while running at 70 FPS, setting a new benchmark in medical detection. Our source code, dataset, and trained model are available at https://github.com/toobatehreem/MedROV.

ODOV: Towards Open-Domain Open-Vocabulary Object Detection

CV and Pattern Recognition

Helps computers recognize any object anywhere.

2 Aug 2025 0

90%

OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations

CV and Pattern Recognition

Finds objects in 3D rooms without human labels.

27 Aug 2025 0

89%

Evaluating the Performance of Open-Vocabulary Object Detection in Low-quality Image

CV and Pattern Recognition

Helps computers see objects in blurry pictures.

28 Dec 2025 2

View PDF Login to Bookmark

Country of Origin

🇦🇪 United Arab Emirates

Page Count

11 pages

MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities

Finds new diseases in medical pictures.

Technical Abstract

ODOV: Towards Open-Domain Open-Vocabulary Object Detection

OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations

Evaluating the Performance of Open-Vocabulary Object Detection in Low-quality Image