Score: 0

Adaptive Detector-Verifier Framework for Zero-Shot Polyp Detection in Open-World Settings

Published: December 13, 2025 | arXiv ID: 2512.12492v1

By: Shengkai Xu , Hsiang Lun Kao , Tianxiang Xu and more

Potential Business Impact:

Finds hidden polyps better in stomach pictures.

Business Areas:

Image Recognition Data and Analytics, Software

Polyp detectors trained on clean datasets often underperform in real-world endoscopy, where illumination changes, motion blur, and occlusions degrade image quality. Existing approaches struggle with the domain gap between controlled laboratory conditions and clinical practice, where adverse imaging conditions are prevalent. In this work, we propose AdaptiveDetector, a novel two-stage detector-verifier framework comprising a YOLOv11 detector with a vision-language model (VLM) verifier. The detector adaptively adjusts per-frame confidence thresholds under VLM guidance, while the verifier is fine-tuned with Group Relative Policy Optimization (GRPO) using an asymmetric, cost-sensitive reward function specifically designed to discourage missed detections -- a critical clinical requirement. To enable realistic assessment under challenging conditions, we construct a comprehensive synthetic testbed by systematically degrading clean datasets with adverse conditions commonly encountered in clinical practice, providing a rigorous benchmark for zero-shot evaluation. Extensive zero-shot evaluation on synthetically degraded CVC-ClinicDB and Kvasir-SEG images demonstrates that our approach improves recall by 14 to 22 percentage points over YOLO alone, while precision remains within 0.7 points below to 1.7 points above the baseline. This combination of adaptive thresholding and cost-sensitive reinforcement learning achieves clinically aligned, open-world polyp detection with substantially fewer false negatives, thereby reducing the risk of missed precancerous polyps and improving patient outcomes.

ACD-CLIP: Decoupling Representation and Dynamic Fusion for Zero-Shot Anomaly Detection

CV and Pattern Recognition

Finds weird things in pictures better.

11 Aug 2025 0

88%

Robust Object Detection with Pseudo Labels from VLMs using Per-Object Co-teaching

CV and Pattern Recognition

Teaches self-driving cars to see objects faster.

13 Nov 2025 0

87%

Architectural Co-Design for Zero-Shot Anomaly Detection: Decoupling Representation and Dynamically Fusing Features in CLIP

CV and Pattern Recognition

Finds hidden problems in pictures using words.

11 Aug 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

13 pages

Adaptive Detector-Verifier Framework for Zero-Shot Polyp Detection in Open-World Settings

Finds hidden polyps better in stomach pictures.

Technical Abstract

ACD-CLIP: Decoupling Representation and Dynamic Fusion for Zero-Shot Anomaly Detection

Robust Object Detection with Pseudo Labels from VLMs using Per-Object Co-teaching

Architectural Co-Design for Zero-Shot Anomaly Detection: Decoupling Representation and Dynamically Fusing Features in CLIP