Adaptive Detector-Verifier Framework for Zero-Shot Polyp Detection in Open-World Settings
Shengkai Xu, Hsiang Lun Kao, Tianxiang Xu, Honghui Zhang, Junqiao Wang, Runmeng Ding, Guanyu Liu, Tianyu Shi, Zhenyu Yu, Guofeng Pan, Ziqian Bi, Yuqi Ouyang

TL;DR
This paper introduces AdaptiveDetector, a two-stage framework combining a YOLOv11 detector and a vision-language verifier, designed to improve zero-shot polyp detection in challenging real-world endoscopy conditions by adaptively adjusting confidence thresholds and using cost-sensitive reinforcement learning.
Contribution
The work presents a novel adaptive detector-verifier framework with a cost-sensitive verifier fine-tuned via reinforcement learning, addressing domain gaps in polyp detection under adverse conditions.
Findings
Improves recall by 14-22 percentage points over baseline YOLO.
Maintains precision within 0.7 to 1.7 points of baseline.
Demonstrates effectiveness in synthetic adverse conditions for zero-shot evaluation.
Abstract
Polyp detectors trained on clean datasets often underperform in real-world endoscopy, where illumination changes, motion blur, and occlusions degrade image quality. Existing approaches struggle with the domain gap between controlled laboratory conditions and clinical practice, where adverse imaging conditions are prevalent. In this work, we propose AdaptiveDetector, a novel two-stage detector-verifier framework comprising a YOLOv11 detector with a vision-language model (VLM) verifier. The detector adaptively adjusts per-frame confidence thresholds under VLM guidance, while the verifier is fine-tuned with Group Relative Policy Optimization (GRPO) using an asymmetric, cost-sensitive reward function specifically designed to discourage missed detections -- a critical clinical requirement. To enable realistic assessment under challenging conditions, we construct a comprehensive synthetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColorectal Cancer Screening and Detection · Advanced Neural Network Applications · AI in cancer detection
