IBoxCLA: Towards Robust Box-supervised Segmentation of Polyp via Improved Box-dice and Contrastive Latent-anchors
Zhiwei Wang, Qiang Hu, Hongkuan Shi, Li He, Man He, Wenxuan Dai,, Yinjiao Tian, Xin Yang, Mei Liu, and Qiang Li

TL;DR
This paper introduces IBoxCLA, a novel box-supervised polyp segmentation method that decouples location and shape learning, utilizing shape decoupling and contrastive latent anchors to improve boundary accuracy and robustness.
Contribution
It proposes IBox and CLA modules that enhance box-supervised segmentation by disentangling shape and location learning, leading to superior performance over existing methods.
Findings
Achieves at least 6.5% higher mDice compared to recent methods.
Attains at least 7.5% higher mIoU over state-of-the-art box-supervised approaches.
Demonstrates competitive results with fully-supervised polyp segmentation models.
Abstract
Box-supervised polyp segmentation attracts increasing attention for its cost-effective potential. Existing solutions often rely on learning-free methods or pretrained models to laboriously generate pseudo masks, triggering Dice constraint subsequently. In this paper, we found that a model guided by the simplest box-filled masks can accurately predict polyp locations/sizes, but suffers from shape collapsing. In response, we propose two innovative learning fashions, Improved Box-dice (IBox) and Contrastive Latent-Anchors (CLA), and combine them to train a robust box-supervised model IBoxCLA. The core idea behind IBoxCLA is to decouple the learning of location/size and shape, allowing for focused constraints on each of them. Specifically, IBox transforms the segmentation map into a proxy map using shape decoupling and confusion-region swapping sequentially. Within the proxy map, shapes are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques
