First-frame Supervised Video Polyp Segmentation via Propagative and Semantic Dual-teacher Network
Qiang Hu, Mei Liu, Qiang Li, Zhiwei Wang

TL;DR
This paper introduces a novel first-frame supervised video polyp segmentation method that significantly reduces annotation costs by using a dual-teacher framework to generate pseudo labels, achieving competitive results with minimal supervision.
Contribution
The paper proposes a new task, FSVPS, and a dual-teacher network that effectively combines propagative and semantic pseudo labels to improve segmentation with only a single annotated frame.
Findings
Achieves competitive performance on SUN-SEG dataset.
Outperforms sparse-frame supervised methods by at least 4.5% in Dice score.
Reduces annotation effort to just one frame per video.
Abstract
Automatic video polyp segmentation plays a critical role in gastrointestinal cancer screening, but the cost of frameby-frame annotations is prohibitively high. While sparse-frame supervised methods have reduced this burden proportionately, the cost remains overwhelming for long-duration videos and large-scale datasets. In this paper, we, for the first time, reduce the annotation cost to just a single frame per polyp video, regardless of the video's length. To this end, we introduce a new task, First-Frame Supervised Video Polyp Segmentation (FSVPS), and propose a novel Propagative and Semantic Dual-Teacher Network (PSDNet). Specifically, PSDNet adopts a teacher-student framework but employs two distinct types of teachers: the propagative teacher and the semantic teacher. The propagative teacher is a universal object tracker that propagates the first-frame annotation to subsequent frames…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques
MethodsALIGN
