From SAM to DINOv2: Towards Distilling Foundation Models to Lightweight Baselines for Generalized Polyp Segmentation
Shivanshu Agnihotri, Snehashis Majhi, Deepak Ranjan Nayak, Debesh Jha

TL;DR
This paper introduces Polyp-DiFoM, a distillation framework that transfers foundation model knowledge into lightweight segmentation models, significantly improving polyp segmentation accuracy and efficiency in medical imaging.
Contribution
The paper presents a novel distillation method that infuses foundation model representations into lightweight models for improved polyp segmentation performance.
Findings
Polyp-DiFoM outperforms baseline models and state-of-the-art methods.
Achieves nearly 9 times reduction in computational cost.
Effective across five benchmark datasets.
Abstract
Accurate polyp segmentation during colonoscopy is critical for the early detection of colorectal cancer and still remains challenging due to significant size, shape, and color variations, and the camouflaged nature of polyps. While lightweight baseline models such as U-Net, U-Net++, and PraNet offer advantages in terms of easy deployment and low computational cost, they struggle to deal with the above issues, leading to limited segmentation performance. In contrast, large-scale vision foundation models such as SAM, DINOv2, OneFormer, and Mask2Former have exhibited impressive generalization performance across natural image domains. However, their direct transfer to medical imaging tasks (e.g., colonoscopic polyp segmentation) is not straightforward, primarily due to the scarcity of large-scale datasets and lack of domain-specific knowledge. To bridge this gap, we propose a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColorectal Cancer Screening and Detection · COVID-19 diagnosis using AI · Advanced Neural Network Applications
