TinySAM: Pushing the Envelope for Efficient Segment Anything Model
Han Shu, Wenshuo Li, Yehui Tang, Yiman Zhang, Yihao Chen, Houqiang Li,, Yunhe Wang, Xinghao Chen

TL;DR
TinySAM is a highly efficient, lightweight segmentation model that maintains strong zero-shot performance by employing knowledge distillation, quantization, and hierarchical strategies, significantly reducing computational costs for edge devices.
Contribution
The paper introduces TinySAM, a novel framework combining knowledge distillation, quantization, and hierarchical segmentation to create a lightweight, efficient segmentation model with minimal performance loss.
Findings
Orders of magnitude reduction in computational cost.
Accelerates inference by 2x with minimal performance impact.
Outperforms existing methods on zero-shot transfer tasks.
Abstract
Recently segment anything model (SAM) has shown powerful segmentation capability and has drawn great attention in computer vision fields. Massive following works have developed various applications based on the pre-trained SAM and achieved impressive performance on downstream vision tasks. However, SAM consists of heavy architectures and requires massive computational capacity, which hinders the further application of SAM on computation constrained edge devices. To this end, in this paper we propose a framework to obtain a tiny segment anything model (TinySAM) while maintaining the strong zero-shot performance. We first propose a full-stage knowledge distillation method with hard prompt sampling and hard mask weighting strategy to distill a lightweight student model. We also adapt the post-training quantization to the prompt-based segmentation task and further reduce the computational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection
MethodsKnowledge Distillation · Segment Anything Model
