SuperSAM: Crafting a SAM Supernetwork via Structured Pruning and Unstructured Parameter Prioritization
Waqwoya Abebe, Sadegh Jafari, Sixing Yu, Akash Dutta, Jan Strube,, Nathan R. Tallent, Luanzheng Guo, Pablo Munoz, Ali Jannesari

TL;DR
This paper introduces SuperSAM, a novel method for designing a supernetwork from the Segment Anything Model (SAM) using structured pruning and parameter prioritization, enabling efficient subnetwork discovery that outperforms the original model.
Contribution
It presents a new search space design strategy for Vision Transformers by converting SAM into a supernetwork with automated pruning and prioritization, improving NAS efficiency.
Findings
Subnetworks are 30-70% smaller than original SAM.
Discovered subnetworks outperform pretrained models.
Automated search space design enhances NAS for ViT.
Abstract
Neural Architecture Search (NAS) is a powerful approach of automating the design of efficient neural architectures. In contrast to traditional NAS methods, recently proposed one-shot NAS methods prove to be more efficient in performing NAS. One-shot NAS works by generating a singular weight-sharing supernetwork that acts as a search space (container) of subnetworks. Despite its achievements, designing the one-shot search space remains a major challenge. In this work we propose a search space design strategy for Vision Transformer (ViT)-based architectures. In particular, we convert the Segment Anything Model (SAM) into a weight-sharing supernetwork called SuperSAM. Our approach involves automating the search space design via layer-wise structured pruning and parameter prioritization. While the structured pruning applies probabilistic removal of certain transformer layers, parameter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModular Robots and Swarm Intelligence · Advanced Surface Polishing Techniques · Space Satellite Systems and Control
MethodsAttention Is All You Need · Absolute Position Encodings · Adam · Residual Connection · Dropout · Softmax · Byte Pair Encoding · Linear Layer · Vision Transformer · Multi-Head Attention
