SuperSAM: Crafting a SAM Supernetwork via Structured Pruning and   Unstructured Parameter Prioritization

Waqwoya Abebe; Sadegh Jafari; Sixing Yu; Akash Dutta; Jan Strube,; Nathan R. Tallent; Luanzheng Guo; Pablo Munoz; Ali Jannesari

arXiv:2501.08504·cs.CV·January 16, 2025

SuperSAM: Crafting a SAM Supernetwork via Structured Pruning and Unstructured Parameter Prioritization

Waqwoya Abebe, Sadegh Jafari, Sixing Yu, Akash Dutta, Jan Strube,, Nathan R. Tallent, Luanzheng Guo, Pablo Munoz, Ali Jannesari

PDF

Open Access 1 Repo

TL;DR

This paper introduces SuperSAM, a novel method for designing a supernetwork from the Segment Anything Model (SAM) using structured pruning and parameter prioritization, enabling efficient subnetwork discovery that outperforms the original model.

Contribution

It presents a new search space design strategy for Vision Transformers by converting SAM into a supernetwork with automated pruning and prioritization, improving NAS efficiency.

Findings

01

Subnetworks are 30-70% smaller than original SAM.

02

Discovered subnetworks outperform pretrained models.

03

Automated search space design enhances NAS for ViT.

Abstract

Neural Architecture Search (NAS) is a powerful approach of automating the design of efficient neural architectures. In contrast to traditional NAS methods, recently proposed one-shot NAS methods prove to be more efficient in performing NAS. One-shot NAS works by generating a singular weight-sharing supernetwork that acts as a search space (container) of subnetworks. Despite its achievements, designing the one-shot search space remains a major challenge. In this work we propose a search space design strategy for Vision Transformer (ViT)-based architectures. In particular, we convert the Segment Anything Model (SAM) into a weight-sharing supernetwork called SuperSAM. Our approach involves automating the search space design via layer-wise structured pruning and parameter prioritization. While the structured pruning applies probabilistic removal of certain transformer layers, parameter…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pnnl/supersam
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModular Robots and Swarm Intelligence · Advanced Surface Polishing Techniques · Space Satellite Systems and Control

MethodsAttention Is All You Need · Absolute Position Encodings · Adam · Residual Connection · Dropout · Softmax · Byte Pair Encoding · Linear Layer · Vision Transformer · Multi-Head Attention