NAS-LoRA: Empowering Parameter-Efficient Fine-Tuning for Visual Foundation Models with Searchable Adaptation
Renqi Chen, Haoyang Su, Shixiang Tang

TL;DR
NAS-LoRA introduces a neural architecture search-based parameter-efficient fine-tuning method for SAM, enhancing adaptation to specialized domains by integrating inductive bias and optimizing semantic learning with reduced training costs.
Contribution
This paper presents NAS-LoRA, a novel PEFT approach that incorporates NAS to dynamically optimize model priors, improving domain adaptation of SAM with stage-wise training strategies.
Findings
Improves adaptation performance of SAM in specialized domains.
Reduces training cost by 24.14% without increasing inference cost.
Enhances high-level semantic learning through NAS-guided architecture optimization.
Abstract
The Segment Anything Model (SAM) has emerged as a powerful visual foundation model for image segmentation. However, adapting SAM to specific downstream tasks, such as medical and agricultural imaging, remains a significant challenge. To address this, Low-Rank Adaptation (LoRA) and its variants have been widely employed to enhancing SAM's adaptation performance on diverse domains. Despite advancements, a critical question arises: can we integrate inductive bias into the model? This is particularly relevant since the Transformer encoder in SAM inherently lacks spatial priors within image patches, potentially hindering the acquisition of high-level semantic information. In this paper, we propose NAS-LoRA, a new Parameter-Efficient Fine-Tuning (PEFT) method designed to bridge the semantic gap between pre-trained SAM and specialized domains. Specifically, NAS-LoRA incorporates a lightweight…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
