NeuroAda: Activating Each Neuron's Potential for Parameter-Efficient Fine-Tuning
Zhi Zhang, Yixian Shen, Congfeng Cao, Ekaterina Shutova

TL;DR
NeuroAda introduces a parameter-efficient fine-tuning method that activates important neurons with bypass connections, achieving state-of-the-art results while significantly reducing memory usage across diverse NLP tasks.
Contribution
NeuroAda combines the strengths of addition-based and selective PEFT methods by activating key neurons with bypass connections, enabling fine-grained tuning with minimal parameters and memory.
Findings
Achieves state-of-the-art performance with ≤0.02% trainable parameters.
Reduces CUDA memory usage by up to 60%.
Effective across 23+ NLP tasks.
Abstract
Existing parameter-efficient fine-tuning (PEFT) methods primarily fall into two categories: addition-based and selective in-situ adaptation. The former, such as LoRA, introduce additional modules to adapt the model to downstream tasks, offering strong memory efficiency. However, their representational capacity is often limited, making them less suitable for fine-grained adaptation. In contrast, the latter directly fine-tunes a carefully chosen subset of the original model parameters, allowing for more precise and effective adaptation, but at the cost of significantly increased memory consumption. To reconcile this trade-off, we propose NeuroAda, a novel PEFT method that enables fine-grained model finetuning while maintaining high memory efficiency. Our approach first identifies important parameters (i.e., connections within the network) as in selective adaptation, and then introduces…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
