Pay Attention to Small Weights

Chao Zhou; Tom Jacobs; Advait Gadhikar; Rebekka Burkholz

arXiv:2506.21374·cs.LG·October 23, 2025

Pay Attention to Small Weights

Chao Zhou, Tom Jacobs, Advait Gadhikar, Rebekka Burkholz

PDF

Open Access 1 Video

TL;DR

This paper introduces NANOADAM, a finetuning method that selectively updates small-magnitude weights based on observed gradient-weight relationships, improving efficiency and performance in NLP and vision tasks.

Contribution

The paper proposes NANOADAM, a gradient-free, weight-based finetuning approach that preserves important large weights and enhances generalization.

Findings

01

NANOADAM outperforms standard methods in NLP and vision tasks.

02

Selective weight updating reduces resource usage.

03

Preserving large weights mitigates catastrophic forgetting.

Abstract

Finetuning large pretrained neural networks is known to be resource-intensive, both in terms of memory and computational cost. To mitigate this, a common approach is to restrict training to a subset of the model parameters. By analyzing the relationship between gradients and weights during finetuning, we observe a notable pattern: large gradients are often associated with small-magnitude weights. This correlation is more pronounced in finetuning settings than in training from scratch. Motivated by this observation, we propose NANOADAM, which dynamically updates only the small-magnitude weights during finetuning and offers several practical advantages: first, this criterion is gradient-free -- the parameter subset can be determined without gradient computation; second, it preserves large-magnitude weights, which are likely to encode critical features learned during pretraining, thereby…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Pay Attention to Small Weights· slideslive

Taxonomy

TopicsObesity and Health Practices · Global Public Health Policies and Epidemiology