NAT: Learning to Attack Neurons for Enhanced Adversarial Transferability

Krishna Kanth Nakka; Alexandre Alahi

arXiv:2508.16937·cs.CV·August 26, 2025

NAT: Learning to Attack Neurons for Enhanced Adversarial Transferability

Krishna Kanth Nakka, Alexandre Alahi

PDF

1 Models

TL;DR

NAT introduces a neuron-specific adversarial attack method that improves transferability across models by targeting individual neurons, outperforming existing methods in fooling rates with fewer queries.

Contribution

The paper proposes NAT, a novel neuron-focused attack method that enhances transferability and efficiency over traditional layer-level approaches.

Findings

01

Achieves over 14% higher fooling rates across models.

02

Surpasses baseline methods in cross-domain transferability.

03

Effective within 10 queries using combined generator attacks.

Abstract

The generation of transferable adversarial perturbations typically involves training a generator to maximize embedding separation between clean and adversarial images at a single mid-layer of a source model. In this work, we build on this approach and introduce Neuron Attack for Transferability (NAT), a method designed to target specific neuron within the embedding. Our approach is motivated by the observation that previous layer-level optimizations often disproportionately focus on a few neurons representing similar concepts, leaving other neurons within the attacked layer minimally affected. NAT shifts the focus from embedding-level separation to a more fundamental, neuron-specific approach. We find that targeting individual neurons effectively disrupts the core units of the neural network, providing a common basis for transferability across different models. Through extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
KKNakka/NAT
model· 1 dl
1 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.