Adversarial Attacks in Weight-Space Classifiers

Tamir Shor; Ethan Fetaya; Chaim Baskin; Alex Bronstein

arXiv:2502.20314·cs.LG·March 4, 2026

Adversarial Attacks in Weight-Space Classifiers

Tamir Shor, Ethan Fetaya, Chaim Baskin, Alex Bronstein

PDF

Open Access

TL;DR

This paper investigates the robustness of weight-space classifiers, especially those based on Implicit Neural Representations, against adversarial attacks, revealing increased robustness due to gradient obfuscation and introducing new attack methods.

Contribution

The study provides the first in-depth analysis of adversarial robustness in weight-space classifiers, highlighting their inherent robustness and limitations without robust training.

Findings

01

Weight-space classifiers show increased robustness to white-box attacks.

02

Gradient obfuscation during INR training contributes to robustness.

03

New adversarial attack suite effectively targets parameter-space classifiers.

Abstract

Implicit Neural Representations (INRs) have been recently garnering increasing interest in various research fields, mainly due to their ability to represent large, complex data in a compact, continuous manner. Past work further showed that numerous popular downstream tasks can be performed directly in the INR parameter-space. Doing so can substantially reduce the computational resources required to process the represented data in their native domain. A major difficulty in using modern machine-learning approaches, is their high susceptibility to adversarial attacks, which have been shown to greatly limit the reliability and applicability of such methods in a wide range of settings. In this work, we perform an in-depth security analysis of the behavior of weight-space classifiers under adversarial attacks. Our study reveals that parameter-space models trained for classification exhibit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI