Adversaries With Incentives: A Strategic Alternative to Adversarial Robustness

Maayan Ehrenberg; Roy Ganz; Nir Rosenfeld

arXiv:2406.11458·cs.LG·June 10, 2025

Adversaries With Incentives: A Strategic Alternative to Adversarial Robustness

Maayan Ehrenberg, Roy Ganz, Nir Rosenfeld

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a strategic training approach that models adversaries with incentives rather than purely malicious intent, leveraging strategic reasoning to improve robustness and reduce conservativeness in adversarial training.

Contribution

It proposes a novel framework that incorporates opponent incentives into training, enabling more flexible and potentially more effective defenses against a range of adversaries.

Findings

01

Knowledge of opponent incentives improves robustness.

02

Potential gains depend on incentive alignment with the task.

03

Strategic training reduces conservativeness compared to traditional adversarial methods.

Abstract

Adversarial training aims to defend against adversaries: malicious opponents whose sole aim is to harm predictive performance in any way possible. This presents a rather harsh perspective, which we assert results in unnecessarily conservative training. As an alternative, we propose to model opponents as simply pursuing their own goals--rather than working directly against the classifier. Employing tools from strategic modeling, our approach enables knowledge or beliefs regarding the opponent's possible incentives to be used as inductive bias for learning. Accordingly, our method of strategic training is designed to defend against all opponents within an 'incentive uncertainty set'. This resorts to adversarial learning when the set is maximal, but offers potential gains when the set can be appropriately reduced. We conduct a series of experiments that show how even mild knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maayango285/Adversaries-With-Incentives
pytorchOfficial

Videos

Adversaries With Incentives: A Strategic Alternative to Adversarial Robustness· slideslive

Taxonomy

TopicsEconomic Sanctions and International Relations

MethodsSparse Evolutionary Training