Anticipating Gaming to Incentivize Improvement: Guiding Agents in (Fair) Strategic Classification
Sura Alhanouti, Parinaz Naghizadeh

TL;DR
This paper models strategic human responses to machine learning classifiers as a game, analyzing how to design fair algorithms that incentivize genuine improvement over manipulation.
Contribution
It introduces a Stackelberg game framework to study strategic responses and proposes methods to design classifiers that promote genuine improvement and fairness.
Findings
Optimal classifiers can prevent manipulation and encourage improvement.
Different agent response classes are characterized based on costs and efficacy.
Fair strategic policies can align incentives towards genuine skill enhancement.
Abstract
As machine learning algorithms increasingly influence critical decision making in different application areas, understanding human strategic behavior in response to these systems becomes vital. We explore individuals' choice between genuinely improving their qualifications (``improvement'') vs. attempting to deceive the algorithm by manipulating their features (``manipulation'') in response to an algorithmic decision system. We further investigate an algorithm designer's ability to shape these strategic responses, and its fairness implications. Specifically, we formulate these interactions as a Stackelberg game, where a firm deploys a (fair) classifier, and individuals strategically respond. Our model incorporates both different costs and stochastic efficacy for manipulation and improvement. The analysis reveals different potential classes of agent responses, and characterizes optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Experimental Behavioral Economics Studies · Game Theory and Applications
MethodsOPT
