Explanation Design in Strategic Learning: Sufficient Explanations that Induce Non-harmful Responses

Kiet Q. H. Vo; Siu Lun Chau; Masahiro Kato; Yixin Wang; Krikamol Muandet

arXiv:2502.04058·cs.AI·May 29, 2025

Explanation Design in Strategic Learning: Sufficient Explanations that Induce Non-harmful Responses

Kiet Q. H. Vo, Siu Lun Chau, Masahiro Kato, Yixin Wang, Krikamol Muandet

PDF

Open Access

TL;DR

This paper explores how to design explanations in algorithmic decision-making that prevent strategic agents from taking harmful actions, proposing a method that balances model accuracy with agent utility.

Contribution

It introduces a necessary condition for explanations to avoid misleading agents and demonstrates that action recommendation explanations can ensure non-harmful responses under certain assumptions.

Findings

01

ARexes enable safe partial model disclosure.

02

Proposed learning procedure jointly optimizes model and explanations.

03

Experiments show improved predictive performance and agent utility.

Abstract

We study explanation design in algorithmic decision making with strategic agents, individuals who may modify their inputs in response to explanations of a decision maker's (DM's) predictive model. As the demand for transparent algorithmic systems continues to grow, most prior work assumes full model disclosure as the default solution. In practice, however, DMs such as financial institutions typically disclose only partial model information via explanations. Such partial disclosure can lead agents to misinterpret the model and take actions that unknowingly harm their utility. A key open question is how DMs can communicate explanations in a way that avoids harming strategic agents, while still supporting their own decision-making goals, e.g., minimising predictive error. In this work, we analyse well-known explanation methods, and establish a necessary condition to prevent explanations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making