Attaining Human`s Desirable Outcomes in Human-AI Interaction via Structural Causal Games
Anjie Liu, Jianhong Wang, Haoxuan Li, Xu Chen, Jun Wang, Samuel Kaski, and Mengyue Yang

TL;DR
This paper introduces a structural causal game framework combined with a pre-policy intervention strategy to guide AI agents toward human-desirable outcomes in human-AI interactions, addressing the challenge of multiple Nash Equilibria.
Contribution
It proposes a novel SCG-based formalization and a reinforcement learning algorithm for pre-policy intervention to achieve human-aligned outcomes.
Findings
Effective in gridworld environments
Demonstrates adaptability in dialogue scenarios
Potential for real-world human-AI interaction improvements
Abstract
In human-AI interaction, a prominent goal is to attain human`s desirable outcome with the assistance of AI agents, which can be ideally delineated as a problem of seeking the optimal Nash Equilibrium that matches the human`s desirable outcome. However, reaching the outcome is usually challenging due to the existence of multiple Nash Equilibria that are related to the assisting task but do not correspond to the human`s desirable outcome. To tackle this issue, we employ a theoretical framework called structural causal game (SCG) to formalize the human-AI interactive process. Furthermore, we introduce a strategy referred to as pre-policy intervention on the SCG to steer AI agents towards attaining the human`s desirable outcome. In more detail, a pre-policy is learned as a generalized intervention to guide the agents` policy selection, under a transparent and interpretable procedure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Bayesian Modeling and Causal Inference · Logic, Reasoning, and Knowledge
