Multi-Agent Inverse Reinforcement Learning: Suboptimal Demonstrations and Alternative Solution Concepts
Sage Bergerson

TL;DR
This paper reviews multi-agent inverse reinforcement learning methods that account for suboptimal human behavior and explores alternative equilibrium concepts to better model realistic social interactions.
Contribution
It systematically compares MIRL methods handling noise, biases, and heuristics, proposing that recursive and generalized solution concepts improve social interaction modeling.
Findings
MaxEnt IRL extensions are primary for handling noise and biases.
Generalized solution concepts include correlated and entropy-regularized equilibria.
Recursive reasoning methods like feedback NE perform well.
Abstract
Multi-agent inverse reinforcement learning (MIRL) can be used to learn reward functions from agents in social environments. To model realistic social dynamics, MIRL methods must account for suboptimal human reasoning and behavior. Traditional formalisms of game theory provide computationally tractable behavioral models, but assume agents have unrealistic cognitive capabilities. This research identifies and compares mechanisms in MIRL methods which a) handle noise, biases and heuristics in agent decision making and b) model realistic equilibrium solution concepts. MIRL research is systematically reviewed to identify solutions for these challenges. The methods and results of these studies are analyzed and compared based on factors including performance accuracy, efficiency, and descriptive quality. We found that the primary methods for handling noise, biases and heuristics in MIRL were…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExperimental Behavioral Economics Studies · Evolutionary Game Theory and Cooperation · Game Theory and Applications
