DIML: Differentiable Inverse Mechanism Learning from Behaviors of Multi-Agent Learning Trajectories
Zhiyu An, Wan Du

TL;DR
DIML introduces a likelihood-based framework for recovering unknown incentive mechanisms from observed multi-agent learning behaviors, enabling counterfactual analysis and scalable inference in complex strategic environments.
Contribution
We propose DIML, a novel differentiable inverse mechanism learning method that infers unstructured incentive mechanisms from observed behaviors, with proven identifiability and scalability.
Findings
DIML accurately recovers incentive differences in simulated environments.
It supports reliable counterfactual predictions in large-scale multi-agent settings.
Performance rivals oracle methods in small environments and scales to hundreds of agents.
Abstract
We study inverse mechanism learning: recovering an unknown incentive-generating mechanism from observed strategic interaction traces of self-interested learning agents. Unlike inverse game theory and multi-agent inverse reinforcement learning, which typically infer utility/reward parameters inside a structured mechanism, our target includes unstructured mechanism -- a (possibly neural) mapping from joint actions to per-agent payoffs. Unlike differentiable mechanism design, which optimizes mechanisms forward, we infer mechanisms from behavior in an observational setting. We propose DIML, a likelihood-based framework that differentiates through a model of multi-agent learning dynamics and uses the candidate mechanism to generate counterfactual payoffs needed to predict observed actions. We establish identifiability of payoff differences under a conditional logit response model and prove…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Explainable Artificial Intelligence (XAI)
