Classifying Ambiguous Identities in Hidden-Role Stochastic Games with Multi-Agent Reinforcement Learning
Shijie Han, Siyuan Li, Bo An, Wei Zhao, Peng Liu

TL;DR
This paper introduces IDRL, a reinforcement learning framework that dynamically infers agent identities in stochastic games with ambiguous roles, improving cooperation and competition strategies in uncertain multi-agent environments.
Contribution
The work presents a novel identity detection framework with relation and danger networks, enabling agents to adaptively identify others and select appropriate policies in unknown or changing roles.
Findings
IDRL outperforms existing MARL methods in Red-10 game
Relation network matches top human performance in identity recognition
Danger network effectively reduces false-positive identifications
Abstract
Multi-agent reinforcement learning (MARL) is a prevalent learning paradigm for solving stochastic games. In most MARL studies, agents in a game are defined as teammates or enemies beforehand, and the relationships among the agents remain fixed throughout the game. However, in real-world problems, the agent relationships are commonly unknown in advance or dynamically changing. Many multi-party interactions start off by asking: who is on my team? This question arises whether it is the first day at the stock exchange or the kindergarten. Therefore, training policies for such situations in the face of imperfect information and ambiguous identities is an important problem that needs to be addressed. In this work, we develop a novel identity detection reinforcement learning (IDRL) framework that allows an agent to dynamically infer the identities of nearby agents and select an appropriate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSports Analytics and Performance
