Nested Training for Mutual Adaptation in Human-AI Teaming
Upasana Biswas, Durgesh Kalwar, Subbarao Kambhampati, Sarath Sreedharan

TL;DR
This paper introduces a nested training approach for human-AI teaming that models human adaptation explicitly, leading to more adaptable and effective robots in cooperative tasks.
Contribution
The paper proposes a nested training regime based on I-POMDPs to train agents that better adapt to human behavior without developing opaque implicit coordination.
Findings
Our method outperforms baselines in task success with unseen adaptive partners.
Agents trained with nested training show greater adaptability in team interactions.
The approach effectively captures human-like adaptive behaviors in robots.
Abstract
Mutual adaptation is a central challenge in human--AI teaming, as humans naturally adjust their strategies in response to a robot's policy. Existing approaches aim to improve diversity in training partners to approximate human behavior, but these partners are static and fail to capture adaptive behavior of humans. Exposing robots to adaptive behaviors is critical, yet when both agents learn simultaneously in a multi-agent setting, they often converge to opaque implicit coordination strategies that only work with the agents they were co-trained with. Such agents fail to generalize when paired with new partners. In order to capture the adaptive behavior of humans, we model the human-robot teaming scenario as an Interactive Partially Observable Markov Decision Process (I-POMDP), explicitly modeling human adaptation as part of the state. We propose a nested training regime to approximately…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Social Robot Interaction and HRI · Human-Automation Interaction and Safety
