Nested Training for Mutual Adaptation in Human-AI Teaming

Upasana Biswas; Durgesh Kalwar; Subbarao Kambhampati; Sarath Sreedharan

arXiv:2602.17737·cs.RO·February 23, 2026

Nested Training for Mutual Adaptation in Human-AI Teaming

Upasana Biswas, Durgesh Kalwar, Subbarao Kambhampati, Sarath Sreedharan

PDF

Open Access

TL;DR

This paper introduces a nested training approach for human-AI teaming that models human adaptation explicitly, leading to more adaptable and effective robots in cooperative tasks.

Contribution

The paper proposes a nested training regime based on I-POMDPs to train agents that better adapt to human behavior without developing opaque implicit coordination.

Findings

01

Our method outperforms baselines in task success with unseen adaptive partners.

02

Agents trained with nested training show greater adaptability in team interactions.

03

The approach effectively captures human-like adaptive behaviors in robots.

Abstract

Mutual adaptation is a central challenge in human--AI teaming, as humans naturally adjust their strategies in response to a robot's policy. Existing approaches aim to improve diversity in training partners to approximate human behavior, but these partners are static and fail to capture adaptive behavior of humans. Exposing robots to adaptive behaviors is critical, yet when both agents learn simultaneously in a multi-agent setting, they often converge to opaque implicit coordination strategies that only work with the agents they were co-trained with. Such agents fail to generalize when paired with new partners. In order to capture the adaptive behavior of humans, we model the human-robot teaming scenario as an Interactive Partially Observable Markov Decision Process (I-POMDP), explicitly modeling human adaptation as part of the state. We propose a nested training regime to approximately…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Social Robot Interaction and HRI · Human-Automation Interaction and Safety