Modeling Latent Partner Strategies for Adaptive Zero-Shot Human-Agent Collaboration

Benjamin Li; Shuyang Shi; Lucia Romero; Huao Li; Yaqi Xie; Woojun Kim; Stefanos Nikolaidis; Michael Lewis; Katia Sycara; Simon Stepputtis

arXiv:2507.05244·cs.AI·July 8, 2025

Modeling Latent Partner Strategies for Adaptive Zero-Shot Human-Agent Collaboration

Benjamin Li, Shuyang Shi, Lucia Romero, Huao Li, Yaqi Xie, Woojun Kim, Stefanos Nikolaidis, Michael Lewis, Katia Sycara, Simon Stepputtis

PDF

TL;DR

This paper introduces TALENTS, a framework that models, categorizes, and adapts to diverse human partner strategies in real-time, enhancing zero-shot collaboration in complex, dynamic tasks like cooperative cooking.

Contribution

The work presents a novel strategy-conditioned cooperator framework utilizing a variational autoencoder and regret minimization for adaptive zero-shot human-agent teamwork.

Findings

01

Outperforms baselines with unfamiliar human partners.

02

Effective in a complex cooperative cooking environment.

03

Demonstrates real-time adaptation to diverse strategies.

Abstract

In collaborative tasks, being able to adapt to your teammates is a necessary requirement for success. When teammates are heterogeneous, such as in human-agent teams, agents need to be able to observe, recognize, and adapt to their human partners in real time. This becomes particularly challenging in tasks with time pressure and complex strategic spaces where the dynamics can change rapidly. In this work, we introduce TALENTS, a strategy-conditioned cooperator framework that learns to represent, categorize, and adapt to a range of partner strategies, enabling ad-hoc teamwork. Our approach utilizes a variational autoencoder to learn a latent strategy space from trajectory data. This latent space represents the underlying strategies that agents employ. Subsequently, the system identifies different types of strategy by clustering the data. Finally, a cooperator agent is trained to generate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.