Learning When to Cooperate Under Heterogeneous Goals
Max Taylor-Davies, Neil Bramley, Christopher G. Lucas

TL;DR
This paper introduces a hierarchical learning approach for agents to decide when to cooperate in environments with heterogeneous goals, improving collaboration effectiveness.
Contribution
It extends ad hoc teamwork to heterogeneous goals and proposes a novel hierarchical imitation-reinforcement learning method for better cooperation.
Findings
Outperforms baseline methods in cooperative environments
Hierarchical approach effectively learns when to cooperate
Auxiliary teammate modeling component's impact varies with observable information
Abstract
A significant element of human cooperative intelligence lies in our ability to identify opportunities for fruitful collaboration; and conversely to recognise when the task at hand is better pursued alone. Research on flexible cooperation in machines has left this meta-level problem largely unexplored, despite its importance for successful collaboration in heterogeneous open-ended environments. Here, we extend the typical Ad Hoc Teamwork (AHT) setting to incorporate the idea of agents having heterogeneous goals that in any given scenario may or may not overlap. We introduce a novel approach to learning policies in this setting, based on a hierarchical combination of imitation and reinforcement learning, and show that it outperforms baseline methods across extended versions of two cooperative environments. We also investigate the contribution of an auxiliary component that learns to model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Mobile Crowdsensing and Crowdsourcing · Social Robot Interaction and HRI
