Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning

Rujikorn Charakorn; Poramate Manoonpong; Nat Dilokthanakul

arXiv:2111.03431·cs.AI·October 21, 2022

Learning to Cooperate with Unseen Agent via Meta-Reinforcement Learning

Rujikorn Charakorn, Poramate Manoonpong, Nat Dilokthanakul

PDF

Open Access

TL;DR

This paper explores using meta-reinforcement learning to enable agents to learn cooperative skills from data, allowing them to adapt and cooperate effectively with unseen agents in complex environments.

Contribution

It introduces a meta-RL approach for ad hoc teamwork, demonstrating its effectiveness in enabling agents to learn cooperation without domain-specific knowledge.

Findings

01

Meta-RL produces robust cooperative agents in diverse environments.

02

Agents successfully adapt to different cooperative circumstances.

03

The method outperforms traditional approaches in ad hoc teamwork scenarios.

Abstract

Ad hoc teamwork problem describes situations where an agent has to cooperate with previously unseen agents to achieve a common goal. For an agent to be successful in these scenarios, it has to have a suitable cooperative skill. One could implement cooperative skills into an agent by using domain knowledge to design the agent's behavior. However, in complex domains, domain knowledge might not be available. Therefore, it is worthwhile to explore how to directly learn cooperative skills from data. In this work, we apply meta-reinforcement learning (meta-RL) formulation in the context of the ad hoc teamwork problem. Our empirical results show that such a method could produce robust cooperative agents in two cooperative environments with different cooperative circumstances: social compliance and language interpretation. (This is a full paper of the extended abstract version.)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Topic Modeling · Multi-Agent Systems and Negotiation

MethodsHigh-Order Consensuses