Episodic Future Thinking Mechanism for Multi-agent Reinforcement   Learning

Dongsu Lee; Minhae Kwon

arXiv:2410.17373·cs.LG·October 24, 2024

Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning

Dongsu Lee, Minhae Kwon

PDF

Open Access 1 Video

TL;DR

This paper introduces an episodic future thinking mechanism for multi-agent reinforcement learning that enables agents to infer other agents' characters, predict their future actions, and adaptively choose optimal strategies, improving performance in diverse multi-agent scenarios.

Contribution

The paper proposes a novel EFT mechanism with a multi-character policy for character inference and future action prediction in multi-agent RL, inspired by cognitive processes.

Findings

01

EFT mechanism improves reward in multi-agent autonomous driving scenarios.

02

Accurate character inference enhances decision-making and performance.

03

Effectiveness persists across societies with varying character diversity.

Abstract

Understanding cognitive processes in multi-agent interactions is a primary goal in cognitive science. It can guide the direction of artificial intelligence (AI) research toward social decision-making in multi-agent systems, which includes uncertainty from character heterogeneity. In this paper, we introduce an episodic future thinking (EFT) mechanism for a reinforcement learning (RL) agent, inspired by cognitive processes observed in animals. To enable future thinking functionality, we first develop a multi-character policy that captures diverse characters with an ensemble of heterogeneous policies. Here, the character of an agent is defined as a different weight combination on reward components, representing distinct behavioral preferences. The future thinking agent collects observation-action trajectories of the target agents and uses the pre-trained multi-character policy to infer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning· slideslive

Taxonomy

TopicsCognitive Science and Mapping