Adaptive Episode Length Adjustment for Multi-agent Reinforcement Learning

Byunghyun Yoo; Younghwan Shin; Hyunwoo Kim; Euisok Chung; Jeongmin Yang

arXiv:2505.19637·cs.MA·May 27, 2025

Adaptive Episode Length Adjustment for Multi-agent Reinforcement Learning

Byunghyun Yoo, Younghwan Shin, Hyunwoo Kim, Euisok Chung, Jeongmin Yang

PDF

Open Access

TL;DR

This paper introduces Adaptive Episode Length Adjustment (AELA) for multi-agent reinforcement learning, dynamically tuning episode lengths based on learning progress to improve convergence and performance.

Contribution

It presents the first method to adaptively adjust episode length in MARL using entropy-based learning progress assessment, enhancing training efficiency.

Findings

01

Significant improvements in convergence speed.

02

Enhanced overall performance in MARL tasks.

03

Effective balancing of exploration and exploitation.

Abstract

In standard reinforcement learning, an episode is defined as a sequence of interactions between agents and the environment, which terminates upon reaching a terminal state or a pre-defined episode length. Setting a shorter episode length enables the generation of multiple episodes with the same number of data samples, thereby facilitating an exploration of diverse states. While shorter episodes may limit the collection of long-term interactions, they may offer significant advantages when properly managed. For example, trajectory truncation in single-agent reinforcement learning has shown how the benefits of shorter episodes can be leveraged despite the trade-off of reduced long-term interaction experiences. However, this approach remains underexplored in MARL. This paper proposes a novel MARL approach, Adaptive Episode Length Adjustment (AELA), where the episode length is initially…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Mathematical and Theoretical Epidemiology and Ecology Models · Adaptive Dynamic Programming Control