Leveraging Statistical Multi-Agent Online Planning with Emergent Value   Function Approximation

Thomy Phan; Lenz Belzner; Thomas Gabor; Kyrill Schmid

arXiv:1804.06311·cs.MA·December 29, 2023·5 cites

Leveraging Statistical Multi-Agent Online Planning with Emergent Value Function Approximation

Thomy Phan, Lenz Belzner, Thomas Gabor, Kyrill Schmid

PDF

Open Access

TL;DR

This paper introduces EVADE, a novel method that enhances multi-agent online planning in stochastic environments by integrating emergent system behavior through reinforcement learning-based value function approximation, improving efficiency and performance.

Contribution

EVADE is the first approach to incorporate global emergent behavior into local online planning for multi-agent systems using reinforcement learning.

Findings

01

EVADE improves planning performance in complex stochastic environments.

02

EVADE enhances efficiency in planning breadth and depth.

03

EVADE outperforms baseline algorithms in a smart factory simulation.

Abstract

Making decisions is a great challenge in distributed autonomous environments due to enormous state spaces and uncertainty. Many online planning algorithms rely on statistical sampling to avoid searching the whole state space, while still being able to make acceptable decisions. However, planning often has to be performed under strict computational constraints making online planning in multi-agent systems highly limited, which could lead to poor system performance, especially in stochastic domains. In this paper, we propose Emergent Value function Approximation for Distributed Environments (EVADE), an approach to integrate global experience into multi-agent online planning in stochastic domains to consider global effects during local planning. For this purpose, a value function is approximated online based on the emergent system behaviour by using methods of reinforcement learning. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Simulation Techniques and Applications · Evolutionary Algorithms and Applications