Leveraging Statistical Multi-Agent Online Planning with Emergent Value Function Approximation
Thomy Phan, Lenz Belzner, Thomas Gabor, Kyrill Schmid

TL;DR
This paper introduces EVADE, a novel method that enhances multi-agent online planning in stochastic environments by integrating emergent system behavior through reinforcement learning-based value function approximation, improving efficiency and performance.
Contribution
EVADE is the first approach to incorporate global emergent behavior into local online planning for multi-agent systems using reinforcement learning.
Findings
EVADE improves planning performance in complex stochastic environments.
EVADE enhances efficiency in planning breadth and depth.
EVADE outperforms baseline algorithms in a smart factory simulation.
Abstract
Making decisions is a great challenge in distributed autonomous environments due to enormous state spaces and uncertainty. Many online planning algorithms rely on statistical sampling to avoid searching the whole state space, while still being able to make acceptable decisions. However, planning often has to be performed under strict computational constraints making online planning in multi-agent systems highly limited, which could lead to poor system performance, especially in stochastic domains. In this paper, we propose Emergent Value function Approximation for Distributed Environments (EVADE), an approach to integrate global experience into multi-agent online planning in stochastic domains to consider global effects during local planning. For this purpose, a value function is approximated online based on the emergent system behaviour by using methods of reinforcement learning. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Simulation Techniques and Applications · Evolutionary Algorithms and Applications
