MAexp: A Generic Platform for RL-based Multi-Agent Exploration

Shaohao Zhu; Jiacheng Zhou; Anjun Chen; Mingming Bai; Jiming Chen and; Jinming Xu

arXiv:2404.12824·cs.RO·April 22, 2024

MAexp: A Generic Platform for RL-based Multi-Agent Exploration

Shaohao Zhu, Jiacheng Zhou, Anjun Chen, Mingming Bai, Jiming Chen and, Jinming Xu

PDF

Open Access 1 Repo

TL;DR

MAexp is a versatile platform for multi-agent exploration in robotics that integrates advanced MARL algorithms, uses point clouds for high-fidelity mapping, and offers a fast, scalable environment for benchmarking diverse algorithms.

Contribution

It introduces MAexp, a comprehensive platform combining multiple MARL algorithms, high-fidelity environment modeling, and scalable multi-robot support for exploration tasks.

Findings

01

Point cloud-based scenarios enable high-fidelity mapping.

02

Sampling speed is approximately 40 times faster than existing platforms.

03

Benchmark results highlight strengths of various MARL algorithms across scenarios.

Abstract

The sim-to-real gap poses a significant challenge in RL-based multi-agent exploration due to scene quantization and action discretization. Existing platforms suffer from the inefficiency in sampling and the lack of diversity in Multi-Agent Reinforcement Learning (MARL) algorithms across different scenarios, restraining their widespread applications. To fill these gaps, we propose MAexp, a generic platform for multi-agent exploration that integrates a broad range of state-of-the-art MARL algorithms and representative scenarios. Moreover, we employ point clouds to represent our exploration scenarios, leading to high-fidelity environment mapping and a sampling speed approximately 40 times faster than existing platforms. Furthermore, equipped with an attention-based Multi-Agent Target Generator and a Single-Agent Motion Planner, MAexp can work with arbitrary numbers of agents and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

duangzhu/maexp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Robotic Path Planning Algorithms · Advanced Control Systems Optimization

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings