BAMAX: Backtrack Assisted Multi-Agent Exploration using Reinforcement   Learning

Geetansh Kalra; Amit Patel; Atul Chaudhari; Divye Singh

arXiv:2411.08400·cs.RO·November 14, 2024

BAMAX: Backtrack Assisted Multi-Agent Exploration using Reinforcement Learning

Geetansh Kalra, Amit Patel, Atul Chaudhari, Divye Singh

PDF

Open Access

TL;DR

BAMAX is a reinforcement learning-based method that improves multi-agent exploration efficiency by using backtrack assistance, leading to faster environment coverage and reduced backtracking in various grid sizes.

Contribution

This paper introduces BAMAX, a novel backtrack-assisted reinforcement learning approach for multi-agent exploration, addressing coordination challenges in unknown environments.

Findings

01

BAMAX outperforms traditional methods in exploration speed.

02

BAMAX reduces the amount of backtracking during exploration.

03

Performance gains are consistent across different grid sizes.

Abstract

Autonomous robots collaboratively exploring an unknown environment is still an open problem. The problem has its roots in coordination among non-stationary agents, each with only a partial view of information. The problem is compounded when the multiple robots must completely explore the environment. In this paper, we introduce Backtrack Assisted Multi-Agent Exploration using Reinforcement Learning (BAMAX), a method for collaborative exploration in multi-agent systems which attempts to explore an entire virtual environment. As in the name, BAMAX leverages backtrack assistance to enhance the performance of agents in exploration tasks. To evaluate BAMAX against traditional approaches, we present the results of experiments conducted across multiple hexagonal shaped grids sizes, ranging from 10x10 to 60x60. The results demonstrate that BAMAX outperforms other methods in terms of faster…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Robotic Path Planning Algorithms