Efficient Q-Learning over Visit Frequency Maps for Multi-agent Exploration of Unknown Environments
Xuyang Chen, Ashvin N. Iyer, Zixing Wang, Ahmed H. Qureshi

TL;DR
This paper introduces an efficient, compact multi-agent exploration method using an integrated visit frequency map, improving communication efficiency and adaptability in unknown environments.
Contribution
The paper proposes a novel integrated visit frequency map and a multi-agent information exchange scheme, enhancing efficiency and scalability over traditional visit frequency maps.
Findings
Achieves similar performance to VFM with lower bandwidth.
Generalizes well to various multi-agent and real-world environments.
Reduces map size while maintaining exploration effectiveness.
Abstract
The robot exploration task has been widely studied with applications spanning from novel environment mapping to item delivery. For some time-critical tasks, such as rescue catastrophes, the agent is required to explore as efficiently as possible. Recently, Visit Frequency-based map representation achieved great success in such scenarios by discouraging repetitive visits with a frequency-based penalty. However, its relatively large size and single-agent settings hinder its further development. In this context, we propose Integrated Visit Frequency Map, which encodes identical information as Visit Frequency Map with a more compact size, and a visit frequency-based multi-agent information exchange and control scheme that is able to accommodate both representations. Through tests in diverse settings, the results indicate our proposed methods can achieve a comparable level of performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Reinforcement Learning in Robotics
