Intrinsic-Motivation Multi-Robot Social Formation Navigation with Coordinated Exploration

Hao Fu; Wei Liu; Shuai Zhou

arXiv:2512.13293·cs.RO·December 17, 2025

Intrinsic-Motivation Multi-Robot Social Formation Navigation with Coordinated Exploration

Hao Fu, Wei Liu, Shuai Zhou

PDF

Open Access

TL;DR

This paper introduces a novel intrinsic-motivation multi-robot RL algorithm for social formation navigation, effectively handling unpredictable pedestrian behaviors and improving coordinated exploration and policy learning.

Contribution

It proposes a self-learning intrinsic reward mechanism with dual-sampling and two-time-scale updates, advancing multi-robot RL for social navigation.

Findings

01

Outperforms existing methods on social formation benchmarks

02

Enhances coordinated exploration efficiency

03

Demonstrates robustness in dynamic pedestrian environments

Abstract

This paper investigates the application of reinforcement learning (RL) to multi-robot social formation navigation, a critical capability for enabling seamless human-robot coexistence. While RL offers a promising paradigm, the inherent unpredictability and often uncooperative dynamics of pedestrian behavior pose substantial challenges, particularly concerning the efficiency of coordinated exploration among robots. To address this, we propose a novel coordinated-exploration multi-robot RL algorithm introducing an intrinsic motivation exploration. Its core component is a self-learning intrinsic reward mechanism designed to collectively alleviate policy conservatism. Moreover, this algorithm incorporates a dual-sampling mode within the centralized training and decentralized execution framework to enhance the representation of both the navigation policy and the intrinsic reward, leveraging a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvacuation and Crowd Dynamics · Social Robot Interaction and HRI · Reinforcement Learning in Robotics