ORION: Option-Regularized Deep Reinforcement Learning for Cooperative Multi-Agent Online Navigation

Shizhe Zhang; Jingsong Liang; Zhitao Zhou; Shuhan Ye; Yizhuo Wang; Ming Siang Derek Tan; Jimmy Chiun; Yuhong Cao; Guillaume Sartoretti

arXiv:2601.01155·cs.RO·May 19, 2026

ORION: Option-Regularized Deep Reinforcement Learning for Cooperative Multi-Agent Online Navigation

Shizhe Zhang, Jingsong Liang, Zhitao Zhou, Shuhan Ye, Yizhuo Wang, Ming Siang Derek Tan, Jimmy Chiun, Yuhong Cao, Guillaume Sartoretti

PDF

TL;DR

ORION is a deep reinforcement learning framework enabling cooperative multi-agent online navigation in partially known environments by actively reducing map uncertainty through decentralized decision-making and online perception sharing.

Contribution

It introduces a novel option-critic based framework with a shared graph encoder and dual-stage cooperation strategy for scalable, real-time multi-agent navigation.

Findings

01

Outperforms state-of-the-art baselines in maze and warehouse environments.

02

Scales to up to 10 robots with high-quality decentralized cooperation.

03

Demonstrated robustness and practicality on physical robot teams.

Abstract

Existing methods for multi-agent navigation typically assume fully known environments, offering limited support for partially known scenarios with outdated or imperfect prior maps, such as warehouses or factory floors. There, agents need to balance path optimality with collecting and sharing environmental information to help teammates reach their own targets. To these ends, we propose ORION, a novel deep reinforcement learning framework for cooperative multi-agent online navigation in partially known environments. Starting from an imperfect prior map, ORION trains agents to make decentralized decisions, coordinate toward individual targets, and actively reduce task-relevant map uncertainty through online observation sharing in a closed perception-action loop. We first design a shared graph encoder that fuses prior map with online perception into a unified representation, providing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Robotics and Sensor-Based Localization