Hilbert-Augmented Reinforcement Learning for Scalable Multi-Robot Coverage and Exploration

Tamil Selvan Gurunathan; Aryya Gangopadhyay

arXiv:2602.19400·cs.RO·February 24, 2026

Hilbert-Augmented Reinforcement Learning for Scalable Multi-Robot Coverage and Exploration

Tamil Selvan Gurunathan, Aryya Gangopadhyay

PDF

Open Access

TL;DR

This paper introduces a novel multi-robot coverage framework that incorporates Hilbert space-filling curves into reinforcement learning, enhancing exploration efficiency, scalability, and trajectory feasibility for resource-limited robots in complex environments.

Contribution

The work integrates Hilbert space-filling priors into decentralized RL algorithms, enabling scalable, efficient coverage and exploration with feasible trajectories for multi-robot systems.

Findings

01

Improved coverage efficiency and reduced redundancy.

02

Faster convergence in sparse-reward environments.

03

Successful deployment on real legged robots.

Abstract

We present a coverage framework that integrates Hilbert space-filling priors into decentralized multi-robot learning and execution. We augment DQN and PPO with Hilbert-based spatial indices to structure exploration and reduce redundancy in sparse-reward environments, and we evaluate scalability in multi-robot grid coverage. We further describe a waypoint interface that converts Hilbert orderings into curvature-bounded, time-parameterized SE(2) trajectories (planar (x, y, {\theta})), enabling onboard feasibility on resource-constrained robots. Experiments show improvements in coverage efficiency, redundancy, and convergence speed over DQN/PPO baselines. In addition, we validate the approach on a Boston Dynamics Spot legged robot, executing the generated trajectories in indoor environments and observing reliable coverage with low redundancy. These results indicate that geometric priors…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robotic Locomotion and Control · Robot Manipulation and Learning