Beyond Specialization: Robust Reinforcement Learning Navigation via Procedural Map Generators

Christian Jestel; Nicolas Bach; Marvin Wiedemann; Jan Finke; Peter Detzner

arXiv:2605.02528·cs.RO·May 5, 2026

Beyond Specialization: Robust Reinforcement Learning Navigation via Procedural Map Generators

Christian Jestel, Nicolas Bach, Marvin Wiedemann, Jan Finke, Peter Detzner

PDF

TL;DR

This paper investigates how procedural map generators affect the generalization of reinforcement learning navigation policies, demonstrating that combined generators and subgoal inputs significantly improve robustness and transfer to real-world scenarios.

Contribution

It systematically compares different procedural map generators and identifies key factors like subgoal inputs that enhance policy robustness and transferability.

Findings

01

Combined generators achieve 91.5% success rate, outperforming individual ones.

02

Subgoal inputs from A* path-planner boost success to 98.9%.

03

Recurrent policies outperform feedforward and classical controllers at higher speeds.

Abstract

Deep reinforcement learning (DRL) navigation policies often overfit to the structure of their training environments, as environmental diversity is typically constrained by the manual effort required to design diverse scenarios. While procedural map generation offers scalable diversity, no prior work systematically compares how different generator types affect policy generalization. We integrate four generators (sparse, maze, graph, and Wave Function Collapse) with guaranteed navigability into MuRoSim, a 2D simulator focusing on training efficiency for LiDAR-based navigation. We cross-evaluate five navigation policies on 1000 seeded maps per generator across three training seeds. Results show a strongly asymmetric cross-generator transfer: a specialist trained on sparse layouts falls to 3.3% success on mazes, whereas a policy trained on the combined generator set achieves 91.5 +/- 1.1%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.