MacroNav: Multi-Task Context Representation Learning Enables Efficient Navigation in Unknown Environments

Kuankuan Sima; Longbin Tang; Zhenyu Yang; Haozhe Ma; Lin Zhao

arXiv:2511.04320·cs.RO·April 22, 2026

MacroNav: Multi-Task Context Representation Learning Enables Efficient Navigation in Unknown Environments

Kuankuan Sima, Longbin Tang, Zhenyu Yang, Haozhe Ma, Lin Zhao

PDF

TL;DR

MacroNav introduces a multi-task self-supervised context encoder combined with graph reasoning, enabling efficient, real-time navigation in unknown environments with improved success rates and computational efficiency.

Contribution

The paper presents MacroNav, a novel framework that integrates a lightweight, multi-scale spatial context encoder with graph-based reasoning for enhanced navigation.

Findings

01

Significant improvements in Success Rate and SPL over state-of-the-art methods.

02

Effective environmental understanding demonstrated in real-world deployments.

03

Achieves high performance with low computational cost.

Abstract

Autonomous navigation in unknown environments requires multi-scale spatial understanding that captures geometric details, topological connectivity, and global structure to support high-level decision making under partial observability. Existing approaches struggle to efficiently capture such multi-scale spatial understanding while maintaining low computational cost for real-time navigation. We present MacroNav, a learning-based navigation framework featuring two key components: (1) a lightweight context encoder trained via multi-task self-supervised learning to capture multi-scale, navigation-centric spatial representations; and (2) a reinforcement learning policy that seamlessly integrates these representations with graph-based reasoning for efficient action selection. Extensive experiments demonstrate the context encoder's effective and robust environmental understanding. Real-world…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.