NavOL: Navigation Policy with Online Imitation Learning
Xiaofei Wei, Chun Gu, Li Zhang

TL;DR
NavOL introduces an online imitation learning framework for robotic navigation that interacts with simulators, leveraging expert demonstrations and a pretrained diffusion policy to improve efficiency and robustness without reward engineering.
Contribution
The paper presents NavOL, a novel online imitation learning paradigm that trains navigation policies through interactive rollouts and expert guidance, reducing distribution shift and eliminating reward design.
Findings
NavOL achieves consistent performance improvements in simulation benchmarks.
It scales efficiently across multiple scenes with high throughput.
Real-world experiments validate its effectiveness in practical navigation tasks.
Abstract
Learning robust navigation policies remains a core challenge in robotics. Offline imitation learning suffers from distribution shift and compounding errors at rollout, while reinforcement learning requires reward engineering and learns inefficiently. In this paper, we propose NavOL, an online imitation learning paradigm that interacts with a simulator and updates itself using expert demonstrations gathered online. Built upon a pretrained navigation diffusion policy that maps local observations to future waypoints, NavOL trains in a rollout update loop: during rollout, the policy acts in the simulator and queries a global planner which has privileged access to the global environment for the optimal path segment as ground truth trajectory labels; during update, the policy is trained on the online collected observation trajectory pairs. This online imitation loop removes the need for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
