Self-Imitated Diffusion Policy for Efficient and Robust Visual Navigation
Runhua Zhang, Junyi Hou, Changxu Cheng, Qiyi Chen, Tao Wang, Wuyue Zhao

TL;DR
The paper introduces SIDP, a novel self-imitation framework for visual navigation that improves planning efficiency and robustness by learning from its own high-quality trajectories, reducing reliance on extensive sampling and filtering.
Contribution
SIDP presents a reward-guided self-imitation approach with curriculum learning and goal-agnostic exploration, advancing visual navigation by enhancing planning quality and efficiency.
Findings
SIDP outperforms previous methods on simulation benchmarks.
SIDP achieves 2.5x faster inference on Jetson Orin Nano.
SIDP demonstrates effective real-world robotic navigation.
Abstract
Diffusion policies (DP) have demonstrated significant potential in visual navigation by capturing diverse multi-modal trajectory distributions. However, standard imitation learning (IL), which most DP methods rely on for training, often inherits sub-optimality and redundancy from expert demonstrations, thereby necessitating a computationally intensive "generate-then-filter" pipeline that relies on auxiliary selectors during inference. To address these challenges, we propose Self-Imitated Diffusion Policy (SIDP), a novel framework that learns improved planning by selectively imitating a set of trajectories sampled from itself. Specifically, SIDP introduces a reward-guided self-imitation mechanism that encourages the policy to consistently produce high-quality trajectories efficiently, rather than outputs of inconsistent quality, thereby reducing reliance on extensive sampling and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Robotic Path Planning Algorithms
