Towards Target-Driven Visual Navigation in Indoor Scenes via Generative Imitation Learning
Qiaoyun Wu, Xiaoxi Gong, Kai Xu, Dinesh Manocha, Jingxuan Dong, Jun, Wang

TL;DR
This paper introduces a novel target-driven visual navigation system for indoor scenes that uses generative imitation learning, multi-view observations, and auxiliary tasks to improve safety, efficiency, and generalization without relying on GPS or odometry.
Contribution
It proposes a new navigation framework with a variational generative module, collision prediction, and target checking, enhancing mapless navigation performance and data efficiency.
Findings
Effective real-world navigation on a TurtleBot
Improved safety through static collision prediction
Enhanced generalization in indoor environments
Abstract
We present a target-driven navigation system to improve mapless visual navigation in indoor scenes. Our method takes a multi-view observation of a robot and a target as inputs at each time step to provide a sequence of actions that move the robot to the target without relying on odometry or GPS at runtime. The system is learned by optimizing a combinational objective encompassing three key designs. First, we propose that an agent conceives the next observation before making an action decision. This is achieved by learning a variational generative module from expert demonstrations. We then propose predicting static collision in advance, as an auxiliary task to improve safety during navigation. Moreover, to alleviate the training data imbalance problem of termination action prediction, we also introduce a target checking module to differentiate from augmenting navigation policy with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Robot Manipulation and Learning · Multimodal Machine Learning Applications
