VI-IKD: High-Speed Accurate Off-Road Navigation using Learned Visual-Inertial Inverse Kinodynamics
Haresh Karnan, Kavan Singh Sikand, Pranav Atreya, Sadegh Rabiee, Xuesu, Xiao, Garrett Warnell, Peter Stone, Joydeep Biswas

TL;DR
This paper introduces VI-IKD, a novel visual-inertial learning model that anticipates vehicle-terrain interactions for high-speed off-road navigation, significantly improving accuracy and robustness over existing methods.
Contribution
The paper presents VI-IKD, a new learned inverse kinodynamics model that incorporates visual future terrain information to enhance high-speed off-road navigation.
Findings
VI-IKD achieves more accurate navigation than state-of-the-art methods.
The approach enables robust off-road navigation at speeds up to 3.5 m/s.
Validated on a 1/5 scale UT-AlphaTruck in diverse environments.
Abstract
One of the key challenges in high speed off road navigation on ground vehicles is that the kinodynamics of the vehicle terrain interaction can differ dramatically depending on the terrain. Previous approaches to addressing this challenge have considered learning an inverse kinodynamics (IKD) model, conditioned on inertial information of the vehicle to sense the kinodynamic interactions. In this paper, we hypothesize that to enable accurate high-speed off-road navigation using a learned IKD model, in addition to inertial information from the past, one must also anticipate the kinodynamic interactions of the vehicle with the terrain in the future. To this end, we introduce Visual-Inertial Inverse Kinodynamics (VI-IKD), a novel learning based IKD model that is conditioned on visual information from a terrain patch ahead of the robot in addition to past inertial information, enabling it to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Autonomous Vehicle Technology and Safety · Robotics and Sensor-Based Localization
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
