LS-VO: Learning Dense Optical Subspace for Robust Visual Odometry Estimation
Gabriele Costante, Thomas A. Ciarfuglia

TL;DR
This paper introduces LS-VO, a deep learning architecture that models optical flow as a non-linear latent space for improved robustness in visual odometry estimation.
Contribution
It proposes a novel auto-encoder based approach to learn a non-linear optical flow manifold jointly with motion estimation, enhancing robustness and performance.
Findings
Significant performance improvement over baselines.
Slight increase in model parameters.
Effective non-linear optical flow representation.
Abstract
This work proposes a novel deep network architecture to solve the camera Ego-Motion estimation problem. A motion estimation network generally learns features similar to Optical Flow (OF) fields starting from sequences of images. This OF can be described by a lower dimensional latent space. Previous research has shown how to find linear approximations of this space. We propose to use an Auto-Encoder network to find a non-linear representation of the OF manifold. In addition, we propose to learn the latent space jointly with the estimation task, so that the learned OF features become a more robust description of the OF input. We call this novel architecture LS-VO. The experiments show that LS-VO achieves a considerable increase in performances in respect to baselines, while the number of parameters of the estimation network only slightly increases.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
