Inertial-Based Scale Estimation for Structure from Motion on Mobile   Devices

Janne Mustaniemi; Juho Kannala; Simo S\"arkk\"a; Jiri Matas; Janne; Heikkil\"a

arXiv:1611.09498·cs.CV·August 14, 2017

Inertial-Based Scale Estimation for Structure from Motion on Mobile Devices

Janne Mustaniemi, Juho Kannala, Simo S\"arkk\"a, Jiri Matas, Janne, Heikkil\"a

PDF

1 Repo

TL;DR

This paper introduces a robust, parameter-free method for estimating the metric scale in structure from motion using inertial measurements from mobile devices, improving accuracy and convergence speed.

Contribution

It presents a novel frequency domain approach for scale estimation that handles noisy data and aligns camera and IMU measurements without parameter tuning.

Findings

01

Outperforms state-of-the-art in accuracy and speed

02

Achieves around 1% scale accuracy from ground truth

03

Enhances Project Tango's motion tracking precision

Abstract

Structure from motion algorithms have an inherent limitation that the reconstruction can only be determined up to the unknown scale factor. Modern mobile devices are equipped with an inertial measurement unit (IMU), which can be used for estimating the scale of the reconstruction. We propose a method that recovers the metric scale given inertial measurements and camera poses. In the process, we also perform a temporal and spatial alignment of the camera and the IMU. Therefore, our solution can be easily combined with any existing visual reconstruction software. The method can cope with noisy camera pose estimates, typically caused by motion blur or rolling shutter artifacts, via utilizing a Rauch-Tung-Striebel (RTS) smoother. Furthermore, the scale estimation is performed in the frequency domain, which provides more robustness to inaccurate sensor time stamps and noisy IMU samples than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

robonrrd/ibse
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings