Minimalist Visual Inertial Odometry

Francesco Pasti; Jeremy Klotz; Nicola Bellotto; and Shree K. Nayar

arXiv:2605.19990·cs.RO·May 20, 2026

Minimalist Visual Inertial Odometry

Francesco Pasti, Jeremy Klotz, Nicola Bellotto, and Shree K. Nayar

PDF

1 Repo

TL;DR

This paper introduces a minimalist visual-inertial odometry system using only four photodiodes and an IMU, achieving accurate motion estimation with reduced resource requirements.

Contribution

The work demonstrates that a small set of optical measurements combined with an IMU can provide robust planar odometry, optimizing sensor design and decoding speed from minimal signals.

Findings

01

System accurately tracks ground truth across diverse terrains.

02

Minimalist sensor setup reduces resource consumption.

03

Joint optimization of mask parameters and TCN improves speed decoding.

Abstract

Visual-Inertial Odometry(VIO), which is critical to mobile robot navigation, uses cameras with a large number of pixels. Capturing and processing camera images requires significant resources. This work presents a minimalist approach to planar odometry, demonstrating that just four visual measurements and an IMU can provide robust motion estimation for differential-drive robots. Our key insight is that four downward-facing photodiodes that sense the world through optical Gabor masks produce signals that encode speed. Based on this, we jointly optimize the mask parameters alongside a Temporal Convolutional Network (TCN) using a physically-grounded simulator. The resulting model decodes speed from just the four measurements produced by the photodiodes. Pairing these estimates with the angular speed from an IMU yields a continuous planar trajectory. We validate our approach with a prototype…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pastifra/four-pixel-vio
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.