GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians

Dasong Gao; Peter Zhi Xuan Li; Vivienne Sze; and Sertac Karaman

arXiv:2409.09295·cs.RO·January 31, 2025

GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians

Dasong Gao, Peter Zhi Xuan Li, Vivienne Sze, and Sertac Karaman

PDF

Open Access 1 Repo

TL;DR

GEVO is a memory-efficient monocular visual odometry framework that uses Gaussian splatting to render scenes from a map, significantly reducing memory usage while maintaining high fidelity.

Contribution

GEVO introduces a novel GS-based SLAM method that renders images from the map instead of storing past images, greatly reducing memory consumption.

Findings

01

Achieves comparable map fidelity to prior methods

02

Reduces memory overhead to around 58 MBs, up to 94x lower

03

Delays degradation of rendered images over time

Abstract

Constructing a high-fidelity representation of the 3D scene using a monocular camera can enable a wide range of applications on mobile devices, such as micro-robots, smartphones, and AR/VR headsets. On these devices, memory is often limited in capacity and its access often dominates the consumption of compute energy. Although Gaussian Splatting (GS) allows for high-fidelity reconstruction of 3D scenes, current GS-based SLAM is not memory efficient as a large number of past images is stored to retrain Gaussians for reducing catastrophic forgetting. These images often require two-orders-of-magnitude higher memory than the map itself and thus dominate the total memory usage. In this work, we present GEVO, a GS-based monocular SLAM framework that achieves comparable fidelity as prior methods by rendering (instead of storing) them from the existing map. Novel Gaussian initialization and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mit-lean/gevo
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Image and Object Detection Techniques