Interp3R: Continuous-time 3D Geometry Estimation with Frames and Events

Shuang Guo; Filbert Febryanto; Lei Sun; Guillermo Gallego

arXiv:2603.14528·cs.CV·March 17, 2026

Interp3R: Continuous-time 3D Geometry Estimation with Frames and Events

Shuang Guo, Filbert Febryanto, Lei Sun, Guillermo Gallego

PDF

Open Access

TL;DR

Interp3R is a novel method that uses asynchronous event data to interpolate 3D scene geometry and camera poses at arbitrary times, surpassing existing discrete-time approaches in accuracy and temporal continuity.

Contribution

It introduces the first continuous-time 3D geometry estimation method leveraging event data to interpolate pointmaps from frame-based models.

Findings

01

Outperforms state-of-the-art baselines in accuracy

02

Generalizes well across synthetic and real-world data

03

Provides temporally continuous 3D scene representations

Abstract

In recent years, 3D visual foundation models pioneered by pointmap-based approaches such as DUSt3R have attracted a lot of interest, achieving impressive accuracy and strong generalization across diverse scenes. However, these methods are inherently limited to recovering scene geometry only at the discrete time instants when images are captured, leaving the scene evolution during the blind time between consecutive frames largely unexplored. We introduce Interp3R, to the best of our knowledge the first method that enhances pointmap-based models to estimate depth and camera poses at arbitrary time instants. Interp3R leverages asynchronous event data to interpolate pointmaps produced by frame-based models, enabling temporally continuous geometric representations. Depth and camera poses are then jointly recovered by aligning the interpolated pointmaps together with those predicted by the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Robotics and Sensor-Based Localization