FILT3R: Latent State Adaptive Kalman Filter for Streaming 3D Reconstruction
Seonghyun Jin, Jong Chul Ye

TL;DR
FILT3R introduces a novel, training-free latent filtering layer that adaptively updates the state in streaming 3D reconstruction using a Kalman-style approach, improving stability and accuracy over existing methods.
Contribution
The paper proposes FILT3R, a new latent filtering method that adaptively balances memory and new information in streaming 3D reconstruction without training.
Findings
FILT3R generalizes overwrite and gating policies as special cases.
It improves long-horizon stability for depth, pose, and 3D reconstruction.
FILT3R adapts gains based on scene change and uncertainty.
Abstract
Streaming 3D reconstruction maintains a persistent latent state that is updated online from incoming frames, enabling constant-memory inference. A key failure mode is the state update rule: aggressive overwrites forget useful history, while conservative updates fail to track new evidence, and both behaviors become unstable beyond the training horizon. To address this challenge, we propose FILT3R, a training-free latent filtering layer that casts recurrent state updates as stochastic state estimation in token space. FILT3R maintains a per-token variance and computes a Kalman-style gain that adaptively balances memory retention against new observations. Process noise -- governing how much the latent state is expected to change between frames -- is estimated online from EMA-normalized temporal drift of candidate tokens. Using extensive experiments, we demonstrate that FILT3R yields an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · 3D Shape Modeling and Analysis · Advanced Vision and Imaging
