MeMix: Writing Less, Remembering More for Streaming 3D Reconstruction

Jiacheng Dong; Huan Li; Sicheng Zhou; Wenhao Hu; Weili Xu; Yan Wang

arXiv:2603.15330·cs.CV·March 17, 2026

MeMix: Writing Less, Remembering More for Streaming 3D Reconstruction

Jiacheng Dong, Huan Li, Sicheng Zhou, Wenhao Hu, Weili Xu, Yan Wang

PDF

Open Access

TL;DR

MeMix is a training-free module that enhances streaming 3D reconstruction by selectively updating memory patches, reducing forgetting and improving accuracy without additional training or memory overhead.

Contribution

Introducing MeMix, a plug-and-play, training-free memory module that improves long-sequence streaming 3D reconstruction by mitigating forgetting with minimal memory overhead.

Findings

01

Reduces reconstruction error by 15.3% on average.

02

Maintains O(1) inference memory.

03

Effective across multiple benchmarks.

Abstract

Reconstruction is a fundamental task in 3D vision and a fundamental capability for spatial intelligence. Particularly, streaming 3D reconstruction is central to real-time spatial perception, yet existing recurrent online models often suffer from progressive degradation on long sequences due to state drift and forgetting, motivating inference-time remedies. We present MeMix, a training-free, plug-and-play module that improves streaming reconstruction by recasting the recurrent state into a Memory Mixture. MeMix partitions the state into multiple independent memory patches and updates only the least-aligned memory patches while exactly preserving others. This selective update mitigates catastrophic forgetting while retaining $O (1)$ inference memory, and requires no fine-tuning or additional learnable parameters, making it directly applicable to existing recurrent reconstruction models.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Computer Graphics and Visualization Techniques