RemixFusion: Residual-based Mixed Representation for Large-scale Online RGB-D Reconstruction

Yuqing Lan; Chenyang Zhu; Shuaifeng Zhi; Jiazhao Zhang; Zhoufeng Wang; Renjiao Yi; Yijie Wang; Kai Xu

arXiv:2507.17594·cs.CV·September 16, 2025

RemixFusion: Residual-based Mixed Representation for Large-scale Online RGB-D Reconstruction

Yuqing Lan, Chenyang Zhu, Shuaifeng Zhi, Jiazhao Zhang, Zhoufeng Wang, Renjiao Yi, Yijie Wang, Kai Xu

PDF

TL;DR

RemixFusion introduces a residual-based mixed scene representation combining explicit TSDF grids with implicit neural residuals, enabling high-quality, large-scale online RGB-D reconstruction with efficient pose estimation.

Contribution

The paper proposes a novel residual-based mixed representation and a pose optimization method that significantly improve large-scale online RGB-D reconstruction quality and efficiency.

Findings

01

Outperforms state-of-the-art methods in mapping accuracy.

02

Achieves better camera pose estimation in large-scale scenes.

03

Efficient online learning with divide-and-conquer scene factorization.

Abstract

The introduction of the neural implicit representation has notably propelled the advancement of online dense reconstruction techniques. Compared to traditional explicit representations, such as TSDF, it improves the mapping completeness and memory efficiency. However, the lack of reconstruction details and the time-consuming learning of neural representations hinder the widespread application of neural-based methods to large-scale online reconstruction. We introduce RemixFusion, a novel residual-based mixed representation for scene reconstruction and camera pose estimation dedicated to high-quality and large-scale online RGB-D reconstruction. In particular, we propose a residual-based map representation comprised of an explicit coarse TSDF grid and an implicit neural module that produces residuals representing fine-grained details to be added to the coarse grid. Such mixed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.