RemixFusion: Residual-based Mixed Representation for Large-scale Online RGB-D Reconstruction
Yuqing Lan, Chenyang Zhu, Shuaifeng Zhi, Jiazhao Zhang, Zhoufeng Wang, Renjiao Yi, Yijie Wang, Kai Xu

TL;DR
RemixFusion introduces a residual-based mixed scene representation combining explicit TSDF grids with implicit neural residuals, enabling high-quality, large-scale online RGB-D reconstruction with efficient pose estimation.
Contribution
The paper proposes a novel residual-based mixed representation and a pose optimization method that significantly improve large-scale online RGB-D reconstruction quality and efficiency.
Findings
Outperforms state-of-the-art methods in mapping accuracy.
Achieves better camera pose estimation in large-scale scenes.
Efficient online learning with divide-and-conquer scene factorization.
Abstract
The introduction of the neural implicit representation has notably propelled the advancement of online dense reconstruction techniques. Compared to traditional explicit representations, such as TSDF, it improves the mapping completeness and memory efficiency. However, the lack of reconstruction details and the time-consuming learning of neural representations hinder the widespread application of neural-based methods to large-scale online reconstruction. We introduce RemixFusion, a novel residual-based mixed representation for scene reconstruction and camera pose estimation dedicated to high-quality and large-scale online RGB-D reconstruction. In particular, we propose a residual-based map representation comprised of an explicit coarse TSDF grid and an implicit neural module that produces residuals representing fine-grained details to be added to the coarse grid. Such mixed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
