ReTR: Modeling Rendering Via Transformer for Generalizable Neural Surface Reconstruction
Yixun Liang, Hao He, Ying-cong Chen

TL;DR
ReTR introduces a transformer-based rendering framework that enhances neural surface reconstruction by modeling complex interactions, leading to improved accuracy and generalization over existing methods.
Contribution
The paper proposes a novel transformer-based rendering process with a learnable meta-ray token, improving surface reconstruction quality and robustness.
Findings
Outperforms state-of-the-art methods in reconstruction quality.
Demonstrates superior generalization across datasets.
Operates effectively within a high-dimensional feature space.
Abstract
Generalizable neural surface reconstruction techniques have attracted great attention in recent years. However, they encounter limitations of low confidence depth distribution and inaccurate surface reasoning due to the oversimplified volume rendering process employed. In this paper, we present Reconstruction TRansformer (ReTR), a novel framework that leverages the transformer architecture to redesign the rendering process, enabling complex render interaction modeling. It introduces a learnable and utilizes the cross-attention mechanism to simulate the interaction of rendering process with sampled points and render the observed color. Meanwhile, by operating within a high-dimensional feature space rather than the color space, ReTR mitigates sensitivity to projected colors in source views. Such improvements result in accurate surface assessment with high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Human Pose and Action Recognition · 3D Shape Modeling and Analysis
