ReTR: Modeling Rendering Via Transformer for Generalizable Neural   Surface Reconstruction

Yixun Liang; Hao He; Ying-cong Chen

arXiv:2305.18832·cs.CV·November 3, 2023·1 cites

ReTR: Modeling Rendering Via Transformer for Generalizable Neural Surface Reconstruction

Yixun Liang, Hao He, Ying-cong Chen

PDF

Open Access 1 Video

TL;DR

ReTR introduces a transformer-based rendering framework that enhances neural surface reconstruction by modeling complex interactions, leading to improved accuracy and generalization over existing methods.

Contribution

The paper proposes a novel transformer-based rendering process with a learnable meta-ray token, improving surface reconstruction quality and robustness.

Findings

01

Outperforms state-of-the-art methods in reconstruction quality.

02

Demonstrates superior generalization across datasets.

03

Operates effectively within a high-dimensional feature space.

Abstract

Generalizable neural surface reconstruction techniques have attracted great attention in recent years. However, they encounter limitations of low confidence depth distribution and inaccurate surface reasoning due to the oversimplified volume rendering process employed. In this paper, we present Reconstruction TRansformer (ReTR), a novel framework that leverages the transformer architecture to redesign the rendering process, enabling complex render interaction modeling. It introduces a learnable $meta-ray token$ and utilizes the cross-attention mechanism to simulate the interaction of rendering process with sampled points and render the observed color. Meanwhile, by operating within a high-dimensional feature space rather than the color space, ReTR mitigates sensitivity to projected colors in source views. Such improvements result in accurate surface assessment with high…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

ReTR: Modeling Rendering Via Transformer for Generalizable Neural Surface Reconstruction· slideslive

Taxonomy

TopicsComputer Graphics and Visualization Techniques · Human Pose and Action Recognition · 3D Shape Modeling and Analysis