Equivariant Light Field Convolution and Transformer
Yinshuang Xu, Jiahui Lei, Kostas Daniilidis

TL;DR
This paper introduces an $SE(3)$-equivariant convolution and transformer framework in ray space, enabling geometric priors to be learned from multiple views for improved 3D reconstruction and view rendering.
Contribution
It proposes a novel $SE(3)$-equivariant neural network architecture in ray space, allowing for robust 3D tasks without transformation augmentation.
Findings
Achieves robustness in roto-translated datasets
Enables equivariant neural rendering and reconstruction
Extends convolution to $SE(3)$-equivariant attention
Abstract
3D reconstruction and novel view rendering can greatly benefit from geometric priors when the input views are not sufficient in terms of coverage and inter-view baselines. Deep learning of geometric priors from 2D images often requires each image to be represented in a canonical frame and the prior to be learned in a given or learned canonical frame. In this paper, given only the relative poses of the cameras, we show how to learn priors from multiple views equivariant to coordinate frame transformations by proposing an -equivariant convolution and transformer in the space of rays in 3D. This enables the creation of a light field that remains equivariant to the choice of coordinate frame. The light field as defined in our work, refers both to the radiance field and the feature field defined on the ray space. We model the ray space, the domain of the light field, as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Medical Image Segmentation Techniques
MethodsConvolution
