TL;DR
AIR-Nets is a novel attention-based architecture for 3D shape reconstruction from point clouds that encodes local and global features, outperforming previous methods especially with sparse data and generalizing well across datasets.
Contribution
Introduces AIR-Nets, the first grid-free, encoder-based approach using attention for locally conditioned implicit 3D shape representation.
Findings
Outperforms previous state-of-the-art methods on ShapeNet.
Excels in sparse data reconstruction scenarios.
Generalizes well to unseen datasets like FAUST.
Abstract
This paper introduces Attentive Implicit Representation Networks (AIR-Nets), a simple, but highly effective architecture for 3D reconstruction from point clouds. Since representing 3D shapes in a local and modular fashion increases generalization and reconstruction quality, AIR-Nets encode an input point cloud into a set of local latent vectors anchored in 3D space, which locally describe the object's geometry, as well as a global latent description, enforcing global consistency. Our model is the first grid-free, encoder-based approach that locally describes an implicit function. The vector attention mechanism from [Zhao et al. 2020] serves as main point cloud processing module, and allows for permutation invariance and translation equivariance. When queried with a 3D coordinate, our decoder gathers information from the global and nearby local latent vectors in order to predict an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
