GARNet: Global-Aware Multi-View 3D Reconstruction Network and the   Cost-Performance Tradeoff

Zhenwei Zhu; Liying Yang; Xuxin Lin; Chaohao Jiang; Ning Li; Lin Yang,; Yanyan Liang

arXiv:2211.02299·cs.CV·May 15, 2023·1 cites

GARNet: Global-Aware Multi-View 3D Reconstruction Network and the Cost-Performance Tradeoff

Zhenwei Zhu, Liying Yang, Xuxin Lin, Chaohao Jiang, Ning Li, Lin Yang,, Yanyan Liang

PDF

Open Access 1 Repo

TL;DR

This paper introduces GARNet, a global-aware multi-view 3D reconstruction network that improves feature fusion using global correlation, with a novel loss and training strategy, outperforming state-of-the-art methods while reducing parameters.

Contribution

The paper proposes a global-aware attention fusion approach, a new loss function, and a dynamic training strategy for multi-view 3D reconstruction, enhancing accuracy and efficiency.

Findings

01

Outperforms existing SOTA methods on ShapeNet

02

Uses fewer parameters than comparable algorithms

03

Provides a view-reduction method balancing cost and performance

Abstract

Deep learning technology has made great progress in multi-view 3D reconstruction tasks. At present, most mainstream solutions establish the mapping between views and shape of an object by assembling the networks of 2D encoder and 3D decoder as the basic structure while they adopt different approaches to obtain aggregation of features from several views. Among them, the methods using attention-based fusion perform better and more stable than the others, however, they still have an obvious shortcoming -- the strong independence of each view during predicting the weights for merging leads to a lack of adaption of the global state. In this paper, we propose a global-aware attention-based fusion approach that builds the correlation between each branch and the global to provide a comprehensive foundation for weights inference. In order to enhance the ability of the network, we introduce a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

GaryZhu1996/GARNet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Human Pose and Action Recognition