Single-view 3D Mesh Reconstruction for Seen and Unseen Categories
Xianghui Yang, Guosheng Lin, Luping Zhou

TL;DR
This paper introduces GenMesh, a two-stage neural network that improves single-view 3D mesh reconstruction, especially for unseen categories, by factorizing the task, capturing shared local geometry, and using multi-view silhouette supervision.
Contribution
The paper proposes a novel end-to-end two-stage network, GenMesh, that enhances generalization to unseen categories by factorizing the reconstruction process and employing local feature sampling and multi-view supervision.
Findings
Outperforms existing methods on ShapeNet and Pix3D datasets.
Significantly improves reconstruction quality for novel objects.
Effective in reducing overfitting and enhancing generalization.
Abstract
Single-view 3D object reconstruction is a fundamental and challenging computer vision task that aims at recovering 3D shapes from single-view RGB images. Most existing deep learning based reconstruction methods are trained and evaluated on the same categories, and they cannot work well when handling objects from novel categories that are not seen during training. Focusing on this issue, this paper tackles Single-view 3D Mesh Reconstruction, to study the model generalization on unseen categories and encourage models to reconstruct objects literally. Specifically, we propose an end-to-end two-stage network, GenMesh, to break the category boundaries in reconstruction. Firstly, we factorize the complicated image-to-mesh mapping into two simpler mappings, i.e., image-to-point mapping and point-to-mesh mapping, while the latter is mainly a geometric problem and less dependent on object…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Human Pose and Action Recognition · Advanced Vision and Imaging
