Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud   Recognition

Xuanyu Yi; Jiajun Deng; Qianru Sun; Xian-Sheng Hua; Joo-Hwee Lim,; Hanwang Zhang

arXiv:2308.09694·cs.CV·August 21, 2023

Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition

Xuanyu Yi, Jiajun Deng, Qianru Sun, Xian-Sheng Hua, Joo-Hwee Lim,, Hanwang Zhang

PDF

Open Access

TL;DR

This paper introduces InvJoint, a training strategy that enhances 2D-3D ensemble collaboration in few-shot point cloud recognition by focusing on hard samples and enforcing invariance between conflicting predictions, leading to improved performance.

Contribution

Proposes InvJoint, a novel invariant training method that improves 2D-3D ensemble collaboration for few-shot point cloud recognition by emphasizing hard samples and prediction invariance.

Findings

01

Outperforms existing methods on ModelNet10/40, ScanObjectNN, Toys4K

02

Improves shape retrieval on ShapeNet-Core

03

Enhances collaborative learning between 2D and 3D models

Abstract

We tackle the data scarcity challenge in few-shot point cloud recognition of 3D objects by using a joint prediction from a conventional 3D model and a well-trained 2D model. Surprisingly, such an ensemble, though seems trivial, has hardly been shown effective in recent 2D-3D models. We find out the crux is the less effective training for the ''joint hard samples'', which have high confidence prediction on different wrong labels, implying that the 2D and 3D models do not collaborate well. To this end, our proposed invariant training strategy, called InvJoint, does not only emphasize the training more on the hard samples, but also seeks the invariance between the conflicting 2D and 3D ambiguous predictions. InvJoint can learn more collaborative 2D and 3D representations for better ensemble. Extensive experiments on 3D shape classification with widely adopted ModelNet10/40, ScanObjectNN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Industrial Vision Systems and Defect Detection