Self-supervised Learning of Rotation-invariant 3D Point Set Features   using Transformer and its Self-distillation

Takahiko Furuya; Zhoujie Chen; Ryutarou Ohbuchi; Zhenzhong Kuang

arXiv:2308.04725·cs.CV·April 22, 2024

Self-supervised Learning of Rotation-invariant 3D Point Set Features using Transformer and its Self-distillation

Takahiko Furuya, Zhoujie Chen, Ryutarou Ohbuchi, Zhenzhong Kuang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a self-supervised learning framework using Transformer architecture to learn accurate, rotation-invariant features from unlabeled 3D point sets, addressing the challenge of label scarcity in 3D shape analysis.

Contribution

It proposes a novel lightweight DNN with self-attention and self-distillation for rotation-invariant 3D feature learning from unlabeled data, improving over existing supervised methods.

Findings

01

The proposed method learns more accurate rotation-invariant features.

02

Existing rotation-invariant architectures are less effective under self-supervision.

03

Combining multi-crop and cut-mix enhances training diversity.

Abstract

Invariance against rotations of 3D objects is an important property in analyzing 3D point set data. Conventional 3D point set DNNs having rotation invariance typically obtain accurate 3D shape features via supervised learning by using labeled 3D point sets as training samples. However, due to the rapid increase in 3D point set data and the high cost of labeling, a framework to learn rotation-invariant 3D shape features from numerous unlabeled 3D point sets is required. This paper proposes a novel self-supervised learning framework for acquiring accurate and rotation-invariant 3D point set features at object-level. Our proposed lightweight DNN architecture decomposes an input 3D point set into multiple global-scale regions, called tokens, that preserve the spatial layout of partial shapes composing the 3D object. We employ a self-attention mechanism to refine the tokens and aggregate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

takahikof/ript_sdmm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Image Processing and 3D Reconstruction · 3D Shape Modeling and Analysis