Self-Supervised Image-to-Point Distillation via Semantically Tolerant   Contrastive Loss

Anas Mahmoud; Jordan S. K. Hu; Tianshu Kuai; Ali Harakeh; Liam Paull,; and Steven L. Waslander

arXiv:2301.05709·cs.CV·March 27, 2023

Self-Supervised Image-to-Point Distillation via Semantically Tolerant Contrastive Loss

Anas Mahmoud, Jordan S. K. Hu, Tianshu Kuai, Ali Harakeh, Liam Paull,, and Steven L. Waslander

PDF

Open Access 1 Repo

TL;DR

This paper introduces a semantically tolerant contrastive loss and class balancing technique to improve 3D representation learning from images, effectively addressing self-similarity and class imbalance issues in autonomous driving datasets.

Contribution

It proposes a novel contrastive loss that considers semantic similarity and a class-agnostic balanced loss to enhance 2D-to-3D representation learning for perception tasks.

Findings

01

Outperforms state-of-the-art methods in 3D semantic segmentation

02

Improves representation quality across various 2D self-supervised models

03

Effectively mitigates self-similarity and class imbalance problems

Abstract

An effective framework for learning 3D representations for perception tasks is distilling rich self-supervised image features via contrastive learning. However, image-to point representation learning for autonomous driving datasets faces two main challenges: 1) the abundance of self-similarity, which results in the contrastive losses pushing away semantically similar point and image regions and thus disturbing the local semantic structure of the learned representations, and 2) severe class imbalance as pretraining gets dominated by over-represented classes. We propose to alleviate the self-similarity problem through a novel semantically tolerant image-to-point contrastive loss that takes into consideration the semantic distance between positive and negative image regions to minimize contrasting semantically similar point and image regions. Additionally, we address class imbalance by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TRAILab/ST-SLidR
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · 3D Shape Modeling and Analysis