CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding
Mohamed Afham, Isuru Dissanayake, Dinithi Dissanayake, Amaya, Dharmasiri, Kanchana Thilakarathna, Ranga Rodrigo

TL;DR
CrossPoint introduces a self-supervised cross-modal contrastive learning method that leverages 2D images and 3D point clouds to improve understanding and performance on 3D tasks without requiring manual annotations.
Contribution
It proposes a novel cross-modal contrastive learning framework that aligns 3D point clouds with corresponding 2D images to enhance 3D representation learning.
Findings
Outperforms previous unsupervised methods on 3D classification and segmentation
Effectively learns transferable 3D representations from 2D images
Demonstrates robustness to transformations in point cloud data
Abstract
Manual annotation of large-scale point cloud dataset for varying tasks such as 3D object classification, segmentation and detection is often laborious owing to the irregular structure of point clouds. Self-supervised learning, which operates without any human labeling, is a promising approach to address this issue. We observe in the real world that humans are capable of mapping the visual concepts learnt from 2D images to understand the 3D world. Encouraged by this insight, we propose CrossPoint, a simple cross-modal contrastive learning approach to learn transferable 3D point cloud representations. It enables a 3D-2D correspondence of objects by maximizing agreement between point clouds and the corresponding rendered 2D image in the invariant space, while encouraging invariance to transformations in the point cloud modality. Our joint training objective combines the feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Surveying and Cultural Heritage · Remote Sensing and LiDAR Applications · Optical measurement and interference techniques
MethodsContrastive Learning
