# 3D Local Feature Learning and Analysis on Point Cloud Parts via Momentum Contrast

**Authors:** Xuanmeng Sha, Tomohiro Mashita, Naoya Chiba, Liyun Zhang

PMC · DOI: 10.3390/s26031007 · Sensors (Basel, Switzerland) · 2026-02-03

## TL;DR

This paper introduces a new method for learning 3D local features from point clouds using momentum contrast, improving efficiency and performance in real-world scenarios.

## Contribution

The novel contribution is applying momentum contrastive learning to learn local features from partial point cloud regions, with effective augmentation strategies.

## Key findings

- The proposed method achieves comparable classification accuracy with 16% less training time.
- Approximately 30% of an object's local part is sufficient for effective learning in occluded scenarios.
- Using PointNet++ with MoCo architecture improves transferable local feature learning from point clouds.

## Abstract

Self-supervised contrastive learning has demonstrated remarkable effectiveness in learning visual representations without labeled data, yet its application to 3D local feature learning from point clouds remains underexplored. Existing methods predominantly focus on complete object shapes, neglecting the critical challenge of recognizing partial observations commonly encountered in real-world 3D perception. We propose a momentum contrastive learning framework specifically designed to learn discriminative local features from randomly sampled point cloud regions. By adapting the MoCo architecture with PointNet++ as the feature backbone, our method treats local parts of point cloud as fundamental contrastive learning units, combined with carefully designed augmentation strategies including random dropout and translation. Experiments on ShapeNet demonstrate that our approach effectively learns transferable local features and the empirical observation that approximately 30% object local part represents a practical threshold for effective learning when simulating real-world occlusion scenarios, and achieves comparable downstream classification accuracy while reducing training time by 16%.

## Full-text entities

- **Diseases:** occlusion (MESH:D001157)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12900106/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12900106/full.md

## References

51 references — full list in the complete paper: https://tomesphere.com/paper/PMC12900106/full.md

---
Source: https://tomesphere.com/paper/PMC12900106