A Systematic Survey on Deep Learning Architectures for Point Cloud Classification and Segmentation

Minhas Kamal; Hiranya Garbha Kumar; Balakrishnan Prabhakaran

arXiv:2605.17131·cs.CV·May 19, 2026

A Systematic Survey on Deep Learning Architectures for Point Cloud Classification and Segmentation

Minhas Kamal, Hiranya Garbha Kumar, Balakrishnan Prabhakaran

PDF

TL;DR

This paper systematically reviews deep learning architectures for point cloud classification and segmentation, discussing their design, performance, challenges, and future directions in 3D vision tasks.

Contribution

It provides a comprehensive categorization, performance evaluation, and critical analysis of deep learning models for point cloud tasks, highlighting architectural innovations and limitations.

Findings

01

Performance benchmarks of various architectures are compared.

02

Architectural innovations improve accuracy and efficiency.

03

Open challenges and future research directions are identified.

Abstract

Point cloud stands as the most widely adopted format for representing 3D shapes and scenes due to its simplicity and geometric fidelity. However, its inherent unordered and irregular nature, exacerbated by sensor noise and occlusions, introduces unique challenges for machine learning based methodologies. To combat these issues, diverse strategies have been developed, including converting to a format that has orderliness, extracting local geometry, and permutation-invariant or self-attention-based processing. In this paper, our focus is directed towards deep learning models for three fundamental tasks in 3D vision: point cloud classification, part segmentation, and semantic segmentation. We begin by formally defining point cloud data, followed by an in-depth discussion on its structural characteristics. Then, we categorize notable works based on their backbone structure and evaluate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.