PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies
Guocheng Qian, Yuchen Li, Houwen Peng, Jinjie Mai, Hasan Abed Al Kader, Hammoud, Mohamed Elhoseiny, Bernard Ghanem

TL;DR
This paper revisits PointNet++ by systematically studying training and scaling strategies, leading to significant performance improvements and the development of PointNeXt, a scalable and efficient model that surpasses current state-of-the-art methods in 3D classification and segmentation.
Contribution
The paper introduces improved training strategies for PointNet++, proposes a new scalable architecture called PointNeXt, and demonstrates its superior performance on multiple 3D understanding benchmarks.
Findings
PointNet++ accuracy on ScanObjectNN improved from 77.9% to 86.1%.
PointNeXt achieves 87.7% accuracy on ScanObjectNN, surpassing PointMLP.
PointNeXt attains 74.9% mean IoU on S3DIS, setting a new state-of-the-art.
Abstract
PointNet++ is one of the most influential neural architectures for point cloud understanding. Although the accuracy of PointNet++ has been largely surpassed by recent networks such as PointMLP and Point Transformer, we find that a large portion of the performance gain is due to improved training strategies, i.e. data augmentation and optimization techniques, and increased model sizes rather than architectural innovations. Thus, the full potential of PointNet++ has yet to be explored. In this work, we revisit the classical PointNet++ through a systematic study of model training and scaling strategies, and offer two major contributions. First, we propose a set of improved training strategies that significantly improve PointNet++ performance. For example, we show that, without any change in architecture, the overall accuracy (OA) of PointNet++ on ScanObjectNN object classification can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
Topics3D Surveying and Cultural Heritage · Image Processing and 3D Reconstruction · 3D Shape Modeling and Analysis
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Label Smoothing · Softmax · Byte Pair Encoding · Adam · Dropout · Residual Connection
