The 2nd Place Solution from the 3D Semantic Segmentation Track in the 2024 Waymo Open Dataset Challenge
Qing Wu

TL;DR
This paper presents MixSeg3D, a novel LiDAR data augmentation method combining advanced mixing strategies with a strong segmentation model, leading to improved 3D semantic segmentation performance in autonomous driving.
Contribution
Introduction of MixSeg3D, a new data augmentation approach using LaserMix and PolarMix with MinkUNet, enhancing 3D segmentation accuracy for autonomous vehicle perception.
Findings
Achieved 2nd place in the 2024 Waymo Open Dataset Challenge.
Demonstrated superiority of MixSeg3D over baseline and prior methods.
Validated effectiveness of scene-scale LiDAR data mixing strategies.
Abstract
3D semantic segmentation is one of the most crucial tasks in driving perception. The ability of a learning-based model to accurately perceive dense 3D surroundings often ensures the safe operation of autonomous vehicles. However, existing LiDAR-based 3D semantic segmentation databases consist of sequentially acquired LiDAR scans that are long-tailed and lack training diversity. In this report, we introduce MixSeg3D, a sophisticated combination of the strong point cloud segmentation model with advanced 3D data mixing strategies. Specifically, our approach integrates the MinkUNet family with LaserMix and PolarMix, two scene-scale data augmentation methods that blend LiDAR point clouds along the ego-scene's inclination and azimuth directions. Through empirical experiments, we demonstrate the superiority of MixSeg3D over the baseline and prior arts. Our team achieved 2nd place in the 3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis
