MDT3D: Multi-Dataset Training for LiDAR 3D Object Detection Generalization
Louis Soum-Fontez, Jean-Emmanuel Deschaud, Fran\c{c}ois Goulette

TL;DR
This paper introduces MDT3D, a multi-dataset training approach that enhances the generalization of LiDAR 3D object detection models across different environments and sensor configurations by leveraging multiple datasets and novel augmentation techniques.
Contribution
The paper proposes a new multi-dataset training framework with label mapping and cross-dataset augmentation to improve 3D object detection robustness across diverse datasets.
Findings
Improved detection accuracy across multiple datasets.
Effective label mapping reduces dataset discrepancies.
Cross-dataset object injection enhances model robustness.
Abstract
Supervised 3D Object Detection models have been displaying increasingly better performance in single-domain cases where the training data comes from the same environment and sensor as the testing data. However, in real-world scenarios data from the target domain may not be available for finetuning or for domain adaptation methods. Indeed, 3D object detection models trained on a source dataset with a specific point distribution have shown difficulties in generalizing to unseen datasets. Therefore, we decided to leverage the information available from several annotated source datasets with our Multi-Dataset Training for 3D Object Detection (MDT3D) method to increase the robustness of 3D object detection models when tested in a new environment with a different sensor configuration. To tackle the labelling gap between datasets, we used a new label mapping based on coarse labels.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
