A workflow for generating synthetic LiDAR datasets in simulation environments
Abhishek Phadke, Shakib Mahmud Dipto, Pratip Rana

TL;DR
This paper introduces a simulation workflow using CoppeliaSim to generate synthetic LiDAR datasets with ground truth, supporting autonomous vehicle perception and sensor security research, while discussing limitations and future enhancements.
Contribution
A novel, automated simulation pipeline for creating synchronized multimodal LiDAR datasets with ground truth, enabling research in perception and security vulnerabilities.
Findings
Generated large-scale synthetic LiDAR point clouds and images
Demonstrated potential for evaluating security threats like spoofing
Discussed limitations and future improvements in realism and scalability
Abstract
This paper presents a simulation workflow for generating synthetic LiDAR datasets to support autonomous vehicle perception, robotics research, and sensor security analysis. Leveraging the CoppeliaSim simulation environment and its Python API, we integrate time-of-flight LiDAR, image sensors, and two dimensional scanners onto a simulated vehicle platform operating within an urban scenario. The workflow automates data capture, storage, and annotation across multiple formats (PCD, PLY, CSV), producing synchronized multimodal datasets with ground truth pose information. We validate the pipeline by generating large-scale point clouds and corresponding RGB and depth imagery. The study examines potential security vulnerabilities in LiDAR data, such as adversarial point injection and spoofing attacks, and demonstrates how synthetic datasets can facilitate the evaluation of defense strategies.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
