A workflow for generating synthetic LiDAR datasets in simulation environments

Abhishek Phadke; Shakib Mahmud Dipto; Pratip Rana

arXiv:2506.17378·cs.RO·June 24, 2025

A workflow for generating synthetic LiDAR datasets in simulation environments

Abhishek Phadke, Shakib Mahmud Dipto, Pratip Rana

PDF

TL;DR

This paper introduces a simulation workflow using CoppeliaSim to generate synthetic LiDAR datasets with ground truth, supporting autonomous vehicle perception and sensor security research, while discussing limitations and future enhancements.

Contribution

A novel, automated simulation pipeline for creating synchronized multimodal LiDAR datasets with ground truth, enabling research in perception and security vulnerabilities.

Findings

01

Generated large-scale synthetic LiDAR point clouds and images

02

Demonstrated potential for evaluating security threats like spoofing

03

Discussed limitations and future improvements in realism and scalability

Abstract

This paper presents a simulation workflow for generating synthetic LiDAR datasets to support autonomous vehicle perception, robotics research, and sensor security analysis. Leveraging the CoppeliaSim simulation environment and its Python API, we integrate time-of-flight LiDAR, image sensors, and two dimensional scanners onto a simulated vehicle platform operating within an urban scenario. The workflow automates data capture, storage, and annotation across multiple formats (PCD, PLY, CSV), producing synchronized multimodal datasets with ground truth pose information. We validate the pipeline by generating large-scale point clouds and corresponding RGB and depth imagery. The study examines potential security vulnerabilities in LiDAR data, such as adversarial point injection and spoofing attacks, and demonstrates how synthetic datasets can facilitate the evaluation of defense strategies.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.