An LLM-driven Scenario Generation Pipeline Using an Extended Scenic DSL for Autonomous Driving Safety Validation

Fida Khandaker Safa; Yupeng Jiang; Xi Zheng

arXiv:2602.20644·cs.SE·February 25, 2026

An LLM-driven Scenario Generation Pipeline Using an Extended Scenic DSL for Autonomous Driving Safety Validation

Fida Khandaker Safa, Yupeng Jiang, Xi Zheng

PDF

Open Access

TL;DR

This paper introduces a scalable pipeline that leverages GPT-4o and an extended Scenic DSL to automatically convert crash reports into executable simulation scenarios for autonomous driving safety testing, improving accuracy and variability capture.

Contribution

It presents a novel intermediate Scenic DSL layer and a pipeline that enhances scenario generation accuracy and scalability compared to prior direct text-to-scenario methods.

Findings

01

100% correctness in environmental and road attributes extraction

02

97-98% accuracy in trajectory extraction

03

Successfully triggered traffic violations in 2,000 simulated scenarios

Abstract

Real-world crash reports, which combine textual summaries and sketches, are valuable for scenario-based testing of autonomous driving systems (ADS). However, current methods cannot effectively translate this multimodal data into precise, executable simulation scenarios, hindering the scalability of ADS safety validation. In this work, we propose a scalable and verifiable pipeline that uses a large language model (GPT-4o mini) and a probabilistic intermediate representation (an Extended Scenic domain-specific language) to automatically extract semantic scenario configurations from crash reports and generate corresponding simulation-ready scenarios. Unlike earlier approaches such as ScenicNL and LCTGen (which generate scenarios directly from text) or TARGET (which uses deterministic mappings from traffic rules), our method introduces an intermediate Scenic DSL layer to separate high-level…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning · Human-Automation Interaction and Safety