Learning Traffic Crashes as Language: Datasets, Benchmarks, and What-if Causal Analyses
Zhiwen Fan, Pu Wang, Yang Zhao, Yibo Zhao, Boris Ivanovic, Zhangyang, Wang, Marco Pavone, Hao Frank Yang

TL;DR
This paper introduces CrashEvent, a large-scale traffic crash dataset, and proposes CrashLLM, a large language model-based approach that improves accident outcome predictions and enables nuanced what-if traffic safety analyses.
Contribution
The paper presents a novel large-scale crash dataset and leverages large language models for detailed accident outcome prediction and interpretability in traffic safety analysis.
Findings
F1 score improved from 34.9% to 53.8% with CrashLLM
CrashLLM predicts crash types, severity, and injuries effectively
Provides a new benchmark and dataset for traffic crash analysis
Abstract
The increasing rate of road accidents worldwide results not only in significant loss of life but also imposes billions financial burdens on societies. Current research in traffic crash frequency modeling and analysis has predominantly approached the problem as classification tasks, focusing mainly on learning-based classification or ensemble learning methods. These approaches often overlook the intricate relationships among the complex infrastructure, environmental, human and contextual factors related to traffic crashes and risky situations. In contrast, we initially propose a large-scale traffic crash language dataset, named CrashEvent, summarizing 19,340 real-world crash reports and incorporating infrastructure data, environmental and traffic textual and visual information in Washington State. Leveraging this rich dataset, we further formulate the crash event feature learning as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
