Towards Reliable and Interpretable Traffic Crash Pattern Prediction and Safety Interventions Using Customized Large Language Models
Yang Zhao, Pu Wang, Yibo Zhao, Hongru Du, Hao Frank Yang

TL;DR
This paper introduces TrafficSafe, a large language model-based framework that predicts traffic crashes and interprets contributing factors by leveraging multi-modal data, significantly improving prediction accuracy and providing actionable insights for safety interventions.
Contribution
The paper presents a novel LLM-based approach for crash prediction and interpretability, integrating multi-modal data and introducing a feature attribution method for traffic safety analysis.
Findings
42% average improvement in F1-score over baselines
Alcohol-impaired driving is the leading factor in severe crashes
TrafficSafe Attribution identifies pivotal features for model training
Abstract
Predicting crash events is crucial for understanding crash distributions and their contributing factors, thereby enabling the design of proactive traffic safety policy interventions. However, existing methods struggle to interpret the complex interplay among various sources of traffic crash data, including numeric characteristics, textual reports, crash imagery, environmental conditions, and driver behavior records. As a result, they often fail to capture the rich semantic information and intricate interrelationships embedded in these diverse data sources, limiting their ability to identify critical crash risk factors. In this research, we propose TrafficSafe, a framework that adapts LLMs to reframe crash prediction and feature attribution as text-based reasoning. A multi-modal crash dataset including 58,903 real-world reports together with belonged infrastructure, environmental,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Natural Language Processing Techniques · Topic Modeling
