Towards Reliable and Interpretable Traffic Crash Pattern Prediction and Safety Interventions Using Customized Large Language Models

Yang Zhao; Pu Wang; Yibo Zhao; Hongru Du; Hao Frank Yang

arXiv:2505.12545·cs.CL·May 22, 2025

Towards Reliable and Interpretable Traffic Crash Pattern Prediction and Safety Interventions Using Customized Large Language Models

Yang Zhao, Pu Wang, Yibo Zhao, Hongru Du, Hao Frank Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces TrafficSafe, a large language model-based framework that predicts traffic crashes and interprets contributing factors by leveraging multi-modal data, significantly improving prediction accuracy and providing actionable insights for safety interventions.

Contribution

The paper presents a novel LLM-based approach for crash prediction and interpretability, integrating multi-modal data and introducing a feature attribution method for traffic safety analysis.

Findings

01

42% average improvement in F1-score over baselines

02

Alcohol-impaired driving is the leading factor in severe crashes

03

TrafficSafe Attribution identifies pivotal features for model training

Abstract

Predicting crash events is crucial for understanding crash distributions and their contributing factors, thereby enabling the design of proactive traffic safety policy interventions. However, existing methods struggle to interpret the complex interplay among various sources of traffic crash data, including numeric characteristics, textual reports, crash imagery, environmental conditions, and driver behavior records. As a result, they often fail to capture the rich semantic information and intricate interrelationships embedded in these diverse data sources, limiting their ability to identify critical crash risk factors. In this research, we propose TrafficSafe, a framework that adapts LLMs to reframe crash prediction and feature attribution as text-based reasoning. A multi-modal crash dataset including 58,903 real-world reports together with belonged infrastructure, environmental,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Puw242/TrafficSafe
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic Prediction and Management Techniques · Natural Language Processing Techniques · Topic Modeling