Broadening Access to Transportation Safety Data with Generative AI: A Schema-Grounded Framework for Spatial Natural Language Queries

Mahdi Azhdari; Eric J. Gonzales

arXiv:2605.21712·cs.CL·May 22, 2026

Broadening Access to Transportation Safety Data with Generative AI: A Schema-Grounded Framework for Spatial Natural Language Queries

Mahdi Azhdari, Eric J. Gonzales

PDF

1 Repo

TL;DR

This paper introduces a schema-grounded natural language interface leveraging generative AI to make transportation safety data more accessible, reliable, and reproducible for diverse stakeholders.

Contribution

It presents a novel framework that interprets natural language queries into deterministic spatial operations, enhancing data access while maintaining schema integrity.

Findings

01

All queries executed successfully on the Massachusetts database.

02

Validation layer corrected errors in 29% of evaluation queries.

03

The approach improves accessibility and trustworthiness of transportation safety data.

Abstract

Transportation safety analysis requires integrating crash records, roadway attributes, and geospatial data through GIS-based workflows, but access remains uneven across agencies and community stakeholders. Technical prerequisites create a gap between analytical tools central to safety planning and the practitioners able to use them. Local agencies, school committees, and residents may have safety concerns but limited capacity to retrieve, filter, map, and analyze relevant data. Generative AI offers a way to narrow this divide, but its public-sector use raises questions about reliability, reproducibility, and governance. This paper presents a schema-grounded natural language interface for transportation safety analysis, using a large language model (LLM) to interpret user intent while preserving deterministic, reviewable execution against an authoritative database. User queries are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mahdi-azhdari/road-safety-nlq
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.