TL;DR
This paper introduces a schema-grounded natural language interface leveraging generative AI to make transportation safety data more accessible, reliable, and reproducible for diverse stakeholders.
Contribution
It presents a novel framework that interprets natural language queries into deterministic spatial operations, enhancing data access while maintaining schema integrity.
Findings
All queries executed successfully on the Massachusetts database.
Validation layer corrected errors in 29% of evaluation queries.
The approach improves accessibility and trustworthiness of transportation safety data.
Abstract
Transportation safety analysis requires integrating crash records, roadway attributes, and geospatial data through GIS-based workflows, but access remains uneven across agencies and community stakeholders. Technical prerequisites create a gap between analytical tools central to safety planning and the practitioners able to use them. Local agencies, school committees, and residents may have safety concerns but limited capacity to retrieve, filter, map, and analyze relevant data. Generative AI offers a way to narrow this divide, but its public-sector use raises questions about reliability, reproducibility, and governance. This paper presents a schema-grounded natural language interface for transportation safety analysis, using a large language model (LLM) to interpret user intent while preserving deterministic, reviewable execution against an authoritative database. User queries are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
