# Situational perception in distracted driving: an agentic multi-modal LLM framework

**Authors:** Ahmad M. Nazar, Mohamed Y. Selim, Ashraf Gaffar, Daji Qiao

PMC · DOI: 10.3389/frai.2025.1669937 · Frontiers in Artificial Intelligence · 2025-10-15

## TL;DR

This paper introduces a new AI system that helps distracted drivers by providing real-time alerts based on camera and GPS data.

## Contribution

The novel contribution is an agentic multi-modal LLM framework for real-time, context-aware driver alerts during distraction.

## Key findings

- The system achieves 85.7% semantic correctness in driver alerts.
- Average response latency is 1.74 seconds, meeting real-time safety requirements.
- The framework outperforms conventional ML-based systems in synthesizing multi-modal data.

## Abstract

Distracted driving is a significant public safety concern, causing thousands of accidents annually. While most driver assistance systems emphasize distraction detection, they fail to deliver real-time environmental perception and context-aware interventions.

We propose a large language model (LLM)-driven intervention framework that assumes distraction is pre-detected and dynamically integrates camera and GPS inputs to generate verbal driver alerts. The framework employs an agentic design, where specialized tools handle object detection, speed limits, live traffic conditions, and weather data. Structured orchestration ensures information is fused efficiently, balancing accuracy with conciseness to avoid overwhelming the driver.

Evaluation of the system demonstrates high performance, with semantic intervention correctness of 85.7% and an average response latency of 1.74 s. Compared to conventional ML-based driver assistance approaches, our framework effectively synthesizes multi-modal environmental data and produces actionable alerts in real time.

These findings highlight the potential of LLM-driven, multi-modal reasoning for distracted driving intervention. Integrating specialized agents and structured orchestration improves situational awareness, maintains concise communication, and meets real-time safety requirements. This proof-of-concept establishes a pathway for deploying intelligent, AI-driven driver support systems in safety-critical applications.

## Full-text entities

- **Diseases:** hallucination (MESH:D006212), deaths (MESH:D003643), injuries (MESH:D014947), LLM (MESH:D007806), crashes (MESH:C536029)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12568598/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12568598/full.md

## References

19 references — full list in the complete paper: https://tomesphere.com/paper/PMC12568598/full.md

---
Source: https://tomesphere.com/paper/PMC12568598