Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models
Yizhou Huang, Yihua Cheng, Kezhi Wang

TL;DR
This paper presents a framework deploying large language models on edge devices for autonomous driving, enabling efficient scene narration and reasoning with improved performance and response speed.
Contribution
It introduces a novel edge-based LLM deployment framework with a multi-modal prompt strategy for driving behavior narration and reasoning.
Findings
LLMs on edge devices achieve satisfactory response speeds
The multi-modal prompt strategy enhances narration and reasoning performance
Experiments on OpenDV-Youtube dataset show significant performance improvements
Abstract
Deep learning architectures with powerful reasoning capabilities have driven significant advancements in autonomous driving technology. Large language models (LLMs) applied in this field can describe driving scenes and behaviors with a level of accuracy similar to human perception, particularly in visual tasks. Meanwhile, the rapid development of edge computing, with its advantage of proximity to data sources, has made edge devices increasingly important in autonomous driving. Edge devices process data locally, reducing transmission delays and bandwidth usage, and achieving faster response times. In this work, we propose a driving behavior narration and reasoning framework that applies LLMs to edge devices. The framework consists of multiple roadside units, with LLMs deployed on each unit. These roadside units collect road data and communicate via 5G NSR/NR networks. Our experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Advanced Text Analysis Techniques · Web Data Mining and Analysis
