Efficient Driving Behavior Narration and Reasoning on Edge Device Using   Large Language Models

Yizhou Huang; Yihua Cheng; Kezhi Wang

arXiv:2409.20364·cs.AI·October 1, 2024

Efficient Driving Behavior Narration and Reasoning on Edge Device Using Large Language Models

Yizhou Huang, Yihua Cheng, Kezhi Wang

PDF

Open Access

TL;DR

This paper presents a framework deploying large language models on edge devices for autonomous driving, enabling efficient scene narration and reasoning with improved performance and response speed.

Contribution

It introduces a novel edge-based LLM deployment framework with a multi-modal prompt strategy for driving behavior narration and reasoning.

Findings

01

LLMs on edge devices achieve satisfactory response speeds

02

The multi-modal prompt strategy enhances narration and reasoning performance

03

Experiments on OpenDV-Youtube dataset show significant performance improvements

Abstract

Deep learning architectures with powerful reasoning capabilities have driven significant advancements in autonomous driving technology. Large language models (LLMs) applied in this field can describe driving scenes and behaviors with a level of accuracy similar to human perception, particularly in visual tasks. Meanwhile, the rapid development of edge computing, with its advantage of proximity to data sources, has made edge devices increasingly important in autonomous driving. Edge devices process data locally, reducing transmission delays and bandwidth usage, and achieving faster response times. In this work, we propose a driving behavior narration and reasoning framework that applies LLMs to edge devices. The framework consists of multiple roadside units, with LLMs deployed on each unit. These roadside units collect road data and communicate via 5G NSR/NR networks. Our experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic Prediction and Management Techniques · Advanced Text Analysis Techniques · Web Data Mining and Analysis