VLM-Auto: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes
Ziang Guo, Zakhar Yagudin, Artem Lykov, Mikhail Konenkov, Dzmitry, Tsetserukou

TL;DR
VLM-Auto is a novel autonomous driving assistant system that leverages visual language models to understand complex road scenes and adapt driving behaviors, demonstrating high accuracy in simulated and real-world environments.
Contribution
The paper introduces VLM-Auto, a new system integrating visual language models for autonomous driving, along with a dataset for fine-tuning and validation in simulation and real-world scenarios.
Findings
Achieved 97.82% average precision on dataset labels in CARLA simulation.
Achieved 96.97% prediction accuracy in night and gloomy real-world scenes.
Demonstrated effective use of VLM in autonomous driving with low computational resources.
Abstract
Recent research on Large Language Models for autonomous driving shows promise in planning and control. However, high computational demands and hallucinations still challenge accurate trajectory prediction and control signal generation. Deterministic algorithms offer reliability but lack adaptability to complex driving scenarios and struggle with context and uncertainty. To address this problem, we propose VLM-Auto, a novel autonomous driving assistant system to empower the autonomous vehicles with adjustable driving behaviors based on the understanding of road scenes. A pipeline involving the CARLA simulator and Robot Operating System 2 (ROS2) verifying the effectiveness of our system is presented, utilizing a single Nvidia 4090 24G GPU while exploiting the capacity of textual output of the Visual Language Model (VLM). Besides, we also contribute a dataset containing an image set and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Human-Automation Interaction and Safety
