Prospective Role of Foundation Models in Advancing Autonomous Vehicles
Jianhua Wu, Bingzhao Gao, Jincheng Gao, Jianhao Yu, Hongqing Chu,, Qiankun Yu, Xun Gong, Yi Chang, H. Eric Tseng, Hong Chen, Jie Chen

TL;DR
This paper explores how large-scale Foundation Models can significantly improve autonomous vehicle systems by enhancing scene understanding, reasoning, data augmentation, and simulation, thereby advancing safety and reliability.
Contribution
It synthesizes current applications and future trends of Foundation Models in autonomous driving, highlighting their potential to address long-tail distribution issues and improve safety.
Findings
FMs enhance scene understanding and reasoning in autonomous driving.
FMs can generate plausible rare driving scenarios for training.
FMs improve prediction of road user behaviors.
Abstract
With the development of artificial intelligence and breakthroughs in deep learning, large-scale Foundation Models (FMs), such as GPT, Sora, etc., have achieved remarkable results in many fields including natural language processing and computer vision. The application of FMs in autonomous driving holds considerable promise. For example, they can contribute to enhancing scene understanding and reasoning. By pre-training on rich linguistic and visual data, FMs can understand and interpret various elements in a driving scene, and provide cognitive reasoning to give linguistic and action instructions for driving decisions and planning. Furthermore, FMs can augment data based on the understanding of driving scenarios to provide feasible scenes of those rare occurrences in the long tail distribution that are unlikely to be encountered during routine driving and data collection. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Autonomous Vehicle Technology and Safety · Geological Modeling and Analysis
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Layer Normalization · Dense Connections · Weight Decay · Multi-Head Attention · Cosine Annealing · Attention Dropout · Dropout
