GLAD: Grounded Layered Autonomous Driving for Complex Service Tasks
Yan Ding, Cheng Cui, Xiaohan Zhang, and Shiqi Zhang

TL;DR
GLAD is a layered planning framework for autonomous vehicles that integrates user preferences, safety, and motion costs, enabling efficient and safe complex service tasks in urban driving.
Contribution
The paper introduces a novel layered architecture with tight coupling for autonomous driving, incorporating visual grounding and system optimization for complex service requests.
Findings
Outperforms existing baselines in simulations
Efficiently handles complex multi-point service requests
Integrates perceptual learning with layered planning
Abstract
Given the current point-to-point navigation capabilities of autonomous vehicles, researchers are looking into complex service requests that require the vehicles to visit multiple points of interest. In this paper, we develop a layered planning framework, called GLAD, for complex service requests in autonomous urban driving. There are three layers for service-level, behavior-level, and motion-level planning. The layered framework is unique in its tight coupling, where the different layers communicate user preferences, safety estimates, and motion costs for system optimization. GLAD is visually grounded by perceptual learning from a dataset of 13.8k instances collected from driving behaviors. GLAD enables autonomous vehicles to efficiently and safely fulfill complex service requests. Experimental results from abstract and full simulation show that our system outperforms a few competitive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Video Surveillance and Tracking Methods · Multimodal Machine Learning Applications
