Efficient Perception, Planning, and Control Algorithm for Vision-Based Automated Vehicles
Der-Hau Lee

TL;DR
This paper presents an efficient vision-based autonomous vehicle control system using a monocular camera and inexpensive sensors, combining a multi-task neural network with rapid motion planning algorithms for real-time driving.
Contribution
The study introduces a novel framework integrating MTUNet with CILQR and VPC for efficient, map-free autonomous driving using only low-cost sensors and achieving high-speed processing.
Findings
MTUNet achieves 40 FPS for multiple tasks on 228x228 images.
The VPC algorithm reduces steering latency below actuator latency.
The system outperforms traditional methods on curvy roads.
Abstract
Autonomous vehicles have limited computational resources and thus require efficient control systems. The cost and size of sensors have limited the development of self-driving cars. To overcome these restrictions, this study proposes an efficient framework for the operation of vision-based automatic vehicles; the framework requires only a monocular camera and a few inexpensive radars. The proposed algorithm comprises a multi-task UNet (MTUNet) network for extracting image features and constrained iterative linear quadratic regulator (CILQR) and vision predictive control (VPC) modules for rapid motion planning and control. MTUNet is designed to simultaneously solve lane line segmentation, the ego vehicle's heading angle regression, road type classification, and traffic object detection tasks at approximately 40 FPS for 228 x 228 pixel RGB input images. The CILQR controllers then use the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications · Video Surveillance and Tracking Methods
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
