Efficient Perception, Planning, and Control Algorithm for Vision-Based   Automated Vehicles

Der-Hau Lee

arXiv:2209.07042·cs.RO·June 26, 2024

Efficient Perception, Planning, and Control Algorithm for Vision-Based Automated Vehicles

Der-Hau Lee

PDF

Open Access

TL;DR

This paper presents an efficient vision-based autonomous vehicle control system using a monocular camera and inexpensive sensors, combining a multi-task neural network with rapid motion planning algorithms for real-time driving.

Contribution

The study introduces a novel framework integrating MTUNet with CILQR and VPC for efficient, map-free autonomous driving using only low-cost sensors and achieving high-speed processing.

Findings

01

MTUNet achieves 40 FPS for multiple tasks on 228x228 images.

02

The VPC algorithm reduces steering latency below actuator latency.

03

The system outperforms traditional methods on curvy roads.

Abstract

Autonomous vehicles have limited computational resources and thus require efficient control systems. The cost and size of sensors have limited the development of self-driving cars. To overcome these restrictions, this study proposes an efficient framework for the operation of vision-based automatic vehicles; the framework requires only a monocular camera and a few inexpensive radars. The proposed algorithm comprises a multi-task UNet (MTUNet) network for extracting image features and constrained iterative linear quadratic regulator (CILQR) and vision predictive control (VPC) modules for rapid motion planning and control. MTUNet is designed to simultaneously solve lane line segmentation, the ego vehicle's heading angle regression, road type classification, and traffic object detection tasks at approximately 40 FPS for 228 x 228 pixel RGB input images. The CILQR controllers then use the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications · Video Surveillance and Tracking Methods

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings