QuadBEV: An Efficient Quadruple-Task Perception Framework via   Bird's-Eye-View Representation

Yuxin Li; Yiheng Li; Xulei Yang; Mengying Yu; Zihang Huang; Xiaojun; Wu; Chai Kiat Yeo

arXiv:2410.06516·cs.RO·October 10, 2024

QuadBEV: An Efficient Quadruple-Task Perception Framework via Bird's-Eye-View Representation

Yuxin Li, Yiheng Li, Xulei Yang, Mengying Yu, Zihang Huang, Xiaojun, Wu, Chai Kiat Yeo

PDF

Open Access

TL;DR

QuadBEV is an efficient multitask perception framework for autonomous driving that integrates four key tasks into a shared system, reducing computational load and improving real-world applicability.

Contribution

It introduces a shared backbone architecture for four perception tasks, addressing multitask learning challenges and enhancing efficiency for resource-constrained environments.

Findings

01

Reduces redundant computations in perception tasks

02

Demonstrates robustness and effectiveness in experiments

03

Suitable for embedded autonomous driving systems

Abstract

Bird's-Eye-View (BEV) perception has become a vital component of autonomous driving systems due to its ability to integrate multiple sensor inputs into a unified representation, enhancing performance in various downstream tasks. However, the computational demands of BEV models pose challenges for real-world deployment in vehicles with limited resources. To address these limitations, we propose QuadBEV, an efficient multitask perception framework that leverages the shared spatial and contextual information across four key tasks: 3D object detection, lane detection, map segmentation, and occupancy prediction. QuadBEV not only streamlines the integration of these tasks using a shared backbone and task-specific heads but also addresses common multitask learning challenges such as learning rate sensitivity and conflicting task objectives. Our framework reduces redundant computations, thereby…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection