FB-BEV: BEV Representation from Forward-Backward View Transformations

Zhiqi Li; Zhiding Yu; Wenhai Wang; Anima Anandkumar; Tong Lu; Jose M.; Alvarez

arXiv:2308.02236·cs.CV·August 21, 2023·6 cites

FB-BEV: BEV Representation from Forward-Backward View Transformations

Zhiqi Li, Zhiding Yu, Wenhai Wang, Anima Anandkumar, Tong Lu, Jose M., Alvarez

PDF

Open Access 1 Repo

TL;DR

FB-BEV introduces a novel view transformation module that combines forward and backward projections to improve BEV perception, achieving state-of-the-art results on nuScenes dataset.

Contribution

The paper proposes a new forward-backward view transformation module that enhances BEV feature quality by leveraging the strengths of both existing paradigms.

Findings

01

Achieves 62.4% NDS on nuScenes test set.

02

Outperforms existing BEV perception methods.

03

Provides a unified framework for view transformation.

Abstract

View Transformation Module (VTM), where transformations happen between multi-view image features and Bird-Eye-View (BEV) representation, is a crucial step in camera-based BEV perception systems. Currently, the two most prominent VTM paradigms are forward projection and backward projection. Forward projection, represented by Lift-Splat-Shoot, leads to sparsely projected BEV features without post-processing. Backward projection, with BEVFormer being an example, tends to generate false-positive BEV features from incorrect projections due to the lack of utilization on depth. To address the above limitations, we propose a novel forward-backward view transformation module. Our approach compensates for the deficiencies in both existing methods, allowing them to enhance each other to obtain higher quality BEV representations mutually. We instantiate the proposed module with FB-BEV, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nvlabs/fb-bev
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Image Processing Techniques and Applications