LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with   Instance Representation

Weijie Ma; Jingwei Jiang; Yang Yang; Zehui Chen; Hao Chen

arXiv:2411.06173·cs.CV·November 20, 2024

LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance Representation

Weijie Ma, Jingwei Jiang, Yang Yang, Zehui Chen, Hao Chen

PDF

Open Access 1 Repo

TL;DR

LSSInst enhances BEV perception in camera-only 3D detection by integrating instance representations with existing LSS frameworks, improving detail retention and overall accuracy in autonomous driving scenarios.

Contribution

The paper introduces a two-stage detector with an instance adaptor that improves geometric detail preservation in LSS-based BEV perception methods.

Findings

01

Boosts performance on nuScenes benchmark

02

Enhances geometric detail retention in BEV representations

03

Outperforms current LSS-based state-of-the-art methods

Abstract

With the attention gained by camera-only 3D object detection in autonomous driving, methods based on Bird-Eye-View (BEV) representation especially derived from the forward view transformation paradigm, i.e., lift-splat-shoot (LSS), have recently seen significant progress. The BEV representation formulated by the frustum based on depth distribution prediction is ideal for learning the road structure and scene layout from multi-view images. However, to retain computational efficiency, the compressed BEV representation such as in resolution and axis is inevitably weak in retaining the individual geometric details, undermining the methodological generality and applicability. With this in mind, to compensate for the missing details and utilize multi-view geometry constraints, we propose LSSInst, a two-stage object detector incorporating BEV and instance representations in tandem. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

weijiemax/lssinst
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection

MethodsSoftmax · Attention Is All You Need