DecoratingFusion: A LiDAR-Camera Fusion Network with the Combination of   Point-level and Feature-level Fusion

Zixuan Yin; Han Sun; Ningzhong Liu; Huiyu Zhou; Jiaquan Shen

arXiv:2501.00220·cs.CV·January 3, 2025

DecoratingFusion: A LiDAR-Camera Fusion Network with the Combination of Point-level and Feature-level Fusion

Zixuan Yin, Han Sun, Ningzhong Liu, Huiyu Zhou, Jiaquan Shen

PDF

TL;DR

DecoratingFusion introduces a novel LiDAR-camera fusion network that combines point-level and feature-level fusion, utilizing hard associations from calibration matrices to improve 3D detection accuracy in autonomous driving.

Contribution

It innovatively integrates hard point-level associations with feature-level fusion, enhancing interpretability and detection performance over existing methods.

Findings

01

Outperforms state-of-the-art methods on KITTI and Waymo datasets.

02

Effectively combines hard and soft associations for improved 3D detection.

03

Demonstrates superior accuracy in autonomous driving scenarios.

Abstract

Lidars and cameras play essential roles in autonomous driving, offering complementary information for 3D detection. The state-of-the-art fusion methods integrate them at the feature level, but they mostly rely on the learned soft association between point clouds and images, which lacks interpretability and neglects the hard association between them. In this paper, we combine feature-level fusion with point-level fusion, using hard association established by the calibration matrices to guide the generation of object queries. Specifically, in the early fusion stage, we use the 2D CNN features of images to decorate the point cloud data, and employ two independent sparse convolutions to extract the decorated point cloud features. In the mid-level fusion stage, we initialize the queries with a center heatmap and embed the predicted class labels as auxiliary information into the queries,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Convolutions · Heatmap