Kaninfradet3D:A Road-side Camera-LiDAR Fusion 3D Perception Model based on Nonlinear Feature Extraction and Intrinsic Correlation
Pei Liu (1), Nanfang Zheng (2), Yiqun Li (2), Junlan Chen (2), Ziyuan, Pu (2) ((1) Intelligent Transportation Thrust, Systems Hub, The Hong Kong, University of Science, Technology (Guangzhou), (2) Transportation,, Southeast University)

TL;DR
Kaninfradet3D introduces a novel roadside 3D perception model that leverages nonlinear feature extraction and intrinsic correlation through Kolmogorov-Arnold Networks to improve fusion of camera and LiDAR data.
Contribution
The paper proposes an optimized feature extraction and fusion framework using KAN layers and cross-attention, enhancing roadside perception accuracy.
Findings
+9.87 mAP improvement on TUMTraf Intersection Dataset
+10.64 mAP improvement on TUMTraf Intersection Dataset (second viewpoint)
+1.40 mAP improvement on TUMTraf V2X dataset
Abstract
With the development of AI-assisted driving, numerous methods have emerged for ego-vehicle 3D perception tasks, but there has been limited research on roadside perception. With its ability to provide a global view and a broader sensing range, the roadside perspective is worth developing. LiDAR provides precise three-dimensional spatial information, while cameras offer semantic information. These two modalities are complementary in 3D detection. However, adding camera data does not increase accuracy in some studies since the information extraction and fusion procedure is not sufficiently reliable. Recently, Kolmogorov-Arnold Networks (KANs) have been proposed as replacements for MLPs, which are better suited for high-dimensional, complex data. Both the camera and the LiDAR provide high-dimensional information, and employing KANs should enhance the extraction of valuable features to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote Sensing and LiDAR Applications · Advanced Neural Network Applications · Robotics and Sensor-Based Localization
Methods+ ( 1 ) ⟷ 805 ⟷ ( 330 ) ⟷ 4056|How do I file a complaint with Expedia?
