SpikeSMOKE: Spiking Neural Networks for Monocular 3D Object Detection with Cross-Scale Gated Coding
Xuemei Chen, Huamin Wang, Jing Peng, Hangchi Shen, Shukai Duan, Shiping Wen, Tingwen Huang

TL;DR
This paper introduces SpikeSMOKE, a low-power spiking neural network architecture for monocular 3D object detection, featuring a novel Cross-Scale Gating Coding mechanism to enhance feature representation and reduce energy consumption.
Contribution
The paper proposes SpikeSMOKE with a new CSGC mechanism and a lightweight residual block, advancing low-power 3D detection with improved accuracy and efficiency over existing models.
Findings
SpikeSMOKE achieves higher AP scores on KITTI dataset.
The CSGC mechanism improves feature representation.
SpikeSMOKE-L reduces parameters by 3x and computation by 10x.
Abstract
With the wide application of 3D object detection in some fields such as autonomous driving, its energy consumption is constantly increasing, making the research on low-power consumption alternatives a key research area. The spiking neural networks (SNNs), possessing low-power consumption characteristics, offer a novel solution for this research. Consequently, we apply SNNs to monocular 3D object detection and propose the SpikeSMOKE architecture, which represents a new attempt at low-power monocular 3D object detection. It's well known that the discrete signals of SNNs can lead to information loss compared to artificial neural networks (ANNs), which limits their feature representation capabilities. To solve this problem, inspired by the synaptic filtering mechanism of biological neurons, we propose a new Cross-Scale Gating Coding Mechanism (CSGC), which can enhance feature representation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsResidual Connection · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Residual Block
