Temporal-Aware Spiking Transformer Hashing Based on 3D-DWT
Zihao Mei, Jianhao Li, Bolin Zhang, Chong Wang, Lijun Guo, Guoqi Li, Jiangbo Qian

TL;DR
This paper introduces Spikinghash, a novel energy-efficient supervised hashing method using spiking neural networks, hierarchical 3D-DWT, and self-attention to improve dynamic vision sensor data retrieval.
Contribution
It presents a hierarchical lightweight spiking neural network with 3D-DWT and self-attention for effective, low-energy data hashing and a new dynamic soft similarity loss for improved retrieval accuracy.
Findings
Achieves state-of-the-art retrieval performance on multiple datasets.
Demonstrates low energy consumption and fewer parameters compared to existing methods.
Effectively captures spatiotemporal features using 3D-DWT and self-attention.
Abstract
With the rapid growth of dynamic vision sensor (DVS) data, constructing a low-energy, efficient data retrieval system has become an urgent task. Hash learning is one of the most important retrieval technologies which can keep the distance between hash codes consistent with the distance between DVS data. As spiking neural networks (SNNs) can encode information through spikes, they demonstrate great potential in promoting energy efficiency. Based on the binary characteristics of SNNs, we first propose a novel supervised hashing method named Spikinghash with a hierarchical lightweight structure. Spiking WaveMixer (SWM) is deployed in shallow layers, utilizing a multilevel 3D discrete wavelet transform (3D-DWT) to decouple spatiotemporal features into various low-frequency and high frequency components, and then employing efficient spectral feature fusion. SWM can effectively capture the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
