Spiking Neural Networks with Temporal Attention-Guided Adaptive Fusion for imbalanced Multi-modal Learning
Jiangrong Shen, Yulin Xie, Qi Xu, Gang Pan, Huajin Tang, Badong Chen

TL;DR
This paper introduces a novel temporal attention-guided adaptive fusion framework for multimodal spiking neural networks, improving handling of modality imbalance and temporal misalignment, and achieving state-of-the-art results with energy efficiency.
Contribution
The paper proposes a dynamic fusion module and a balanced loss function for multimodal SNNs, enabling hierarchical temporal integration and improved convergence across modalities.
Findings
Achieved state-of-the-art accuracy on multiple datasets.
Enhanced energy efficiency in multimodal SNNs.
Faster convergence and better temporal alignment than baseline models.
Abstract
Multimodal spiking neural networks (SNNs) hold significant potential for energy-efficient sensory processing but face critical challenges in modality imbalance and temporal misalignment. Current approaches suffer from uncoordinated convergence speeds across modalities and static fusion mechanisms that ignore time-varying cross-modal interactions. We propose the temporal attention-guided adaptive fusion framework for multimodal SNNs with two synergistic innovations: 1) The Temporal Attention-guided Adaptive Fusion (TAAF) module that dynamically assigns importance scores to fused spiking features at each timestep, enabling hierarchical integration of temporally heterogeneous spike-based features; 2) The temporal adaptive balanced fusion loss that modulates learning rates per modality based on the above attention scores, preventing dominant modalities from monopolizing optimization. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · EEG and Brain-Computer Interfaces · Speech and Audio Processing
MethodsSoftmax · Attention Is All You Need
