S$^2$M-Former: Spiking Symmetric Mixing Branchformer for Brain Auditory Attention Detection

Jiaqi Wang; Zhengyu Ma; Xiongri Shen; Chenlin Zhou; Leilei Zhao; Han Zhang; Yi Zhong; Siqi Cai; Zhenxi Song; Zhiguo Zhang

arXiv:2508.05164·cs.LG·November 12, 2025

S$^2$M-Former: Spiking Symmetric Mixing Branchformer for Brain Auditory Attention Detection

Jiaqi Wang, Zhengyu Ma, Xiongri Shen, Chenlin Zhou, Leilei Zhao, Han Zhang, Yi Zhong, Siqi Cai, Zhenxi Song, Zhiguo Zhang

PDF

TL;DR

S$^2$M-Former introduces a brain-inspired spiking neural network architecture for auditory attention detection, achieving high accuracy with significantly reduced power consumption and parameters, suitable for neuro-steered hearing devices.

Contribution

The paper presents a novel spiking symmetric mixing framework with lightweight token sequences, enhancing EEG feature learning while reducing energy use and parameters.

Findings

01

Achieves state-of-the-art accuracy on three AAD benchmarks.

02

Reduces power consumption by 5.8× compared to recent ANN methods.

03

Lowers model parameters by 14.7×, improving efficiency.

Abstract

Auditory attention detection (AAD) aims to decode listeners' focus in complex auditory environments from electroencephalography (EEG) recordings, which is crucial for developing neuro-steered hearing devices. Despite recent advancements, EEG-based AAD remains hindered by the absence of synergistic frameworks that can fully leverage complementary EEG features under energy-efficiency constraints. We propose S $^{2}$ M-Former, a novel spiking symmetric mixing framework to address this limitation through two key innovations: i) Presenting a spike-driven symmetric architecture composed of parallel spatial and frequency branches with mirrored modular design, leveraging biologically plausible token-channel mixers to enhance complementary learning across branches; ii) Introducing lightweight 1D token sequences to replace conventional 3D operations, reducing parameters by 14.7 $\times$ . The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.