SWIM: Short-Window CNN Integrated with Mamba for EEG-Based Auditory   Spatial Attention Decoding

Ziyang Zhang; Andrew Thwaites; Alexandra Woolgar; Brian Moore; Chao; Zhang

arXiv:2409.19884·eess.AS·November 28, 2024

SWIM: Short-Window CNN Integrated with Mamba for EEG-Based Auditory Spatial Attention Decoding

Ziyang Zhang, Andrew Thwaites, Alexandra Woolgar, Brian Moore, Chao, Zhang

PDF

Open Access 1 Repo

TL;DR

SWIM is a novel EEG-based auditory spatial attention decoding model combining short-window CNN and sequence modeling, achieving state-of-the-art accuracy without speech envelopes.

Contribution

The paper introduces SWIM, a new model integrating a short-window CNN with Mamba for improved EEG-based auditory attention decoding, surpassing previous methods.

Findings

01

Achieved 86.2% accuracy in attention decoding

02

Reduced classification errors by 31%

03

Outperformed previous state-of-the-art results

Abstract

In complex auditory environments, the human auditory system possesses the remarkable ability to focus on a specific speaker while disregarding others. In this study, a new model named SWIM, a short-window convolution neural network (CNN) integrated with Mamba, is proposed for identifying the locus of auditory attention (left or right) from electroencephalography (EEG) signals without relying on speech envelopes. SWIM consists of two parts. The first is a short-window CNN (SW $_{CNN}$ ), which acts as a short-term EEG feature extractor and achieves a final accuracy of 84.9% in the leave-one-speaker-out setup on the widely used KUL dataset. This improvement is due to the use of an improved CNN structure, data augmentation, multitask training, and model combination. The second part, Mamba, is a sequence model first applied to auditory spatial attention decoding to leverage the long-term…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

windowso/swim-asad
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBlind Source Separation Techniques · Image and Signal Denoising Methods · Neural Networks and Applications

MethodsSoftmax · Attention Is All You Need · Focus · Convolution · Mamba: Linear-Time Sequence Modeling with Selective State Spaces