Sample-level EEG-based Selective Auditory Attention Decoding with Markov Switching Models
Yuanyuan Yao, Simon Geirnaert, Tinne Tuytelaars, Alexander Bertrand

TL;DR
This paper introduces a novel Markov switching model that integrates attention decoding and smoothing for EEG signals, enabling faster and more accurate sample-level detection of auditory attention switches.
Contribution
It presents a unified probabilistic framework for attention decoding and smoothing, improving temporal resolution without sacrificing accuracy.
Findings
Achieves comparable accuracy to HMM post-processing.
Enables faster detection of attention switches.
Provides a sample-level decoding approach.
Abstract
Selective auditory attention decoding aims to identify the speaker of interest from listeners' neural signals, such as electroencephalography (EEG), in the presence of multiple concurrent speakers. Most existing methods operate at the window level, facing a trade-off between temporal resolution and decoding accuracy. Recent work has shown that hidden Markov model (HMM)-based post-processing can smooth window-level decoder outputs to improve this trade-off. Instead of using a separate smoothing step, we propose to integrate the decoding and smoothing components into a single probabilistic framework using a Markov switching model (MSM). It directly models the relationship between the EEG and speech envelopes under each attention state while incorporating the temporal dynamics of attention. This formulation enables sample-level attention decoding, with model parameters and attention states…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Emotion and Mood Recognition · Speech and Audio Processing
