Post-processing of EEG-based Auditory Attention Decoding Decisions via Hidden Markov Models

Nicolas Heintz; Tom Francart; Alexander Bertrand

arXiv:2506.24024·eess.SP·July 1, 2025

Post-processing of EEG-based Auditory Attention Decoding Decisions via Hidden Markov Models

Nicolas Heintz, Tom Francart, Alexander Bertrand

PDF

Open Access

TL;DR

This paper introduces a hidden Markov model to enhance EEG-based auditory attention decoding, significantly improving accuracy and responsiveness in both real-time and offline scenarios by modeling attention stability.

Contribution

The novel integration of a hidden Markov model with existing AAD algorithms to better capture attention dynamics and improve decoding performance.

Findings

01

HMM significantly improves AAD accuracy in real-time and offline settings.

02

HMM outperforms existing postprocessing methods in accuracy and responsiveness.

03

Performance depends on window length, switching frequency, and initial AAD accuracy.

Abstract

Auditory attention decoding (AAD) algorithms exploit brain signals, such as electroencephalography (EEG), to identify which speaker a listener is focusing on in a multi-speaker environment. While state-of-the-art AAD algorithms can identify the attended speaker on short time windows, their predictions are often too inaccurate for practical use. In this work, we propose augmenting AAD with a hidden Markov model (HMM) that models the temporal structure of attention. More specifically, the HMM relies on the fact that a subject is much less likely to switch attention than to keep attending the same speaker at any moment in time. We show how a HMM can significantly improve existing AAD algorithms in both causal (real-time) and non-causal (offline) settings. We further demonstrate that HMMs outperform existing postprocessing approaches in both accuracy and responsiveness, and explore how…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEEG and Brain-Computer Interfaces · Emotion and Mood Recognition · Speech and Audio Processing