Relating EEG to continuous speech using deep neural networks: a review
Corentin Puffay, Bernd Accou, Lies Bollens, Mohammad Jalilpour Monesi,, Jonas Vanthornhout, Hugo Van hamme, Tom Francart

TL;DR
This review paper discusses deep learning approaches to relate EEG signals to continuous speech, highlighting methodological issues and proposing standards for model evaluation in this emerging research area.
Contribution
It provides a comprehensive review of 29 studies on deep learning for EEG-speech relation, identifying common pitfalls and emphasizing the need for standardized benchmarks.
Findings
Identified biases in cross-validation and data leakage issues.
Highlighted the lack of standardized datasets and evaluation metrics.
Emphasized the importance of good practices for model analysis.
Abstract
Objective. When a person listens to continuous speech, a corresponding response is elicited in the brain and can be recorded using electroencephalography (EEG). Linear models are presently used to relate the EEG recording to the corresponding speech signal. The ability of linear models to find a mapping between these two signals is used as a measure of neural tracking of speech. Such models are limited as they assume linearity in the EEG-speech relationship, which omits the nonlinear dynamics of the brain. As an alternative, deep learning models have recently been used to relate EEG to continuous speech. Approach. This paper reviews and comments on deep-learning-based studies that relate EEG to continuous speech in single- or multiple-speakers paradigms. We point out recurrent methodological pitfalls and the need for a standard benchmark of model analysis. Main results. We gathered 29…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Blind Source Separation Techniques · Neural dynamics and brain function
