Relating the fundamental frequency of speech with EEG using a dilated convolutional network
Corentin Puffay, Jana Van Canneyt, Jonas Vanthornhout, Hugo Van Hamme,, Tom Francart

TL;DR
This paper introduces a dilated convolutional neural network model that relates speech features, including fundamental frequency and speech envelope, to EEG signals, demonstrating improved neural tracking and potential for hearing diagnosis.
Contribution
The study presents a novel nonlinear dilated convolutional model that jointly analyzes speech envelope and fundamental frequency to better relate speech features to EEG signals.
Findings
Combining f0 and speech envelope improves EEG prediction accuracy.
The model generalizes well to unseen subjects.
Neural tracking of f0 is evidenced by the model.
Abstract
To investigate how speech is processed in the brain, we can model the relation between features of a natural speech signal and the corresponding recorded electroencephalogram (EEG). Usually, linear models are used in regression tasks. Either EEG is predicted, or speech is reconstructed, and the correlation between predicted and actual signal is used to measure the brain's decoding ability. However, given the nonlinear nature of the brain, the modeling ability of linear models is limited. Recent studies introduced nonlinear models to relate the speech envelope to EEG. We set out to include other features of speech that are not coded in the envelope, notably the fundamental frequency of the voice (f0). F0 is a higher-frequency feature primarily coded at the brainstem to midbrain level. We present a dilated-convolutional model to provide evidence of neural tracking of the f0. We show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Blind Source Separation Techniques · Neural dynamics and brain function
