Extracting Different Levels of Speech Information from EEG Using an   LSTM-Based Model

Mohammad Jalilpour Monesi; Bernd Accou; Tom Francart; Hugo Van Hamme

arXiv:2106.09622·eess.AS·June 18, 2021

Extracting Different Levels of Speech Information from EEG Using an LSTM-Based Model

Mohammad Jalilpour Monesi, Bernd Accou, Tom Francart, Hugo Van Hamme

PDF

2 Repos

TL;DR

This paper investigates how an LSTM-based neural network decodes various levels of speech information from EEG signals, revealing that it primarily utilizes features like silences, intensity, and phonetic classes, with mel spectrograms providing the best performance.

Contribution

The study analyzes the specific speech features that an LSTM model uses to decode EEG signals, highlighting the importance of certain features like mel spectrograms for accurate speech information extraction.

Findings

01

The model exploits silences, intensity, and broad phonetic classes from EEG.

02

Mel spectrogram features yield the highest decoding accuracy at 84%.

03

Different speech features contribute variably to EEG-based speech decoding.

Abstract

Decoding the speech signal that a person is listening to from the human brain via electroencephalography (EEG) can help us understand how our auditory system works. Linear models have been used to reconstruct the EEG from speech or vice versa. Recently, Artificial Neural Networks (ANNs) such as Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) based architectures have outperformed linear models in modeling the relation between EEG and speech. Before attempting to use these models in real-world applications such as hearing tests or (second) language comprehension assessment we need to know what level of speech information is being utilized by these models. In this study, we aim to analyze the performance of an LSTM-based model using different levels of speech features. The task of the model is to determine which of two given speech segments is matched with the recorded…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.