Learning Subject-Invariant Representations from Speech-Evoked EEG Using   Variational Autoencoders

Lies Bollens; Tom Francart; Hugo Van Hamme

arXiv:2207.00323·eess.AS·July 25, 2022

Learning Subject-Invariant Representations from Speech-Evoked EEG Using Variational Autoencoders

Lies Bollens, Tom Francart, Hugo Van Hamme

PDF

TL;DR

This paper introduces a novel variational autoencoder approach to extract subject-invariant features from speech-evoked EEG data, enhancing generalization and classification accuracy across subjects.

Contribution

It adapts factorized hierarchical variational autoencoders to disentangle subject and content features in EEG, improving cross-subject speech processing.

Findings

01

Subject accuracy reaches 98.96% on subject latent space.

02

Content classification accuracy reaches 62.91%.

03

Disentangled representations improve EEG-based speech understanding.

Abstract

The electroencephalogram (EEG) is a powerful method to understand how the brain processes speech. Linear models have recently been replaced for this purpose with deep neural networks and yield promising results. In related EEG classification fields, it is shown that explicitly modeling subject-invariant features improves generalization of models across subjects and benefits classification accuracy. In this work, we adapt factorized hierarchical variational autoencoders to exploit parallel EEG recordings of the same stimuli. We model EEG into two disentangled latent spaces. Subject accuracy reaches 98.96% and 1.60% on respectively the subject and content latent space, whereas binary content classification experiments reach an accuracy of 51.51% and 62.91% on respectively the subject and content latent space.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.