Teaching Wav2Vec2 the Language of the Brain
Tobias Fiedler, Leon Hermann, Florian M\"uller, Sarel Cohen, Peter, Chin, Tobias Friedrich, Eilon Vaadia

TL;DR
This paper demonstrates that pre-trained speech recognition models like Wav2Vec2 can be adapted to decode spoken language from brain activity, significantly improving accuracy over training from scratch or freezing the model.
Contribution
It introduces a transfer learning approach that leverages Wav2Vec2's pre-trained weights for brain data decoding, showing improved performance in speech-to-brain decoding tasks.
Findings
Full fine-tuning with pre-trained weights achieves CER of 18.54%.
Transfer learning from speech recognition models enhances brain decoding accuracy.
Pre-trained models outperform training from scratch or freezing the model.
Abstract
The decoding of continuously spoken speech from neuronal activity has the potential to become an important clinical solution for paralyzed patients. Deep Learning Brain Computer Interfaces (BCIs) have recently successfully mapped neuronal activity to text contents in subjects who attempted to formulate speech. However, only small BCI datasets are available. In contrast, labeled data and pre-trained models for the closely related task of speech recognition from audio are widely available. One such model is Wav2Vec2 which has been trained in a self-supervised fashion to create meaningful representations of speech audio data. In this study, we show that patterns learned by Wav2Vec2 are transferable to brain data. Specifically, we replace its audio feature extractor with an untrained Brain Feature Extractor (BFE) model. We then execute full fine-tuning with pre-trained weights for Wav2Vec2,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems · Educational Tools and Methods
