TL;DR
This study demonstrates that MEG signals can decode phonetic information with high accuracy during speech production, highlighting the importance of neural oscillations in speech-related brain activity and informing brain-computer interface development.
Contribution
It introduces a comparative analysis of machine learning models for phoneme decoding from MEG signals during speech production and perception, emphasizing the significance of low-frequency oscillations.
Findings
Higher decoding accuracy during speech production (76.6%) than passive listening (~51%)
Elastic Net classifier outperformed neural networks in decoding tasks
Delta and Theta frequency bands significantly contributed to phoneme decoding
Abstract
Understanding the neural mechanisms underlying speech production is essential for both advancing cognitive neuroscience theory and developing practical communication technologies. In this study, we investigated magnetoencephalography signals to decode phones from brain activity during speech production and perception (passive listening and voice playback) tasks. Using a dataset comprising 17 participants, we performed pairwise phone classification, extending our analysis to 15 phonetic pairs. Multiple machine learning approaches, including regularized linear models and neural network architectures, were compared to determine their effectiveness in decoding phonetic information. Our results demonstrate significantly higher decoding accuracy during speech production (76.6%) compared to passive listening and playback modalities (~51%), emphasizing the richer neural information available…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
