TL;DR
This paper demonstrates that deep learning models significantly improve the decoding of speech from neural signals in the human sensorimotor cortex, revealing hierarchical neural structures and frequency band contributions.
Contribution
The study introduces the application of deep networks for neural data analysis, outperforming linear models in speech prediction and elucidating neural hierarchical structure and frequency band relevance.
Findings
Deep networks outperform baseline models in speech prediction accuracy.
Hierarchical structure in neural data is revealed through model confusions.
High-gamma frequency band contains most relevant information for speech decoding.
Abstract
A fundamental challenge in neuroscience is to understand what structure in the world is represented in spatially distributed patterns of neural activity from multiple single-trial measurements. This is often accomplished by learning a simple, linear transformations between neural features and features of the sensory stimuli or motor task. While successful in some early sensory processing areas, linear mappings are unlikely to be ideal tools for elucidating nonlinear, hierarchical representations of higher-order brain areas during complex tasks, such as the production of speech by humans. Here, we apply deep networks to predict produced speech syllables from cortical surface electric potentials recorded from human sensorimotor cortex. We found that deep networks had higher decoding prediction accuracy compared to baseline models, and also exhibited greater improvements in accuracy with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
