Sound reconstruction from human brain activity via a generative model with brain-like auditory features
Jong-Yun Park, Mitsuaki Tsukamoto, Misato Tanaka, Yukiyasu Kamitani

TL;DR
This study introduces a novel method combining brain decoding and generative audio models to reconstruct arbitrary sounds from human brain activity, capturing perceptual content and generalizing across sound categories.
Contribution
It leverages hierarchical auditory features and an audio transformer to improve sound reconstruction from fMRI data, demonstrating perceptual accuracy and generalization.
Findings
Hierarchical DNN features are better decoded than spectrotemporal features.
Reconstructed sounds capture perceptual content and quality.
Reconstruction reflects attended sounds over unattended ones.
Abstract
The successful reconstruction of perceptual experiences from human brain activity has provided insights into the neural representations of sensory experiences. However, reconstructing arbitrary sounds has been avoided due to the complexity of temporal sequences in sounds and the limited resolution of neuroimaging modalities. To overcome these challenges, leveraging the hierarchical nature of brain auditory processing could provide a path toward reconstructing arbitrary sounds. Previous studies have indicated a hierarchical homology between the human auditory system and deep neural network (DNN) models. Furthermore, advancements in audio-generative models enable to transform compressed representations back into high-resolution sounds. In this study, we introduce a novel sound reconstruction method that combines brain decoding of auditory features with an audio-generative model. Using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural dynamics and brain function · Neuroscience and Music Perception · Hearing Loss and Rehabilitation
