EffortNet: A Deep Learning Framework for Objective Assessment of Speech Enhancement Technologies Using EEG-Based Alpha Oscillations
Ching-Chih Sung, Cheng-Hung Hsin, Yu-Anne Shiah, Bo-Jyun Lin, Yi-Xuan Lai, Chia-Ying Lee, Yu-Te Wang, Borchin Su, and Yu Tsao

TL;DR
EffortNet is a deep learning framework that decodes listening effort from EEG alpha oscillations, enabling personalized assessment of speech enhancement technologies with high accuracy and adaptability.
Contribution
This work introduces EffortNet, combining self-supervised, incremental, and transfer learning to improve EEG-based listening effort classification across individuals.
Findings
Alpha oscillations are valid biomarkers of listening effort.
EffortNet achieves 80.9% accuracy with limited new subject data.
Transformer-enhanced speech produces neural responses closer to clean speech.
Abstract
This paper presents EffortNet, a novel deep learning framework for decoding individual listening effort from electroencephalography (EEG) during speech comprehension. Listening effort represents a significant challenge in speech-hearing research, particularly for aging populations and those with hearing impairment. We collected 64-channel EEG data from 122 participants during speech comprehension under four conditions: clean, noisy, MMSE-enhanced, and Transformer-enhanced speech. Statistical analyses confirmed that alpha oscillations (8-13 Hz) exhibited significantly higher power during noisy speech processing compared to clean or enhanced conditions, confirming their validity as objective biomarkers of listening effort. To address the substantial inter-individual variability in EEG signals, EffortNet integrates three complementary learning paradigms: self-supervised learning to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHearing Loss and Rehabilitation · EEG and Brain-Computer Interfaces · Speech and Audio Processing
