Speech and music source separation for cochlear implant users: front-end and end-to-end approach
Sina Tahmasebi, Waldo Nogueira

TL;DR
This study compares deep learning methods to improve speech and music perception for cochlear implant users in noisy environments.
Contribution
The study evaluates front-end and end-to-end DNN-based source separation approaches for cochlear implant users in speech and music tasks.
Findings
End-to-end DNNs outperformed front-end models in speech understanding tasks.
Front-end models scored higher in music appreciation for cochlear implant users.
Objective metrics and listening experiments were used to assess model performance.
Abstract
A cochlear implant (CI) is a surgically implanted neuroprosthetic device designed to restore auditory perception in individuals with profound sensorineural hearing loss. While CI users generally demonstrate good speech intelligibility in quiet listening environments, their performance significantly declines in the presence of competing sound sources. Moreover, music perception and appreciation remain limited for many CI users. These limitations are largely attributed to the inadequate representation of pitch information, which is critical for both music and speech stream segregation in complex auditory scenes. To address these challenges, source separation techniques have been increasingly employed to enhance target speech and isolate singing voices in music. Previous research has shown that CI users report greater music enjoyment when vocals are enhanced relative to the accompanying…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHearing Loss and Rehabilitation · Speech and Audio Processing · Voice and Speech Disorders
