Speech and music source separation for cochlear implant users: front-end and end-to-end approach

Sina Tahmasebi; Waldo Nogueira

PMC · DOI:10.3389/fnins.2025.1696669·January 13, 2026

Speech and music source separation for cochlear implant users: front-end and end-to-end approach

Sina Tahmasebi, Waldo Nogueira

PDF

Open Access

TL;DR

This study compares deep learning methods to improve speech and music perception for cochlear implant users in noisy environments.

Contribution

The study evaluates front-end and end-to-end DNN-based source separation approaches for cochlear implant users in speech and music tasks.

Findings

01

End-to-end DNNs outperformed front-end models in speech understanding tasks.

02

Front-end models scored higher in music appreciation for cochlear implant users.

03

Objective metrics and listening experiments were used to assess model performance.

Abstract

A cochlear implant (CI) is a surgically implanted neuroprosthetic device designed to restore auditory perception in individuals with profound sensorineural hearing loss. While CI users generally demonstrate good speech intelligibility in quiet listening environments, their performance significantly declines in the presence of competing sound sources. Moreover, music perception and appreciation remain limited for many CI users. These limitations are largely attributed to the inadequate representation of pitch information, which is critical for both music and speech stream segregation in complex auditory scenes. To address these challenges, source separation techniques have been increasingly employed to enhance target speech and isolate singing voices in music. Previous research has shown that CI users report greater music enjoyment when vocals are enhanced relative to the accompanying…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Diseases1

sensorineural hearing loss

Figures8

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHearing Loss and Rehabilitation · Speech and Audio Processing · Voice and Speech Disorders