Towards Automated Single Channel Source Separation using Neural Networks

Arpita Gang; Pravesh Biyani; Akshay Soni

arXiv:1806.08086·eess.AS·June 22, 2018·1 cites

Towards Automated Single Channel Source Separation using Neural Networks

Arpita Gang, Pravesh Biyani, Akshay Soni

PDF

Open Access

TL;DR

This paper introduces a flexible framework for single channel source separation that enhances existing algorithms by focusing on one source at a time, automates hyper-parameter tuning, and demonstrates improved SDR and SAR results.

Contribution

It proposes a generic, scalable framework for single source separation that improves performance and automates hyper-parameter tuning, applicable to any existing SCSS algorithm.

Findings

01

Improved SDR and SAR performance on neural network based separation.

02

Framework scales well with more than two sources.

03

Automates hyper-parameter tuning for practical use.

Abstract

Many applications of single channel source separation (SCSS) including automatic speech recognition (ASR), hearing aids etc. require an estimation of only one source from a mixture of many sources. Treating this special case as a regular SCSS problem where in all constituent sources are given equal priority in terms of reconstruction may result in a suboptimal separation performance. In this paper, we tackle the one source separation problem by suitably modifying the orthodox SCSS framework and focus only on one source at a time. The proposed approach is a generic framework that can be applied to any existing SCSS algorithm, improves performance, and scales well when there are more than two sources in the mixture unlike most existing SCSS methods. Additionally, existing SCSS algorithms rely on fine hyper-parameter tuning hence making them difficult to use in practice. Our framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Blind Source Separation Techniques · Advanced Adaptive Filtering Techniques