Multi-Band Multi-Resolution Fully Convolutional Neural Networks for   Singing Voice Separation

Emad M. Grais; Fei Zhao; Mark D. Plumbley

arXiv:1910.09266·cs.SD·October 22, 2019

Multi-Band Multi-Resolution Fully Convolutional Neural Networks for Singing Voice Separation

Emad M. Grais, Fei Zhao, Mark D. Plumbley

PDF

1 Repo

TL;DR

This paper introduces a multi-band multi-resolution convolutional neural network that processes different frequency bands of audio spectrograms with tailored resolutions, improving singing voice separation performance efficiently.

Contribution

It proposes a novel neural network architecture that processes spectrogram frequency bands with different resolutions and filter configurations, enhancing separation accuracy.

Findings

01

Outperforms existing deep neural networks in singing voice separation

02

Uses fewer parameters while achieving better results

03

Effectively captures low and high frequency information separately

Abstract

Deep neural networks with convolutional layers usually process the entire spectrogram of an audio signal with the same time-frequency resolutions, number of filters, and dimensionality reduction scale. According to the constant-Q transform, good features can be extracted from audio signals if the low frequency bands are processed with high frequency resolution filters and the high frequency bands with high time resolution filters. In the spectrogram of a mixture of singing voices and music signals, there is usually more information about the voice in the low frequency bands than the high frequency bands. These raise the need for processing each part of the spectrogram differently. In this paper, we propose a multi-band multi-resolution fully convolutional neural network (MBR-FCN) for singing voice separation. The MBR-FCN processes the frequency bands that have more information about the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sevagh/xumx-slicq
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.