Dilated Convolution with Dilated GRU for Music Source Separation
Jen-Yu Liu, Yi-Hsuan Yang

TL;DR
This paper introduces a novel neural network architecture combining dilated convolutions with a modified GRU, called Dilated GRU, to improve music source separation by capturing wider context efficiently.
Contribution
It proposes the Dilated GRU, a new recurrent unit that enhances context reach and computational efficiency in music source separation models.
Findings
Achieves comparable or better separation quality than state-of-the-art models.
Effectively captures wider context with fewer recurrent steps.
Runs faster due to parallelizable operations.
Abstract
Stacked dilated convolutions used in Wavenet have been shown effective for generating high-quality audios. By replacing pooling/striding with dilation in convolution layers, they can preserve high-resolution information and still reach distant locations. Producing high-resolution predictions is also crucial in music source separation, whose goal is to separate different sound sources while maintaining the quality of the separated sounds. Therefore, this paper investigates using stacked dilated convolutions as the backbone for music source separation. However, while stacked dilated convolutions can reach wider context than standard convolutions, their effective receptive fields are still fixed and may not be wide enough for complex music audio signals. To reach information at remote locations, we propose to combine dilated convolution with a modified version of gated recurrent units…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Music and Audio Processing · Music Technology and Sound Studies
MethodsMixture of Logistic Distributions · Dilated Causal Convolution · Dilated Convolution · WaveNet · Convolution · Gated Recurrent Unit
