Music Source Separation Based on a Lightweight Deep Learning Framework   (DTTNET: DUAL-PATH TFC-TDF UNET)

Junyu Chen; Susmitha Vekkot; Pancham Shukla

arXiv:2309.08684·eess.AS·March 20, 2024

Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)

Junyu Chen, Susmitha Vekkot, Pancham Shukla

PDF

Open Access 2 Repos

TL;DR

This paper introduces DTTNet, a lightweight deep learning model for music source separation that achieves competitive results with significantly fewer parameters, emphasizing efficiency and generalization.

Contribution

The paper presents a novel lightweight architecture, DTTNet, combining dual-path modules and TFC-TDF UNet, improving efficiency while maintaining high separation quality.

Findings

01

DTTNet achieves 10.12 dB cSDR on vocals.

02

DTTNet uses 86.7% fewer parameters than BSRNN.

03

DTTNet demonstrates strong pattern-specific performance and generalization.

Abstract

Music source separation (MSS) aims to extract 'vocals', 'drums', 'bass' and 'other' tracks from a piece of mixed music. While deep learning methods have shown impressive results, there is a trend toward larger models. In our paper, we introduce a novel and lightweight architecture called DTTNet, which is based on Dual-Path Module and Time-Frequency Convolutions Time-Distributed Fully-connected UNet (TFC-TDF UNet). DTTNet achieves 10.12 dB cSDR on 'vocals' compared to 10.01 dB reported for Bandsplit RNN (BSRNN) but with 86.7% fewer parameters. We also assess pattern-specific performance and model generalization for intricate audio patterns.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Speech Recognition and Synthesis