Towards robust music source separation on loud commercial music

Chang-Bin Jeon; Kyogu Lee

arXiv:2208.14355·cs.SD·August 31, 2022·1 cites

Towards robust music source separation on loud commercial music

Chang-Bin Jeon, Kyogu Lee

PDF

Open Access 3 Repos

TL;DR

This paper investigates how the loudness and compression in modern commercial music cause domain mismatch issues in source separation models, and proposes LimitAug data augmentation to improve robustness across domains.

Contribution

The study introduces out-of-domain datasets mimicking modern music mastering and proposes LimitAug, a novel data augmentation method, to enhance model robustness against domain mismatch.

Findings

01

Performance drops significantly on out-of-domain datasets.

02

LimitAug improves robustness and in-domain performance.

03

Proposed method mitigates effects of loudness and compression.

Abstract

Nowadays, commercial music has extreme loudness and heavily compressed dynamic range compared to the past. Yet, in music source separation, these characteristics have not been thoroughly considered, resulting in the domain mismatch between the laboratory and the real world. In this paper, we confirmed that this domain mismatch negatively affect the performance of the music source separation networks. To this end, we first created the out-of-domain evaluation datasets, musdb-L and XL, by mimicking the music mastering process. Then, we quantitatively verify that the performance of the state-of-the-art algorithms significantly deteriorated in our datasets. Lastly, we proposed LimitAug data augmentation method to reduce the domain mismatch, which utilizes an online limiter during the training data sampling process. We confirmed that it not only alleviates the performance degradation on our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Acoustic Wave Phenomena Research