On loss functions and evaluation metrics for music source separation

Enric Gus\'o; Jordi Pons; Santiago Pascual; Joan Serr\`a

arXiv:2202.07968·cs.SD·February 17, 2022

On loss functions and evaluation metrics for music source separation

Enric Gus\'o, Jordi Pons, Santiago Pascual, Joan Serr\`a

PDF

Open Access

TL;DR

This paper systematically benchmarks various loss functions for music source separation, evaluates their effectiveness as metrics, and highlights the limitations of standard evaluation methods, proposing alternatives based on these losses.

Contribution

It provides a comprehensive comparison of loss functions for music source separation and explores their potential as evaluation metrics, addressing limitations of current standards.

Findings

01

Certain loss functions outperform traditional metrics in separation quality.

02

Some loss-based metrics correlate better with subjective listening tests.

03

Standard SNR metrics can be misleading in specific scenarios.

Abstract

We investigate which loss functions provide better separations via benchmarking an extensive set of those for music source separation. To that end, we first survey the most representative audio source separation losses we identified, to later consistently benchmark them in a controlled experimental setup. We also explore using such losses as evaluation metrics, via cross-correlating them with the results of a subjective test. Based on the observation that the standard signal-to-distortion ratio metric can be misleading in some scenarios, we study alternative evaluation metrics based on the considered losses.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Acoustic Wave Phenomena Research