TL;DR
This paper introduces a unified gradient reweighting method that allows control over the distribution of source separation results, improving robustness, focus on specific sound classes, and convergence speed.
Contribution
A simple, flexible gradient reweighting scheme that biases deep learning models towards desired result distributions in audio source separation tasks.
Findings
Enables control over model robustness and focus on specific sound classes.
Improves convergence speed by emphasizing easier examples.
Balances worst-case and average performance effectively.
Abstract
Recent deep learning approaches have shown great improvement in audio source separation tasks. However, the vast majority of such work is focused on improving average separation performance, often neglecting to examine or control the distribution of the results. In this paper, we propose a simple, unified gradient reweighting scheme, with a lightweight modification to bias the learning process of a model and steer it towards a certain distribution of results. More specifically, we reweight the gradient updates of each batch, using a user-specified probability distribution. We apply this method to various source separation tasks, in order to shift the operating point of the models towards different objectives. We demonstrate different parameterizations of our unified reweighting scheme can be used towards addressing several real-world problems, such as unreliable separation estimates.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
