SDR -- Medium Rare with Fast Computations
Robin Scheibler

TL;DR
This paper introduces a fast, numerically accurate algorithm for bss eval metrics in source separation, significantly reducing computation time while maintaining acceptable accuracy, enabling more efficient neural network training.
Contribution
The paper presents a novel, fast iterative method for computing bss eval metrics based on subspace angles, fixing previous implementation shortcomings and improving efficiency.
Findings
Speed-up of up to two orders of magnitude in metric computation.
Approximate solver maintains acceptable accuracy for most applications.
Longer distortion filters can improve neural network training.
Abstract
We revisit the widely used bss eval metrics for source separation with an eye out for performance. We propose a fast algorithm fixing shortcomings of publicly available implementations. First, we show that the metrics are fully specified by the squared cosine of just two angles between estimate and reference subspaces. Second, large linear systems are involved. However, they are structured, and we apply a fast iterative method based on conjugate gradient descent. The complexity of this step is thus reduced by a factor quadratic in the distortion filter size used in bss eval, usually 512. In experiments, we assess speed and numerical accuracy. Not only is the loss of accuracy due to the approximate solver acceptable for most applications, but the speed-up is up to two orders of magnitude in some, not so extreme, cases. We confirm that our implementation can train neural networks, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Seismic Imaging and Inversion Techniques · Non-Destructive Testing Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
