Does k Matter? k-NN Hubness Analysis for Kernel Additive Modelling Vocal Separation
Delia Fano Yela, Dan Stowell, Mark Sandler

TL;DR
This paper investigates the impact of the parameter k in kernel additive modelling for vocal separation, proposing a graph theory-based method to automatically optimize k using k-NN hubness, leading to improved separation performance.
Contribution
It introduces a novel graph theory-based method to automatically optimize the k parameter in KAM for source separation, specifically vocal separation.
Findings
k-NN hubness effectively guides k selection
Optimized k improves separation quality
Method reduces computational cost
Abstract
Kernel Additive Modelling (KAM) is a framework for source separation aiming to explicitly model inherent properties of sound sources to help with their identification and separation. KAM separates a given source by applying robust statistics on the selection of time-frequency bins obtained through a source-specific kernel, typically the k-NN function. Even though the parameter k appears to be key for a successful separation, little discussion on its influence or optimisation can be found in the literature. Here we propose a novel method, based on graph theory statistics, to automatically optimise in a vocal separation task. We introduce the k-NN hubness as an indicator to find a tailored k at a low computational cost. Subsequently, we evaluate our method in comparison to the common approach to choose k. We further discuss the influence and importance of this parameter with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
