DisCover: Disentangled Music Representation Learning for Cover Song Identification
Jiahao Xun, Shengyu Zhang, Yanting Yang, Jieming Zhu, Liqun Deng, Zhou, Zhao, Zhenhua Dong, Ruiqi Li, Lichao Zhang, Fei Wu

TL;DR
DisCover introduces a novel disentangled music representation learning framework for cover song identification, effectively separating version-specific and invariant features to improve identification accuracy amidst high variances.
Contribution
The paper proposes a new framework with knowledge-guided and adversarial modules to disentangle factors, addressing intra- and inter-version biases in cover song identification.
Findings
DisCover outperforms existing methods in cover song identification accuracy.
Disentanglement significantly improves invariant music representation learning.
The framework effectively blocks intra- and inter-version effects.
Abstract
In the field of music information retrieval (MIR), cover song identification (CSI) is a challenging task that aims to identify cover versions of a query song from a massive collection. Existing works still suffer from high intra-song variances and inter-song correlations, due to the entangled nature of version-specific and version-invariant factors in their modeling. In this work, we set the goal of disentangling version-specific and version-invariant factors, which could make it easier for the model to learn invariant music representations for unseen query songs. We analyze the CSI task in a disentanglement view with the causal graph technique, and identify the intra-version and inter-version effects biasing the invariant learning. To block these effects, we propose the disentangled music representation learning framework (DisCover) for CSI. DisCover consists of two critical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Diverse Musicological Studies · Music Technology and Sound Studies
MethodsKernel Density Matrices
