Leave-One-EquiVariant: Alleviating invariance-related information loss in contrastive music representations
Julien Guinot, Elio Quinton, Gy\"orgy Fazekas

TL;DR
This paper introduces LOEV, a flexible framework for contrastive music representation learning that preserves task-relevant information by selectively controlling invariances, enhancing performance on specific MIR tasks.
Contribution
The paper proposes the LOEV framework and its variant LOEV++, which enable task-adaptive invariance control and disentangled representations in contrastive music learning.
Findings
LOEV improves performance on augmentation-related MIR tasks.
LOEV++ achieves disentangled, attribute-specific representations.
The approach alleviates information loss due to invariance constraints.
Abstract
Contrastive learning has proven effective in self-supervised musical representation learning, particularly for Music Information Retrieval (MIR) tasks. However, reliance on augmentation chains for contrastive view generation and the resulting learnt invariances pose challenges when different downstream tasks require sensitivity to certain musical attributes. To address this, we propose the Leave One EquiVariant (LOEV) framework, which introduces a flexible, task-adaptive approach compared to previous work by selectively preserving information about specific augmentations, allowing the model to maintain task-relevant equivariances. We demonstrate that LOEV alleviates information loss related to learned invariances, improving performance on augmentation related tasks and retrieval without sacrificing general representation quality. Furthermore, we introduce a variant of LOEV, LOEV++,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeuroscience and Music Perception · Music and Audio Processing · Music Technology and Sound Studies
