Less is more: Faster and better music version identification with   embedding distillation

Furkan Yesiler; Joan Serr\`a; Emilia G\'omez

arXiv:2010.03284·cs.SD·October 8, 2020

Less is more: Faster and better music version identification with embedding distillation

Furkan Yesiler, Joan Serr\`a, Emilia G\'omez

PDF

Open Access 1 Repo

TL;DR

This paper introduces data distillation techniques to reduce embedding size in music version identification, achieving smaller, faster, and more accurate systems suitable for real-world applications.

Contribution

It proposes novel and existing distillation methods to significantly compress embeddings while improving accuracy in cover song detection.

Findings

01

Embeddings reduced by 99% in size.

02

Achieved up to 3% accuracy improvement.

03

Enabled real-time retrieval on standard hardware.

Abstract

Version identification systems aim to detect different renditions of the same underlying musical composition (loosely called cover songs). By learning to encode entire recordings into plain vector embeddings, recent systems have made significant progress in bridging the gap between accuracy and scalability, which has been a key challenge for nearly two decades. In this work, we propose to further narrow this gap by employing a set of data distillation techniques that reduce the embedding dimensionality of a pre-trained state-of-the-art model. We compare a wide range of techniques and propose new ones, from classical dimensionality reduction to more sophisticated distillation schemes. With those, we obtain 99% smaller embeddings that, moreover, yield up to a 3% accuracy increase. Such small embeddings can have an important impact in retrieval time, up to the point of making a real-world…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

furkanyesiler/re-move
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing