Large-Scale Music Annotation and Retrieval: Learning to Rank in Joint   Semantic Spaces

Jason Weston; Samy Bengio; Philippe Hamel

arXiv:1105.5196·cs.LG·March 19, 2015·20 cites

Large-Scale Music Annotation and Retrieval: Learning to Rank in Joint Semantic Spaces

Jason Weston, Samy Bengio, Philippe Hamel

PDF

Open Access

TL;DR

This paper introduces a scalable multi-task learning approach that models audio, artist names, and tags in a shared semantic space to improve music annotation and retrieval across large datasets, outperforming baseline methods.

Contribution

It presents a novel scalable method that jointly learns semantic relationships in music data using multi-task learning, capturing meaningful similarities efficiently.

Findings

01

Outperforms baseline methods in accuracy

02

Faster and uses less memory than existing approaches

03

Learns an interpretable semantic space

Abstract

Music prediction tasks range from predicting tags given a song or clip of audio, predicting the name of the artist, or predicting related songs given a song, clip, artist name or tag. That is, we are interested in every semantic relationship between the different musical concepts in our database. In realistically sized databases, the number of songs is measured in the hundreds of thousands or more, and the number of artists in the tens of thousands or more, providing a considerable challenge to standard machine learning techniques. In this work, we propose a method that scales to such datasets which attempts to capture the semantic similarities between the database items by modeling audio, artist names, and tags in a single low-dimensional semantic space. This choice of space is learnt by optimizing the set of prediction tasks of interest jointly using multi-task learning. Our method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Diverse Musicological Studies

Methods1cycle learning rate scheduling policy