Melon Playlist Dataset: a public dataset for audio-based playlist   generation and music tagging

Andres Ferraro; Yuntae Kim; Soohyeon Lee; Biho Kim; Namjun Jo; Semi; Lim; Suyon Lim; Jungtaek Jang; Sehwan Kim; Xavier Serra; Dmitry Bogdanov

arXiv:2102.00201·cs.SD·February 2, 2021

Melon Playlist Dataset: a public dataset for audio-based playlist generation and music tagging

Andres Ferraro, Yuntae Kim, Soohyeon Lee, Biho Kim, Namjun Jo, Semi, Lim, Suyon Lim, Jungtaek Jang, Sehwan Kim, Xavier Serra, Dmitry Bogdanov

PDF

1 Repo

TL;DR

The Melon Playlist Dataset offers a large, publicly available collection of audio representations and annotations from Korean streaming service Melon, enabling research in music tagging, playlist continuation, and representation learning.

Contribution

It introduces a comprehensive, publicly accessible dataset of mel-spectrograms and playlists, facilitating advancements in music information retrieval and addressing cold-start challenges.

Findings

01

Dataset includes 649,091 tracks and 148,826 playlists

02

Supports tasks like auto-tagging and playlist continuation

03

Provides baseline for cold-start music recommendation

Abstract

One of the main limitations in the field of audio signal processing is the lack of large public datasets with audio representations and high-quality annotations due to restrictions of copyrighted commercial music. We present Melon Playlist Dataset, a public dataset of mel-spectrograms for 649,091tracks and 148,826 associated playlists annotated by 30,652 different tags. All the data is gathered from Melon, a popular Korean streaming service. The dataset is suitable for music information retrieval tasks, in particular, auto-tagging and automatic playlist continuation. Even though the latter can be addressed by collaborative filtering approaches, audio provides opportunities for research on track suggestions and building systems resistant to the cold-start problem, for which we provide a baseline. Moreover, the playlists and the annotations included in the Melon Playlist Dataset make it…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

andrebola/icassp2021
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methodstravel james