TL;DR
The Melon Playlist Dataset offers a large, publicly available collection of audio representations and annotations from Korean streaming service Melon, enabling research in music tagging, playlist continuation, and representation learning.
Contribution
It introduces a comprehensive, publicly accessible dataset of mel-spectrograms and playlists, facilitating advancements in music information retrieval and addressing cold-start challenges.
Findings
Dataset includes 649,091 tracks and 148,826 playlists
Supports tasks like auto-tagging and playlist continuation
Provides baseline for cold-start music recommendation
Abstract
One of the main limitations in the field of audio signal processing is the lack of large public datasets with audio representations and high-quality annotations due to restrictions of copyrighted commercial music. We present Melon Playlist Dataset, a public dataset of mel-spectrograms for 649,091tracks and 148,826 associated playlists annotated by 30,652 different tags. All the data is gathered from Melon, a popular Korean streaming service. The dataset is suitable for music information retrieval tasks, in particular, auto-tagging and automatic playlist continuation. Even though the latter can be addressed by collaborative filtering approaches, audio provides opportunities for research on track suggestions and building systems resistant to the cold-start problem, for which we provide a baseline. Moreover, the playlists and the annotations included in the Melon Playlist Dataset make it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodstravel james
