DiVa: An Iterative Framework to Harvest More Diverse and Valid Labels from User Comments for Music
Hongru Liang (1), Jingyao Liu (1), Yuanxin Xiang (1), Jiachen Du (2),, Lanjun Zhou (2), Shushen Pan (2), Wenqiang Lei (1) ((1) Sichuan, University, (2) Tencent Music Entertainment)

TL;DR
This paper introduces DiVa, an iterative framework that leverages user comments to automatically harvest more diverse and valid music labels, addressing the limitations of existing methods that lack label diversity.
Contribution
DiVa is a novel iterative approach that uses pseudo-labels and a joint score function to extract diverse labels from user comments for music annotation.
Findings
DiVa outperforms state-of-the-art solutions in label diversity.
The framework effectively captures labels missed by gold annotations.
Experimental results demonstrate the superiority of DiVa on a densely annotated dataset.
Abstract
Towards sufficient music searching, it is vital to form a complete set of labels for each song. However, current solutions fail to resolve it as they cannot produce diverse enough mappings to make up for the information missed by the gold labels. Based on the observation that such missing information may already be presented in user comments, we propose to study the automated music labeling in an essential but under-explored setting, where the model is required to harvest more diverse and valid labels from the users' comments given limited gold labels. To this end, we design an iterative framework (DiVa) to harvest more verse and lid labels from user comments for music. The framework makes a classifier able to form complete sets of labels for songs via pseudo-labels inferred from pre-trained classifiers and a novel joint score function. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methodsfail
