Music Similarity Calculation of Individual Instrumental Sounds Using Metric Learning
Yuka Hashizume, Li Li, Tomoki Toda

TL;DR
This paper introduces a novel weakly supervised metric learning approach to calculate music similarity focusing on individual instrumental sounds, enhancing music recommendation systems' flexibility and accuracy.
Contribution
It proposes a new method for learning similarity metrics for individual instrumental sounds without tag information, using triplet loss based on source separation and weak supervision.
Findings
Learned unique similarity metrics for individual instrumental sources.
Using specific instrumental sounds can improve similarity measurement accuracy.
Performance decreases when using separated instrumental sounds.
Abstract
The criteria for measuring music similarity are important for developing a flexible music recommendation system. Some data-driven methods have been proposed to calculate music similarity from only music signals, such as metric learning based on a triplet loss using tag information on each musical piece. However, the resulting music similarity metric usually captures the entire piece of music, i.e., the mixing of various instrumental sound sources, limiting the capability of the music recommendation system, e.g., it is difficult to search for a musical piece containing similar drum sounds. Towards the development of a more flexible music recommendation system, we propose a music similarity calculation method that focuses on individual instrumental sound sources in a musical piece. By fully exploiting the potential of data-driven methods for our proposed method, we employ weakly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies
