Multilingual Music Genre Embeddings for Effective Cross-Lingual Music Item Annotation
Elena V. Epure, Guillaume Salha, Romain Hennequin

TL;DR
This paper introduces a method for cross-lingual music genre annotation using multilingual embeddings, enabling translation of genre tags across languages without needing a parallel corpus, thus improving music information retrieval.
Contribution
The paper proposes a novel approach to multilingual music genre embedding and translation, overcoming the limitations of previous methods restricted to English and lacking parallel corpora.
Findings
Effective cross-lingual genre translation in multiple languages
Outperforms previous baseline in English multi-source translation
Public release of multilingual data and code
Abstract
Annotating music items with music genres is crucial for music recommendation and information retrieval, yet challenging given that music genres are subjective concepts. Recently, in order to explicitly consider this subjectivity, the annotation of music items was modeled as a translation task: predict for a music item its music genres within a target vocabulary or taxonomy (tag system) from a set of music genre tags originating from other tag systems. However, without a parallel corpus, previous solutions could not handle tag systems in other languages, being limited to the English-language only. Here, by learning multilingual music genre embeddings, we enable cross-lingual music genre translation without relying on a parallel corpus. First, we apply compositionality functions on pre-trained word embeddings to represent multi-word tags.Second, we adapt the tag representations to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Topic Modeling · Speech Recognition and Synthesis
