Personalized Dynamic Music Emotion Recognition with Dual-Scale Attention-Based Meta-Learning
Dengming Zhang, Weitao You, Ziheng Liu, Lingyun Sun, Pei Chen

TL;DR
This paper introduces a novel dual-scale attention meta-learning approach for dynamic music emotion recognition that effectively captures long-term dependencies and personalizes emotion prediction with minimal data.
Contribution
It proposes a dual-scale attention transformer and a task construction strategy for personalized emotion recognition, advancing the state-of-the-art in DMER and PDMER.
Findings
Achieves state-of-the-art results in traditional DMER.
Effectively predicts personalized emotions with only one annotation sample.
Improves long-term dependency modeling in music emotion recognition.
Abstract
Dynamic Music Emotion Recognition (DMER) aims to predict the emotion of different moments in music, playing a crucial role in music information retrieval. The existing DMER methods struggle to capture long-term dependencies when dealing with sequence data, which limits their performance. Furthermore, these methods often overlook the influence of individual differences on emotion perception, even though everyone has their own personalized emotional perception in the real world. Motivated by these issues, we explore more effective sequence processing methods and introduce the Personalized DMER (PDMER) problem, which requires models to predict emotions that align with personalized perception. Specifically, we propose a Dual-Scale Attention-Based Meta-Learning (DSAML) method. This method fuses features from a dual-scale feature extractor and captures both short and long-term dependencies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing
MethodsSoftmax · Attention Is All You Need · ALIGN
