Controllable Embedding Transformation for Mood-Guided Music Retrieval
Julia Wilkins, Jaehun Kim, Matthew E. P. Davies, Juan Pablo Bello, Matthew C. McCallum

TL;DR
This paper introduces a novel framework for mood-guided music retrieval that enables controlled transformation of music embeddings to modify mood while preserving other attributes, improving personalization in recommendation systems.
Contribution
It presents a new embedding transformation method guided by mood labels, with a sampling mechanism for proxy targets and a joint training objective, outperforming baselines.
Findings
Strong mood transformation performance demonstrated on two datasets.
Better preservation of genre and instrumentation compared to training-free baselines.
Establishes controllable embedding transformation as a promising paradigm for personalized music retrieval.
Abstract
Music representations are the backbone of modern recommendation systems, powering playlist generation, similarity search, and personalized discovery. Yet most embeddings offer little control for adjusting a single musical attribute, e.g., changing only the mood of a track while preserving its genre or instrumentation. In this work, we address the problem of controllable music retrieval through embedding-based transformation, where the objective is to retrieve songs that remain similar to a seed track but are modified along one chosen dimension. We propose a novel framework for mood-guided music embedding transformation, which learns a mapping from a seed audio embedding to a target embedding guided by mood labels, while preserving other musical attributes. Because mood cannot be directly altered in the seed audio, we introduce a sampling mechanism that retrieves proxy targets to balance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
