MR-FlowDPO: Multi-Reward Direct Preference Optimization for Flow-Matching Text-to-Music Generation

Alon Ziv; Sanyuan Chen; Andros Tjandra; Yossi Adi; Wei-Ning Hsu; Bowen Shi

arXiv:2512.10264·cs.SD·December 16, 2025

MR-FlowDPO: Multi-Reward Direct Preference Optimization for Flow-Matching Text-to-Music Generation

Alon Ziv, Sanyuan Chen, Andros Tjandra, Yossi Adi, Wei-Ning Hsu, Bowen Shi

PDF

Open Access

TL;DR

MR-FlowDPO introduces a multi-reward optimization framework for flow-matching text-to-music models, significantly improving alignment with human preferences and musical quality through novel reward integration and scoring mechanisms.

Contribution

It presents MR-FlowDPO, a new method that combines multiple musical rewards with direct preference optimization to enhance music generation quality.

Findings

01

Significantly improves music quality and alignment with human preferences.

02

Outperforms baselines in audio quality, text alignment, and musicality.

03

Enhances rhythmic stability using a novel semantic self-supervised scoring mechanism.

Abstract

A key challenge in music generation models is their lack of direct alignment with human preferences, as music evaluation is inherently subjective and varies widely across individuals. We introduce MR-FlowDPO, a novel approach that enhances flow-matching-based music generation models - a major class of modern music generative models, using Direct Preference Optimization (DPO) with multiple musical rewards. The rewards are crafted to assess music quality across three key dimensions: text alignment, audio production quality, and semantic consistency, utilizing scalable off-the-shelf models for each reward prediction. We employ these rewards in two ways: (i) By constructing preference data for DPO and (ii) by integrating the rewards into text prompting. To address the ambiguity in musicality evaluation, we propose a novel scoring mechanism leveraging semantic self-supervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic Technology and Sound Studies · Music and Audio Processing · Artificial Intelligence in Games