Improving AI-generated music with user-guided training

Vishwa Mohan Singh; Sai Anirudh Aryasomayajula; Ahan Chatterjee; Beste Aydemir; Rifat Mehreen Amin

arXiv:2506.04852·cs.SD·June 6, 2025

Improving AI-generated music with user-guided training

Vishwa Mohan Singh, Sai Anirudh Aryasomayajula, Ahan Chatterjee, Beste Aydemir, Rifat Mehreen Amin

PDF

Open Access

TL;DR

This paper introduces a human-in-the-loop training method for AI music generation, using user feedback and genetic algorithms to iteratively improve music quality and personalization.

Contribution

It presents a novel approach combining user ratings and genetic algorithms to fine-tune AI music models based on subjective user preferences.

Findings

01

Average rating increased by 0.2 after first iteration

02

Second iteration added an increase of 0.39 in ratings

03

Demonstrates iterative improvement in user satisfaction

Abstract

AI music generation has advanced rapidly, with models like diffusion and autoregressive algorithms enabling high-fidelity outputs. These tools can alter styles, mix instruments, or isolate them. Since sound can be visualized as spectrograms, image-generation algorithms can be applied to generate novel music. However, these algorithms are typically trained on fixed datasets, which makes it challenging for them to interpret and respond to user input accurately. This is especially problematic because music is highly subjective and requires a level of personalization that image generation does not provide. In this work, we propose a human-computation approach to gradually improve the performance of these algorithms based on user interactions. The human-computation element involves aggregating and selecting user ratings to use as the loss function for fine-tuning the model. We employ a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic Technology and Sound Studies · Generative Adversarial Networks and Image Synthesis · Music and Audio Processing

MethodsDiffusion