Music Generation using Human-In-The-Loop Reinforcement Learning

Aju Ani Justus

arXiv:2501.15304·cs.SD·January 28, 2025

Music Generation using Human-In-The-Loop Reinforcement Learning

Aju Ani Justus

PDF

Open Access

TL;DR

This paper introduces a real-time music generation system that uses human feedback within a reinforcement learning framework, guided by music theory principles, to iteratively improve compositions.

Contribution

It develops a novel Human-In-The-Loop Reinforcement Learning framework tailored for music, integrating music theory constraints with an episodic Q-learning algorithm.

Findings

01

System generates musical compositions with iterative human feedback.

02

Framework effectively incorporates music theory into reinforcement learning.

03

Produces compositions aligned with user preferences.

Abstract

This paper presents an approach that combines Human-In-The-Loop Reinforcement Learning (HITL RL) with principles derived from music theory to facilitate real-time generation of musical compositions. HITL RL, previously employed in diverse applications such as modelling humanoid robot mechanics and enhancing language models, harnesses human feedback to refine the training process. In this study, we develop a HILT RL framework that can leverage the constraints and principles in music theory. In particular, we propose an episodic tabular Q-learning algorithm with an epsilon-greedy exploration policy. The system generates musical tracks (compositions), continuously enhancing its quality through iterative human-in-the-loop feedback. The reward function for this process is the subjective musical taste of the user.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic Technology and Sound Studies · Muscle activation and electromyography studies · Tactile and Sensory Interactions

MethodsQ-Learning