A Controllable Perceptual Feature Generative Model for Melody Harmonization via Conditional Variational Autoencoder

Dengyun Huang; Yonghua Zhu

arXiv:2511.14600·cs.SD·November 19, 2025

A Controllable Perceptual Feature Generative Model for Melody Harmonization via Conditional Variational Autoencoder

Dengyun Huang, Yonghua Zhu

PDF

Open Access

TL;DR

This paper introduces CPFG-Net, a neural network that predicts perceptual features and generates harmonically coherent chords for melody harmonization, emphasizing controllability, musical expressiveness, and creativity in symbolic music generation.

Contribution

The paper presents a novel controllable generative model for melody harmonization using perceptual features, along with a new dataset and a transformation algorithm for chord inference.

Findings

01

State-of-the-art perceptual feature prediction accuracy

02

Demonstrated musical expressiveness and creativity in chord inference

03

Model can be extended to audio-based music generation

Abstract

While Large Language Models (LLMs) make symbolic music generation increasingly accessible, producing music with distinctive composition and rich expressiveness remains a significant challenge. Many studies have introduced emotion models to guide the generative process. However, these approaches still fall short of delivering novelty and creativity. In the field of Music Information Retrieval (MIR), auditory perception is recognized as a key dimension of musical experience, offering insights into both compositional intent and emotional patterns. To this end, we propose a neural network named CPFG-Net, along with a transformation algorithm that maps perceptual feature values to chord representations, enabling melody harmonization. The system can controllably predict sequences of perceptual features and tonal structures from given melodies, and subsequently generate harmonically coherent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic Technology and Sound Studies · Music and Audio Processing · Neuroscience and Music Perception