A Controllable Perceptual Feature Generative Model for Melody Harmonization via Conditional Variational Autoencoder
Dengyun Huang, Yonghua Zhu

TL;DR
This paper introduces CPFG-Net, a neural network that predicts perceptual features and generates harmonically coherent chords for melody harmonization, emphasizing controllability, musical expressiveness, and creativity in symbolic music generation.
Contribution
The paper presents a novel controllable generative model for melody harmonization using perceptual features, along with a new dataset and a transformation algorithm for chord inference.
Findings
State-of-the-art perceptual feature prediction accuracy
Demonstrated musical expressiveness and creativity in chord inference
Model can be extended to audio-based music generation
Abstract
While Large Language Models (LLMs) make symbolic music generation increasingly accessible, producing music with distinctive composition and rich expressiveness remains a significant challenge. Many studies have introduced emotion models to guide the generative process. However, these approaches still fall short of delivering novelty and creativity. In the field of Music Information Retrieval (MIR), auditory perception is recognized as a key dimension of musical experience, offering insights into both compositional intent and emotional patterns. To this end, we propose a neural network named CPFG-Net, along with a transformation algorithm that maps perceptual feature values to chord representations, enabling melody harmonization. The system can controllably predict sequences of perceptual features and tonal structures from given melodies, and subsequently generate harmonically coherent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic Technology and Sound Studies · Music and Audio Processing · Neuroscience and Music Perception
