Hierarchical Symbolic Pop Music Generation with Graph Neural Networks
Wen Qing Lim, Jinhua Liang, Huan Zhang

TL;DR
This paper introduces a hierarchical graph neural network approach for generating polyphonic Chinese pop music, capturing both rhythmic and structural aspects to produce coherent long-term compositions.
Contribution
It presents a novel multi-graph method and a two-step VAE-based framework for modeling both phrase-level and song-level structures in music generation.
Findings
Models learn structural nuances like chord and pitch distributions.
Generated music maintains coherence in rhythm and long-term structure.
Approach extends graph-based music generation to polyphonic and structured compositions.
Abstract
Music is inherently made up of complex structures, and representing them as graphs helps to capture multiple levels of relationships. While music generation has been explored using various deep generation techniques, research on graph-related music generation is sparse. Earlier graph-based music generation worked only on generating melodies, and recent works to generate polyphonic music do not account for longer-term structure. In this paper, we explore a multi-graph approach to represent both the rhythmic patterns and phrase structure of Chinese pop music. Consequently, we propose a two-step approach that aims to generate polyphonic music with coherent rhythm and long-term structure. We train two Variational Auto-Encoder networks - one on a MIDI dataset to generate 4-bar phrases, and another on song structure labels to generate full song structure. Our work shows that the models are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Human Motion and Animation
