Music SketchNet: Controllable Music Generation via Factorized Representations of Pitch and Rhythm
Ke Chen, Cheng-i Wang, Taylor Berg-Kirkpatrick, Shlomo Dubnov

TL;DR
Music SketchNet is a neural network framework that enables controllable, guided music generation by factorizing pitch and rhythm, allowing for partial specification and context-aware completion of monophonic musical pieces.
Contribution
The paper introduces SketchVAE for explicit pitch and rhythm factorization and two discriminative architectures for guided music completion, advancing controllable music generation techniques.
Findings
Outperforms state-of-the-art models in music completion tasks
Successfully incorporates user-specified pitch and rhythm snippets
Achieves better objective and subjective results in evaluations
Abstract
Drawing an analogy with automatic image completion systems, we propose Music SketchNet, a neural network framework that allows users to specify partial musical ideas guiding automatic music generation. We focus on generating the missing measures in incomplete monophonic musical pieces, conditioned on surrounding context, and optionally guided by user-specified pitch and rhythm snippets. First, we introduce SketchVAE, a novel variational autoencoder that explicitly factorizes rhythm and pitch contour to form the basis of our proposed model. Then we introduce two discriminative architectures, SketchInpainter and SketchConnector, that in conjunction perform the guided music completion, filling in representations for the missing measures conditioned on surrounding context and user-specified snippets. We evaluate SketchNet on a standard dataset of Irish folk music and compare with models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Neuroscience and Music Perception
MethodsSolana Customer Service Number +1-833-534-1729
