Generating Music with a Self-Correcting Non-Chronological Autoregressive Model
Wayne Chi, Prachi Kumar, Suri Yaddanapudi, Rahul Suresh, Umut Isik

TL;DR
This paper introduces a self-correcting, non-chronological autoregressive model for music generation that improves quality and control by representing music as edit events and fixing previous errors during generation.
Contribution
It presents a novel self-correcting, non-chronological autoregressive approach that enhances music generation quality and offers finer control compared to existing methods.
Findings
Outperforms orderless NADE and Gibbs sampling in quality metrics
Reduces error accumulation during music generation
Enables finer note-by-note control in collaborative composition
Abstract
We describe a novel approach for generating music using a self-correcting, non-chronological, autoregressive model. We represent music as a sequence of edit events, each of which denotes either the addition or removal of a note---even a note previously generated by the model. During inference, we generate one edit event at a time using direct ancestral sampling. Our approach allows the model to fix previous mistakes such as incorrectly sampled notes and prevent accumulation of errors which autoregressive models are prone to have. Another benefit is a finer, note-by-note control during human and AI collaborative composition. We show through quantitative metrics and human survey evaluation that our approach generates better results than orderless NADE and Gibbs sampling approaches.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Topic Modeling
