JS Fake Chorales: a Synthetic Dataset of Polyphonic Music with Human Annotation
Omar Peracha

TL;DR
This paper introduces JS Fake Chorales, a large synthetic dataset of polyphonic music generated by a new algorithm, which can enhance music modeling research and is nearly indistinguishable from real Bach chorales in human evaluations.
Contribution
The paper presents a novel synthetic dataset of polyphonic music, generated by a learning-based algorithm, and demonstrates its effectiveness in improving polyphonic music modeling.
Findings
Respondents only 7% better than random at distinguishing fake from real chorales.
Augmenting training data with JS Fake Chorales improves state-of-the-art validation loss.
The dataset enables scalable, high-quality synthetic polyphonic music generation.
Abstract
High-quality datasets for learning-based modelling of polyphonic symbolic music remain less readily-accessible at scale than in other domains, such as language modelling or image classification. Deep learning algorithms show great potential for enabling the widespread use of interactive music generation technology in consumer applications, but the lack of large-scale datasets remains a bottleneck for the development of algorithms that can consistently generate high-quality outputs. We propose that models with narrow expertise can serve as a source of high-quality scalable synthetic data, and open-source the JS Fake Chorales, a dataset of 500 pieces generated by a new learning-based algorithm, provided in MIDI form. We take consecutive outputs from the algorithm and avoid cherry-picking in order to validate the potential to further scale this dataset on-demand. We conduct an online…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Diverse Musicological Studies
