The MIDI Degradation Toolkit: Symbolic Music Augmentation and Correction
Andrew McLeod, James Owers, Kazuyoshi Yoshii

TL;DR
The MIDI Degradation Toolkit (MDTK) introduces functions to generate degraded MIDI data with errors, facilitating the development of models for error detection, classification, and correction to improve music transcription and robustness of MIDI-based systems.
Contribution
We present MDTK, a toolkit for creating degraded MIDI datasets and tasks for error detection and correction, enabling research on improving music transcription and robustness of MIDI systems.
Findings
Created ACME v1.0 dataset with degraded MIDI excerpts
Proposed four tasks for error detection, classification, localization, correction
Toolkit supports dynamic degradation during training
Abstract
In this paper, we introduce the MIDI Degradation Toolkit (MDTK), containing functions which take as input a musical excerpt (a set of notes with pitch, onset time, and duration), and return a "degraded" version of that excerpt with some error (or errors) introduced. Using the toolkit, we create the Altered and Corrupted MIDI Excerpts dataset version 1.0 (ACME v1.0), and propose four tasks of increasing difficulty to detect, classify, locate, and correct the degradations. We hypothesize that models trained for these tasks can be useful in (for example) improving automatic music transcription performance if applied as a post-processing step. To that end, MDTK includes a script that measures the distribution of different types of errors in a transcription, and creates a degraded dataset with similar properties. MDTK's degradations can also be applied dynamically to a dataset during…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech Recognition and Synthesis · Music Technology and Sound Studies
