Music Transcription by Deep Learning with Data and "Artificial Semantic"   Augmentation

Vladyslav Sarnatskyi; Vadym Ovcharenko; Mariia Tkachenko; Sergii; Stirenko; Yuri Gordienko; Anis Rojbi

arXiv:1712.03228·cs.SD·December 12, 2017

Music Transcription by Deep Learning with Data and "Artificial Semantic" Augmentation

Vladyslav Sarnatskyi, Vadym Ovcharenko, Mariia Tkachenko, Sergii, Stirenko, Yuri Gordienko, Anis Rojbi

PDF

Open Access

TL;DR

This paper explores deep learning techniques for music transcription, introducing data and "artificial semantic" augmentation methods to improve note recognition accuracy for monophonic and polyphonic music.

Contribution

It proposes novel data augmentation strategies, including "artificial semantic" augmentation, to enhance deep learning performance in music transcription tasks.

Findings

01

Data augmentation improves recognition accuracy.

02

Artificial semantic augmentation increases training data diversity.

03

Enhanced methods outperform previous approaches.

Abstract

In this progress paper the previous results of the single note recognition by deep learning are presented. The several ways for data augmentation and "artificial semantic" augmentation are proposed to enhance efficiency of deep learning approaches for monophonic and polyphonic note recognition by increase of dimensions of training data, their lossless and lossy transformations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Diverse Musicological Studies