# Multitask Learning for Polyphonic Piano Transcription, a Case Study

**Authors:** Rainer Kelz, Sebastian B\"ock, Gerhard Widmer

arXiv: 1902.04390 · 2019-02-13

## TL;DR

This paper explores how multitask learning with multiple prediction targets affects the accuracy of polyphonic piano transcription using CNNs on the MAESTRO dataset.

## Contribution

It investigates the impact of adding auxiliary prediction tasks in CNN-based models for polyphonic piano transcription.

## Key findings

- Additional objectives improve transcription accuracy.
- Different CNN architectures respond variably to multitask learning.
- Quantitative analysis on the MAESTRO dataset demonstrates performance differences.

## Abstract

Viewing polyphonic piano transcription as a multitask learning problem, where we need to simultaneously predict onsets, intermediate frames and offsets of notes, we investigate the performance impact of additional prediction targets, using a variety of suitable convolutional neural network architectures. We quantify performance differences of additional objectives on the large MAESTRO dataset.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.04390/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/1902.04390/full.md

## References

19 references — full list in the complete paper: https://tomesphere.com/paper/1902.04390/full.md

---
Source: https://tomesphere.com/paper/1902.04390