A Lightweight Instrument-Agnostic Model for Polyphonic Note   Transcription and Multipitch Estimation

Rachel M. Bittner; Juan Jos\'e Bosch; David Rubinstein; Gabriel; Meseguer-Brocal; Sebastian Ewert

arXiv:2203.09893·cs.SD·May 13, 2022

A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch Estimation

Rachel M. Bittner, Juan Jos\'e Bosch, David Rubinstein, Gabriel, Meseguer-Brocal, Sebastian Ewert

PDF

1 Repo 1 Models

TL;DR

This paper introduces a lightweight, instrument-agnostic neural network for polyphonic music transcription that jointly predicts onsets, pitches, and note activations, achieving competitive accuracy with less complexity.

Contribution

The authors propose a simple, multi-output neural model capable of generalizing across instruments, including vocals, with improved accuracy over baseline methods.

Findings

01

Outperforms comparable baseline in note estimation accuracy.

02

Achieves frame-level accuracy close to specialized state-of-the-art systems.

03

Supports polyphonic transcription for a wide variety of instruments.

Abstract

Automatic Music Transcription (AMT) has been recognized as a key enabling technology with a wide range of applications. Given the task's complexity, best results have typically been reported for systems focusing on specific settings, e.g. instrument-specific systems tend to yield improved results over instrument-agnostic methods. Similarly, higher accuracy can be obtained when only estimating frame-wise $f_{0}$ values and neglecting the harder note event detection. Despite their high accuracy, such specialized systems often cannot be deployed in the real-world. Storage and network constraints prohibit the use of multiple specialized models, while memory and run-time constraints limit their complexity. In this paper, we propose a lightweight neural network for musical instrument transcription, which supports polyphonic outputs and generalizes to a wide variety of instruments (including…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

spotify/basic-pitch
tfOfficial

Models

🤗
spotify/basic-pitch
model· ♡ 24
♡ 24

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.