Modeling Musical Onset Probabilities via Neural Distribution Learning

Jaesung Huh; Egil Martinsson; Adrian Kim; Jung-Woo Ha

arXiv:2002.03559·cs.SD·February 11, 2020·1 cites

Modeling Musical Onset Probabilities via Neural Distribution Learning

Jaesung Huh, Egil Martinsson, Adrian Kim, Jung-Woo Ha

PDF

Open Access

TL;DR

This paper introduces a neural density prediction model for musical onset detection, estimating time-to-event and time-since-event distributions from mel-spectrograms using CNNs, achieving competitive results on the Bock dataset.

Contribution

It presents a novel sequential density prediction approach for modeling musical onsets with CNNs, advancing the state-of-the-art in onset detection.

Findings

01

Achieved comparable results to existing deep-learning models.

02

Successfully modeled TTE and TSE distributions from spectrograms.

03

Demonstrated effectiveness on the Bock dataset.

Abstract

Musical onset detection can be formulated as a time-to-event (TTE) or time-since-event (TSE) prediction task by defining music as a sequence of onset events. Here we propose a novel method to model the probability of onsets by introducing a sequential density prediction model. The proposed model estimates TTE & TSE distributions from mel-spectrograms using convolutional neural networks (CNNs) as a density predictor. We evaluate our model on the Bock dataset show-ing comparable results to previous deep-learning models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Neuroscience and Music Perception · Music Technology and Sound Studies