Semi-supervised Neural Chord Estimation Based on a Variational   Autoencoder with Latent Chord Labels and Features

Yiming Wu; Tristan Carsault; Eita Nakamura; Kazuyoshi Yoshii

arXiv:2005.07091·cs.SD·September 9, 2020·1 cites

Semi-supervised Neural Chord Estimation Based on a Variational Autoencoder with Latent Chord Labels and Features

Yiming Wu, Tristan Carsault, Eita Nakamura, Kazuyoshi Yoshii

PDF

Open Access 1 Repo

TL;DR

This paper introduces a semi-supervised neural chord estimation method using a variational autoencoder that effectively leverages both annotated and unannotated music data, incorporating prior knowledge of chord sequences.

Contribution

It proposes a unified generative-discriminative model with a variational autoencoder framework for improved chord estimation, integrating Markov priors and semi-supervised learning.

Findings

01

Performance improved with Markov prior regularization.

02

Semi-supervised learning enhances accuracy with unannotated data.

03

Generative model regularization benefits supervised ACE.

Abstract

This paper describes a statistically-principled semi-supervised method of automatic chord estimation (ACE) that can make effective use of music signals regardless of the availability of chord annotations. The typical approach to ACE is to train a deep classification model (neural chord estimator) in a supervised manner by using only annotated music signals. In this discriminative approach, prior knowledge about chord label sequences (model output) has scarcely been taken into account. In contrast, we propose a unified generative and discriminative approach in the framework of amortized variational inference. More specifically, we formulate a deep generative model that represents the generative process of chroma vectors (observed variables) from discrete labels and continuous features (latent variables), which are assumed to follow a Markov model favoring self-transitions and a standard…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Xiao-Ming/VAEChordEstimation
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing

MethodsSolana Customer Service Number +1-833-534-1729