Semi-supervised Neural Chord Estimation Based on a Variational Autoencoder with Latent Chord Labels and Features
Yiming Wu, Tristan Carsault, Eita Nakamura, Kazuyoshi Yoshii

TL;DR
This paper introduces a semi-supervised neural chord estimation method using a variational autoencoder that effectively leverages both annotated and unannotated music data, incorporating prior knowledge of chord sequences.
Contribution
It proposes a unified generative-discriminative model with a variational autoencoder framework for improved chord estimation, integrating Markov priors and semi-supervised learning.
Findings
Performance improved with Markov prior regularization.
Semi-supervised learning enhances accuracy with unannotated data.
Generative model regularization benefits supervised ACE.
Abstract
This paper describes a statistically-principled semi-supervised method of automatic chord estimation (ACE) that can make effective use of music signals regardless of the availability of chord annotations. The typical approach to ACE is to train a deep classification model (neural chord estimator) in a supervised manner by using only annotated music signals. In this discriminative approach, prior knowledge about chord label sequences (model output) has scarcely been taken into account. In contrast, we propose a unified generative and discriminative approach in the framework of amortized variational inference. More specifically, we formulate a deep generative model that represents the generative process of chroma vectors (observed variables) from discrete labels and continuous features (latent variables), which are assumed to follow a Markov model favoring self-transitions and a standard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing
MethodsSolana Customer Service Number +1-833-534-1729
