Learning Musical Relations using Gated Autoencoders

Stefan Lattner; Maarten Grachten; Gerhard Widmer

arXiv:1708.05325·cs.SD·August 18, 2017

Learning Musical Relations using Gated Autoencoders

Stefan Lattner, Maarten Grachten, Gerhard Widmer

PDF

Open Access

TL;DR

This paper investigates the use of Gated Autoencoders to learn and represent musical transformations, demonstrating their effectiveness over RBMs in capturing content-invariant relations for music analysis.

Contribution

It introduces the application of Gated Autoencoders for modeling musical transformations, showing their ability to learn content-invariant representations for structured music analysis.

Findings

01

GAEs outperform RBMs in learning musical transformations

02

GAEs produce content-invariant representations of musical relations

03

Models like GAEs can enhance music analysis systems

Abstract

Music is usually highly structured and it is still an open question how to design models which can successfully learn to recognize and represent musical structure. A fundamental problem is that structurally related patterns can have very distinct appearances, because the structural relationships are often based on transformations of musical material, like chromatic or diatonic transposition, inversion, retrograde, or rhythm change. In this preliminary work, we study the potential of two unsupervised learning techniques - Restricted Boltzmann Machines (RBMs) and Gated Autoencoders (GAEs) - to capture pre-defined transformations from constructed data pairs. We evaluate the models by using the learned representations as inputs in a discriminative task where for a given type of transformation (e.g. diatonic transposition), the specific relation between two musical patterns must be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Generative Adversarial Networks and Image Synthesis · Music Technology and Sound Studies