Learning Musical Relations using Gated Autoencoders
Stefan Lattner, Maarten Grachten, Gerhard Widmer

TL;DR
This paper investigates the use of Gated Autoencoders to learn and represent musical transformations, demonstrating their effectiveness over RBMs in capturing content-invariant relations for music analysis.
Contribution
It introduces the application of Gated Autoencoders for modeling musical transformations, showing their ability to learn content-invariant representations for structured music analysis.
Findings
GAEs outperform RBMs in learning musical transformations
GAEs produce content-invariant representations of musical relations
Models like GAEs can enhance music analysis systems
Abstract
Music is usually highly structured and it is still an open question how to design models which can successfully learn to recognize and represent musical structure. A fundamental problem is that structurally related patterns can have very distinct appearances, because the structural relationships are often based on transformations of musical material, like chromatic or diatonic transposition, inversion, retrograde, or rhythm change. In this preliminary work, we study the potential of two unsupervised learning techniques - Restricted Boltzmann Machines (RBMs) and Gated Autoencoders (GAEs) - to capture pre-defined transformations from constructed data pairs. We evaluate the models by using the learned representations as inputs in a discriminative task where for a given type of transformation (e.g. diatonic transposition), the specific relation between two musical patterns must be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Generative Adversarial Networks and Image Synthesis · Music Technology and Sound Studies
