Deep rank-based transposition-invariant distances on musical sequences
Ga\"etan Hadjeres, Frank Nielsen

TL;DR
This paper proposes a data-driven, transposition-invariant distance measure for symbolic musical sequences, combining learned features with a handcrafted rank distance, improving robustness and perceptual relevance in music analysis.
Contribution
It introduces a novel hybrid distance that is invariant to transpositions and less dependent on musical encoding, applicable across different musical styles.
Findings
Effective on Bach chorale melodies dataset
Achieves perceptually meaningful similarity assessments
Less sensitive to encoding variations
Abstract
Distances on symbolic musical sequences are needed for a variety of applications, from music retrieval to automatic music generation. These musical sequences belong to a given corpus (or style) and it is obvious that a good distance on musical sequences should take this information into account; being able to define a distance ex nihilo which could be applicable to all music styles seems implausible. A distance could also be invariant under some transformations, such as transpositions, so that it can be used as a distance between musical motives rather than musical sequences. However, to our knowledge, none of the approaches to devise musical distances seem to address these issues. This paper introduces a method to build transposition-invariant distances on symbolic musical sequences which are learned from data. It is a hybrid distance which combines learned feature representations of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies
