ChordSync: Conformer-Based Alignment of Chord Annotations to Music Audio
Andrea Poltronieri, Valentina Presutti, Mart\'in Rocamora

TL;DR
ChordSync is a conformer-based model that aligns chord annotations with music audio without requiring weak alignment, enabling better use of online chord data for MIR and music education.
Contribution
We introduce ChordSync, a novel conformer-based approach that aligns chord annotations with audio directly, providing a pre-trained model and library for easy application.
Findings
Enables accurate alignment of chord annotations with audio.
Facilitates creation of diverse, annotated music datasets.
Enhances music education through synchronized annotations.
Abstract
In the Western music tradition, chords are the main constituent components of harmony, a fundamental dimension of music. Despite its relevance for several Music Information Retrieval (MIR) tasks, chord-annotated audio datasets are limited and need more diversity. One way to improve those resources is to leverage the large number of chord annotations available online, but this requires aligning them with music audio. However, existing audio-to-score alignment techniques, which typically rely on Dynamic Time Warping (DTW), fail to address this challenge, as they require weakly aligned data for precise synchronisation. In this paper, we introduce ChordSync, a novel conformer-based model designed to seamlessly align chord annotations with audio, eliminating the need for weak alignment. We also provide a pre-trained model and a user-friendly library, enabling users to synchronise chord…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing
MethodsALIGN
