A new definition of the distortion matrix for an audio-to-score alignment system
A. J. Mu\~noz-Montoro, P. Vera-Candeas, D. Suarez-Dou, R. Cortina

TL;DR
This paper introduces a novel distortion matrix definition for an audio-to-score alignment system based on DTW, utilizing note combinations and spectral pattern learning to improve alignment accuracy.
Contribution
It proposes a new way to define the distortion matrix by arranging score info in note combinations and learning spectral patterns, enhancing DTW-based alignment.
Findings
Improved alignment accuracy with the new distortion matrix.
Effective spectral pattern learning for note combinations.
Enhanced robustness of score following system.
Abstract
In this paper we present a new definition of the distortion matrix for a score following framework based on DTW. The proposal consists of arranging the score information in a sequence of note combinations and learning a spectral pattern for each combination using instrument models. Then, the distortion matrix is computed using these spectral patterns and a novel decomposition of the input signal.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies
