Pairing Real-Time Piano Transcription with Symbol-level Tracking for Precise and Robust Score Following
Silvan Peter, Patricia Hu, Gerhard Widmer

TL;DR
This paper introduces a hybrid real-time music tracking system that combines audio-to-note transcription with symbol-level score tracking, outperforming traditional audio-only methods in precision and robustness.
Contribution
The paper presents a novel symbolic domain approach for real-time score following, integrating audio transcription with a new symbol-level tracker, improving over existing audio-only systems.
Findings
Outperforms audio-only methods in tracking accuracy
Enhances robustness in score following
Demonstrates effectiveness of symbolic domain conversion
Abstract
Real-time music tracking systems follow a musical performance and at any time report the current position in a corresponding score. Most existing methods approach this problem exclusively in the audio domain, typically using online time warping (OLTW) techniques on incoming audio and an audio representation of the score. Audio OLTW techniques have seen incremental improvements both in features and model heuristics which reached a performance plateau in the past ten years. We argue that converting and representing the performance in the symbolic domain -- thereby transforming music tracking into a symbolic task -- can be a more effective approach, even when the domain transformation is imperfect. Our music tracking system combines two real-time components: one handling audio-to-note transcription and the other a novel symbol-level tracker between transcribed input and score. We compare…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Time Series Analysis and Forecasting · Video Analysis and Summarization
