The SMC Blind Spot: A Failure Mode Analysis of State-of-the-Art Beat Tracking
Jaehoon Ahn, Tae Gum Hwang, Moon-Ryul Jung

TL;DR
This paper analyzes failure modes of state-of-the-art neural network models in beat tracking on the SMC dataset, revealing key issues like octave errors and tempo inference limitations.
Contribution
It identifies specific failure modes in current models and suggests targeted improvements such as diversified training data and multi-hypothesis tempo estimation.
Findings
State-of-the-art models exhibit octave and continuity errors.
Default tempo constraints cause incorrect tempo predictions.
Fundamental oversights hinder performance on challenging datasets.
Abstract
Over the past two decades, the task of musical beat tracking has transitioned from heuristic onset detection algorithms to highly capable deep neural networks (DNN). Although DNN-based beat tracking models achieve near-perfect performance on mainstream, percussive datasets, the SMC dataset has stubbornly yielded low F-measure scores. By testing how well state-of-the-art models detect beats on individual tracks in the SMC dataset, we identify three distinct failure modes: octave errors, continuity errors, and complete tracking failure where all metrics fall below 0.3. We reveal that state-of-the-art models tend to generate "confident-but-wrong" activations. Furthermore, we show that the standard DBN's default minimum tempo of 55 BPM prevents it from inferring the correct tempo for 21\% of SMC tracks, forcing double-tempo predictions on slow music. By exposing such fundamental oversights,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
