KuiSCIMA v2.0: Improved Baselines, Calibration, and Cross-Notation Generalization for Historical Chinese Music Notations in Jiang Kui's Baishidaoren Gequ
Tristan Repolusk, Eduardo Veas

TL;DR
This paper advances optical music recognition for historical Chinese notations by developing a robust, well-calibrated model that outperforms human transcribers and ensures cross-edition generalization, significantly aiding cultural preservation.
Contribution
It introduces improved recognition models, calibration techniques, and a comprehensive dataset extension for historical Chinese music notations, enhancing accuracy and robustness over prior work.
Findings
CER reduced from 10.4% to 7.1% for suzipu
CER of 0.9% achieved for l"ul"upu
Models outperform human transcribers in accuracy
Abstract
Optical Music Recognition (OMR) for historical Chinese musical notations, such as suzipu and l\"ul\"upu, presents unique challenges due to high class imbalance and limited training data. This paper introduces significant advancements in OMR for Jiang Kui's influential collection Baishidaoren Gequ from 1202. In this work, we develop and evaluate a character recognition model for scarce imbalanced data. We improve upon previous baselines by reducing the Character Error Rate (CER) from 10.4% to 7.1% for suzipu, despite working with 77 highly imbalanced classes, and achieve a remarkable CER of 0.9% for l\"ul\"upu. Our models outperform human transcribers, with an average human CER of 15.9% and a best-case CER of 7.6%. We employ temperature scaling to achieve a well-calibrated model with an Expected Calibration Error (ECE) below 0.0162. Using a leave-one-edition-out cross-validation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Handwritten Text Recognition Techniques · Music Technology and Sound Studies
