Region-based Layout Analysis of Music Score Images
Francisco J. Castellanos, Carlos Garrido-Munoz, Antonio R\'ios-Vila,, Jorge Calvo-Zaragoza

TL;DR
This paper conducts a comprehensive experimental study on region-based layout analysis for optical music recognition, highlighting the importance of model choice, evaluation metrics, and introducing a semi-synthetic data generation method.
Contribution
It provides an extensive analysis of neural architectures for music score layout analysis and proposes a semi-synthetic data generation technique to improve performance with limited data.
Findings
Model performance significantly impacts transcription accuracy.
Evaluation metrics may not reflect the final OMR system performance.
Semi-synthetic data generation enables state-of-the-art results with less labeled data.
Abstract
The Layout Analysis (LA) stage is of vital importance to the correct performance of an Optical Music Recognition (OMR) system. It identifies the regions of interest, such as staves or lyrics, which must then be processed in order to transcribe their content. Despite the existence of modern approaches based on deep learning, an exhaustive study of LA in OMR has not yet been carried out with regard to the precision of different models, their generalization to different domains or, more importantly, their impact on subsequent stages of the pipeline. This work focuses on filling this gap in literature by means of an experimental study of different neural architectures, music document types and evaluation scenarios. The need for training data has also led to a proposal for a new semi-synthetic data generation technique that enables the efficient applicability of LA approaches in real…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing
