Experimenting active and sequential learning in a medieval music manuscript

Sachin Sharma (GSSI); Federico Simonetta (GSSI); Michele Flammini (GSSI)

arXiv:2507.15633·cs.CV·July 22, 2025

Experimenting active and sequential learning in a medieval music manuscript

Sachin Sharma (GSSI), Federico Simonetta (GSSI), Michele Flammini (GSSI)

PDF

TL;DR

This study explores active and sequential learning techniques for optical music recognition in medieval manuscripts, demonstrating that uncertainty-based AL may not always be effective in data-scarce scenarios.

Contribution

It introduces a preliminary application of AL and SL methods to medieval music manuscripts using YOLOv8, highlighting limitations of uncertainty-based AL in such contexts.

Findings

01

Uncertainty-based AL was ineffective on the medieval manuscript dataset.

02

Comparable accuracy to full supervision achieved with fewer labeled samples.

03

Highlights need for more usable AL methods in data-scarcity scenarios.

Abstract

Optical Music Recognition (OMR) is a cornerstone of music digitization initiatives in cultural heritage, yet it remains limited by the scarcity of annotated data and the complexity of historical manuscripts. In this paper, we present a preliminary study of Active Learning (AL) and Sequential Learning (SL) tailored for object detection and layout recognition in an old medieval music manuscript. Leveraging YOLOv8, our system selects samples with the highest uncertainty (lowest prediction confidence) for iterative labeling and retraining. Our approach starts with a single annotated image and successfully boosts performance while minimizing manual labeling. Experimental results indicate that comparable accuracy to fully supervised training can be achieved with significantly fewer labeled examples. We test the methodology as a preliminary investigation on a novel dataset offered to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.