GAN-based Content-Conditioned Generation of Handwritten Musical Symbols

Gerard Asbert; Pau Torras; Lei Kang; Alicia Forn\'es; Josep Llad\'os

arXiv:2510.17869·cs.CV·October 22, 2025

GAN-based Content-Conditioned Generation of Handwritten Musical Symbols

Gerard Asbert, Pau Torras, Lei Kang, Alicia Forn\'es, Josep Llad\'os

PDF

Open Access

TL;DR

This paper introduces a GAN-based method to generate realistic handwritten musical symbols, aiming to improve optical music recognition by augmenting training data with synthetic, high-fidelity samples.

Contribution

It presents a novel music symbol-level GAN that produces realistic handwritten musical symbols and assembles them into full scores, advancing synthetic data generation for OMR.

Findings

01

Generated symbols show high visual realism

02

Synthetic scores can potentially enhance recognition models

03

Progress in realistic handwritten score synthesis

Abstract

The field of Optical Music Recognition (OMR) is currently hindered by the scarcity of real annotated data, particularly when dealing with handwritten historical musical scores. In similar fields, such as Handwritten Text Recognition, it was proven that synthetic examples produced with image generation techniques could help to train better-performing recognition architectures. This study explores the generation of realistic, handwritten-looking scores by implementing a music symbol-level Generative Adversarial Network (GAN) and assembling its output into a full score using the Smashcima engraving software. We have systematically evaluated the visual fidelity of these generated samples, concluding that the generated symbols exhibit a high degree of realism, marking significant progress in synthetic score generation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Handwritten Text Recognition Techniques · Generative Adversarial Networks and Image Synthesis