Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale Adversarial Pre-training

Zijian Zhao

arXiv:2407.08306·cs.SD·June 27, 2025

Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale Adversarial Pre-training

Zijian Zhao

PDF

Open Access 1 Repo 1 Models

TL;DR

This paper introduces Adversarial-MidiBERT, a novel pre-training approach for symbolic music understanding that adaptively masks tokens to improve contextual learning and reduce bias, outperforming traditional methods.

Contribution

The paper proposes a new adversarial masking strategy for pre-training models in symbolic music understanding, enhancing contextual capture and reducing bias compared to standard MLM methods.

Findings

01

Achieves superior performance across four SMU tasks.

02

Effectively reduces bias associated with random masking.

03

Demonstrates the benefit of adaptive masking in music understanding models.

Abstract

As a crucial aspect of Music Information Retrieval (MIR), Symbolic Music Understanding (SMU) has garnered significant attention for its potential to assist both musicians and enthusiasts in learning and creating music. Recently, pre-trained language models have been widely adopted in SMU due to the substantial similarities between symbolic music and natural language, as well as the ability of these models to leverage limited music data effectively. However, some studies have shown the common pre-trained methods like Mask Language Model (MLM) may introduce bias issues like racism discrimination in Natural Language Process (NLP) and affects the performance of downstream tasks, which also happens in SMU. This bias often arises when masked tokens cannot be inferred from their context, forcing the model to overfit the training set instead of generalizing. To address this challenge, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

RS2002/Adversarial-MidiBERT
pytorchOfficial

Models

🤗
RS2002/Adversarial-MidiBERT
model· 26 dl
26 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic Technology and Sound Studies · Music and Audio Processing · Diverse Musicological Studies

MethodsSoftmax · Attention Is All You Need · Sparse Evolutionary Training