Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale Adversarial Pre-training
Zijian Zhao

TL;DR
This paper introduces Adversarial-MidiBERT, a novel pre-training approach for symbolic music understanding that adaptively masks tokens to improve contextual learning and reduce bias, outperforming traditional methods.
Contribution
The paper proposes a new adversarial masking strategy for pre-training models in symbolic music understanding, enhancing contextual capture and reducing bias compared to standard MLM methods.
Findings
Achieves superior performance across four SMU tasks.
Effectively reduces bias associated with random masking.
Demonstrates the benefit of adaptive masking in music understanding models.
Abstract
As a crucial aspect of Music Information Retrieval (MIR), Symbolic Music Understanding (SMU) has garnered significant attention for its potential to assist both musicians and enthusiasts in learning and creating music. Recently, pre-trained language models have been widely adopted in SMU due to the substantial similarities between symbolic music and natural language, as well as the ability of these models to leverage limited music data effectively. However, some studies have shown the common pre-trained methods like Mask Language Model (MLM) may introduce bias issues like racism discrimination in Natural Language Process (NLP) and affects the performance of downstream tasks, which also happens in SMU. This bias often arises when masked tokens cannot be inferred from their context, forcing the model to overfit the training set instead of generalizing. To address this challenge, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic Technology and Sound Studies · Music and Audio Processing · Diverse Musicological Studies
MethodsSoftmax · Attention Is All You Need · Sparse Evolutionary Training
