Supervised Learning for Game Music Segmentation
Shangxuan Luo, Joshua Reiss

TL;DR
This paper investigates supervised learning approaches, combining CNNs and RNNs, for music structure segmentation in game music, aiming to improve understanding and generation of musical structure.
Contribution
It introduces a supervised learning method for music segmentation using a new dataset, achieving comparable results to unsupervised methods with less training data.
Findings
Supervised model performs comparably to state-of-the-art unsupervised methods.
Created a new annotated dataset of 309 game music segments.
Combines CNNs and RNNs for effective music structure segmentation.
Abstract
At present, neural network-based models, including transformers, struggle to generate memorable and readily comprehensible music from unified and repetitive musical material due to a lack of understanding of musical structure. Consequently, these models are rarely employed by the games industry. It is hypothesised by many scholars that the modelling of musical structure may inform models at a higher level, thereby enhancing the quality of music generation. The aim of this study is to explore the performance of supervised learning methods in the task of structural segmentation, which is the initial step in music structure modelling. An audio game music dataset with 309 structural annotations was created to train the proposed method, which combines convolutional neural networks and recurrent neural networks, achieving performance comparable to the state-of-the-art unsupervised learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Artificial Intelligence in Games
