From Note-Level to Chord-Level Neural Network Models for Voice Separation in Symbolic Music
Patrick Gray, Razvan Bunescu

TL;DR
This paper introduces neural network models for voice separation in symbolic music, capable of handling complex voice-leading scenarios by classifying notes at note or chord levels, and demonstrates their effectiveness on popular and Bach music.
Contribution
It proposes a flexible voice separation task and compares note-level and chord-level neural models trained on annotated data, advancing automatic voice separation techniques.
Findings
Chord-level model outperforms note-level model in accuracy.
Both neural models surpass traditional envelope-based methods.
Models effectively separate voices in complex polyphonic music, including Bach.
Abstract
Music is often experienced as a progression of concurrent streams of notes, or voices. The degree to which this happens depends on the position along a voice-leading continuum, ranging from monophonic, to homophonic, to polyphonic, which complicates the design of automatic voice separation models. We address this continuum by defining voice separation as the task of decomposing music into streams that exhibit both a high degree of external perceptual separation from the other streams and a high degree of internal perceptual consistency. The proposed voice separation task allows for a voice to diverge to multiple voices and also for multiple voices to converge to the same voice. Equipped with this flexible task definition, we manually annotated a corpus of popular music and used it to train neural networks that assign notes to voices either separately for each note in a chord…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Music and Audio Processing · Music Technology and Sound Studies
