TL;DR
PiRhDy introduces a hierarchical framework for creating comprehensive symbolic music embeddings that incorporate pitch, rhythm, and dynamics, improving performance in music understanding tasks.
Contribution
The paper presents a novel hierarchical framework, PiRhDy, that effectively integrates multi-faceted musical features into embeddings considering melodic and harmonic contexts.
Findings
Significant improvement in melody completion accuracy.
Enhanced performance in accompaniment suggestion.
Effective genre classification results.
Abstract
Definitive embeddings remain a fundamental challenge of computational musicology for symbolic music in deep learning today. Analogous to natural language, music can be modeled as a sequence of tokens. This motivates the majority of existing solutions to explore the utilization of word embedding models to build music embeddings. However, music differs from natural languages in two key aspects: (1) musical token is multi-faceted -- it comprises of pitch, rhythm and dynamics information; and (2) musical context is two-dimensional -- each musical token is dependent on both melodic and harmonic contexts. In this work, we provide a comprehensive solution by proposing a novel framework named PiRhDy that integrates pitch, rhythm, and dynamics information seamlessly. PiRhDy adopts a hierarchical strategy which can be decomposed into two steps: (1) token (i.e., note event) modeling, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
