Calliope -- A Polyphonic Music Transformer
Andrea Valenti, Stefano Berti, Davide Bacciu

TL;DR
Calliope introduces a Transformer-based autoencoder that effectively models multi-track polyphonic music, significantly improving sequence reconstruction and generation, especially for long sequences, demonstrating the suitability of Transformers for complex musical data.
Contribution
The paper presents Calliope, a novel Transformer-based autoencoder specifically designed for efficient polyphonic music modeling, advancing the state of the art.
Findings
Improves sequence reconstruction accuracy.
Enhances music generation quality.
Performs well on long musical sequences.
Abstract
The polyphonic nature of music makes the application of deep learning to music modelling a challenging task. On the other hand, the Transformer architecture seems to be a good fit for this kind of data. In this work, we present Calliope, a novel autoencoder model based on Transformers for the efficient modelling of multi-track sequences of polyphonic music. The experiments show that our model is able to improve the state of the art on musical sequence reconstruction and generation, with remarkably good results especially on long sequences.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Adam · Label Smoothing · Residual Connection · Dense Connections · Softmax
