MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling
Drew Edwards, Xavier Riley, Pedro Sarmento, Simon Dixon

TL;DR
This paper presents a deep learning Transformer-based model for generating guitar tablatures from symbolic music, outperforming existing methods and validated through a user study with guitarists.
Contribution
Introduces a novel Transformer-based masked language model for guitar tablature inference, trained on a large dataset and fine-tuned for improved accuracy.
Findings
Model outperforms existing algorithms in tablature accuracy
User study shows higher playability ratings for generated tablatures
Pre-training on large dataset enhances model performance
Abstract
Guitar tablatures enrich the structure of traditional music notation by assigning each note to a string and fret of a guitar in a particular tuning, indicating precisely where to play the note on the instrument. The problem of generating tablature from a symbolic music representation involves inferring this string and fret assignment per note across an entire composition or performance. On the guitar, multiple string-fret assignments are possible for most pitches, which leads to a large combinatorial space that prevents exhaustive search approaches. Most modern methods use constraint-based dynamic programming to minimize some cost function (e.g.\ hand position movement). In this work, we introduce a novel deep learning solution to symbolic guitar tablature estimation. We train an encoder-decoder Transformer model in a masked language modeling paradigm to assign notes to strings. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Sparse Evolutionary Training · Linear Layer · Residual Connection · Multi-Head Attention · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Softmax · Absolute Position Encodings
