Automatic Stroke Classification of Tabla Accompaniment in Hindustani Vocal Concert Audio
Rohit M. A., Preeti Rao

TL;DR
This paper develops an instrument-independent automatic stroke classification system for tabla in Hindustani vocal concerts, utilizing acoustic features and data augmentation to improve accuracy across different instruments.
Contribution
It introduces a novel approach for instrument-independent tabla stroke classification using acoustic features and data augmentation techniques, especially pitch-shifting.
Findings
Pitch-shifting augmentation improves classification robustness.
Instrument dependence of features affects system performance.
Task-specific data augmentation enhances model generalization.
Abstract
The tabla is a unique percussion instrument due to the combined harmonic and percussive nature of its timbre, and the contrasting harmonic frequency ranges of its two drums. This allows a tabla player to uniquely emphasize parts of the rhythmic cycle (theka) in order to mark the salient positions. An analysis of the loudness dynamics and timing deviations at various cycle positions is an important part of musicological studies on the expressivity in tabla accompaniment. To achieve this at a corpus-level, and not restrict it to the few recordings that manual annotation can afford, it is helpful to have access to an automatic tabla transcription system. Although a few systems have been built by training models on labeled tabla strokes, the achieved accuracy does not necessarily carry over to unseen instruments. In this article, we report our work towards building an instrument-independent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies
