A Dataset for Automatic Vocal Mode Classification

Reemt Hinrichs; Sonja Stephan; Alexander Lange; J\"orn Ostermann

arXiv:2601.18339·cs.SD·April 30, 2026

A Dataset for Automatic Vocal Mode Classification

Reemt Hinrichs, Sonja Stephan, Alexander Lange, J\"orn Ostermann

PDF

1 Repo

TL;DR

This paper introduces a new dataset of vocal mode recordings for automatic classification, including annotations and baseline results, to advance singing teaching technology.

Contribution

A novel, annotated vocal mode dataset with over 13,000 samples from professional singers, enabling improved automatic classification methods.

Findings

01

Achieved 81.3% balanced accuracy with ResNet18.

02

Dataset includes 3,752 unique samples from four singers.

03

Provides baseline classification results for future research.

Abstract

The Complete Vocal Technique (CVT) is a school of singing developed in the past decades by Cathrin Sadolin et al.. CVT groups the use of the voice into so called vocal modes, namely Neutral, Curbing, Overdrive and Edge. Knowledge of the desired vocal mode can be helpful for singing students. Automatic classification of vocal modes can thus be important for technology-assisted singing teaching. Previously, automatic classification of vocal modes has been attempted without major success, potentially due to a lack of data. Therefore, we recorded a novel vocal mode dataset consisting of sustained vowels recorded from four singers, three of which professional singers with more than five years of CVT-experience. The dataset covers the entire vocal range of the subjects, totaling 3,752 unique samples. By using four microphones, thereby offering a natural data augmentation, the dataset consists…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://zenodo.org/records/14276415
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.