Machine Learning Approaches to Vocal Register Classification in Contemporary Male Pop Music

Alexander Kim; Charlotte Botha

arXiv:2505.11378·cs.SD·August 22, 2025

Machine Learning Approaches to Vocal Register Classification in Contemporary Male Pop Music

Alexander Kim, Charlotte Botha

PDF

Open Access

TL;DR

This paper introduces two machine learning methods, SVM and CNN, for classifying vocal registers in male pop music using mel-spectrogram features, aiding singers and vocal analysis tools.

Contribution

It presents novel machine learning approaches for vocal register classification in pop music, including practical integration and a new software tool called AVRA.

Findings

01

Support Vector Machine achieved high classification accuracy.

02

Convolutional Neural Network demonstrated robust performance.

03

Methods support broader voice type and genre analysis.

Abstract

For singers of all experience levels, one of the most daunting challenges in learning technical repertoire is navigating placement and vocal register in and around the passagio (passage between chest voice and head voice registers). Particularly in pop music, where a single artist may use a variety of timbre's and textures to achieve a desired quality, it can be difficult to identify what vocal register within the vocal range a singer is using. This paper presents two methods for classifying vocal registers in an audio signal of male pop music through the analysis of textural features of mel-spectrogram images. Additionally, we will discuss the practical integration of these models for vocal analysis tools, and introduce a concurrently developed software called AVRA which stands for Automatic Vocal Register Analysis. Our proposed methods achieved consistent classification of vocal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Voice and Speech Disorders · Emotion and Mood Recognition