Objective Measurements of Voice Quality
Hira Dhamyal, Rita Singh

TL;DR
This paper develops objective formulae to quantify voice quality by linking subjective descriptors to measurable signal properties, validated against datasets with subjective labels.
Contribution
It introduces 24 formulae for voice sub-qualities based on 25 signal metrics, bridging subjective descriptors with objective measurements.
Findings
Formulae accurately reflect subjective voice quality labels
Validation confirms the correlation between formulae and perceived qualities
Provides a standardized approach for objective voice quality assessment
Abstract
The quality of human voice plays an important role across various fields like music, speech therapy, and communication, yet it lacks a universally accepted, objective definition. Instead, voice quality is referred to using subjective descriptors like "rough", "breathy" etc. Despite this subjectivity, extensive research across disciplines has linked these voice qualities to specific information about the speaker, such as health, physiological traits, and others. Current machine learning approaches for voice profiling rely on data-driven analysis without fully incorporating these established correlations, due to their qualitative nature. This paper aims to objectively quantify voice quality by synthesizing formulaic representations from past findings that correlate voice qualities to signal-processing metrics. We introduce formulae for 24 voice sub-qualities based on 25 signal properties,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis
