I-vector Based Within Speaker Voice Quality Identification on connected speech
Chuyao Feng, Eva van Leer, Mackenzie Lee Curtis, David V. Anderson

TL;DR
This study demonstrates that i-vector based systems significantly improve the automatic classification of different voice qualities in connected speech, which can aid in voice therapy and training.
Contribution
It introduces an i-vector based approach for differentiating voice qualities within the same speaker, outperforming traditional acoustic measures.
Findings
i-vector system achieved 97.5% accuracy
traditional system achieved 77.2% accuracy
i-vector approach better captures voice quality features
Abstract
Voice disorders affect a large portion of the population, especially heavy voice users such as teachers or call-center workers. Most voice disorders can be treated effectively with behavioral voice therapy, which teaches patients to replace problematic, habituated voice production mechanics with optimal voice production technique(s), yielding improved voice quality. However, treatment often fails because patients have difficulty differentiating their habitual voice from the target technique independently, when clinician feedback is unavailable between therapy sessions. Therefore, with the long term aim to extend clinician feedback to extra-clinical settings, we built two systems that automatically differentiate various voice qualities produced by the same individual. We hypothesized that 1) a system based on i-vectors could classify these qualities as if they represent different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVoice and Speech Disorders · Speech Recognition and Synthesis · Phonetics and Phonology Research
