I-vector Based Within Speaker Voice Quality Identification on connected   speech

Chuyao Feng; Eva van Leer; Mackenzie Lee Curtis; David V. Anderson

arXiv:2102.07307·cs.SD·February 16, 2021

I-vector Based Within Speaker Voice Quality Identification on connected speech

Chuyao Feng, Eva van Leer, Mackenzie Lee Curtis, David V. Anderson

PDF

Open Access

TL;DR

This study demonstrates that i-vector based systems significantly improve the automatic classification of different voice qualities in connected speech, which can aid in voice therapy and training.

Contribution

It introduces an i-vector based approach for differentiating voice qualities within the same speaker, outperforming traditional acoustic measures.

Findings

01

i-vector system achieved 97.5% accuracy

02

traditional system achieved 77.2% accuracy

03

i-vector approach better captures voice quality features

Abstract

Voice disorders affect a large portion of the population, especially heavy voice users such as teachers or call-center workers. Most voice disorders can be treated effectively with behavioral voice therapy, which teaches patients to replace problematic, habituated voice production mechanics with optimal voice production technique(s), yielding improved voice quality. However, treatment often fails because patients have difficulty differentiating their habitual voice from the target technique independently, when clinician feedback is unavailable between therapy sessions. Therefore, with the long term aim to extend clinician feedback to extra-clinical settings, we built two systems that automatically differentiate various voice qualities produced by the same individual. We hypothesized that 1) a system based on i-vectors could classify these qualities as if they represent different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVoice and Speech Disorders · Speech Recognition and Synthesis · Phonetics and Phonology Research