Quantifying the effect of speech pathology on automatic and human   speaker verification

Bence Mark Halpern; Thomas Tienkamp; Wen-Chin Huang; Lester Phillip; Violeta; Teja Rebernik; Sebastiaan de Visscher; Max Witjes; Martijn Wieling,; Defne Abur; Tomoki Toda

arXiv:2406.06208·cs.SD·June 11, 2024

Quantifying the effect of speech pathology on automatic and human speaker verification

Bence Mark Halpern, Thomas Tienkamp, Wen-Chin Huang, Lester Phillip, Violeta, Teja Rebernik, Sebastiaan de Visscher, Max Witjes, Martijn Wieling,, Defne Abur, Tomoki Toda

PDF

Open Access

TL;DR

This study examines how speech pathology, due to oral cancer surgery, affects automatic and human speaker verification performance, revealing negative impacts and correlations with speech severity, alongside perceptual comparisons.

Contribution

It provides new insights into the impact of speech pathology on speaker verification systems using parallel pre- and post-surgery datasets.

Findings

01

Pathological speech reduces ASV accuracy

02

Speech severity correlates with decreased ASV performance

03

Moderate agreement between perceptual and objective severity scores

Abstract

This study investigates how surgical intervention for speech pathology (specifically, as a result of oral cancer surgery) impacts the performance of an automatic speaker verification (ASV) system. Using two recently collected Dutch datasets with parallel pre and post-surgery audio from the same speaker, NKI-OC-VC and SPOKE, we assess the extent to which speech pathology influences ASV performance, and whether objective/subjective measures of speech severity are correlated with the performance. Finally, we carry out a perceptual study to compare judgements of ASV and human listeners. Our findings reveal that pathological speech negatively affects ASV performance, and the severity of the speech is negatively correlated with the performance. There is a moderate agreement in perceptual and objective scores of speaker similarity and severity, however, we could not clearly establish in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and dialogue systems