Speaker discrimination in humans and machines: Effects of speaking style variability
Amber Afshan, Jody Kreiman, and Abeer Alwan

TL;DR
This study compares human and machine speaker discrimination performance across different speaking styles, revealing that both perform better with style-matched speech and that combining their responses enhances accuracy, with humans outperforming machines in certain conditions.
Contribution
The paper provides a comparative analysis of human and machine speaker discrimination, highlighting differences in approach and performance across speaking styles, and demonstrates the benefit of fusing their responses.
Findings
Humans outperform machines in style-matched conditions for read speech.
Fusing human and machine responses improves speaker discrimination accuracy.
Both humans and machines perform better with style-matched speech.
Abstract
Does speaking style variation affect humans' ability to distinguish individuals from their voices? How do humans compare with automatic systems designed to discriminate between voices? In this paper, we attempt to answer these questions by comparing human and machine speaker discrimination performance for read speech versus casual conversations. Thirty listeners were asked to perform a same versus different speaker task. Their performance was compared to a state-of-the-art x-vector/PLDA-based automatic speaker verification system. Results showed that both humans and machines performed better with style-matched stimuli, and human performance was better when listeners were native speakers of American English. Native listeners performed better than machines in the style-matched conditions (EERs of 6.96% versus 14.35% for read speech, and 15.12% versus 19.87%, for conversations), but for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide)
