Playing a Part: Speaker Verification at the Movies

Andrew Brown; Jaesung Huh; Arsha Nagrani; Joon Son Chung; Andrew; Zisserman

arXiv:2010.15716·cs.SD·February 12, 2021

Playing a Part: Speaker Verification at the Movies

Andrew Brown, Jaesung Huh, Arsha Nagrani, Joon Son Chung, Andrew, Zisserman

PDF

1 Repo

TL;DR

This paper evaluates how well current speaker recognition models perform on movie speech data with disguises and domain differences, introduces a new challenging dataset, and explores domain adaptation techniques.

Contribution

Introduces VoxMovies, a new challenging dataset for speaker recognition in movies, and benchmarks model performance with domain adaptation methods.

Findings

01

Model performance drops significantly on movie data.

02

Domain adaptation improves accuracy but leaves room for improvement.

03

VoxMovies dataset highlights challenges in real-world speaker recognition.

Abstract

The goal of this work is to investigate the performance of popular speaker recognition models on speech segments from movies, where often actors intentionally disguise their voice to play a character. We make the following three contributions: (i) We collect a novel, challenging speaker recognition dataset called VoxMovies, with speech for 856 identities from almost 4000 movie clips. VoxMovies contains utterances with varying emotion, accents and background noise, and therefore comprises an entirely different domain to the interview-style, emotionally calm utterances in current speaker recognition datasets such as VoxCeleb; (ii) We provide a number of domain adaptation evaluation sets, and benchmark the performance of state-of-the-art speaker recognition models on these evaluation pairs. We demonstrate that both speaker verification and identification performance drops steeply on this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

JaesungHuh/VoxMovies
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.