Parameter-Free Attentive Scoring for Speaker Verification

Jason Pelecanos; Quan Wang; Yiling Huang; Ignacio Lopez Moreno

arXiv:2203.05642·cs.SD·March 7, 2023

Parameter-Free Attentive Scoring for Speaker Verification

Jason Pelecanos, Quan Wang, Yiling Huang, Ignacio Lopez Moreno

PDF

1 Repo

TL;DR

This paper introduces a parameter-free attentive scoring method for speaker verification, inspired by Transformer attention, which improves performance without needing a parametric scoring model.

Contribution

It proposes a novel, parameter-free attentive scoring mechanism based on scaled dot product attention for speaker verification tasks.

Findings

01

Improves average EER by 10% over cosine similarity baseline

02

Demonstrates effectiveness across four different tasks

03

Explores various normalization and pooling strategies

Abstract

This paper presents a novel study of parameter-free attentive scoring for speaker verification. Parameter-free scoring provides the flexibility of comparing speaker representations without the need of an accompanying parametric scoring model. Inspired by the attention component in Transformer neural networks, we propose a variant of the scaled dot product attention mechanism to compare enrollment and test segment representations. In addition, this work explores the effect on performance of (i) different types of normalization, (ii) independent versus tied query/key estimation, (iii) varying the number of key-value pairs and (iv) pooling multiple enrollment utterance statistics. Experimental results for a 4 task average show that a simple parameter-free attentive scoring mechanism can improve the average EER by 10% over the best cosine similarity baseline.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google/speaker-id/tree/master/lingvo
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Dense Connections · Residual Connection · Layer Normalization · Absolute Position Encodings · Adam · Label Smoothing