Optimization of data-driven filterbank for automatic speaker   verification

Susanta Sarangi; Md Sahidullah; Goutam Saha

arXiv:2007.10729·eess.AS·July 22, 2020

Optimization of data-driven filterbank for automatic speaker verification

Susanta Sarangi, Md Sahidullah, Goutam Saha

PDF

TL;DR

This paper introduces a novel data-driven filterbank design for speaker verification that optimizes filter parameters from speech data, leading to improved discriminative features and better verification accuracy compared to traditional methods.

Contribution

The paper proposes a new filterbank optimization method using PCA and speech-signal-based frequency warping, requiring limited unlabeled data, and demonstrates its superiority over existing filterbanks.

Findings

01

9.75% relative EER reduction with VoxCeleb1 and i-vector system

02

4.43% relative EER reduction with x-vector system

03

Fusion with MFCC further improves performance

Abstract

Most of the speech processing applications use triangular filters spaced in mel-scale for feature extraction. In this paper, we propose a new data-driven filter design method which optimizes filter parameters from a given speech data. First, we introduce a frame-selection based approach for developing speech-signal-based frequency warping scale. Then, we propose a new method for computing the filter frequency responses by using principal component analysis (PCA). The main advantage of the proposed method over the recently introduced deep learning based methods is that it requires very limited amount of unlabeled speech-data. We demonstrate that the proposed filterbank has more speaker discriminative power than commonly used mel filterbank as well as existing data-driven filterbank. We conduct automatic speaker verification (ASV) experiments with different corpora using various…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.