Speaker recognition by means of a combination of linear and nonlinear   predictive models

Marcos Faundez-Zanuy

arXiv:2203.03190·cs.SD·March 8, 2022

Speaker recognition by means of a combination of linear and nonlinear predictive models

Marcos Faundez-Zanuy

PDF

Open Access

TL;DR

This paper enhances speaker recognition accuracy by combining linear and nonlinear predictive models with LPCC features, showing significant error rate reductions and proposing an efficient computational algorithm.

Contribution

It introduces a novel combination of predictive models with LPCC for improved speaker recognition performance.

Findings

01

Error rate reduced from 6.31% to 3.68% with nonlinear residuals.

02

Linear prediction residuals improve accuracy by 2.63%.

03

Proposes an efficient algorithm for computational reduction.

Abstract

This paper deals the combination of nonlinear predictive models with classical LPCC parameterization for speaker recognition. It is shown that the combination of both a measure defined over LPCC coefficients and a measure defined over predictive analysis residual signal gives rise to an improvement over the classical method that considers only the LPCC coefficients. If the residual signal is obtained from a linear prediction analysis, the improvement is 2.63% (error rate drops from 6.31% to 3.68%) and if it is computed through a nonlinear predictive neural nets based model, the improvement is 3.68%. An efficient algorithm for reducing the computational burden is also proposed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Fuzzy Logic and Control Systems · Fault Detection and Control Systems