LIA system description for NIST SRE 2016

Mickael Rouvier; Pierre-Michel Bousquet; Moez Ajili; Waad Ben Kheder,; Driss Matrouf; Jean-Fran\c{c}ois Bonastre

arXiv:1612.05168·cs.SD·December 16, 2016·5 cites

LIA system description for NIST SRE 2016

Mickael Rouvier, Pierre-Michel Bousquet, Moez Ajili, Waad Ben Kheder,, Driss Matrouf, Jean-Fran\c{c}ois Bonastre

PDF

Open Access

TL;DR

This paper details the LIA speaker recognition system for NIST SRE 2016, combining eight sub-systems based on i-vector/PLDA with various features and data-shifting techniques, fused at score level.

Contribution

It introduces a multi-sub-system speaker recognition system utilizing diverse feature extraction and data-shifting methods, optimized for NIST SRE 2016 evaluation.

Findings

01

Achieved competitive speaker recognition performance.

02

Demonstrated effectiveness of fusion of multiple sub-systems.

03

Validated robustness of diverse feature and data-shifting combinations.

Abstract

This paper describes the LIA speaker recognition system developed for the Speaker Recognition Evaluation (SRE) campaign. Eight sub-systems are developed, all based on a state-of-the-art approach: i-vector/PLDA which represents the mainstream technique in text-independent speaker recognition. These sub-systems differ: on the acoustic feature extraction front-end (MFCC, PLP), at the i-vector extraction stage (UBM, DNN or two-feats posteriors) and finally on the data-shifting (IDVC, mean-shifting). The submitted system is a fusion at the score-level of these eight sub-systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing