Sparse Architectures for Text-Independent Speaker Verification Using   Deep Neural Networks

Sara Sedighi; Shayan Ramhormozi

arXiv:1805.07628·cs.SD·August 13, 2018

Sparse Architectures for Text-Independent Speaker Verification Using Deep Neural Networks

Sara Sedighi, Shayan Ramhormozi

PDF

Open Access

TL;DR

This paper explores structured sparsity in deep neural networks for text-independent speaker verification, demonstrating that pruning can enhance verification accuracy and reduce computational demands.

Contribution

It introduces structured sparsity enforcement in DNNs for speaker verification, showing that pruning can improve performance by mitigating overfitting.

Findings

01

Sparsity enforcement improves verification accuracy.

02

Pruned models require less computational power.

03

Sparsity prevents overfitting in deep networks.

Abstract

Network pruning is of great importance due to the elimination of the unimportant weights or features activated due to the network over-parametrization. Advantages of sparsity enforcement include preventing the overfitting and speedup. Considering a large number of parameters in deep architectures, network compression becomes of critical importance due to the required huge amount of computational power. In this work, we impose structured sparsity for speaker verification which is the validation of the query speaker compared to the speaker gallery. We will show that the mere sparsity enforcement can improve the verification results due to the possible initial overfitting in the network.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing