# Deep Neural Network Embeddings with Gating Mechanisms for   Text-Independent Speaker Verification

**Authors:** Lanhua You, Wu Guo, Lirong Dai, Jun Du

arXiv: 1903.12092 · 2019-04-05

## TL;DR

This paper introduces gating mechanisms in deep neural networks for speaker verification, enhancing frame-level representations and pooling strategies to improve performance on NIST datasets.

## Contribution

It proposes a gated convolutional neural network and a novel gated-attention pooling method for more effective speaker verification.

## Key findings

- GCNN outperforms traditional TDNN in feature representation.
- Gated-attention pooling improves verification accuracy.
- Results show significant performance gains on NIST SRE datasets.

## Abstract

In this paper, gating mechanisms are applied in deep neural network (DNN) training for x-vector-based text-independent speaker verification. First, a gated convolution neural network (GCNN) is employed for modeling the frame-level embedding layers. Compared with the time-delay DNN (TDNN), the GCNN can obtain more expressive frame-level representations through carefully designed memory cell and gating mechanisms. Moreover, we propose a novel gated-attention statistics pooling strategy in which the attention scores are shared with the output gate. The gated-attention statistics pooling combines both gating and attention mechanisms into one framework; therefore, we can capture more useful information in the temporal pooling layer. Experiments are carried out using the NIST SRE16 and SRE18 evaluation datasets. The results demonstrate the effectiveness of the GCNN and show that the proposed gated-attention statistics pooling can further improve the performance.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.12092/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/1903.12092/full.md

## References

23 references — full list in the complete paper: https://tomesphere.com/paper/1903.12092/full.md

---
Source: https://tomesphere.com/paper/1903.12092