Staircase Network: structural language identification via hierarchical   attentive units

Trung Ngo Trong; Ville Hautam\"aki; Kristiina Jokinen

arXiv:1804.11067·cs.AI·May 1, 2018

Staircase Network: structural language identification via hierarchical attentive units

Trung Ngo Trong, Ville Hautam\"aki, Kristiina Jokinen

PDF

TL;DR

The paper introduces a hierarchical neural network with attentive units for language identification, leveraging meta-information and auxiliary tasks to improve accuracy and generalization over existing methods.

Contribution

It proposes a novel staircase architecture that incorporates hierarchical meta-information and auxiliary tasks for robust language recognition.

Findings

01

Outperforms state-of-the-art i-vector methods on multiple corpora.

02

Effectively handles class imbalance and channel variability.

03

Enhances generalization through hierarchical learning and auxiliary tasks.

Abstract

Language recognition system is typically trained directly to optimize classification error on the target language labels, without using the external, or meta-information in the estimation of the model parameters. However labels are not independent of each other, there is a dependency enforced by, for example, the language family, which affects negatively on classification. The other external information sources (e.g. audio encoding, telephony or video speech) can also decrease classification accuracy. In this paper, we attempt to solve these issues by constructing a deep hierarchical neural network, where different levels of meta-information are encapsulated by attentive prediction units and also embedded into the training progress. The proposed method learns auxiliary tasks to obtain robust internal representation and to construct a variant of attentive units within the hierarchical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.