Exploiting Nontrivial Connectivity for Automatic Speech Recognition

Marius Paraschiv; Lasse Borgholt; Tycho Max Sylvester Tax; Marco Singh; and Lars Maal{\o}e

arXiv:1711.10271·cs.SD·November 29, 2017

Exploiting Nontrivial Connectivity for Automatic Speech Recognition

Marius Paraschiv, Lasse Borgholt, Tycho Max Sylvester Tax, Marco Singh, and Lars Maal{\o}e

PDF

Open Access

TL;DR

This paper compares different deep network architectures and demonstrates their effective application to automatic speech recognition, leading to significant performance improvements.

Contribution

It introduces the application of residual, densely-connected, and highway networks to speech recognition, showing their advantages over traditional models.

Findings

01

Residual, dense, and highway networks improve speech recognition accuracy.

02

Deep networks with nontrivial connectivity address vanishing gradients.

03

Significant performance gains over existing models in speech recognition.

Abstract

Nontrivial connectivity has allowed the training of very deep networks by addressing the problem of vanishing gradients and offering a more efficient method of reusing parameters. In this paper we make a comparison between residual networks, densely-connected networks and highway networks on an image classification task. Next, we show that these methodologies can easily be deployed into automatic speech recognition and provide significant improvements to existing models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech Recognition and Synthesis · Speech and Audio Processing

MethodsHighway networks