Language Modeling with Highway LSTM

Gakuto Kurata; Bhuvana Ramabhadran; George Saon; Abhinav Sethy

arXiv:1709.06436·cs.CL·September 20, 2017·1 cites

Language Modeling with Highway LSTM

Gakuto Kurata, Bhuvana Ramabhadran, George Saon, Abhinav Sethy

PDF

Open Access

TL;DR

This paper introduces Highway LSTM, an extension of standard LSTM with highway networks inside, which increases depth and improves language modeling performance in speech recognition tasks.

Contribution

The paper proposes Highway LSTM models with highway networks integrated inside LSTM units, enhancing depth and accuracy over traditional LSTM language models.

Findings

01

HW-LSTM improves speech recognition accuracy.

02

Achieves new best performance on Switchboard and CallHome tasks.

Abstract

Language models (LMs) based on Long Short Term Memory (LSTM) have shown good gains in many automatic speech recognition tasks. In this paper, we extend an LSTM by adding highway networks inside an LSTM and use the resulting Highway LSTM (HW-LSTM) model for language modeling. The added highway networks increase the depth in the time dimension. Since a typical LSTM has two internal states, a memory cell and a hidden state, we compare various types of HW-LSTM by adding highway networks onto the memory cell and/or the hidden state. Experimental results on English broadcast news and conversational telephone speech recognition show that the proposed HW-LSTM LM improves speech recognition accuracy on top of a strong LSTM LM baseline. We report 5.1% and 9.9% on the Switchboard and CallHome subsets of the Hub5 2000 evaluation, which reaches the best performance numbers reported on these tasks to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Music and Audio Processing

MethodsHighway networks · Sigmoid Activation · Tanh Activation · Long Short-Term Memory