A Parallelizable Lattice Rescoring Strategy with Neural Language Models

Ke Li; Daniel Povey; Sanjeev Khudanpur

arXiv:2103.05081·eess.AS·March 10, 2021

A Parallelizable Lattice Rescoring Strategy with Neural Language Models

Ke Li, Daniel Povey, Sanjeev Khudanpur

PDF

Open Access 1 Repo

TL;DR

This paper introduces a parallelizable lattice rescoring method using neural language models that improves efficiency and lattice compactness in speech recognition, facilitating easier integration with existing systems.

Contribution

It presents a novel parallel computation strategy and lattice expansion algorithm that enhance neural LM rescoring efficiency and lattice compactness.

Findings

01

Achieves comparable recognition performance to baseline methods.

02

Generates more compact lattices, reducing computational complexity.

03

Simplifies integration of neural LMs with Kaldi.

Abstract

This paper proposes a parallel computation strategy and a posterior-based lattice expansion algorithm for efficient lattice rescoring with neural language models (LMs) for automatic speech recognition. First, lattices from first-pass decoding are expanded by the proposed posterior-based lattice expansion algorithm. Second, each expanded lattice is converted into a minimal list of hypotheses that covers every arc. Each hypothesis is constrained to be the best path for at least one arc it includes. For each lattice, the neural LM scores of the minimal list are computed in parallel and are then integrated back to the lattice in the rescoring stage. Experiments on the Switchboard dataset show that the proposed rescoring strategy obtains comparable recognition performance and generates more compact lattices than a competitive baseline method. Furthermore, the parallel rescoring method offers…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kaldi-asr/kaldi
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Natural Language Processing Techniques · Speech and Audio Processing