Use of Knowledge Graph in Rescoring the N-Best List in Automatic Speech Recognition
Ashwini Jaya Kumar, Camilo Morales, Maria-Esther Vidal, Christoph, Schmidt, S\"oren Auer

TL;DR
This paper introduces a novel approach to rescoring N-best lists in automatic speech recognition by leveraging semantic relatedness computed through knowledge graph embeddings, enhancing recognition accuracy.
Contribution
It applies semantic web techniques, specifically TransE embeddings, to improve N-best list rescoring in ASR, which is a new approach in the field.
Findings
Semantic relatedness improves rescoring accuracy
Knowledge graph embeddings enhance recognition performance
Novel application of semantic web in ASR
Abstract
With the evolution of neural network based methods, automatic speech recognition (ASR) field has been advanced to a level where building an application with speech interface is a reality. In spite of these advances, building a real-time speech recogniser faces several problems such as low recognition accuracy, domain constraint, and out-of-vocabulary words. The low recognition accuracy problem is addressed by improving the acoustic model, language model, decoder and by rescoring the N-best list at the output of the decoder. We are considering the N-best list rescoring approach to improve the recognition accuracy. Most of the methods in the literature use the grammatical, lexical, syntactic and semantic connection between the words in a recognised sentence as a feature to rescore. In this paper, we have tried to see the semantic relatedness between the words in a sentence to rescore the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks
