Speech-Driven Text Retrieval: Using Target IR Collections for   Statistical Language Model Adaptation in Speech Recognition

Atsushi Fujii; Katunobu Itou; Tetsuya Ishikawa

arXiv:cs/0206037·cs.CL·May 23, 2007·5 cites

Speech-Driven Text Retrieval: Using Target IR Collections for Statistical Language Model Adaptation in Speech Recognition

Atsushi Fujii, Katunobu Itou, Tetsuya Ishikawa

PDF

Open Access

TL;DR

This paper presents a method to improve speech-driven text retrieval by adapting speech recognition language models to target collections, enhancing both recognition and retrieval accuracy in practical applications.

Contribution

It introduces a novel approach to adapt statistical language models for speech recognition based on target collections, specifically for speech-driven text retrieval.

Findings

01

Adaptation improves recognition accuracy

02

Enhanced retrieval performance observed

03

Effective with existing test collections

Abstract

Speech recognition has of late become a practical technology for real world applications. Aiming at speech-driven text retrieval, which facilitates retrieving information with spoken queries, we propose a method to integrate speech recognition and retrieval methods. Since users speak contents related to a target collection, we adapt statistical language models used for speech recognition based on the target collection, so as to improve both the recognition and retrieval accuracy. Experiments using existing test collections combined with dictated queries showed the effectiveness of our method.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and dialogue systems · Music and Audio Processing