Processing Self Corrections in a speech to speech system

Joerg Spilker; Martin Klarner; Guenther Goerz

arXiv:cs/0008016·cs.CL·May 23, 2007

Processing Self Corrections in a speech to speech system

Joerg Spilker, Martin Klarner, Guenther Goerz

PDF

Open Access

TL;DR

This paper introduces a multi-level framework for detecting and correcting speech repairs in spontaneous dialogues by cascading filters across acoustics, lexis, syntax, and semantics to improve spoken language systems.

Contribution

It presents a novel integrated approach combining acoustic, lexical, syntactic, and semantic information with cascading filters for effective speech repair correction.

Findings

01

Improved detection of speech repairs in spontaneous speech.

02

Enhanced correction accuracy through multi-level filtering.

03

Integration of acoustic and linguistic features boosts system robustness.

Abstract

Speech repairs occur often in spontaneous spoken dialogues. The ability to detect and correct those repairs is necessary for any spoken language system. We present a framework to detect and correct speech repairs where all relevant levels of information, i.e., acoustics, lexis, syntax and semantics can be integrated. The basic idea is to reduce the search space for repairs as soon as possible by cascading filters that involve more and more features. At first an acoustic module generates hypotheses about the existence of a repair. Second a stochastic model suggests a correction for every hypothesis. Well scored corrections are inserted as new paths in the word lattice. Finally a lattice parser decides on accepting the rep air.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems