Robust Spoken Language Understanding with RL-based Value Error Recovery

Chen Liu; Su Zhu; Lu Chen; Kai Yu

arXiv:2009.03095·cs.CL·September 8, 2020

Robust Spoken Language Understanding with RL-based Value Error Recovery

Chen Liu, Su Zhu, Lu Chen, Kai Yu

PDF

Open Access

TL;DR

This paper introduces a robust SLU framework that combines input adaptation with rule-based value error recovery, optimized via reinforcement learning to improve extraction accuracy from ASR-transcribed speech.

Contribution

It proposes a novel integrated SLU framework that jointly utilizes rule-based error recovery and RL-based optimization, enhancing robustness against ASR errors.

Findings

01

Significant improvement over baselines on CATSLU dataset

02

Effective integration of value error recovery with RL optimization

03

Enhanced robustness of SLU in noisy speech scenarios

Abstract

Spoken Language Understanding (SLU) aims to extract structured semantic representations (e.g., slot-value pairs) from speech recognized texts, which suffers from errors of Automatic Speech Recognition (ASR). To alleviate the problem caused by ASR-errors, previous works may apply input adaptations to the speech recognized texts, or correct ASR errors in predicted values by searching the most similar candidates in pronunciation. However, these two methods are applied separately and independently. In this work, we propose a new robust SLU framework to guide the SLU input adaptation with a rule-based value error recovery module. The framework consists of a slot tagging model and a rule-based value error recovery module. We pursue on an adapted slot tagging model which can extract potential slot-value pairs mentioned in ASR hypotheses and is suitable for the existing value error recovery…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis