Design Considerations For Hypothesis Rejection Modules In Spoken Language Understanding Systems
Aman Alok, Rahul Gupta, Shankar Ananthakrishnan

TL;DR
This paper explores two designs for hypothesis rejection modules in Spoken Language Understanding systems, comparing their effectiveness and analyzing the impact of incorporating ASR features to improve rejection accuracy.
Contribution
It introduces and compares two hypothesis rejection schemes for SLU systems, highlighting their similarities, differences, and the benefits of including ASR features.
Findings
Both schemes achieve similar rejection performance (~2.5% FRR at ~4.5% FAR)
Using all features improves system performance
Incorporating ASR features reduces FRR to 1.9% at 3.8% FAR
Abstract
Spoken Language Understanding (SLU) systems typically consist of a set of machine learning models that operate in conjunction to produce an SLU hypothesis. The generated hypothesis is then sent to downstream components for further action. However, it is desirable to discard an incorrect hypothesis before sending it downstream. In this work, we present two designs for SLU hypothesis rejection modules: (i) scheme R1 that performs rejection on domain specific SLU hypothesis and, (ii) scheme R2 that performs rejection on hypothesis generated from the overall SLU system. Hypothesis rejection modules in both schemes reject/accept a hypothesis based on features drawn from the utterance directed to the SLU system, the associated SLU hypothesis and SLU confidence score. Our experiments suggest that both the schemes yield similar results (scheme R1: 2.5% FRR @ 4.5% FAR, scheme R2: 2.5% FRR @ 4.6%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
