A Re-ranker Scheme for Integrating Large Scale NLU models
Chengwei Su, Rahul Gupta, Shankar Ananthakrishnan, Spyros Matsoukas

TL;DR
This paper introduces a novel re-ranker for large-scale NLU systems that improves hypothesis selection, calibration, and domain modularity, leading to better cross-domain performance and asynchronous training capabilities.
Contribution
It proposes a new re-ranker strategy with specialized loss functions that enhances hypothesis accuracy, calibration, and supports independent domain training in large-scale NLU systems.
Findings
Reduced top hypothesis error rate
Achieved cross-domain calibration
Supported independent domain dataset training
Abstract
Large scale Natural Language Understanding (NLU) systems are typically trained on large quantities of data, requiring a fast and scalable training strategy. A typical design for NLU systems consists of domain-level NLU modules (domain classifier, intent classifier and named entity recognizer). Hypotheses (NLU interpretations consisting of various intent+slot combinations) from these domain specific modules are typically aggregated with another downstream component. The re-ranker integrates outputs from domain-level recognizers, returning a scored list of cross domain hypotheses. An ideal re-ranker will exhibit the following two properties: (a) it should prefer the most relevant hypothesis for the given input as the top hypothesis and, (b) the interpretation scores corresponding to each hypothesis produced by the re-ranker should be calibrated. Calibration allows the final NLU…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems
