Uncertainty Determines the Adequacy of the Mode and the Tractability of Decoding in Sequence-to-Sequence Models
Felix Stahlberg, Ilia Kulikov, Shankar Kumar

TL;DR
This paper investigates how intrinsic uncertainty in NLP tasks influences the effectiveness of sequence-to-sequence models, revealing that high ambiguity affects search strategies and model behavior, and introduces a new exact n-best search algorithm.
Contribution
The study quantifies sentence-level uncertainty in NLP tasks and demonstrates its impact on search and model performance, proposing a novel exact n-best search method for uncertain tasks.
Findings
High uncertainty leads to more beam search errors.
Mode inadequacy is prominent in highly ambiguous tasks.
The new n-best search algorithm improves decoding accuracy.
Abstract
In many natural language processing (NLP) tasks the same input (e.g. source sentence) can have multiple possible outputs (e.g. translations). To analyze how this ambiguity (also known as intrinsic uncertainty) shapes the distribution learned by neural sequence models we measure sentence-level uncertainty by computing the degree of overlap between references in multi-reference test sets from two different NLP tasks: machine translation (MT) and grammatical error correction (GEC). At both the sentence- and the task-level, intrinsic uncertainty has major implications for various aspects of search such as the inductive biases in beam search and the complexity of exact search. In particular, we show that well-known pathologies such as a high number of beam search errors, the inadequacy of the mode, and the drop in system performance with large beam sizes apply to tasks with high level of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Machine Learning and Data Classification
