Modeling Confidence in Sequence-to-Sequence Models
Jan Niehues, Ngoc-Quan Pham

TL;DR
This paper introduces methods to assess the confidence of sequence-to-sequence models by measuring similarity between training and test conditions, improving confidence estimation for tasks like machine translation and speech recognition.
Contribution
It proposes novel confidence estimation techniques based on similarity measures, including internal alignment models, applicable across multiple sequence-to-sequence tasks.
Findings
Improved segment-level confidence estimation in machine translation.
Enhanced confidence annotation for source tokens.
Identified 60% of errors by examining 20% of speech recognition data.
Abstract
Recently, significant improvements have been achieved in various natural language processing tasks using neural sequence-to-sequence models. While aiming for the best generation quality is important, ultimately it is also necessary to develop models that can assess the quality of their output. In this work, we propose to use the similarity between training and test conditions as a measure for models' confidence. We investigate methods solely using the similarity as well as methods combining it with the posterior probability. While traditionally only target tokens are annotated with confidence measures, we also investigate methods to annotate source tokens with confidence. By learning an internal alignment model, we can significantly improve confidence projection over using state-of-the-art external alignment tools. We evaluate the proposed methods on downstream confidence estimation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest
