Confidence Estimation in Structured Prediction

Avihai Mejer; Koby Crammer

arXiv:1111.1386·cs.LG·November 8, 2011·2 cites

Confidence Estimation in Structured Prediction

Avihai Mejer, Koby Crammer

PDF

Open Access

TL;DR

This paper introduces methods to estimate confidence levels in structured prediction models that are not probabilistic, enhancing their utility in tasks like sequence labeling and dependency parsing.

Contribution

It proposes novel techniques to compute confidence scores for non-probabilistic structured prediction algorithms, enabling better error detection and active learning.

Findings

01

Confidence estimates reflect true correctness probability.

02

Methods improve detection of mislabeled words.

03

Enhance active learning and trade-off between recall and precision.

Abstract

Structured classification tasks such as sequence labeling and dependency parsing have seen much interest by the Natural Language Processing and the machine learning communities. Several online learning algorithms were adapted for structured tasks such as Perceptron, Passive- Aggressive and the recently introduced Confidence-Weighted learning . These online algorithms are easy to implement, fast to train and yield state-of-the-art performance. However, unlike probabilistic models like Hidden Markov Model and Conditional random fields, these methods generate models that output merely a prediction with no additional information regarding confidence in the correctness of the output. In this work we fill the gap proposing few alternatives to compute the confidence in the output of non-probabilistic algorithms.We show how to compute confidence estimates in the prediction such that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Algorithms