Learning to Reject with a Fixed Predictor: Application to   Decontextualization

Christopher Mohri; Daniel Andor; Eunsol Choi; Michael Collins

arXiv:2301.09044·cs.LG·February 1, 2023

Learning to Reject with a Fixed Predictor: Application to Decontextualization

Christopher Mohri, Daniel Andor, Eunsol Choi, Michael Collins

PDF

Open Access

TL;DR

This paper introduces a new approach for classification with a reject option, specifically applied to decontextualization in NLP, with theoretical guarantees and improved performance on a new dataset.

Contribution

It proposes a novel problem formulation and surrogate loss function for classification with reject options, along with a theoretical analysis and application to decontextualization.

Findings

01

Significant 25% coverage improvement over baselines

02

Error rate halved while maintaining high coverage

03

Approaching the theoretical performance limit

Abstract

We study the problem of classification with a reject option for a fixed predictor, applicable in natural language processing. We introduce a new problem formulation for this scenario, and an algorithm minimizing a new surrogate loss function. We provide a complete theoretical analysis of the surrogate loss function with a strong $H$ -consistency guarantee. For evaluation, we choose the decontextualization task, and provide a manually-labelled dataset of $2, 000$ examples. Our algorithm significantly outperforms the baselines considered, with a $\sim 25%$ improvement in coverage when halving the error rate, which is only $\sim 3%$ away from the theoretical limit.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Machine Learning and Algorithms