Learning to Reject with a Fixed Predictor: Application to Decontextualization
Christopher Mohri, Daniel Andor, Eunsol Choi, Michael Collins

TL;DR
This paper introduces a new approach for classification with a reject option, specifically applied to decontextualization in NLP, with theoretical guarantees and improved performance on a new dataset.
Contribution
It proposes a novel problem formulation and surrogate loss function for classification with reject options, along with a theoretical analysis and application to decontextualization.
Findings
Significant 25% coverage improvement over baselines
Error rate halved while maintaining high coverage
Approaching the theoretical performance limit
Abstract
We study the problem of classification with a reject option for a fixed predictor, applicable in natural language processing. We introduce a new problem formulation for this scenario, and an algorithm minimizing a new surrogate loss function. We provide a complete theoretical analysis of the surrogate loss function with a strong -consistency guarantee. For evaluation, we choose the decontextualization task, and provide a manually-labelled dataset of examples. Our algorithm significantly outperforms the baselines considered, with a improvement in coverage when halving the error rate, which is only away from the theoretical limit.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Machine Learning and Algorithms
