Conformal Predictor for Improving Zero-shot Text Classification   Efficiency

Prafulla Kumar Choubey; Yu Bai; Chien-Sheng Wu; Wenhao Liu; Nazneen; Rajani

arXiv:2210.12619·cs.CL·October 25, 2022

Conformal Predictor for Improving Zero-shot Text Classification Efficiency

Prafulla Kumar Choubey, Yu Bai, Chien-Sheng Wu, Wenhao Liu, Nazneen, Rajani

PDF

Open Access

TL;DR

This paper introduces a conformal predictor to efficiently narrow down candidate labels in zero-shot text classification, significantly reducing inference time while maintaining high accuracy.

Contribution

It proposes a novel use of conformal prediction to improve the efficiency of cross-encoder zero-shot models without sacrificing performance.

Findings

01

Inference time reduced by over 22% on multiple datasets.

02

Prediction sets maintain 99% coverage, ensuring high reliability.

03

Method applicable to intent and topic classification tasks.

Abstract

Pre-trained language models (PLMs) have been shown effective for zero-shot (0shot) text classification. 0shot models based on natural language inference (NLI) and next sentence prediction (NSP) employ cross-encoder architecture and infer by making a forward pass through the model for each label-text pair separately. This increases the computational cost to make inferences linearly in the number of labels. In this work, we improve the efficiency of such cross-encoder-based 0shot models by restricting the number of likely labels using another fast base classifier-based conformal predictor (CP) calibrated on samples labeled by the 0shot model. Since a CP generates prediction sets with coverage guarantees, it reduces the number of target labels without excluding the most probable label based on the 0shot model. We experiment with three intent and two topic classification datasets. With a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsBalanced Selection