Conformal Predictor for Improving Zero-shot Text Classification Efficiency
Prafulla Kumar Choubey, Yu Bai, Chien-Sheng Wu, Wenhao Liu, Nazneen, Rajani

TL;DR
This paper introduces a conformal predictor to efficiently narrow down candidate labels in zero-shot text classification, significantly reducing inference time while maintaining high accuracy.
Contribution
It proposes a novel use of conformal prediction to improve the efficiency of cross-encoder zero-shot models without sacrificing performance.
Findings
Inference time reduced by over 22% on multiple datasets.
Prediction sets maintain 99% coverage, ensuring high reliability.
Method applicable to intent and topic classification tasks.
Abstract
Pre-trained language models (PLMs) have been shown effective for zero-shot (0shot) text classification. 0shot models based on natural language inference (NLI) and next sentence prediction (NSP) employ cross-encoder architecture and infer by making a forward pass through the model for each label-text pair separately. This increases the computational cost to make inferences linearly in the number of labels. In this work, we improve the efficiency of such cross-encoder-based 0shot models by restricting the number of likely labels using another fast base classifier-based conformal predictor (CP) calibrated on samples labeled by the 0shot model. Since a CP generates prediction sets with coverage guarantees, it reduces the number of target labels without excluding the most probable label based on the 0shot model. We experiment with three intent and two topic classification datasets. With a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsBalanced Selection
