ConEntail: An Entailment-based Framework for Universal Zero and Few Shot   Classification with Supervised Contrastive Pretraining

Ranran Haoran Zhang; Aysa Xuemo Fan; Rui Zhang

arXiv:2210.07587·cs.CL·February 14, 2023

ConEntail: An Entailment-based Framework for Universal Zero and Few Shot Classification with Supervised Contrastive Pretraining

Ranran Haoran Zhang, Aysa Xuemo Fan, Rui Zhang

PDF

Open Access 1 Repo

TL;DR

ConEntail introduces a universal classification framework using entailment-based pretraining and supervised contrastive learning, significantly improving zero and few shot classification performance across diverse datasets.

Contribution

The paper presents a novel entailment-based meta-task and supervised contrastive pretraining approach for universal zero and few shot classification, leveraging extensive annotated datasets.

Findings

01

Outperforms baselines with 9.4% average improvement in zero shot

02

Achieves 3.5% average improvement in few shot settings

03

Effectively exploits 57 annotated datasets for pretraining

Abstract

A universal classification model aims to generalize to diverse classification tasks in both zero and few shot settings. A promising way toward universal classification is to cast heterogeneous data formats into a dataset-agnostic "meta-task" (e.g., textual entailment, question answering) then pretrain a model on the combined meta dataset. The existing work is either pretrained on specific subsets of classification tasks, or pretrained on both classification and generation data but the model could not fulfill its potential in universality and reliability. These also leave a massive amount of annotated data under-exploited. To fill these gaps, we propose ConEntail, a new framework for universal zero and few shot classification with supervised contrastive pretraining. Our unified meta-task for classification is based on nested entailment. It can be interpreted as "Does sentence a entails…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

psunlpgroup/ConEntail
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications