Zero-Shot Text Classification via Self-Supervised Tuning

Chaoqun Liu; Wenxuan Zhang; Guizhen Chen; Xiaobao Wu; Anh Tuan Luu,; Chip Hong Chang; Lidong Bing

arXiv:2305.11442·cs.CL·May 26, 2023·2 cites

Zero-Shot Text Classification via Self-Supervised Tuning

Chaoqun Liu, Wenxuan Zhang, Guizhen Chen, Xiaobao Wu, Anh Tuan Luu,, Chip Hong Chang, Lidong Bing

PDF

Open Access 1 Repo 4 Models

TL;DR

This paper introduces a self-supervised tuning method for language models that improves zero-shot text classification by leveraging unlabeled data and a novel first sentence prediction task, outperforming existing approaches on multiple benchmarks.

Contribution

The paper proposes a new self-supervised learning paradigm for zero-shot classification that reduces prompt sensitivity and eliminates the need for annotated task data.

Findings

01

Outperforms state-of-the-art on 7 out of 10 tasks

02

Less sensitive to prompt design

03

Effective with unlabeled data

Abstract

Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data, called self-supervised tuning. By exploring the inherent structure of free texts, we propose a new learning objective called first sentence prediction to bridge the gap between unlabeled data and text classification tasks. After tuning the model to learn to predict the first sentence in a paragraph based on the rest, the model is able to conduct zero-shot inference on unseen tasks such as topic classification and sentiment analysis. Experimental results show that our model outperforms the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

damo-nlp-sg/sstuning
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification