Extreme Zero-Shot Learning for Extreme Text Classification

Yuanhao Xiong; Wei-Cheng Chang; Cho-Jui Hsieh; Hsiang-Fu Yu; Inderjit; Dhillon

arXiv:2112.08652·cs.LG·December 17, 2021

Extreme Zero-Shot Learning for Extreme Text Classification

Yuanhao Xiong, Wei-Cheng Chang, Cho-Jui Hsieh, Hsiang-Fu Yu, Inderjit, Dhillon

PDF

Open Access 1 Repo

TL;DR

This paper introduces MACLR, a novel self-supervised pre-training method for extreme zero-shot text classification that effectively learns from raw text without supervision, outperforming existing methods in accuracy.

Contribution

The paper proposes MACLR, a new contrastive pre-training approach for EZ-XMC that leverages raw text and techniques like multi-scale clustering and label regularization, enabling superior zero-shot performance.

Findings

01

MACLR achieves 5-10% higher precision and recall on EZ-XMC datasets.

02

Pre-trained encoder improves with limited supervision in few-shot scenarios.

03

Fine-tuning enhances performance further with minimal labeled data.

Abstract

The eXtreme Multi-label text Classification (XMC) problem concerns finding most relevant labels for an input text instance from a large label set. However, the XMC setup faces two challenges: (1) it is not generalizable to predict unseen labels in dynamic environments, and (2) it requires a large amount of supervised (instance, label) pairs, which can be difficult to obtain for emerging domains. Recently, the generalized zero-shot XMC (GZ-XMC) setup has been studied and ZestXML is proposed accordingly to handle the unseen labels, which still requires a large number of annotated (instance, label) pairs. In this paper, we consider a more practical scenario called Extreme Zero-Shot XMC (EZ-XMC), in which no supervision is needed and merely raw text of instances and labels are accessible. Few-Shot XMC (FS-XMC), an extension to EZ-XMC with limited supervision is also investigated. To learn…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amzn/pecos
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Domain Adaptation and Few-Shot Learning · Speech Recognition and Synthesis