Retrieval Augmented Zero-Shot Text Classification

Tassallah Abdullahi; Ritambhara Singh; Carsten Eickhoff

arXiv:2406.15241·cs.IR·June 28, 2024

Retrieval Augmented Zero-Shot Text Classification

Tassallah Abdullahi, Ritambhara Singh, Carsten Eickhoff

PDF

1 Repo

TL;DR

QZero is a training-free method that enhances zero-shot text classification by retrieving supporting Wikipedia categories, significantly improving performance without retraining, especially in resource-constrained settings.

Contribution

Introducing QZero, a novel knowledge augmentation technique that reformulates queries using Wikipedia retrieval to boost zero-shot classification performance without additional training.

Findings

01

QZero improves classification accuracy by at least 5% in news and medical datasets.

02

It enables small embedding models to match larger models' performance.

03

QZero provides insights into query context and topic relevance.

Abstract

Zero-shot text learning enables text classifiers to handle unseen classes efficiently, alleviating the need for task-specific training data. A simple approach often relies on comparing embeddings of query (text) to those of potential classes. However, the embeddings of a simple query sometimes lack rich contextual information, which hinders the classification performance. Traditionally, this has been addressed by improving the embedding model with expensive training. We introduce QZero, a novel training-free knowledge augmentation approach that reformulates queries by retrieving supporting categories from Wikipedia to improve zero-shot text classification performance. Our experiments across six diverse datasets demonstrate that QZero enhances performance for state-of-the-art static and contextual embedding models without the need for retraining. Notably, in News and medical topic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rsinghlab/qzero
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.