In-Context Learning for Extreme Multi-Label Classification

Karel D'Oosterlinck; Omar Khattab; Fran\c{c}ois Remy; Thomas; Demeester; Chris Develder; Christopher Potts

arXiv:2401.12178·cs.CL·January 23, 2024·5 cites

In-Context Learning for Extreme Multi-Label Classification

Karel D'Oosterlinck, Omar Khattab, Fran\c{c}ois Remy, Thomas, Demeester, Chris Develder, Christopher Potts

PDF

Open Access 2 Repos

TL;DR

This paper introduces a novel in-context learning approach for extreme multi-label classification that leverages a multi-step interaction between language models and retrievers, achieving state-of-the-art results without fine-tuning.

Contribution

The paper presents the $ exttt{Infer--Retrieve--Rank}$ program and the $ exttt{DSPy}$ model, enabling effective multi-label classification with minimal examples and no fine-tuning, adaptable to various datasets.

Findings

01

Achieved state-of-the-art results on three benchmarks (HOUSE, TECH, TECHWOLF).

02

Attained competitive performance on a diverse benchmark (BioDEX).

03

Requires only tens of labeled examples and no fine-tuning.

Abstract

Multi-label classification problems with thousands of classes are hard to solve with in-context learning alone, as language models (LMs) might lack prior knowledge about the precise classes or how to assign them, and it is generally infeasible to demonstrate every class in a prompt. We propose a general program, $Infer--Retrieve--Rank$ , that defines multi-step interactions between LMs and retrievers to efficiently tackle such problems. We implement this program using the $DSPy$ programming model, which specifies in-context systems in a declarative manner, and use $DSPy$ optimizers to tune it towards specific datasets by bootstrapping only tens of few-shot examples. Our primary extreme classification program, optimized separately for each task, attains state-of-the-art results across three benchmarks (HOUSE, TECH, TECHWOLF). We apply the same program to a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Domain Adaptation and Few-Shot Learning · Topic Modeling