In-Context Learning for Extreme Multi-Label Classification
Karel D'Oosterlinck, Omar Khattab, Fran\c{c}ois Remy, Thomas, Demeester, Chris Develder, Christopher Potts

TL;DR
This paper introduces a novel in-context learning approach for extreme multi-label classification that leverages a multi-step interaction between language models and retrievers, achieving state-of-the-art results without fine-tuning.
Contribution
The paper presents the $ exttt{Infer--Retrieve--Rank}$ program and the $ exttt{DSPy}$ model, enabling effective multi-label classification with minimal examples and no fine-tuning, adaptable to various datasets.
Findings
Achieved state-of-the-art results on three benchmarks (HOUSE, TECH, TECHWOLF).
Attained competitive performance on a diverse benchmark (BioDEX).
Requires only tens of labeled examples and no fine-tuning.
Abstract
Multi-label classification problems with thousands of classes are hard to solve with in-context learning alone, as language models (LMs) might lack prior knowledge about the precise classes or how to assign them, and it is generally infeasible to demonstrate every class in a prompt. We propose a general program, , that defines multi-step interactions between LMs and retrievers to efficiently tackle such problems. We implement this program using the programming model, which specifies in-context systems in a declarative manner, and use optimizers to tune it towards specific datasets by bootstrapping only tens of few-shot examples. Our primary extreme classification program, optimized separately for each task, attains state-of-the-art results across three benchmarks (HOUSE, TECH, TECHWOLF). We apply the same program to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Domain Adaptation and Few-Shot Learning · Topic Modeling
