In-Context Probing: Toward Building Robust Classifiers via Probing Large Language Models
Afra Amini, Massimiliano Ciaramita

TL;DR
This paper introduces In-Context Probing (ICP), a method that enhances the robustness of classifiers built on large language models by probing representations instead of decoding outputs, showing improved stability and efficiency especially with small datasets.
Contribution
The paper proposes In-Context Probing, a novel approach that improves classifier robustness and performance on small datasets by probing representations rather than decoding outputs.
Findings
ICP is more robust to instruction variations.
ICP performs comparably or better than fine-tuning.
ICP is effective with fewer than 100 training examples.
Abstract
Large language models are able to learn new tasks in context, where they are provided with instructions and a few annotated examples. However, the effectiveness of in-context learning is dependent on the provided context, and the performance on a downstream task can vary considerably, depending on the instruction. Importantly, such dependency on the context can surface in unpredictable ways, e.g., a seemingly more informative instruction might lead to a worse performance. In this paper, we propose an alternative approach, which we term In-Context Probing (ICP). Similar to in-context learning, we contextualize the representation of the input with an instruction, but instead of decoding the output prediction, we probe the contextualized representation to predict the label. Through a series of experiments on a diverse set of classification tasks, we show that in-context probing is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Machine Learning and Data Classification
