Designing Informative Metrics for Few-Shot Example Selection

Rishabh Adiga; Lakshminarayanan Subramanian; Varun Chandrasekaran

arXiv:2403.03861·cs.CL·August 1, 2024·1 cites

Designing Informative Metrics for Few-Shot Example Selection

Rishabh Adiga, Lakshminarayanan Subramanian, Varun Chandrasekaran

PDF

Open Access 1 Video

TL;DR

This paper introduces a complexity-based method for selecting examples in few-shot learning with pretrained language models, improving performance on sequence tagging tasks without training dedicated selection models.

Contribution

It presents a novel complexity-based prompt selection approach that aligns test and example complexity, achieving state-of-the-art results in few-shot NER tasks.

Findings

01

Achieves 5% F1 improvement on CoNLL2003 with GPT-4.

02

Large gains (up to 28.85 points) on smaller models like GPT-j-6B.

03

Effective without training dedicated selection models.

Abstract

Pretrained language models (PLMs) have shown remarkable few-shot learning capabilities when provided with properly formatted examples. However, selecting the "best" examples remains an open challenge. We propose a complexity-based prompt selection approach for sequence tagging tasks. This approach avoids the training of a dedicated model for selection of examples, and instead uses certain metrics to align the syntactico-semantic complexity of test sentences and examples. We use both sentence- and word-level metrics to match the complexity of examples to the (test) sentence being considered. Our results demonstrate that our approach extracts greater performance from PLMs: it achieves state-of-the-art performance on few-shot NER, achieving a 5% absolute improvement in F1 score on the CoNLL2003 dataset for GPT-4. We also see large gains of upto 28.85 points (F1/Acc.) in smaller models like…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Designing Informative Metrics for Few-Shot Example Selection· underline

Taxonomy

TopicsAdvanced Multi-Objective Optimization Algorithms · Manufacturing Process and Optimization

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Dropout · Softmax · Dense Connections · Label Smoothing · Adam · Absolute Position Encodings · Residual Connection