Few-shot learning for sentence pair classification and its applications   in software engineering

Robert Kraig Helmeczi; Mucahit Cevik; Savas Y{\i}ld{\i}r{\i}m

arXiv:2306.08058·cs.SE·June 16, 2023·2 cites

Few-shot learning for sentence pair classification and its applications in software engineering

Robert Kraig Helmeczi, Mucahit Cevik, Savas Y{\i}ld{\i}r{\i}m

PDF

Open Access

TL;DR

This paper compares different few-shot learning methods for sentence pair classification using BERT-based models, demonstrating that PET performs strongly and can approach full-data performance with limited labeled examples in software engineering tasks.

Contribution

It provides a comprehensive empirical comparison of fine-tuning, PET, and SetFit for BERT models in few-shot settings within software engineering applications.

Findings

01

PET achieves near full-data performance with few hundred labeled examples

02

SetFit and fine-tuning are less effective in low-data regimes

03

Empirical analysis identifies high-performance techniques for specific tasks

Abstract

Few-shot learning-the ability to train models with access to limited data-has become increasingly popular in the natural language processing (NLP) domain, as large language models such as GPT and T0 have been empirically shown to achieve high performance in numerous tasks with access to just a handful of labeled examples. Smaller language models such as BERT and its variants have also been shown to achieve strong performance with just a handful of labeled examples when combined with few-shot learning algorithms like pattern-exploiting training (PET) and SetFit. The focus of this work is to investigate the performance of alternative few-shot learning approaches with BERT-based models. Specifically, vanilla fine-tuning, PET and SetFit are compared for numerous BERT-based checkpoints over an array of training set sizes. To facilitate this investigation, applications of few-shot learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Machine Learning and Algorithms