Few-shot learning for sentence pair classification and its applications in software engineering
Robert Kraig Helmeczi, Mucahit Cevik, Savas Y{\i}ld{\i}r{\i}m

TL;DR
This paper compares different few-shot learning methods for sentence pair classification using BERT-based models, demonstrating that PET performs strongly and can approach full-data performance with limited labeled examples in software engineering tasks.
Contribution
It provides a comprehensive empirical comparison of fine-tuning, PET, and SetFit for BERT models in few-shot settings within software engineering applications.
Findings
PET achieves near full-data performance with few hundred labeled examples
SetFit and fine-tuning are less effective in low-data regimes
Empirical analysis identifies high-performance techniques for specific tasks
Abstract
Few-shot learning-the ability to train models with access to limited data-has become increasingly popular in the natural language processing (NLP) domain, as large language models such as GPT and T0 have been empirically shown to achieve high performance in numerous tasks with access to just a handful of labeled examples. Smaller language models such as BERT and its variants have also been shown to achieve strong performance with just a handful of labeled examples when combined with few-shot learning algorithms like pattern-exploiting training (PET) and SetFit. The focus of this work is to investigate the performance of alternative few-shot learning approaches with BERT-based models. Specifically, vanilla fine-tuning, PET and SetFit are compared for numerous BERT-based checkpoints over an array of training set sizes. To facilitate this investigation, applications of few-shot learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Machine Learning and Algorithms
