Robustness of Demonstration-based Learning Under Limited Data Scenario

Hongxin Zhang; Yanzhe Zhang; Ruiyi Zhang; Diyi Yang

arXiv:2210.10693·cs.CL·October 20, 2022

Robustness of Demonstration-based Learning Under Limited Data Scenario

Hongxin Zhang, Yanzhe Zhang, Ruiyi Zhang, Diyi Yang

PDF

Open Access 1 Repo

TL;DR

This paper investigates the robustness of demonstration-based learning in language models under limited data conditions, revealing that even random demonstrations can improve performance and confidence, with factors like length and relevance affecting outcomes.

Contribution

It provides a systematic analysis of how different types of demonstrations influence model robustness and performance in few-shot learning scenarios.

Findings

01

Random token demonstrations still improve few-shot learning.

02

Demonstration length and token relevance significantly impact performance.

03

Demonstrations increase model confidence in superficial patterns.

Abstract

Demonstration-based learning has shown great potential in stimulating pretrained language models' ability under limited data scenario. Simply augmenting the input with some demonstrations can significantly improve performance on few-shot NER. However, why such demonstrations are beneficial for the learning process remains unclear since there is no explicit alignment between the demonstrations and the predictions. In this paper, we design pathological demonstrations by gradually removing intuitively useful information from the standard ones to take a deep dive of the robustness of demonstration-based sequence labeling and show that (1) demonstrations composed of random tokens still make the model a better few-shot learner; (2) the length of random demonstrations and the relevance of random tokens are the main factors affecting the performance; (3) demonstrations increase the confidence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

salt-nlp/robustdemo
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification