FewCLUE: A Chinese Few-shot Learning Evaluation Benchmark

Liang Xu; Xiaojing Lu; Chenyang Yuan; Xuanwei Zhang; Huilin Xu; Hu; Yuan; Guoao Wei; Xiang Pan; Xin Tian; Libo Qin; Hu Hai

arXiv:2107.07498·cs.CL·September 30, 2021·28 cites

FewCLUE: A Chinese Few-shot Learning Evaluation Benchmark

Liang Xu, Xiaojing Lu, Chenyang Yuan, Xuanwei Zhang, Huilin Xu, Hu, Yuan, Guoao Wei, Xiang Pan, Xin Tian, Libo Qin, Hu Hai

PDF

Open Access 1 Repo 3 Models

TL;DR

This paper introduces FewCLUE, a comprehensive benchmark for evaluating few-shot learning methods on Chinese NLP tasks, providing insights into method effectiveness and a platform for future research progress.

Contribution

It presents the first extensive Chinese few-shot learning benchmark, evaluates multiple methods, and offers tools and a leaderboard to advance Chinese NLP research.

Findings

01

Performance of few-shot methods varies with pre-trained models.

02

PET and P-tuning outperform others with specific models.

03

Benchmark facilitates fair comparison and future research.

Abstract

Pretrained Language Models (PLMs) have achieved tremendous success in natural language understanding tasks. While different learning schemes -- fine-tuning, zero-shot, and few-shot learning -- have been widely explored and compared for languages such as English, there is comparatively little work in Chinese to fairly and comprehensively evaluate and compare these methods and thus hinders cumulative progress. In this paper, we introduce the Chinese Few-shot Learning Evaluation Benchmark (FewCLUE), the first comprehensive few-shot evaluation benchmark in Chinese. It includes nine tasks, ranging from single-sentence and sentence-pair classification tasks to machine reading comprehension tasks. We systematically evaluate five state-of-the-art (SOTA) few-shot learning methods (including PET, ADAPET, LM-BFF, P-tuning and EFL), and compare their performance with fine-tuning and zero-shot…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CLUEbenchmark/FewCLUE
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · ERNIE · Linear Layer · Attention Dropout · Weight Decay · Linear Warmup With Linear Decay · Residual Connection · Softmax