FewCLUE: A Chinese Few-shot Learning Evaluation Benchmark
Liang Xu, Xiaojing Lu, Chenyang Yuan, Xuanwei Zhang, Huilin Xu, Hu, Yuan, Guoao Wei, Xiang Pan, Xin Tian, Libo Qin, Hu Hai

TL;DR
This paper introduces FewCLUE, a comprehensive benchmark for evaluating few-shot learning methods on Chinese NLP tasks, providing insights into method effectiveness and a platform for future research progress.
Contribution
It presents the first extensive Chinese few-shot learning benchmark, evaluates multiple methods, and offers tools and a leaderboard to advance Chinese NLP research.
Findings
Performance of few-shot methods varies with pre-trained models.
PET and P-tuning outperform others with specific models.
Benchmark facilitates fair comparison and future research.
Abstract
Pretrained Language Models (PLMs) have achieved tremendous success in natural language understanding tasks. While different learning schemes -- fine-tuning, zero-shot, and few-shot learning -- have been widely explored and compared for languages such as English, there is comparatively little work in Chinese to fairly and comprehensively evaluate and compare these methods and thus hinders cumulative progress. In this paper, we introduce the Chinese Few-shot Learning Evaluation Benchmark (FewCLUE), the first comprehensive few-shot evaluation benchmark in Chinese. It includes nine tasks, ranging from single-sentence and sentence-pair classification tasks to machine reading comprehension tasks. We systematically evaluate five state-of-the-art (SOTA) few-shot learning methods (including PET, ADAPET, LM-BFF, P-tuning and EFL), and compare their performance with fine-tuning and zero-shot…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · ERNIE · Linear Layer · Attention Dropout · Weight Decay · Linear Warmup With Linear Decay · Residual Connection · Softmax
