A Challenging Benchmark for Low-Resource Learning

Yudong Wang; Chang Ma; Qingxiu Dong; Lingpeng Kong; Jingjing Xu

arXiv:2303.03840·cs.CL·March 10, 2023·1 cites

A Challenging Benchmark for Low-Resource Learning

Yudong Wang, Chang Ma, Qingxiu Dong, Lingpeng Kong, Jingjing Xu

PDF

Open Access 1 Repo

TL;DR

This paper introduces hardBench, a challenging benchmark with 11 datasets to better evaluate neural networks' robustness in low-resource settings, revealing significant performance gaps and weaknesses.

Contribution

The paper presents hardBench, a new benchmark covering diverse datasets, and provides a theoretical analysis of low-resource learning difficulties, highlighting existing models' limitations.

Findings

01

Neural networks perform poorly on hardBench, exposing robustness issues.

02

Pre-trained models do not improve on hardBench despite better traditional benchmarks.

03

Significant performance gap remains between models and human-level understanding.

Abstract

With promising yet saturated results in high-resource settings, low-resource datasets have gradually become popular benchmarks for evaluating the learning ability of advanced neural networks (e.g., BigBench, superGLUE). Some models even surpass humans according to benchmark test results. However, we find that there exists a set of hard examples in low-resource settings that challenge neural networks but are not well evaluated, which causes over-estimated performance. We first give a theoretical analysis on which factors bring the difficulty of low-resource learning. It then motivate us to propose a challenging benchmark hardBench to better evaluate the learning ability, which covers 11 datasets, including 3 computer vision (CV) datasets and 8 natural language process (NLP) datasets. Experiments on a wide range of models show that neural networks, even pre-trained language models, have…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qian2333/hard-bench
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications

MethodsTest