Small-Bench NLP: Benchmark for small single GPU trained models in   Natural Language Processing

Kamal Raj Kanakarajan; Bhuvana Kundumani; Malaikannan; Sankarasubbu

arXiv:2109.10847·cs.LG·September 24, 2021

Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing

Kamal Raj Kanakarajan, Bhuvana Kundumani, Malaikannan, Sankarasubbu

PDF

Open Access 1 Repo

TL;DR

This paper introduces Small-Bench NLP, a benchmark for evaluating small, resource-efficient NLP models trained on a single GPU, facilitating accessible research and innovation in the field.

Contribution

The paper presents a new benchmark and leaderboard for small NLP models, along with a competitive ELECTRA-DeBERTa model that performs comparably to larger models.

Findings

01

Small models can achieve high performance on NLP tasks.

02

The ELECTRA-DeBERTa (15M) model attains an 81.53 average score on the benchmark.

03

The benchmark enables resource-constrained researchers to experiment effectively.

Abstract

Recent progress in the Natural Language Processing domain has given us several State-of-the-Art (SOTA) pretrained models which can be finetuned for specific tasks. These large models with billions of parameters trained on numerous GPUs/TPUs over weeks are leading in the benchmark leaderboards. In this paper, we discuss the need for a benchmark for cost and time effective smaller models trained on a single GPU. This will enable researchers with resource constraints experiment with novel and innovative ideas on tokenization, pretraining tasks, architecture, fine tuning methods etc. We set up Small-Bench NLP, a benchmark for small efficient neural language models trained on a single GPU. Small-Bench NLP benchmark comprises of eight NLP tasks on the publicly available GLUE datasets and a leaderboard to track the progress of the community. Our ELECTRA-DeBERTa (15M parameters) small model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

smallbenchnlp/benchmark
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis