Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing
Kamal Raj Kanakarajan, Bhuvana Kundumani, Malaikannan, Sankarasubbu

TL;DR
This paper introduces Small-Bench NLP, a benchmark for evaluating small, resource-efficient NLP models trained on a single GPU, facilitating accessible research and innovation in the field.
Contribution
The paper presents a new benchmark and leaderboard for small NLP models, along with a competitive ELECTRA-DeBERTa model that performs comparably to larger models.
Findings
Small models can achieve high performance on NLP tasks.
The ELECTRA-DeBERTa (15M) model attains an 81.53 average score on the benchmark.
The benchmark enables resource-constrained researchers to experiment effectively.
Abstract
Recent progress in the Natural Language Processing domain has given us several State-of-the-Art (SOTA) pretrained models which can be finetuned for specific tasks. These large models with billions of parameters trained on numerous GPUs/TPUs over weeks are leading in the benchmark leaderboards. In this paper, we discuss the need for a benchmark for cost and time effective smaller models trained on a single GPU. This will enable researchers with resource constraints experiment with novel and innovative ideas on tokenization, pretraining tasks, architecture, fine tuning methods etc. We set up Small-Bench NLP, a benchmark for small efficient neural language models trained on a single GPU. Small-Bench NLP benchmark comprises of eight NLP tasks on the publicly available GLUE datasets and a leaderboard to track the progress of the community. Our ELECTRA-DeBERTa (15M parameters) small model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
