On Importance of Pruning and Distillation for Efficient Low Resource NLP

Aishwarya Mirashi; Purva Lingayat; Srushti Sonavane; Tejas Padhiyar,; Raviraj Joshi; Geetanjali Kale

arXiv:2409.14162·cs.CL·September 24, 2024

On Importance of Pruning and Distillation for Efficient Low Resource NLP

Aishwarya Mirashi, Purva Lingayat, Srushti Sonavane, Tejas Padhiyar,, Raviraj Joshi, Geetanjali Kale

PDF

Open Access

TL;DR

This paper investigates techniques like pruning and distillation to improve the efficiency of transformer models for low-resource languages, specifically Marathi, achieving significant speedups while maintaining accuracy.

Contribution

It introduces a combined approach of pruning and distillation tailored for Marathi NLP, demonstrating substantial efficiency gains with minimal accuracy loss.

Findings

01

25% pruning plus knowledge distillation yields 2.56x speedup

02

Efficiency techniques maintain baseline accuracy in Marathi NLP

03

Optimized models reduce environmental impact of NLP tasks

Abstract

The rise of large transformer models has revolutionized Natural Language Processing, leading to significant advances in tasks like text classification. However, this progress demands substantial computational resources, escalating training duration, and expenses with larger model sizes. Efforts have been made to downsize and accelerate English models (e.g., Distilbert, MobileBert). Yet, research in this area is scarce for low-resource languages. In this study, we explore the case of the low-resource Indic language Marathi. Leveraging the marathi-topic-all-doc-v2 model as our baseline, we implement optimization techniques to reduce computation time and memory usage. Our focus is on enhancing the efficiency of Marathi transformer models while maintaining top-tier accuracy and reducing computational demands. Using the MahaNews document classification dataset and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies

MethodsMovement Pruning · Pruning · Knowledge Distillation · Focus