On Importance of Pruning and Distillation for Efficient Low Resource NLP
Aishwarya Mirashi, Purva Lingayat, Srushti Sonavane, Tejas Padhiyar,, Raviraj Joshi, Geetanjali Kale

TL;DR
This paper investigates techniques like pruning and distillation to improve the efficiency of transformer models for low-resource languages, specifically Marathi, achieving significant speedups while maintaining accuracy.
Contribution
It introduces a combined approach of pruning and distillation tailored for Marathi NLP, demonstrating substantial efficiency gains with minimal accuracy loss.
Findings
25% pruning plus knowledge distillation yields 2.56x speedup
Efficiency techniques maintain baseline accuracy in Marathi NLP
Optimized models reduce environmental impact of NLP tasks
Abstract
The rise of large transformer models has revolutionized Natural Language Processing, leading to significant advances in tasks like text classification. However, this progress demands substantial computational resources, escalating training duration, and expenses with larger model sizes. Efforts have been made to downsize and accelerate English models (e.g., Distilbert, MobileBert). Yet, research in this area is scarce for low-resource languages. In this study, we explore the case of the low-resource Indic language Marathi. Leveraging the marathi-topic-all-doc-v2 model as our baseline, we implement optimization techniques to reduce computation time and memory usage. Our focus is on enhancing the efficiency of Marathi transformer models while maintaining top-tier accuracy and reducing computational demands. Using the MahaNews document classification dataset and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
MethodsMovement Pruning · Pruning · Knowledge Distillation · Focus
