Experimental Evaluation of Deep Learning models for Marathi Text Classification
Atharva Kulkarni, Meet Mandhane, Manali Likhitkar, Gayatri Kshirsagar,, Jayashree Jagdale, Raviraj Joshi

TL;DR
This paper evaluates various deep learning models, including CNN, LSTM, ULMFiT, and BERT, for Marathi text classification, comparing their performance on publicly available datasets to guide future Marathi NLP research.
Contribution
It provides a comprehensive comparison of deep learning models and embeddings for Marathi text classification, highlighting that simple CNN and LSTM models perform comparably to BERT.
Findings
CNN and LSTM with FastText embeddings perform on par with BERT models.
Pre-trained Marathi embeddings enhance model performance.
The study offers insights into effective NLP approaches for Marathi language.
Abstract
The Marathi language is one of the prominent languages used in India. It is predominantly spoken by the people of Maharashtra. Over the past decade, the usage of language on online platforms has tremendously increased. However, research on Natural Language Processing (NLP) approaches for Marathi text has not received much attention. Marathi is a morphologically rich language and uses a variant of the Devanagari script in the written form. This works aims to provide a comprehensive overview of available resources and models for Marathi text classification. We evaluate CNN, LSTM, ULMFiT, and BERT based models on two publicly available Marathi text classification datasets and present a comparative analysis. The pre-trained Marathi fast text word embeddings by Facebook and IndicNLP are used in conjunction with word-based models. We show that basic single layer models based on CNN and LSTM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Weight Tying · Activation Regularization · Embedding Dropout · DropConnect · Temporal Activation Regularization · Variational Dropout · ASGD Weight-Dropped LSTM · Slanted Triangular Learning Rates · Discriminative Fine-Tuning
