An Attention Ensemble Approach for Efficient Text Classification of   Indian Languages

Atharva Kulkarni; Amey Hengle; Rutuja Udyawar

arXiv:2102.10275·cs.CL·February 23, 2021

An Attention Ensemble Approach for Efficient Text Classification of Indian Languages

Atharva Kulkarni, Amey Hengle, Rutuja Udyawar

PDF

Open Access

TL;DR

This paper introduces a hybrid CNN-BiLSTM attention ensemble model for efficient text classification of Marathi, an Indian language, achieving state-of-the-art results in technical domain identification tasks.

Contribution

It presents a novel hybrid ensemble approach combining CNN and BiLSTM with attention for resource-constrained Indian language NLP tasks.

Findings

01

Achieved 89.57% validation accuracy and 0.8875 F1-score.

02

Outperformed baseline models and other teams in shared task.

03

Secured the best system submission with 64.26% test accuracy.

Abstract

The recent surge of complex attention-based deep learning architectures has led to extraordinary results in various downstream NLP tasks in the English language. However, such research for resource-constrained and morphologically rich Indian vernacular languages has been relatively limited. This paper proffers team SPPU\_AKAH's solution for the TechDOfication 2020 subtask-1f: which focuses on the coarse-grained technical domain identification of short text documents in Marathi, a Devanagari script-based Indian language. Availing the large dataset at hand, a hybrid CNN-BiLSTM attention ensemble model is proposed that competently combines the intermediate sentence representations generated by the convolutional neural network and the bidirectional long short-term memory, leading to efficient text classification. Experimental results show that the proposed model outperforms various baseline…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text and Document Classification Technologies