Multi-Task Text Classification using Graph Convolutional Networks for Large-Scale Low Resource Language
Mounika Marreddy, Subba Reddy Oota, Lakshmi Sireesha Vakada, Venkata, Charan Chinni, Radhika Mamidi

TL;DR
This paper introduces a multi-task graph convolutional network approach for Telugu language text classification, demonstrating significant improvements over existing models across four NLP tasks with a newly created dataset.
Contribution
The paper proposes a novel supervised graph reconstruction method, MT-Text GCN, for multi-task Telugu NLP, addressing resource scarcity and morphological complexity.
Findings
MT-Text GCN outperforms pretrained embeddings and Transformer models.
Achieves high F1-scores: SA (0.84), EI (0.55), HS (0.83), SAR (0.66).
Provides a new Telugu dataset, TEL-NLP.
Abstract
Graph Convolutional Networks (GCN) have achieved state-of-art results on single text classification tasks like sentiment analysis, emotion detection, etc. However, the performance is achieved by testing and reporting on resource-rich languages like English. Applying GCN for multi-task text classification is an unexplored area. Moreover, training a GCN or adopting an English GCN for Indian languages is often limited by data availability, rich morphological variation, syntax, and semantic differences. In this paper, we study the use of GCN for the Telugu language in single and multi-task settings for four natural language processing (NLP) tasks, viz. sentiment analysis (SA), emotion identification (EI), hate-speech (HS), and sarcasm detection (SAR). In order to evaluate the performance of GCN with one of the Indian languages, Telugu, we analyze the GCN based models with extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Text and Document Classification Technologies
MethodsXLM-R · Linear Layer · Absolute Position Encodings · Multi-Head Attention · Layer Normalization · Residual Connection · Attention Is All You Need · Softmax · Label Smoothing · Adam
