Code-Mixed Telugu-English Hate Speech Detection
Santhosh Kakarla, Gautama Shastry Bulusu Venkata

TL;DR
This paper explores transformer-based models for hate speech detection in Telugu, demonstrating that translation to English and multilingual models like Hindi-Abusive-MuRIL enhance accuracy in low-resource language NLP tasks.
Contribution
It introduces the use of LoRA fine-tuning and translation strategies with transformer models to improve hate speech detection in Telugu, a low-resource language.
Findings
Translation improves model performance in Telugu hate speech detection.
Hindi-Abusive-MuRIL outperforms other models in accuracy and F1 score.
Multilingual processing enhances hate speech classification in low-resource languages.
Abstract
Hate speech detection in low-resource languages like Telugu is a growing challenge in NLP. This study investigates transformer-based models, including TeluguHateBERT, HateBERT, DeBERTa, Muril, IndicBERT, Roberta, and Hindi-Abusive-MuRIL, for classifying hate speech in Telugu. We fine-tune these models using Low-Rank Adaptation (LoRA) to optimize efficiency and performance. Additionally, we explore a multilingual approach by translating Telugu text into English using Google Translate to assess its impact on classification accuracy. Our experiments reveal that most models show improved performance after translation, with DeBERTa and Hindi-Abusive-MuRIL achieving higher accuracy and F1 scores compared to training directly on Telugu text. Notably, Hindi-Abusive-MuRIL outperforms all other models in both the original Telugu dataset and the translated dataset, demonstrating its robustness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
MethodsHow do I file a dispute with Expedia?*DisputeFastService · DeBERTa
