TL;DR
This paper describes the problemConquero team's submission of transformer-based and soft label approaches for multilingual offensive language identification in social media at SemEval-2020, achieving competitive ranks across three sub-tasks.
Contribution
The paper introduces transformer fine-tuning and soft label methods for multilingual offensive language detection, demonstrating their effectiveness in a shared task setting.
Findings
Transformer models achieved competitive rankings across sub-tasks.
Soft label approach provided an alternative method for offense target identification.
Transformer-based models outperformed baseline approaches in several languages.
Abstract
In this paper, we present various systems submitted by our team problemConquero for SemEval-2020 Shared Task 12 Multilingual Offensive Language Identification in Social Media. We participated in all the three sub-tasks of OffensEval-2020, and our final submissions during the evaluation phase included transformer-based approaches and a soft label-based approach. BERT based fine-tuned models were submitted for each language of sub-task A (offensive tweet identification). RoBERTa based fine-tuned model for sub-task B (automatic categorization of offense types) was submitted. We submitted two models for sub-task C (offense target identification), one using soft labels and the other using BERT based fine-tuned model. Our ranks for sub-task A were Greek-19 out of 37, Turkish-22 out of 46, Danish-26 out of 39, Arabic-39 out of 53, and English-20 out of 85. We achieved a rank of 28 out of 43…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · WordPiece · Linear Warmup With Linear Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Dense Connections · Layer Normalization · Residual Connection · Adam · Multi-Head Attention
