UoT-UWF-PartAI at SemEval-2021 Task 5: Self Attention Based Bi-GRU with   Multi-Embedding Representation for Toxicity Highlighter

Hamed Babaei Giglou; Taher Rahgooy; Mostafa Rahgouy; Jafar Razmara

arXiv:2104.13164·cs.CL·April 28, 2021

UoT-UWF-PartAI at SemEval-2021 Task 5: Self Attention Based Bi-GRU with Multi-Embedding Representation for Toxicity Highlighter

Hamed Babaei Giglou, Taher Rahgooy, Mostafa Rahgouy, Jafar Razmara

PDF

TL;DR

This paper introduces a token-level toxicity detection model using self-attention and multi-embedding representations, combining GPT-2, GloVe, and RoBERTa embeddings, achieving promising results in highlighting toxic spans.

Contribution

It presents a novel token-level toxicity detection model based on self-attention and multi-embedding representations, which improves span detection accuracy.

Findings

01

Effective detection of toxic spans demonstrated

02

Multi-embedding approach enhances token representation

03

Model outperforms previous token-level toxicity methods

Abstract

Toxic Spans Detection(TSD) task is defined as highlighting spans that make a text toxic. Many works have been done to classify a given comment or document as toxic or non-toxic. However, none of those proposed models work at the token level. In this paper, we propose a self-attention-based bidirectional gated recurrent unit(BiGRU) with a multi-embedding representation of the tokens. Our proposed model enriches the representation by a combination of GPT-2, GloVe, and RoBERTa embeddings, which led to promising results. Experimental results show that our proposed approach is very effective in detecting span tokens.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Cosine Annealing · Linear Warmup With Linear Decay · WordPiece · Residual Connection · Softmax · Attention Dropout