LT@Helsinki at SemEval-2020 Task 12: Multilingual or language-specific   BERT?

Marc P\`amies; Emily \"Ohman; Kaisla Kajava; J\"org Tiedemann

arXiv:2008.00805·cs.CL·August 4, 2020

LT@Helsinki at SemEval-2020 Task 12: Multilingual or language-specific BERT?

Marc P\`amies, Emily \"Ohman, Kaisla Kajava, J\"org Tiedemann

PDF

TL;DR

This paper describes how the LT@Helsinki team used BERT models fine-tuned on specific datasets to achieve state-of-the-art results in multilingual offensive language detection tasks at SemEval-2020.

Contribution

The paper demonstrates the effectiveness of BERT for multilingual offensive language identification and provides models fine-tuned for specific sub-tasks.

Findings

01

BERT achieved state-of-the-art results in offensive language detection.

02

Fine-tuning BERT on OLID and SOLID datasets is effective.

03

Multilingual BERT models perform well across different languages.

Abstract

This paper presents the different models submitted by the LT@Helsinki team for the SemEval 2020 Shared Task 12. Our team participated in sub-tasks A and C; titled offensive language identification and offense target identification, respectively. In both cases we used the so-called Bidirectional Encoder Representation from Transformer (BERT), a model pre-trained by Google and fine-tuned by us on the OLID and SOLID datasets. The results show that offensive tweet classification is one of several language-based tasks where BERT can achieve state-of-the-art results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · WordPiece · Linear Warmup With Linear Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Attention Is All You Need · Label Smoothing · Adam