UPB at SemEval-2021 Task 7: Adversarial Multi-Task Learning for   Detecting and Rating Humor and Offense

R\u{a}zvan-Alexandru Sm\u{a}du; Dumitru-Clementin Cercel; Mihai; Dascalu

arXiv:2104.06063·cs.CL·April 14, 2021

UPB at SemEval-2021 Task 7: Adversarial Multi-Task Learning for Detecting and Rating Humor and Offense

R\u{a}zvan-Alexandru Sm\u{a}du, Dumitru-Clementin Cercel, Mihai, Dascalu

PDF

TL;DR

This paper presents AMTL-Humor, an adversarial multi-task neural network leveraging BERTweet, BiLSTM, and Capsule layers to detect and rate humor and offense in texts, achieving high accuracy and F1-scores on SemEval-2021 tasks.

Contribution

It introduces a novel adversarial multi-task learning approach with an ensemble model for humor and offense detection and rating, improving performance over existing methods.

Findings

01

Achieved 95.66% F1-score and 94.70% accuracy on humor detection.

02

Attained RMSE scores of 0.6200 and 0.5318 on humor rating and offense detection.

03

Ensemble of configurations outperforms individual models.

Abstract

Detecting humor is a challenging task since words might share multiple valences and, depending on the context, the same words can be even used in offensive expressions. Neural network architectures based on Transformer obtain state-of-the-art results on several Natural Language Processing tasks, especially text classification. Adversarial learning, combined with other techniques such as multi-task learning, aids neural models learn the intrinsic properties of data. In this work, we describe our adversarial multi-task network, AMTL-Humor, used to detect and rate humor and offensive texts from Task 7 at SemEval-2021. Each branch from the model is focused on solving a related task, and consists of a BiLSTM layer followed by Capsule layers, on top of BERTweet used for generating contextualized embeddings. Our best model consists of an ensemble of all tested configurations, and achieves a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dense Connections · Attention Is All You Need · Softmax · Tanh Activation · Adam · Sigmoid Activation