UPB at SemEval-2021 Task 7: Adversarial Multi-Task Learning for Detecting and Rating Humor and Offense
R\u{a}zvan-Alexandru Sm\u{a}du, Dumitru-Clementin Cercel, Mihai, Dascalu

TL;DR
This paper presents AMTL-Humor, an adversarial multi-task neural network leveraging BERTweet, BiLSTM, and Capsule layers to detect and rate humor and offense in texts, achieving high accuracy and F1-scores on SemEval-2021 tasks.
Contribution
It introduces a novel adversarial multi-task learning approach with an ensemble model for humor and offense detection and rating, improving performance over existing methods.
Findings
Achieved 95.66% F1-score and 94.70% accuracy on humor detection.
Attained RMSE scores of 0.6200 and 0.5318 on humor rating and offense detection.
Ensemble of configurations outperforms individual models.
Abstract
Detecting humor is a challenging task since words might share multiple valences and, depending on the context, the same words can be even used in offensive expressions. Neural network architectures based on Transformer obtain state-of-the-art results on several Natural Language Processing tasks, especially text classification. Adversarial learning, combined with other techniques such as multi-task learning, aids neural models learn the intrinsic properties of data. In this work, we describe our adversarial multi-task network, AMTL-Humor, used to detect and rate humor and offensive texts from Task 7 at SemEval-2021. Each branch from the model is focused on solving a related task, and consists of a BiLSTM layer followed by Capsule layers, on top of BERTweet used for generating contextualized embeddings. Our best model consists of an ensemble of all tested configurations, and achieves a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dense Connections · Attention Is All You Need · Softmax · Tanh Activation · Adam · Sigmoid Activation
