The Naughtyformer: A Transformer Understands Offensive Humor

Leonard Tang; Alexander Cai; Steve Li; Jason Wang

arXiv:2211.14369·cs.CL·November 29, 2022·1 cites

The Naughtyformer: A Transformer Understands Offensive Humor

Leonard Tang, Alexander Cai, Steve Li, Jason Wang

PDF

Open Access 1 Datasets

TL;DR

This paper introduces Naughtyformer, a Transformer-based model trained on a Reddit-filtered jokes dataset to classify humor subtypes, especially offensive jokes, demonstrating superior detection of offensiveness over existing methods.

Contribution

The paper presents a novel dataset and a Transformer model specifically designed for classifying humor subtypes, focusing on offensive humor detection.

Findings

01

Naughtyformer outperforms state-of-the-art methods in detecting offensive jokes.

02

The dataset enables nuanced classification of humor subtypes.

03

Transformer-based approach improves accuracy in humor subtype detection.

Abstract

Jokes are intentionally written to be funny, but not all jokes are created the same. Some jokes may be fit for a classroom of kindergarteners, but others are best reserved for a more mature audience. While recent work has shown impressive results on humor detection in text, here we instead investigate the more nuanced task of detecting humor subtypes, especially of the less innocent variety. To that end, we introduce a novel jokes dataset filtered from Reddit and solve the subtype classification task using a finetuned Transformer dubbed the Naughtyformer. Moreover, we show that our model is significantly better at detecting offensiveness in jokes compared to state-of-the-art methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

metaeval/offensive-humor
dataset· 23 dl
23 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHumor Studies and Applications · Comics and Graphic Narratives

MethodsAttention Is All You Need · Layer Normalization · Softmax · Adam · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Label Smoothing · Absolute Position Encodings · Linear Layer