TL;DR
This paper investigates how transformer models detect humor in aligned sentence pairs, revealing that a single attention head can identify humorous words, providing insights into the model's interpretability.
Contribution
It introduces a challenging aligned dataset for humor detection and demonstrates that transformers can recognize humor with high accuracy, while also analyzing model mechanisms.
Findings
Transformer models achieve 78% accuracy on aligned humor detection.
A single attention head learns to identify humorous words.
Insights into how transformers recognize humor through attention analysis.
Abstract
The automatic detection of humor poses a grand challenge for natural language processing. Transformer-based systems have recently achieved remarkable results on this task, but they usually (1)~were evaluated in setups where serious vs humorous texts came from entirely different sources, and (2)~focused on benchmarking performance without providing insights into how the models work. We make progress in both respects by training and analyzing transformer-based humor recognition models on a recently introduced dataset consisting of minimal pairs of aligned sentences, one serious, the other humorous. We find that, although our aligned dataset is much harder than previous datasets, transformer-based models recognize the humorous sentence in an aligned pair with high accuracy (78%). In a careful error analysis, we characterize easy vs hard instances. Finally, by analyzing attention weights,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
