TL;DR
This paper investigates the use of large neural language models and ensembles to detect and rate humor and offensiveness in texts, addressing the challenge of subjectivity and data scarcity in this domain.
Contribution
It demonstrates that large neural models can effectively capture humor and offense nuances, achieving competitive results on SemEval-2021 Task 7 despite limited labeled data.
Findings
Achieved third place in subtask 1b
Ranked around top 33% overall
Models effectively capture humor and offense nuances
Abstract
Humor and Offense are highly subjective due to multiple word senses, cultural knowledge, and pragmatic competence. Hence, accurately detecting humorous and offensive texts has several compelling use cases in Recommendation Systems and Personalized Content Moderation. However, due to the lack of an extensive labeled dataset, most prior works in this domain haven't explored large neural models for subjective humor understanding. This paper explores whether large neural models and their ensembles can capture the intricacies associated with humor/offense detection and rating. Our experiments on the SemEval-2021 Task 7: HaHackathon show that we can develop reasonable humor and offense detection systems with such models. Our models are ranked third in subtask 1b and consistently ranked around the top 33% of the leaderboard for the remaining subtasks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
