TL;DR
This paper develops sarcasm detection for Slovenian, a less-resourced language, using translation and large language models, showing that larger models and ensembling improve detection accuracy close to human agreement.
Contribution
It introduces a novel approach for sarcasm detection in a less-resourced language using translation and large transformer models, demonstrating effectiveness of model size and ensembling.
Findings
Larger models outperform smaller ones in sarcasm detection.
Ensembling models slightly improves performance.
Best model achieves an F1-score of 0.765, near human agreement.
Abstract
The sarcasm detection task in natural language processing tries to classify whether an utterance is sarcastic or not. It is related to sentiment analysis since it often inverts surface sentiment. Because sarcastic sentences are highly dependent on context, and they are often accompanied by various non-verbal cues, the task is challenging. Most of related work focuses on high-resourced languages like English. To build a sarcasm detection dataset for a less-resourced language, such as Slovenian, we leverage two modern techniques: a machine translation specific medium-size transformer model, and a very large generative language model. We explore the viability of translated datasets and how the size of a pretrained transformer affects its ability to detect sarcasm. We train ensembles of detection models and evaluate models' performance. The results show that larger models generally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
