Modelling Adjectival Modification Effects on Semantic Plausibility
Anna Golub, Beate Zywietz, and Annerose Eichel

TL;DR
This paper investigates how adjectival modifications influence the perceived plausibility of events, using the ADEPT benchmark, and evaluates the performance of transformer-based models, revealing current limitations and the importance of balanced evaluation.
Contribution
It introduces a novel approach using sentence transformers for modeling plausibility changes due to adjectival modifications and emphasizes the need for realistic evaluation methods.
Findings
Transformers and sentence transformers struggle with the task.
Sentence transformers underperform compared to RoBERTa.
Balanced evaluation is crucial for trustworthy results.
Abstract
While the task of assessing the plausibility of events such as ''news is relevant'' has been addressed by a growing body of work, less attention has been paid to capturing changes in plausibility as triggered by event modification. Understanding changes in plausibility is relevant for tasks such as dialogue generation, commonsense reasoning, and hallucination detection as it allows to correctly model, for example, ''gentle sarcasm'' as a sign of closeness rather than unkindness among friends [9]. In this work, we tackle the ADEPT challenge benchmark [6] consisting of 16K English sentence pairs differing by exactly one adjectival modifier. Our modeling experiments provide a conceptually novel method by using sentence transformers, and reveal that both they and transformer-based models struggle with the task at hand, and sentence transformers - despite their conceptual alignment with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
