Generating Diverse Negations from Affirmative Sentences
Darian Rodriguez Vasquez, Afroditi Papadaki

TL;DR
This paper introduces NegVerse, a novel method for generating diverse negations from affirmative sentences to improve negation understanding in language models, addressing dataset limitations and enhancing negation diversity and quality.
Contribution
NegVerse provides a new rule-based approach with prompt tuning and filtering to generate diverse, high-quality negations, surpassing existing methods in diversity and syntactic preservation.
Findings
NegVerse produces negations with higher lexical similarity to original sentences.
It achieves better syntactic preservation compared to baseline methods.
NegVerse enhances negation diversity in generated datasets.
Abstract
Despite the impressive performance of large language models across various tasks, they often struggle with reasoning under negated statements. Negations are important in real-world applications as they encode negative polarity in verb phrases, clauses, or other expressions. Nevertheless, they are underrepresented in current benchmarks, which mainly include basic negation forms and overlook more complex ones, resulting in insufficient data for training a language model. In this work, we propose NegVerse, a method that tackles the lack of negation datasets by producing a diverse range of negation types from affirmative sentences, including verbal, non-verbal, and affixal forms commonly found in English text. We provide new rules for masking parts of sentences where negations are most likely to occur, based on syntactic structure and use a frozen baseline LLM and prompt tuning to generate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
