Mastering Negation: Boosting Grounding Models via Grouped Opposition-Based Learning

Zesheng Yang; Xi Jiang; Bingzhang Hu; Weili Guan; Runmin Cong; Guo-Jun Qi; and Feng Zheng

arXiv:2603.12606·cs.CV·March 16, 2026

Mastering Negation: Boosting Grounding Models via Grouped Opposition-Based Learning

Zesheng Yang, Xi Jiang, Bingzhang Hu, Weili Guan, Runmin Cong, Guo-Jun Qi, and Feng Zheng

PDF

Open Access

TL;DR

This paper introduces D-Negation, a dataset with positive and negative semantic annotations, and a grouped opposition-based learning framework that improves vision-language grounding models' ability to interpret negation and complex expressions.

Contribution

The paper presents a novel dataset and a structured learning approach that explicitly models negation, enhancing grounding models' understanding of negative semantics.

Findings

01

Up to 4.4 mAP improvement on positive semantics

02

Up to 5.7 mAP improvement on negative semantics

03

Enhanced robustness and localization accuracy in grounding models

Abstract

Current vision-language detection and grounding models predominantly focus on prompts with positive semantics and often struggle to accurately interpret and ground complex expressions containing negative semantics. A key reason for this limitation is the lack of high-quality training data that explicitly captures discriminative negative samples and negation-aware language descriptions. To address this challenge, we introduce D-Negation, a new dataset that provides objects annotated with both positive and negative semantic descriptions. Building upon the observation that negation reasoning frequently appears in natural language, we further propose a grouped opposition-based learning framework that learns negation-aware representations from limited samples. Specifically, our method organizes opposing semantic descriptions from D-Negation into structured groups and formulates two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Topic Modeling