Predictive Reasoning with Augmented Anomaly Contrastive Learning for Compositional Visual Relations

Chengtai Li; Yuting He; Jianfeng Ren; Ruibin Bai; Yitian Zhao; Heng Yu; Xudong Jiang

arXiv:2603.01125·cs.CV·March 3, 2026

Predictive Reasoning with Augmented Anomaly Contrastive Learning for Compositional Visual Relations

Chengtai Li, Yuting He, Jianfeng Ren, Ruibin Bai, Yitian Zhao, Heng Yu, Xudong Jiang

PDF

Open Access

TL;DR

This paper introduces PR-A$^2$CL, a novel framework for compositional visual reasoning that combines anomaly contrastive learning with a predict-and-verify paradigm, significantly improving performance on complex CVR tasks.

Contribution

The paper proposes a new method integrating augmented anomaly contrastive learning with a rule-based reasoning paradigm for better compositional visual relation understanding.

Findings

01

Outperforms state-of-the-art models on SVRT, CVR, and MC$^2$R datasets.

02

Effectively models complex compositional rules.

03

Enhances generalization in visual reasoning tasks.

Abstract

While visual reasoning for simple analogies has received significant attention, compositional visual relations (CVR) remain relatively unexplored due to their greater complexity. To solve CVR tasks, we propose Predictive Reasoning with Augmented Anomaly Contrastive Learning (PR-A $^{2}$ CL), \ie, to identify an outlier image given three other images that follow the same compositional rules. To address the challenge of modelling abundant compositional rules, an Augmented Anomaly Contrastive Learning is designed to distil discriminative and generalizable features by maximizing similarity among normal instances while minimizing similarity between normal and anomalous outliers. More importantly, a predict-and-verify paradigm is introduced for rule-based reasoning, in which a series of Predictive Anomaly Reasoning Blocks (PARBs) iteratively leverage features from three out of the four images to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI)