Predictive Reasoning with Augmented Anomaly Contrastive Learning for Compositional Visual Relations
Chengtai Li, Yuting He, Jianfeng Ren, Ruibin Bai, Yitian Zhao, Heng Yu, Xudong Jiang

TL;DR
This paper introduces PR-A$^2$CL, a novel framework for compositional visual reasoning that combines anomaly contrastive learning with a predict-and-verify paradigm, significantly improving performance on complex CVR tasks.
Contribution
The paper proposes a new method integrating augmented anomaly contrastive learning with a rule-based reasoning paradigm for better compositional visual relation understanding.
Findings
Outperforms state-of-the-art models on SVRT, CVR, and MC$^2$R datasets.
Effectively models complex compositional rules.
Enhances generalization in visual reasoning tasks.
Abstract
While visual reasoning for simple analogies has received significant attention, compositional visual relations (CVR) remain relatively unexplored due to their greater complexity. To solve CVR tasks, we propose Predictive Reasoning with Augmented Anomaly Contrastive Learning (PR-ACL), \ie, to identify an outlier image given three other images that follow the same compositional rules. To address the challenge of modelling abundant compositional rules, an Augmented Anomaly Contrastive Learning is designed to distil discriminative and generalizable features by maximizing similarity among normal instances while minimizing similarity between normal and anomalous outliers. More importantly, a predict-and-verify paradigm is introduced for rule-based reasoning, in which a series of Predictive Anomaly Reasoning Blocks (PARBs) iteratively leverage features from three out of the four images to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI)
