Learning Perceptual Inference by Contrasting
Chi Zhang, Baoxiong Jia, Feng Gao, Yixin Zhu, Hongjing Lu, Song-Chun, Zhu

TL;DR
This paper introduces CoPINet, a novel contrastive learning-based model that significantly improves machine reasoning on Raven's Progressive Matrices by leveraging perceptual inference and permutation invariance.
Contribution
The paper proposes CoPINet, a new model that combines contrastive learning and perceptual inference, achieving state-of-the-art results on RPM datasets for reasoning tasks.
Findings
CoPINet outperforms previous models on RPM datasets.
Contrastive learning enhances reasoning capabilities in AI.
Permutation-invariant design improves generalization.
Abstract
"Thinking in pictures," [1] i.e., spatial-temporal reasoning, effortless and instantaneous for humans, is believed to be a significant ability to perform logical induction and a crucial factor in the intellectual history of technology development. Modern Artificial Intelligence (AI), fueled by massive datasets, deeper models, and mighty computation, has come to a stage where (super-)human-level performances are observed in certain specific tasks. However, current AI's ability in "thinking in pictures" is still far lacking behind. In this work, we study how to improve machines' reasoning ability on one challenging task of this kind: Raven's Progressive Matrices (RPM). Specifically, we borrow the very idea of "contrast effects" from the field of psychology, cognition, and education to design and train a permutation-invariant model. Inspired by cognitive studies, we equip our model with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCognitive Science and Mapping · Image Retrieval and Classification Techniques · Neural Networks and Applications
