Visual Concept-Metaconcept Learning

Chi Han; Jiayuan Mao; Chuang Gan; Joshua B. Tenenbaum; Jiajun Wu

arXiv:2002.01464·cs.CV·February 5, 2020·21 cites

Visual Concept-Metaconcept Learning

Chi Han, Jiayuan Mao, Chuang Gan, Joshua B. Tenenbaum, Jiajun Wu

PDF

Open Access 1 Repo

TL;DR

This paper introduces VCML, a model that jointly learns visual concepts and metaconcepts, enabling better generalization and learning from limited or noisy data by exploiting their bidirectional relationship.

Contribution

The paper proposes a novel joint learning framework for concepts and metaconcepts that leverages their bidirectional connection to improve visual understanding and generalization.

Findings

01

VCML effectively generalizes to unseen concept pairs.

02

It improves learning from limited, noisy, and biased data.

03

Validation on synthetic and real datasets supports its effectiveness.

Abstract

Humans reason with concepts and metaconcepts: we recognize red and green from visual input; we also understand that they describe the same property of objects (i.e., the color). In this paper, we propose the visual concept-metaconcept learner (VCML) for joint learning of concepts and metaconcepts from images and associated question-answer pairs. The key is to exploit the bidirectional connection between visual concepts and metaconcepts. Visual representations provide grounding cues for predicting relations between unseen pairs of concepts. Knowing that red and green describe the same property of objects, we generalize to the fact that cube and sphere also describe the same property of objects, since they both categorize the shape of objects. Meanwhile, knowledge about metaconcepts empowers visual concept learning from limited, noisy, and even biased data. From just a few examples of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Glaciohound/VCML
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Bioinformatics · Multimodal Machine Learning Applications