FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic   descriptions, and Conceptual Relations

Lingjie Mei; Jiayuan Mao; Ziqi Wang; Chuang Gan; Joshua B. Tenenbaum

arXiv:2203.16639·cs.CV·April 1, 2022·6 cites

FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic descriptions, and Conceptual Relations

Lingjie Mei, Jiayuan Mao, Ziqi Wang, Chuang Gan, Joshua B. Tenenbaum

PDF

Open Access 1 Video

TL;DR

FALCON is a meta-learning framework that rapidly learns new visual concepts by integrating visual data, linguistic descriptions, and conceptual relations, enabling reasoning about unseen images with minimal examples.

Contribution

The paper introduces FALCON, a novel approach that combines multi-modal data streams and box embeddings for quick visual concept learning from limited data.

Findings

01

Effective on synthetic datasets

02

Demonstrates generalization to real-world data

03

Supports reasoning about unseen images

Abstract

We present a meta-learning framework for learning new visual concepts quickly, from just one or a few examples, guided by multiple naturally occurring data streams: simultaneously looking at images, reading sentences that describe the objects in the scene, and interpreting supplemental sentences that relate the novel concept with other concepts. The learned concepts support downstream applications, such as answering questions by reasoning about unseen images. Our model, namely FALCON, represents individual visual concepts, such as colors and shapes, as axis-aligned boxes in a high-dimensional space (the "box embedding space"). Given an input image and its paired sentence, our model first resolves the referential expression in the sentence and associates the novel concept with particular objects in the scene. Next, our model interprets supplemental sentences to relate the novel concept…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic descriptions, and Conceptual Relations· slideslive

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques