Concept Generalization in Visual Representation Learning

Mert Bulent Sariyildiz; Yannis Kalantidis; Diane Larlus; Karteek; Alahari

arXiv:2012.05649·cs.CV·September 13, 2021

Concept Generalization in Visual Representation Learning

Mert Bulent Sariyildiz, Yannis Kalantidis, Diane Larlus, Karteek, Alahari

PDF

1 Repo

TL;DR

This paper introduces ImageNet-CoG, a benchmark that measures how well visual models generalize to unseen concepts based on semantic relationships, revealing the impact of architecture and training methods.

Contribution

It proposes a new semantic-aware benchmark for concept generalization and provides a large-scale analysis of various models' performance on it.

Findings

01

Semantic relationships influence generalization performance.

02

Model architecture and training data significantly affect concept transferability.

03

Transformer models outperform convolutional ones in generalization tasks.

Abstract

Measuring concept generalization, i.e., the extent to which models trained on a set of (seen) visual concepts can be leveraged to recognize a new set of (unseen) concepts, is a popular way of evaluating visual representations, especially in a self-supervised learning framework. Nonetheless, the choice of unseen concepts for such an evaluation is usually made arbitrarily, and independently from the seen concepts used to train representations, thus ignoring any semantic relationships between the two. In this paper, we argue that the semantic relationships between seen and unseen concepts affect generalization performance and propose ImageNet-CoG, a novel benchmark on the ImageNet-21K (IN-21K) dataset that enables measuring concept generalization in a principled way. Our benchmark leverages expert knowledge that comes from WordNet in order to define a sequence of unseen IN-21K concept sets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

naver/cog
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution