Towards Recognizing New Semantic Concepts in New Visual Domains

Massimiliano Mancini

arXiv:2012.09058·cs.CV·December 17, 2020·1 cites

Towards Recognizing New Semantic Concepts in New Visual Domains

Massimiliano Mancini

PDF

Open Access 1 Repo

TL;DR

This thesis explores methods for enabling deep learning models to recognize new semantic concepts across unseen visual domains by transferring knowledge, extending pretrained models, and handling semantic shifts without access to original training data.

Contribution

It introduces novel approaches for domain generalization, knowledge transfer without original data, and recognizing unseen concepts in multiple domains, advancing open-world visual recognition.

Findings

01

Variants of batch-normalization improve domain adaptation and generalization.

02

Proposed methods enable incremental learning without catastrophic forgetting.

03

Domain and semantic mixing approach shows promise for recognizing unseen concepts.

Abstract

Deep learning models heavily rely on large scale annotated datasets for training. Unfortunately, datasets cannot capture the infinite variability of the real world, thus neural networks are inherently limited by the restricted visual and semantic information contained in their training set. In this thesis, we argue that it is crucial to design deep architectures that can operate in previously unseen visual domains and recognize novel semantic concepts. In the first part of the thesis, we describe different solutions to enable deep models to generalize to new visual domains, by transferring knowledge from a labeled source domain(s) to a domain (target) where no labeled data are available. We will show how variants of batch-normalization (BN) can be applied to different scenarios, from domain adaptation when source and target are mixtures of multiple latent domains, to domain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mancinimassimiliano/adagraph
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications