Towards Recognizing New Semantic Concepts in New Visual Domains
Massimiliano Mancini

TL;DR
This thesis explores methods for enabling deep learning models to recognize new semantic concepts across unseen visual domains by transferring knowledge, extending pretrained models, and handling semantic shifts without access to original training data.
Contribution
It introduces novel approaches for domain generalization, knowledge transfer without original data, and recognizing unseen concepts in multiple domains, advancing open-world visual recognition.
Findings
Variants of batch-normalization improve domain adaptation and generalization.
Proposed methods enable incremental learning without catastrophic forgetting.
Domain and semantic mixing approach shows promise for recognizing unseen concepts.
Abstract
Deep learning models heavily rely on large scale annotated datasets for training. Unfortunately, datasets cannot capture the infinite variability of the real world, thus neural networks are inherently limited by the restricted visual and semantic information contained in their training set. In this thesis, we argue that it is crucial to design deep architectures that can operate in previously unseen visual domains and recognize novel semantic concepts. In the first part of the thesis, we describe different solutions to enable deep models to generalize to new visual domains, by transferring knowledge from a labeled source domain(s) to a domain (target) where no labeled data are available. We will show how variants of batch-normalization (BN) can be applied to different scenarios, from domain adaptation when source and target are mixtures of multiple latent domains, to domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
