Learning to see across Domains and Modalities

Fabio Maria Carlucci

arXiv:1902.04992·cs.CV·February 14, 2019

Learning to see across Domains and Modalities

Fabio Maria Carlucci

PDF

Open Access

TL;DR

This paper explores transfer learning techniques for visual object recognition, focusing on domain adaptation and cross-modality transfer, including RGB-D recognition in robotics, to improve model performance with limited data.

Contribution

It introduces new methods for unsupervised domain adaptation and cross-modality transfer learning, addressing challenges in robotic perception with depth data.

Findings

01

Effective feature and image transfer methods for domain adaptation.

02

Successful use of synthetic data for depth modality recognition.

03

Cross-modality transfer learning improves RGB-D recognition accuracy.

Abstract

Deep learning has raised hopes and expectations as a general solution for many applications; indeed it has proven effective, but it also showed a strong dependence on large quantities of data. Luckily, it has been shown that, even when data is scarce, a successful model can be trained by reusing prior knowledge. Thus, developing techniques for transfer learning, in its broadest definition, is a crucial element towards the deployment of effective and accurate intelligent systems. This thesis will focus on a family of transfer learning methods applied to the task of visual object recognition, specifically image classification. Transfer learning is a general term, and specific settings have been given specific names: when the learner has only access to unlabeled data from the a target domain and labeled data from a different domain (the source), the problem is known as that of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications