Deep Transfer Learning with Joint Adaptation Networks

Mingsheng Long; Han Zhu; Jianmin Wang; Michael I. Jordan

arXiv:1605.06636·cs.LG·August 18, 2017·115 cites

Deep Transfer Learning with Joint Adaptation Networks

Mingsheng Long, Han Zhu, Jianmin Wang, Michael I. Jordan

PDF

Open Access 1 Repo

TL;DR

This paper introduces Joint Adaptation Networks (JAN), a deep transfer learning model that aligns joint distributions of multiple layers across domains using JMMD, achieving state-of-the-art results.

Contribution

JAN is the first to align joint distributions of multiple layers in deep networks for transfer learning, improving adaptation performance.

Findings

01

Achieves state-of-the-art results on standard datasets.

02

Effectively aligns joint distributions across domains.

03

Uses adversarial training with JMMD for domain adaptation.

Abstract

Deep networks have been successfully applied to learn transferable features for adapting models from a source domain to a different target domain. In this paper, we present joint adaptation networks (JAN), which learn a transfer network by aligning the joint distributions of multiple domain-specific layers across domains based on a joint maximum mean discrepancy (JMMD) criterion. Adversarial training strategy is adopted to maximize JMMD such that the distributions of the source and target domains are made more distinguishable. Learning can be performed by stochastic gradient descent with the gradients computed by back-propagation in linear-time. Experiments testify that our model yields state of the art results on standard datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thuml/Transfer-Learning-Library
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Speech Recognition and Synthesis