Transfer Learning with Pre-trained Conditional Generative Models
Shin'ya Yamaguchi, Sekitoshi Kanai, Atsutoshi Kumagai, Daiki Chijiwa,, Hisashi Kashima

TL;DR
This paper introduces a transfer learning approach that leverages pre-trained conditional generative models to transfer knowledge without requiring overlapping labels, source data access, or identical target architectures, outperforming traditional methods.
Contribution
The paper proposes a novel transfer learning method using deep generative models with pseudo pre-training and semi-supervised learning stages, removing common assumptions in transfer learning.
Findings
Outperforms scratch training baselines.
Outperforms knowledge distillation methods.
Effective without source data access or label overlap.
Abstract
Transfer learning is crucial in training deep neural networks on new target tasks. Current transfer learning methods always assume at least one of (i) source and target task label spaces overlap, (ii) source datasets are available, and (iii) target network architectures are consistent with source ones. However, holding these assumptions is difficult in practical settings because the target task rarely has the same labels as the source task, the source dataset access is restricted due to storage costs and privacy, and the target architecture is often specialized to each task. To transfer source knowledge without these assumptions, we propose a transfer learning method that uses deep generative models and is composed of the following two stages: pseudo pre-training (PP) and pseudo semi-supervised learning (P-SSL). PP trains a target architecture with an artificial dataset synthesized by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Speech Recognition and Synthesis · Music and Audio Processing
