Transfer Learning with Pre-trained Conditional Generative Models

Shin'ya Yamaguchi; Sekitoshi Kanai; Atsutoshi Kumagai; Daiki Chijiwa,; Hisashi Kashima

arXiv:2204.12833·cs.LG·February 21, 2025·1 cites

Transfer Learning with Pre-trained Conditional Generative Models

Shin'ya Yamaguchi, Sekitoshi Kanai, Atsutoshi Kumagai, Daiki Chijiwa,, Hisashi Kashima

PDF

Open Access

TL;DR

This paper introduces a transfer learning approach that leverages pre-trained conditional generative models to transfer knowledge without requiring overlapping labels, source data access, or identical target architectures, outperforming traditional methods.

Contribution

The paper proposes a novel transfer learning method using deep generative models with pseudo pre-training and semi-supervised learning stages, removing common assumptions in transfer learning.

Findings

01

Outperforms scratch training baselines.

02

Outperforms knowledge distillation methods.

03

Effective without source data access or label overlap.

Abstract

Transfer learning is crucial in training deep neural networks on new target tasks. Current transfer learning methods always assume at least one of (i) source and target task label spaces overlap, (ii) source datasets are available, and (iii) target network architectures are consistent with source ones. However, holding these assumptions is difficult in practical settings because the target task rarely has the same labels as the source task, the source dataset access is restricted due to storage costs and privacy, and the target architecture is often specialized to each task. To transfer source knowledge without these assumptions, we propose a transfer learning method that uses deep generative models and is composed of the following two stages: pseudo pre-training (PP) and pseudo semi-supervised learning (P-SSL). PP trains a target architecture with an artificial dataset synthesized by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Speech Recognition and Synthesis · Music and Audio Processing