Deep Transfer Learning: Model Framework and Error Analysis
Yuling Jiao, Huazhen Lin, Yuchen Luo, Jerry Zhijian Yang

TL;DR
This paper introduces a deep transfer learning framework that leverages multi-domain data to improve downstream task performance, with features for automatic domain feature identification and enhanced interpretability, supported by theoretical error analysis and empirical validation.
Contribution
It proposes a novel deep transfer learning framework with automatic feature identification and interpretability, along with theoretical error bounds and empirical validation.
Findings
Significantly improves convergence rates for Lipschitz functions in downstream tasks.
Reduces error bounds from no transfer to partial and complete transfer scenarios.
Validated effectiveness on image classification and regression datasets.
Abstract
This paper presents a framework for deep transfer learning, which aims to leverage information from multi-domain upstream data with a large number of samples to a single-domain downstream task with a considerably smaller number of samples , where , in order to enhance performance on downstream task. Our framework offers several intriguing features. First, it allows the existence of both shared and domain-specific features across multi-domain data and provides a framework for automatic identification, achieving precise transfer and utilization of information. Second, the framework explicitly identifies upstream features that contribute to downstream tasks, establishing clear relationships between upstream domains and downstream tasks, thereby enhancing interpretability. Error analysis shows that our framework can significantly improve the convergence rate for learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
