A High-Dimensional Statistical Method for Optimizing Transfer Quantities in Multi-Source Transfer Learning
Qingyue Zhang, Haohao Fu, Guanbo Huang, Yaoyuan Liang, Chang Chu, Tianren Peng, Yanru Wu, Qi Li, Yang Li, Shao-Lun Huang

TL;DR
This paper introduces a theoretical framework and an algorithm for optimizing the number of source samples used in multi-source transfer learning, improving training efficiency and accuracy.
Contribution
It provides a novel high-dimensional statistical method to determine optimal transfer sample quantities and develops an architecture-agnostic algorithm implementing these insights.
Findings
The proposed algorithm outperforms state-of-the-art methods in accuracy.
It enhances data efficiency in multi-source transfer learning.
Experimental results validate the theoretical framework.
Abstract
Multi-source transfer learning provides an effective solution to data scarcity in real-world supervised learning scenarios by leveraging multiple source tasks. In this field, existing works typically use all available samples from sources in training, which constrains their training efficiency and may lead to suboptimal results. To address this, we propose a theoretical framework that answers the question: what is the optimal quantity of source samples needed from each source task to jointly train the target model? Specifically, we introduce a generalization error measure based on K-L divergence, and minimize it based on high-dimensional statistical analysis to determine the optimal transfer quantity for each source task. Additionally, we develop an architecture-agnostic and data-efficient algorithm OTQMS to implement our theoretical results for target model training in multi-source…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and ELM · Domain Adaptation and Few-Shot Learning
