Task-group Relatedness and Generalization Bounds for Regularized Multi-task Learning
Chao Zhang, Dacheng Tao, Tao Hu, Xiang Li

TL;DR
This paper provides theoretical analysis of regularized multi-task learning, establishing conditions under which it outperforms single-task learning and guarantees task-wise consistency based on task relatedness measures.
Contribution
It introduces new measures of task relatedness and derives generalization bounds for RMTL in a vector-valued framework, addressing key questions on performance and consistency.
Findings
RMTL can outperform STL with fewer task samples under certain relatedness conditions.
The paper establishes generalization bounds for RMTL based on task relatedness measures.
Provides conditions ensuring the consistency of each task during simultaneous learning.
Abstract
In this paper, we study the generalization performance of regularized multi-task learning (RMTL) in a vector-valued framework, where MTL is considered as a learning process for vector-valued functions. We are mainly concerned with two theoretical questions: 1) under what conditions does RMTL perform better with a smaller task sample size than STL? 2) under what conditions is RMTL generalizable and can guarantee the consistency of each task during simultaneous learning? In particular, we investigate two types of task-group relatedness: the observed discrepancy-dependence measure (ODDM) and the empirical discrepancy-dependence measure (EDDM), both of which detect the dependence between two groups of multiple related tasks (MRTs). We then introduce the Cartesian product-based uniform entropy number (CPUEN) to measure the complexities of vector-valued function classes. By applying the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Sparse and Compressive Sensing Techniques · Machine Learning and ELM
