Conic Multi-Task Classification
Cong Li, Michael Georgiopoulos, Georgios C. Anagnostopoulos

TL;DR
This paper introduces Conic Multi-Task Learning (MTL), providing a theoretical generalization bound that explains its advantages over traditional Average MTL and proposing a new model that optimizes this bound.
Contribution
It derives the first theoretical generalization bound for Conic MTL, showing that optimal combination coefficients may differ from equal weighting, and proposes a model to optimize these coefficients.
Findings
Conic MTL can outperform Average MTL based on the generalization bound.
The optimal combination coefficients are not necessarily equal, challenging previous heuristics.
Experimental results validate the proposed model's effectiveness.
Abstract
Traditionally, Multi-task Learning (MTL) models optimize the average of task-related objective functions, which is an intuitive approach and which we will be referring to as Average MTL. However, a more general framework, referred to as Conic MTL, can be formulated by considering conic combinations of the objective functions instead; in this framework, Average MTL arises as a special case, when all combination coefficients equal 1. Although the advantage of Conic MTL over Average MTL has been shown experimentally in previous works, no theoretical justification has been provided to date. In this paper, we derive a generalization bound for the Conic MTL method, and demonstrate that the tightest bound is not necessarily achieved, when all combination coefficients equal 1; hence, Average MTL may not always be the optimal choice, and it is important to consider Conic MTL. As a byproduct of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Machine Learning and ELM
