DMTG: One-Shot Differentiable Multi-Task Grouping
Yuan Gao, Shuguo Jiang, Moran Li, Jin-Gang Yu, Gui-Song, Xia

TL;DR
DMTG introduces a fully differentiable, one-shot method for multi-task grouping that efficiently identifies task groups and trains models simultaneously, outperforming sequential approaches.
Contribution
It proposes a novel differentiable pruning approach for multi-task grouping that improves efficiency and reduces bias compared to prior sequential methods.
Findings
Improves training efficiency over sequential methods.
Effectively exploits high-order task affinity.
Achieves promising performance on CelebA and Taskonomy datasets.
Abstract
We aim to address Multi-Task Learning (MTL) with a large number of tasks by Multi-Task Grouping (MTG). Given N tasks, we propose to simultaneously identify the best task groups from 2^N candidates and train the model weights simultaneously in one-shot, with the high-order task-affinity fully exploited. This is distinct from the pioneering methods which sequentially identify the groups and train the model weights, where the group identification often relies on heuristics. As a result, our method not only improves the training efficiency, but also mitigates the objective bias introduced by the sequential procedures that potentially lead to a suboptimal solution. Specifically, we formulate MTG as a fully differentiable pruning problem on an adaptive network architecture determined by an underlying Categorical distribution. To categorize N tasks into K groups (represented by K encoder…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Graph Neural Networks · Multimodal Machine Learning Applications
MethodsSparse Evolutionary Training · Pruning
