DMTG: One-Shot Differentiable Multi-Task Grouping

Yuan Gao; Shuguo Jiang; Moran Li; Jin-Gang Yu; Gui-Song; Xia

arXiv:2407.05082·cs.LG·July 9, 2024

DMTG: One-Shot Differentiable Multi-Task Grouping

Yuan Gao, Shuguo Jiang, Moran Li, Jin-Gang Yu, Gui-Song, Xia

PDF

Open Access 1 Repo 1 Video

TL;DR

DMTG introduces a fully differentiable, one-shot method for multi-task grouping that efficiently identifies task groups and trains models simultaneously, outperforming sequential approaches.

Contribution

It proposes a novel differentiable pruning approach for multi-task grouping that improves efficiency and reduces bias compared to prior sequential methods.

Findings

01

Improves training efficiency over sequential methods.

02

Effectively exploits high-order task affinity.

03

Achieves promising performance on CelebA and Taskonomy datasets.

Abstract

We aim to address Multi-Task Learning (MTL) with a large number of tasks by Multi-Task Grouping (MTG). Given N tasks, we propose to simultaneously identify the best task groups from 2^N candidates and train the model weights simultaneously in one-shot, with the high-order task-affinity fully exploited. This is distinct from the pioneering methods which sequentially identify the groups and train the model weights, where the group identification often relies on heuristics. As a result, our method not only improves the training efficiency, but also mitigates the objective bias introduced by the sequential procedures that potentially lead to a suboptimal solution. Specifically, we formulate MTG as a fully differentiable pruning problem on an adaptive network architecture determined by an underlying Categorical distribution. To categorize N tasks into K groups (represented by K encoder…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ethanygao/dmtg
pytorchOfficial

Videos

DMTG: One-Shot Differentiable Multi-Task Grouping· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Graph Neural Networks · Multimodal Machine Learning Applications

MethodsSparse Evolutionary Training · Pruning