Giving each task what it needs -- leveraging structured sparsity for tailored multi-task learning
Richa Upadhyay, Ronald Phlypo, Rajkumar Saini, Marcus Liwicki

TL;DR
This paper introduces Layer-Optimized Multi-Task (LOMT) models that leverage structured sparsity to select optimal features and layers for each task, improving multi-task learning performance especially in resource-constrained environments.
Contribution
The work proposes a novel two-step approach using structured sparsity to identify task-specific layers and decoders, enhancing multi-task learning efficiency and effectiveness.
Findings
LOMT models outperform conventional MTL models on NYU-v2 and CelebAMask-HD datasets.
Structured sparsity effectively identifies optimal layers for individual tasks.
Tailored architecture reduces redundancy and improves task-specific feature utilization.
Abstract
In the Multi-task Learning (MTL) framework, every task demands distinct feature representations, ranging from low-level to high-level attributes. It is vital to address the specific (feature/parameter) needs of each task, especially in computationally constrained environments. This work, therefore, introduces Layer-Optimized Multi-Task (LOMT) models that utilize structured sparsity to refine feature selection for individual tasks and enhance the performance of all tasks in a multi-task scenario. Structured or group sparsity systematically eliminates parameters from trivial channels and, sometimes, eventually, entire layers within a convolution neural network during training. Consequently, the remaining layers provide the most optimal features for a given task. In this two-step approach, we subsequently leverage this sparsity-induced optimal layer information to build the LOMT models by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
MethodsConvolution · Feature Selection
