Giving each task what it needs -- leveraging structured sparsity for   tailored multi-task learning

Richa Upadhyay; Ronald Phlypo; Rajkumar Saini; Marcus Liwicki

arXiv:2406.03048·cs.CV·September 6, 2024

Giving each task what it needs -- leveraging structured sparsity for tailored multi-task learning

Richa Upadhyay, Ronald Phlypo, Rajkumar Saini, Marcus Liwicki

PDF

Open Access 1 Repo

TL;DR

This paper introduces Layer-Optimized Multi-Task (LOMT) models that leverage structured sparsity to select optimal features and layers for each task, improving multi-task learning performance especially in resource-constrained environments.

Contribution

The work proposes a novel two-step approach using structured sparsity to identify task-specific layers and decoders, enhancing multi-task learning efficiency and effectiveness.

Findings

01

LOMT models outperform conventional MTL models on NYU-v2 and CelebAMask-HD datasets.

02

Structured sparsity effectively identifies optimal layers for individual tasks.

03

Tailored architecture reduces redundancy and improves task-specific feature utilization.

Abstract

In the Multi-task Learning (MTL) framework, every task demands distinct feature representations, ranging from low-level to high-level attributes. It is vital to address the specific (feature/parameter) needs of each task, especially in computationally constrained environments. This work, therefore, introduces Layer-Optimized Multi-Task (LOMT) models that utilize structured sparsity to refine feature selection for individual tasks and enhance the performance of all tasks in a multi-task scenario. Structured or group sparsity systematically eliminates parameters from trivial channels and, sometimes, eventually, entire layers within a convolution neural network during training. Consequently, the remaining layers provide the most optimal features for a given task. In this two-step approach, we subsequently leverage this sparsity-induced optimal layer information to build the LOMT models by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ricupa/layer-optimized-multi-task-model
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning

MethodsConvolution · Feature Selection