Multi-Objective Optimization for Sparse Deep Multi-Task Learning

S. S. Hotegni; M. Berkemeier; S. Peitz

arXiv:2308.12243·cs.LG·March 27, 2024·1 cites

Multi-Objective Optimization for Sparse Deep Multi-Task Learning

S. S. Hotegni, M. Berkemeier, S. Peitz

PDF

Open Access 1 Repo

TL;DR

This paper introduces a multi-objective optimization algorithm for deep multi-task learning that effectively balances conflicting criteria like task performance and sparsity, enabling adaptive model sparsification during training.

Contribution

It presents a novel scalarization-based optimization method that finds all optimal solutions and facilitates sparsity in deep multi-task models using standard optimizers.

Findings

01

Adaptive sparsification during training is possible without significant performance loss.

02

The method effectively balances multiple conflicting objectives in deep learning.

03

Experimental results demonstrate improved model efficiency with maintained accuracy.

Abstract

Different conflicting optimization criteria arise naturally in various Deep Learning scenarios. These can address different main tasks (i.e., in the setting of Multi-Task Learning), but also main and secondary tasks such as loss minimization versus sparsity. The usual approach is a simple weighting of the criteria, which formally only works in the convex setting. In this paper, we present a Multi-Objective Optimization algorithm using a modified Weighted Chebyshev scalarization for training Deep Neural Networks (DNNs) with respect to several tasks. By employing this scalarization technique, the algorithm can identify all optimal solutions of the original problem while reducing its complexity to a sequence of single-objective problems. The simplified problems are then solved using an Augmented Lagrangian method, enabling the use of popular optimization techniques such as Adam and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

salomonhotegni/mdmtn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification

MethodsAdam · Focus