Multi-Objective Optimization for Sparse Deep Multi-Task Learning
S. S. Hotegni, M. Berkemeier, S. Peitz

TL;DR
This paper introduces a multi-objective optimization algorithm for deep multi-task learning that effectively balances conflicting criteria like task performance and sparsity, enabling adaptive model sparsification during training.
Contribution
It presents a novel scalarization-based optimization method that finds all optimal solutions and facilitates sparsity in deep multi-task models using standard optimizers.
Findings
Adaptive sparsification during training is possible without significant performance loss.
The method effectively balances multiple conflicting objectives in deep learning.
Experimental results demonstrate improved model efficiency with maintained accuracy.
Abstract
Different conflicting optimization criteria arise naturally in various Deep Learning scenarios. These can address different main tasks (i.e., in the setting of Multi-Task Learning), but also main and secondary tasks such as loss minimization versus sparsity. The usual approach is a simple weighting of the criteria, which formally only works in the convex setting. In this paper, we present a Multi-Objective Optimization algorithm using a modified Weighted Chebyshev scalarization for training Deep Neural Networks (DNNs) with respect to several tasks. By employing this scalarization technique, the algorithm can identify all optimal solutions of the original problem while reducing its complexity to a sequence of single-objective problems. The simplified problems are then solved using an Augmented Lagrangian method, enabling the use of popular optimization techniques such as Adam and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
MethodsAdam · Focus
