Scalable Pareto Front Approximation for Deep Multi-Objective Learning

Michael Ruchte; Josif Grabocka

arXiv:2103.13392·cs.LG·October 15, 2021

Scalable Pareto Front Approximation for Deep Multi-Objective Learning

Michael Ruchte, Josif Grabocka

PDF

1 Repo

TL;DR

This paper introduces a scalable method for approximating the Pareto front in deep multi-objective learning by conditioning networks on preferences, achieving high-quality solutions efficiently.

Contribution

It proposes a novel approach that conditions neural networks directly on preferences and maintains a well-spread Pareto front, significantly reducing training overhead.

Findings

01

Achieves state-of-the-art Pareto front quality

02

Computes Pareto fronts significantly faster

03

Scales to large networks with minimal overhead

Abstract

Multi-objective optimization (MOO) is a prevalent challenge for Deep Learning, however, there exists no scalable MOO solution for truly deep neural networks. Prior work either demand optimizing a new network for every point on the Pareto front, or induce a large overhead to the number of trainable parameters by using hyper-networks conditioned on modifiable preferences. In this paper, we propose to condition the network directly on these preferences by augmenting them to the feature space. Furthermore, we ensure a well-spread Pareto front by penalizing the solutions to maintain a small angle to the preference vector. In a series of experiments, we demonstrate that our Pareto fronts achieve state-of-the-art quality despite being computed significantly faster. Furthermore, we showcase the scalability as our method approximates the full Pareto front on the CelebA dataset with an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ruchtem/cosmos
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPointwise Convolution · Depthwise Convolution · Depthwise Separable Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · Batch Normalization · Squeeze-and-Excitation Block · Dense Connections · Inverted Residual Block · Dropout