Pareto Navigation Gradient Descent: a First-Order Algorithm for   Optimization in Pareto Set

Mao Ye; Qiang Liu

arXiv:2110.08713·math.OC·September 5, 2022·1 cites

Pareto Navigation Gradient Descent: a First-Order Algorithm for Optimization in Pareto Set

Mao Ye, Qiang Liu

PDF

Open Access

TL;DR

This paper introduces Pareto Navigation Gradient Descent, a first-order algorithm that efficiently finds Pareto models optimizing a reference criterion, applicable to large-scale multi-task learning problems.

Contribution

It presents the first practical first-order method for OPT-in-Pareto, enabling efficient optimization in complex, non-convex multi-objective settings.

Findings

01

Efficiently finds Pareto models with a first-order method.

02

Demonstrates effectiveness on multi-task learning problems.

03

Provides theoretical convergence guarantees.

Abstract

Many modern machine learning applications, such as multi-task learning, require finding optimal model parameters to trade-off multiple objective functions that may conflict with each other. The notion of the Pareto set allows us to focus on the set of (often infinite number of) models that cannot be strictly improved. But it does not provide an actionable procedure for picking one or a few special models to return to practical users. In this paper, we consider \emph{optimization in Pareto set (OPT-in-Pareto)}, the problem of finding Pareto models that optimize an extra reference criterion function within the Pareto set. This function can either encode a specific preference from the users, or represent a generic diversity measure for obtaining a set of diversified Pareto models that are representative of the whole Pareto set. Unfortunately, despite being a highly useful framework,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Model Reduction and Neural Networks · Gaussian Processes and Bayesian Inference