Learning Gradients of Convex Functions with Monotone Gradient Networks

Shreyas Chaudhari; Srinivasa Pranav; Jos\'e M. F. Moura

arXiv:2301.10862·cs.LG·March 21, 2023

Learning Gradients of Convex Functions with Monotone Gradient Networks

Shreyas Chaudhari, Srinivasa Pranav, Jos\'e M. F. Moura

PDF

Open Access

TL;DR

This paper introduces two neural network architectures, C-MGN and M-MGN, designed to learn the gradients of convex functions, demonstrating improved training efficiency and accuracy, with applications in optimal transport mapping.

Contribution

The paper proposes novel monotone gradient neural networks that effectively learn convex function gradients with fewer parameters and better accuracy than existing methods.

Findings

01

Easier to train than state-of-the-art methods

02

More accurate in learning monotone gradient fields

03

Successfully applied to optimal transport mappings

Abstract

While much effort has been devoted to deriving and analyzing effective convex formulations of signal processing problems, the gradients of convex functions also have critical applications ranging from gradient-based optimization to optimal transport. Recent works have explored data-driven methods for learning convex objective functions, but learning their monotone gradients is seldom studied. In this work, we propose C-MGN and M-MGN, two monotone gradient neural network architectures for directly learning the gradients of convex functions. We show that, compared to state of the art methods, our networks are easier to train, learn monotone gradient fields more accurately, and use significantly fewer parameters. We further demonstrate their ability to learn optimal transport mappings to augment driving image data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning