Learning Gradients of Convex Functions with Monotone Gradient Networks
Shreyas Chaudhari, Srinivasa Pranav, Jos\'e M. F. Moura

TL;DR
This paper introduces two neural network architectures, C-MGN and M-MGN, designed to learn the gradients of convex functions, demonstrating improved training efficiency and accuracy, with applications in optimal transport mapping.
Contribution
The paper proposes novel monotone gradient neural networks that effectively learn convex function gradients with fewer parameters and better accuracy than existing methods.
Findings
Easier to train than state-of-the-art methods
More accurate in learning monotone gradient fields
Successfully applied to optimal transport mappings
Abstract
While much effort has been devoted to deriving and analyzing effective convex formulations of signal processing problems, the gradients of convex functions also have critical applications ranging from gradient-based optimization to optimal transport. Recent works have explored data-driven methods for learning convex objective functions, but learning their monotone gradients is seldom studied. In this work, we propose C-MGN and M-MGN, two monotone gradient neural network architectures for directly learning the gradients of convex functions. We show that, compared to state of the art methods, our networks are easier to train, learn monotone gradient fields more accurately, and use significantly fewer parameters. We further demonstrate their ability to learn optimal transport mappings to augment driving image data.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning
