Leveraging Intrinsic Gradient Information for Further Training of   Differentiable Machine Learning Models

Chris McDonagh; Xi Chen

arXiv:2112.00094·cs.LG·January 17, 2022

Leveraging Intrinsic Gradient Information for Further Training of Differentiable Machine Learning Models

Chris McDonagh, Xi Chen

PDF

Open Access

TL;DR

This paper introduces methods to utilize derivative information from models to enhance the accuracy and performance of differentiable machine learning models across various applications.

Contribution

It generalizes the use of gradient information and provides practical methods to leverage it for improving ML models, including GANs, neural network tuning, and linear regression regularization.

Findings

01

Gradient information improves model accuracy

02

Enhanced GAN performance with GI

03

Efficient neural network complexity tuning

Abstract

Designing models that produce accurate predictions is the fundamental objective of machine learning (ML). This work presents methods demonstrating that when the derivatives of target variables (outputs) with respect to inputs can be extracted from processes of interest, e.g., neural networks (NN) based surrogate models, they can be leveraged to further improve the accuracy of differentiable ML models. This paper generalises the idea and provides practical methodologies that can be used to leverage gradient information (GI) across a variety of applications including: (1) Improving the performance of generative adversarial networks (GANs); (2) efficiently tuning NN model complexity; (3) regularising linear regressions. Numerical results show that GI can effective enhance ML models with existing datasets, demonstrating its value for a variety of applications.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis

MethodsLinear Regression