Leveraging Intrinsic Gradient Information for Further Training of Differentiable Machine Learning Models
Chris McDonagh, Xi Chen

TL;DR
This paper introduces methods to utilize derivative information from models to enhance the accuracy and performance of differentiable machine learning models across various applications.
Contribution
It generalizes the use of gradient information and provides practical methods to leverage it for improving ML models, including GANs, neural network tuning, and linear regression regularization.
Findings
Gradient information improves model accuracy
Enhanced GAN performance with GI
Efficient neural network complexity tuning
Abstract
Designing models that produce accurate predictions is the fundamental objective of machine learning (ML). This work presents methods demonstrating that when the derivatives of target variables (outputs) with respect to inputs can be extracted from processes of interest, e.g., neural networks (NN) based surrogate models, they can be leveraged to further improve the accuracy of differentiable ML models. This paper generalises the idea and provides practical methodologies that can be used to leverage gradient information (GI) across a variety of applications including: (1) Improving the performance of generative adversarial networks (GANs); (2) efficiently tuning NN model complexity; (3) regularising linear regressions. Numerical results show that GI can effective enhance ML models with existing datasets, demonstrating its value for a variety of applications.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis
MethodsLinear Regression
