Sobolev Training for Neural Networks

Wojciech Marian Czarnecki; Simon Osindero; Max Jaderberg; Grzegorz; \'Swirszcz; Razvan Pascanu

arXiv:1706.04859·cs.LG·July 27, 2017·60 cites

Sobolev Training for Neural Networks

Wojciech Marian Czarnecki, Simon Osindero, Max Jaderberg, Grzegorz, \'Swirszcz, Razvan Pascanu

PDF

Open Access

TL;DR

Sobolev Training enhances neural network function approximation by incorporating target derivatives during training, leading to improved accuracy and generalization across various domains.

Contribution

The paper introduces Sobolev Training, a novel method that integrates derivative information into neural network training to improve performance and data efficiency.

Findings

01

Improved accuracy in regression tasks.

02

Enhanced generalization in policy distillation.

03

Effective in large-scale synthetic gradient applications.

Abstract

At the heart of deep learning we aim to use neural networks as function approximators - training them to produce outputs from inputs in emulation of a ground truth function or data creation process. In many cases we only have access to input-output pairs from the ground truth, however it is becoming more common to have access to derivatives of the target output with respect to the input - for example when the ground truth function is itself a neural network such as in network compression or distillation. Generally these target derivatives are not computed, or are ignored. This paper introduces Sobolev Training for neural networks, which is a method for incorporating these target derivatives in addition the to target values while training. By optimising neural networks to not only approximate the function's outputs but also the function's derivatives we encode additional information…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Machine Learning and Data Classification