Enhancing approximation abilities of neural networks by training derivatives
V.I. Avrutskiy

TL;DR
This paper introduces a method to improve neural network approximation accuracy by incorporating derivatives into the training process, significantly enhancing precision especially for differential equations.
Contribution
It presents a novel training approach that uses target derivatives in the cost function, enabling neural networks to achieve much higher accuracy with a GPU-efficient gradient calculation.
Findings
Achieves 140-1000 times more accurate approximation with equal operations.
Enables solving differential equations with 13 times better accuracy.
Allows larger grid steps in differential equation solutions.
Abstract
A method to increase the precision of feedforward networks is proposed. It requires a prior knowledge of a target function derivatives of several orders and uses this information in gradient based training. Forward pass calculates not only the values of the output layer of a network but also their derivatives. The deviations of those derivatives from the target ones are used in an extended cost function and then backward pass calculates the gradient of the extended cost with respect to weights, which can then be used by any weights update algorithm. Despite a substantial increase in arithmetic operations per pattern (if compared to the conventional training), the extended cost allows to obtain 140--1000 times more accurate approximation for simple cases if the total number of operations is equal. This precision also happens to be out of reach for the regular cost function. The method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
