TL;DR
This paper introduces a novel discontinuous deep neural network architecture designed to handle piecewise continuous functions, with theoretical approximation guarantees and a specialized semi-supervised training method, validated on real and synthetic datasets.
Contribution
The paper proposes a new discontinuous neural network model with approximation guarantees and a tailored semi-supervised training procedure for piecewise continuous functions.
Findings
The model achieves universal approximation in the space of piecewise continuous functions.
The semi-supervised training procedure effectively trains the discontinuous neural network.
Experimental results demonstrate the model's performance on financial and synthetic datasets.
Abstract
Most stochastic gradient descent algorithms can optimize neural networks that are sub-differentiable in their parameters; however, this implies that the neural network's activation function must exhibit a degree of continuity which limits the neural network model's uniform approximation capacity to continuous functions. This paper focuses on the case where the discontinuities arise from distinct sub-patterns, each defined on different parts of the input space. We propose a new discontinuous deep neural network model trainable via a decoupled two-step procedure that avoids passing gradient updates through the network's only and strategically placed, discontinuous unit. We provide approximation guarantees for our architecture in the space of bounded continuous functions and universal approximation guarantees in the space of piecewise continuous functions which we introduced herein. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
