KKT-based optimality conditions for neural network approximation
Vinesha Peiris, Nadezda Sukhorukova, Julien Ugon

TL;DR
This paper derives necessary optimality conditions for neural network approximation in $l_1$ and max norms, using KKT conditions and convex analysis, focusing on shallow networks with one hidden layer.
Contribution
It introduces a novel approach to formulate nonsmooth neural network approximation problems as smooth constrained problems and applies KKT conditions for optimality analysis.
Findings
Provides necessary optimality conditions for neural network approximation.
Reformulates nonsmooth problems into smooth constrained optimization problems.
Uses convex analysis to express optimality conditions.
Abstract
In this paper, we obtain necessary optimality conditions for neural network approximation. We consider neural networks in Manhattan ( norm) and Chebyshev ( norm). The optimality conditions are based on neural networks with at most one hidden layer. We reformulate nonsmooth unconstrained optimisation problems as larger dimension constrained problems with smooth objective functions and constraints. Then we use KKT conditions to develop the necessary conditions and present the optimality conditions in terms of convex analysis and convex sets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
