KKT-based optimality conditions for neural network approximation

Vinesha Peiris; Nadezda Sukhorukova; Julien Ugon

arXiv:2506.17305·math.OC·June 24, 2025

KKT-based optimality conditions for neural network approximation

Vinesha Peiris, Nadezda Sukhorukova, Julien Ugon

PDF

TL;DR

This paper derives necessary optimality conditions for neural network approximation in $l_1$ and max norms, using KKT conditions and convex analysis, focusing on shallow networks with one hidden layer.

Contribution

It introduces a novel approach to formulate nonsmooth neural network approximation problems as smooth constrained problems and applies KKT conditions for optimality analysis.

Findings

01

Provides necessary optimality conditions for neural network approximation.

02

Reformulates nonsmooth problems into smooth constrained optimization problems.

03

Uses convex analysis to express optimality conditions.

Abstract

In this paper, we obtain necessary optimality conditions for neural network approximation. We consider neural networks in Manhattan ( $l_{1}$ norm) and Chebyshev ( $max$ norm). The optimality conditions are based on neural networks with at most one hidden layer. We reformulate nonsmooth unconstrained optimisation problems as larger dimension constrained problems with smooth objective functions and constraints. Then we use KKT conditions to develop the necessary conditions and present the optimality conditions in terms of convex analysis and convex sets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.