Population Risk Bounds for Kolmogorov-Arnold Networks Trained by DP-SGD with Correlated Noise

Puyu Wang; Jan Schuchardt; Nikita Kalinin; Junyu Zhou; Sophie Fellenz; Christoph Lampert; Marius Kloft

arXiv:2605.12648·cs.LG·May 14, 2026

Population Risk Bounds for Kolmogorov-Arnold Networks Trained by DP-SGD with Correlated Noise

Puyu Wang, Jan Schuchardt, Nikita Kalinin, Junyu Zhou, Sophie Fellenz, Christoph Lampert, Marius Kloft

PDF

TL;DR

This paper derives the first population risk bounds for Kolmogorov-Arnold Networks trained with mini-batch SGD, including differentially private variants with correlated Gaussian noise, bridging theory and practical training methods.

Contribution

It introduces a novel analysis approach for correlated-noise DP training in non-convex neural networks, extending prior work to more realistic training scenarios.

Findings

01

Established population risk bounds for KANs with correlated noise

02

Provided sharper bounds for fixed-second-layer KANs

03

Developed new analysis techniques for correlated-noise DP training

Abstract

We establish the first population risk bounds for Kolmogorov-Arnold Networks (KANs) trained by mini-batch SGD with gradient clipping, covering non-private SGD as well as differentially private SGD (DP-SGD) with Gaussian perturbations that interpolate between independent and temporally correlated noise. This setting is substantially closer to practice than prior KAN theory along two axes: training is by mini-batch SGD, the standard recipe for modern networks, rather than full-batch gradient descent (GD); and correlated-noise mechanisms have empirically shown a more favorable privacy-utility tradeoff than independent-noise mechanisms. Our results cover the corresponding full-batch GD and independent-noise DP-GD results for KANs by Wang et al. (2026), while yielding sharper fixed-second-layer specializations. The technical core is a new analysis route for correlated-noise DP training in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.