Sprecher Networks: A Parameter-Efficient Kolmogorov-Arnold Architecture
Christian H\"agg, Kathl\'en Kohn, Giovanni Luca Marchetti, Boris Shapiro

TL;DR
Sprecher Networks are a novel, parameter-efficient architecture based on the Kolmogorov-Arnold representation, enabling scalable, deep models with linear width scaling and low memory footprint, suitable for resource-constrained environments.
Contribution
We introduce Sprecher Networks, a new spline-based architecture with linear width scaling and efficient memory use, suitable for deep, wide models in resource-limited settings.
Findings
Achieved stable training on deep residual models with 25 layers.
Demonstrated real-time digit classification on embedded devices.
Outperformed MLP and KAN baselines in regression and classification tasks.
Abstract
We introduce Sprecher Networks (SNs), a family of trainable architectures derived from David Sprecher's 1965 constructive form of the Kolmogorov-Arnold representation. Each SN block implements a "sum of shifted univariate functions" using only two shared learnable splines per block, a monotone inner spline and a general outer spline , together with a learnable shift parameter and a mixing vector shared across all output dimensions. Stacking these blocks yields deep, compositional models; for vector-valued outputs we append an additional non-summed output block. We also propose an optional lateral mixing operator enabling intra-block communication between output channels with only additional parameters. Owing to the vector (not matrix) mixing weights and spline sharing, SNs scale linearly in width, approximately…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Ferroelectric and Negative Capacitance Devices · Stochastic Gradient Optimization Techniques
