Deep Learning Alternatives of the Kolmogorov Superposition Theorem

Leonardo Ferreira Guilhoto; Paris Perdikaris

arXiv:2410.01990·cs.LG·May 18, 2026·2 cites

Deep Learning Alternatives of the Kolmogorov Superposition Theorem

Leonardo Ferreira Guilhoto, Paris Perdikaris

PDF

2 Repos 1 Video 3 Reviews

TL;DR

This paper introduces ActNet, a scalable deep learning model based on the Kolmogorov Superposition Theorem, which outperforms previous KST-based models and competes with state-of-the-art MLPs in PDE simulation tasks.

Contribution

The paper presents ActNet, a novel deep learning architecture that addresses limitations of traditional KST formulations and enhances function approximation in scientific computing.

Findings

01

ActNet outperforms Kolmogorov-Arnold Networks in benchmarks.

02

ActNet is competitive with top MLP-based approaches in PDE simulations.

03

The model demonstrates improved scalability and practical applicability.

Abstract

This paper explores alternative formulations of the Kolmogorov Superposition Theorem (KST) as a foundation for neural network design. The original KST formulation, while mathematically elegant, presents practical challenges due to its limited insight into the structure of inner and outer functions and the large number of unknown variables it introduces. Kolmogorov-Arnold Networks (KANs) leverage KST for function approximation, but they have faced scrutiny due to mixed results compared to traditional multilayer perceptrons (MLPs) and practical limitations imposed by the original KST formulation. To address these issues, we introduce ActNet, a scalable deep learning model that builds on the KST and overcomes many of the drawbacks of Kolmogorov's original formulation. We evaluate ActNet in the context of Physics-Informed Neural Networks (PINNs), a framework well-suited for leveraging KST's…

Peer Reviews

Decision·ICLR 2025 Spotlight

Reviewer 01Rating 8Confidence 3

Strengths

The paper is clear and justify its contribution, and put in context with the current state of art The paper justifies the change in the KAN architecture and its relationship. The connection with the multi-head transformer is a bit stretched tho. The paper provides some experiments to show the potential of changing the representation of the KST.

Weaknesses

The exposition is very good, the experiments show the advantage with respect to other KST architecture. There is only the evaluation in the PINN context. It would be nice to have more experiments, for example against some neural operators and training from data.

Reviewer 02Rating 6Confidence 4

Strengths

1. The authors provide a solid theoretical foundation for ActNet, detailing its universal approximation capabilities and presenting a well-motivated formulation based on KST. This paper compares the complexity of different formulations well. 2. Three 1D examples and two 2D examples are included to demonstrate that ActNet can achieve better performance. 3. This paper compares the model performance with other open-source JaxPi frameworks to enhance reproducibility and fairness.

Weaknesses

1. The selected examples primarily use sinusoidal forcing terms, including equations like Poisson, Helmholtz, and Allen-Cahn. The proposed model uses sine functions as basis functions, which justifies its improved performance over the spline basis functions used in KAN. In Figure 12, we can see that SIREN can have the best performance. Therefore, comprehensive benchmarks should be included for challenging 2D and 3D problems, including the Navier–Stokes equations and turbulence cases. 2. As this

Reviewer 03Rating 8Confidence 5

Strengths

1. **Originality**: The paper brings a fresh perspective to neural network design by leveraging alternative formulations of the Kolmogorov Superposition Theorem (KST). Introducing ActNet, based on Laczkovich's theorem, reflects a novel approach to overcoming the limitations of Kolmogorov-Arnold Networks (KANs), making KST more applicable to practical deep learning tasks, particularly within Physics-Informed Neural Networks (PINNs). 2. **Quality**: The research is thorough, with ActNet being tes

Weaknesses

1. Although ActNet performs well on PINNs, its comparisons are limited to specific benchmarks and do not consistently compare against the latest models for PINNs. 2. The paper lacks detailed ablation studies on critical design choices within ActNet, such as the basis functions used or the impact of ActLayer depth. Including these analyses would clarify the sensitivity of ActNet’s performance to these parameters and help guide future implementations or adaptations of the model. 3. While the pap

Code & Models

Repositories

Videos

Deep Learning Alternatives Of The Kolmogorov Superposition Theorem· slideslive

Taxonomy

TopicsAdvanced Data Processing Techniques