DropKAN: Regularizing KANs by masking post-activations

Mohammed Ghaith Altarabichi

arXiv:2407.13044·cs.LG·August 21, 2024·2 cites

DropKAN: Regularizing KANs by masking post-activations

Mohammed Ghaith Altarabichi

PDF

Open Access 1 Repo

TL;DR

DropKAN introduces a simple regularization technique for Kolmogorov-Arnold Networks by masking activations, which improves their generalization performance compared to standard Dropout.

Contribution

This paper presents DropKAN, a novel regularization method that embeds dropout masks within KAN layers, enhancing their generalization ability.

Findings

01

DropKAN outperforms standard Dropout in KANs on real datasets.

02

Embedding masks within KAN layers improves regularization effectiveness.

03

DropKAN is easy to implement with minimal coding effort.

Abstract

We propose DropKAN (Dropout Kolmogorov-Arnold Networks) a regularization method that prevents co-adaptation of activation function weights in Kolmogorov-Arnold Networks (KANs). DropKAN functions by embedding the drop mask directly within the KAN layer, randomly masking the outputs of some activations within the KANs' computation graph. We show that this simple procedure that require minimal coding effort has a regularizing effect and consistently lead to better generalization of KANs. We analyze the adaptation of the standard Dropout with KANs and demonstrate that Dropout applied to KANs' neurons can lead to unpredictable behavior in the feedforward pass. We carry an empirical study with real world Machine Learning datasets to validate our findings. Our results suggest that DropKAN is consistently a better alternative to using standard Dropout with KANs, and improves the generalization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ghaith81/dropkan
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition · Handwritten Text Recognition Techniques

MethodsDropout