A Gated Residual Kolmogorov-Arnold Networks for Mixtures of Experts

Hugo Inzirillo; Remi Genet

arXiv:2409.15161·cs.LG·December 16, 2024·2 cites

A Gated Residual Kolmogorov-Arnold Networks for Mixtures of Experts

Hugo Inzirillo, Remi Genet

PDF

Open Access 1 Repo

TL;DR

This paper presents KAMoE, a new Mixture of Experts framework utilizing Gated Residual Kolmogorov-Arnold Networks to improve efficiency and interpretability, demonstrating superior performance in financial and real estate applications.

Contribution

Introduces GRKAN as an innovative gating mechanism for MoE, enhancing performance and interpretability over traditional gating functions.

Findings

01

KAMoE outperforms traditional MoE architectures across tasks.

02

GRKAN shows superior performance in LSTM-based sequential models.

03

Insights into trade-offs between model complexity and performance.

Abstract

This paper introduces KAMoE, a novel Mixture of Experts (MoE) framework based on Gated Residual Kolmogorov-Arnold Networks (GRKAN). We propose GRKAN as an alternative to the traditional gating function, aiming to enhance efficiency and interpretability in MoE modeling. Through extensive experiments on digital asset markets and real estate valuation, we demonstrate that KAMoE consistently outperforms traditional MoE architectures across various tasks and model types. Our results show that GRKAN exhibits superior performance compared to standard Gating Residual Networks, particularly in LSTM-based models for sequential tasks. We also provide insights into the trade-offs between model complexity and performance gains in MoE and KAMoE architectures.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

remigenet/kamoe
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Financial Markets and Investment Strategies · Stochastic Gradient Optimization Techniques

MethodsMixture of Experts