Interpretable Reinforcement Learning for Load Balancing using Kolmogorov-Arnold Networks

Kamal Singh; Sami Marouani; Ahmad Al Sheikh; Pham Tran Anh Quang; Amaury Habrard

arXiv:2505.14459·cs.LG·May 21, 2025

Interpretable Reinforcement Learning for Load Balancing using Kolmogorov-Arnold Networks

Kamal Singh, Sami Marouani, Ahmad Al Sheikh, Pham Tran Anh Quang, Amaury Habrard

PDF

Open Access

TL;DR

This paper introduces an interpretable reinforcement learning approach for load balancing in networks using Kolmogorov-Arnold Networks, enabling extraction of controller equations and better understanding of decision processes.

Contribution

The paper proposes using Kolmogorov-Arnold Networks within RL to achieve interpretability in network load balancing policies, which is a novel approach.

Findings

01

Effective load balancing policies learned with interpretability.

02

Improved network throughput and reduced delay.

03

Ability to extract explicit controller equations.

Abstract

Reinforcement learning (RL) has been increasingly applied to network control problems, such as load balancing. However, existing RL approaches often suffer from lack of interpretability and difficulty in extracting controller equations. In this paper, we propose the use of Kolmogorov-Arnold Networks (KAN) for interpretable RL in network control. We employ a PPO agent with a 1-layer actor KAN model and an MLP Critic network to learn load balancing policies that maximise throughput utility, minimize loss as well as delay. Our approach allows us to extract controller equations from the learned neural networks, providing insights into the decision-making process. We evaluate our approach using different reward functions demonstrating its effectiveness in improving network performance while providing interpretable policies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsEntropy Regularization · + ( 1 ) ⟷ 805 ⟷ ( 330 ) ⟷ 4056|How do I file a complaint with Expedia? · Proximal Policy Optimization