Kolmogorov-Arnold Network for Online Reinforcement Learning

Victor Augusto Kich; Jair Augusto Bottega; Raul Steinmetz; Ricardo; Bedin Grando; Ayano Yorozu; Akihisa Ohya

arXiv:2408.04841·cs.LG·September 4, 2024

Kolmogorov-Arnold Network for Online Reinforcement Learning

Victor Augusto Kich, Jair Augusto Bottega, Raul Steinmetz, Ricardo, Bedin Grando, Ayano Yorozu, Akihisa Ohya

PDF

Open Access 1 Repo

TL;DR

This paper investigates the integration of Kolmogorov-Arnold Networks into the PPO reinforcement learning framework, demonstrating comparable performance to traditional MLPs with fewer parameters on robotics benchmarks.

Contribution

It introduces the use of KANs as function approximators in PPO, showing their efficiency and potential advantages over MLPs in reinforcement learning.

Findings

01

KAN-based PPO achieves similar performance to MLP-based PPO.

02

KANs require fewer parameters, reducing memory usage.

03

The approach is effective on DeepMind Control Proprio Robotics benchmark.

Abstract

Kolmogorov-Arnold Networks (KANs) have shown potential as an alternative to Multi-Layer Perceptrons (MLPs) in neural networks, providing universal function approximation with fewer parameters and reduced memory usage. In this paper, we explore the use of KANs as function approximators within the Proximal Policy Optimization (PPO) algorithm. We evaluate this approach by comparing its performance to the original MLP-based PPO using the DeepMind Control Proprio Robotics benchmark. Our results indicate that the KAN-based reinforcement learning algorithm can achieve comparable performance to its MLP-based counterpart, often with fewer parameters. These findings suggest that KANs may offer a more efficient option for reinforcement learning models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

victorkich/kolmogorov-ppo
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsEntropy Regularization · Proximal Policy Optimization