Agile Reinforcement Learning through Separable Neural Architecture
Rajib Mostakim, Reza T. Batley, Sourav Saha

TL;DR
This paper introduces SPAN, a spline-based neural architecture for reinforcement learning that improves sample efficiency and success rates over traditional MLPs, especially in resource-constrained environments.
Contribution
The work proposes SPAN, a novel separable neural architecture with learnable preprocessing, enhancing parameter and computational efficiency in RL function approximation.
Findings
SPAN achieves 30-50% better sample efficiency.
SPAN outperforms MLPs with 1.3-9x higher success rates.
SPAN shows robustness to hyperparameter variations.
Abstract
Deep reinforcement learning (RL) is increasingly deployed in resource-constrained environments, yet the go-to function approximators - multilayer perceptrons (MLPs) - are often parameter-inefficient due to an imperfect inductive bias for the smooth structure of many value functions. This mismatch can also hinder sample efficiency and slow policy learning in this capacity-limited regime. Although model compression techniques exist, they operate post-hoc and do not improve learning efficiency. Recent spline-based separable architectures - such as Kolmogorov-Arnold Networks (KANs) - have been shown to offer parameter efficiency but are widely reported to exhibit significant computational overhead, especially at scale. In seeking to address these limitations, this work introduces SPAN (SPline-based Adaptive Networks), a novel function approximation approach to RL. SPAN adapts the low rank…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Model Reduction and Neural Networks · Neural Networks and Reservoir Computing
