Discovering Continuous-Time Memory-Based Symbolic Policies using Genetic Programming
Sigur de Vries, Sander Keemink, Marcel van Gerven

TL;DR
This paper introduces a method to evolve interpretable, memory-augmented symbolic policies for control tasks using genetic programming, enhancing transparency and performance in complex environments.
Contribution
The paper presents a novel approach combining continuous-time memory with symbolic policies optimized via genetic programming for improved control.
Findings
Memory-augmented symbolic policies outperform memory-less ones.
Memory improves performance in volatile environments.
Policies are interpretable and transparent.
Abstract
Artificial intelligence techniques are increasingly being applied to solve control problems, but often rely on black-box methods without transparent output generation. To improve the interpretability and transparency in control systems, models can be defined as white-box symbolic policies described by mathematical expressions. For better performance in partially observable and volatile environments, the symbolic policies are extended with memory represented by continuous-time latent variables, governed by differential equations. Genetic programming is used for optimisation, resulting in interpretable policies consisting of symbolic expressions. Our results show that symbolic policies with memory compare with black-box policies on a variety of control tasks. Furthermore, the benefit of the memory in symbolic policies is demonstrated on experiments where memory-less policies fall short.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Reinforcement Learning in Robotics · Evolution and Genetic Dynamics
MethodsFocus
