Leveraging Genetic Algorithms for Efficient Demonstration Generation in Real-World Reinforcement Learning Environments

Tom Maus; Asma Atamna; Tobias Glasmachers

arXiv:2507.00762·cs.LG·July 3, 2025

Leveraging Genetic Algorithms for Efficient Demonstration Generation in Real-World Reinforcement Learning Environments

Tom Maus, Asma Atamna, Tobias Glasmachers

PDF

Open Access

TL;DR

This paper explores using Genetic Algorithms to generate expert demonstrations that enhance reinforcement learning in industrial environments, leading to faster convergence and improved performance.

Contribution

It introduces a novel hybrid approach combining GAs with RL, demonstrating improved training efficiency and performance in a real-world inspired setting.

Findings

01

GA-generated demonstrations improve RL performance

02

PPO agents with GA data achieve higher rewards

03

Hybrid methods accelerate training convergence

Abstract

Reinforcement Learning (RL) has demonstrated significant potential in certain real-world industrial applications, yet its broader deployment remains limited by inherent challenges such as sample inefficiency and unstable learning dynamics. This study investigates the utilization of Genetic Algorithms (GAs) as a mechanism for improving RL performance in an industrially inspired sorting environment. We propose a novel approach in which GA-generated expert demonstrations are used to enhance policy learning. These demonstrations are incorporated into a Deep Q-Network (DQN) replay buffer for experience-based learning and utilized as warm-start trajectories for Proximal Policy Optimization (PPO) agents to accelerate training convergence. Our experiments compare standard RL training with rule-based heuristics, brute-force optimization, and demonstration data, revealing that GA-derived…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Advanced Multi-Objective Optimization Algorithms