CosmoCore-Evo: Evolutionary Dream-Replay Reinforcement Learning for Adaptive Code Generation
Santhosh Kumar Ravindran

TL;DR
CosmoCore-Evo enhances adaptive code generation by integrating evolutionary algorithms with reinforcement learning, enabling better handling of environment shifts and fostering emergent behaviors.
Contribution
It introduces an evolutionary extension to the CosmoCore framework, incorporating mutation and selection in RL trajectories for improved adaptability and novelty.
Findings
Achieves up to 35% higher solution novelty
Faster adaptation by 25% in benchmark tasks
Efficacy confirmed through ablation studies
Abstract
Building on the affective dream-replay reinforcement learning framework of CosmoCore, we introduce CosmoCore-Evo, an extension that incorporates evolutionary algorithms to enhance adaptability and novelty in code generation tasks. Inspired by anthropological aspects of human evolution, such as natural selection and adaptation in early hominids, CosmoCore-Evo treats RL trajectories as ``genomes'' that undergo mutation and selection during the nocturnal replay phase. This mechanism allows agents to break free from trained patterns, fostering emergent behaviors and improved performance in distribution-shifted environments, such as changing APIs or novel libraries. We augment the Dream Queue with evolutionary operations, including mutation of high-fitness trajectories and enterprise-tuned fitness functions that incorporate efficiency, compliance, and scalability metrics. Evaluated on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Artificial Intelligence in Games · Reinforcement Learning in Robotics
