Score: 1

CosmoCore-Evo: Evolutionary Dream-Replay Reinforcement Learning for Adaptive Code Generation

Published: December 20, 2025 | arXiv ID: 2512.21351v1

By: Santhosh Kumar Ravindran

BigTech Affiliations: Microsoft

Potential Business Impact:

Helps computers learn to create new, better code.

Business Areas:

Simulation Software

Building on the affective dream-replay reinforcement learning framework of CosmoCore, we introduce CosmoCore-Evo, an extension that incorporates evolutionary algorithms to enhance adaptability and novelty in code generation tasks. Inspired by anthropological aspects of human evolution, such as natural selection and adaptation in early hominids, CosmoCore-Evo treats RL trajectories as ``genomes'' that undergo mutation and selection during the nocturnal replay phase. This mechanism allows agents to break free from trained patterns, fostering emergent behaviors and improved performance in distribution-shifted environments, such as changing APIs or novel libraries. We augment the Dream Queue with evolutionary operations, including mutation of high-fitness trajectories and enterprise-tuned fitness functions that incorporate efficiency, compliance, and scalability metrics. Evaluated on extended benchmarks including HumanEval variants with shifts, BigCodeBench, and a custom PySpark pipeline simulation, CosmoCore-Evo achieves up to 35% higher novelty in solutions and 25% faster adaptation compared to the original CosmoCore and baselines like PPO and REAMER. Ablations confirm the role of evolutionary components in bridging the sentient gap for LLM agents. Code for replication, including a toy simulation, is provided.

CosmoCore Affective Dream-Replay Reinforcement Learning for Code Generation

Software Engineering

Makes AI write better code by learning from mistakes.

20 Oct 2025 1

88%

CORE: Code-based Inverse Self-Training Framework with Graph Expansion for Virtual Agents

Machine Learning (CS)

Teaches robots to learn new tasks by watching.

5 Jan 2026 0

87%

CodeEvo: Interaction-Driven Synthesis of Code-centric Data through Hybrid and Iterative Feedback

Software Engineering

Teaches computers to write better code.

25 Jul 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

12 pages

CosmoCore-Evo: Evolutionary Dream-Replay Reinforcement Learning for Adaptive Code Generation

Helps computers learn to create new, better code.

Technical Abstract

CosmoCore Affective Dream-Replay Reinforcement Learning for Code Generation

CORE: Code-based Inverse Self-Training Framework with Graph Expansion for Virtual Agents

CodeEvo: Interaction-Driven Synthesis of Code-centric Data through Hybrid and Iterative Feedback