TL;DR
The paper introduces ARES-LSHADE, a memetic differential evolution algorithm enhanced with LLM-driven research and a Polish phase, achieving near-perfect results on the GNBG benchmark.
Contribution
It presents a novel LLM-driven research loop and a memetic algorithm with adaptive components, advancing the state-of-the-art in benchmark optimization.
Findings
Achieved 510 wins out of 744 on GNBG functions.
Reached machine precision on 18 of 24 functions.
Identified the hardest functions with characteristic plateau signatures.
Abstract
We present ARES-LSHADE, a memetic differential-evolution variant submitted to the GECCO 2026 competition on LLM-designed evolutionary algorithms for the Generalized Numerical Benchmark Generator (GNBG). The algorithm builds on the LLM-LSHADE 2025 winner, contributing two new components: (a) a scout-augmented mutation operator with adaptive CMA-ES integration, produced by an autonomous research loop across approximately thirty LLM-driven design experiments, and (b) a multi-start L-BFGS-B polish phase that respects strict blackbox treatment of the benchmark. On the official 31-run-per-function evaluation with the competition-specified function-evaluation budgets, ARES-LSHADE obtains 510 of 744 wins (per-function gap below 1e-8), reaching machine precision on 18 of 24 functions. The remaining six functions exhibit characteristic plateau signatures consistent with GNBG's compositional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
