# Diagonal Acceleration for Covariance Matrix Adaptation Evolution   Strategies

**Authors:** Youhei Akimoto, Nikolaus Hansen

arXiv: 1905.05885 · 2019-05-16

## TL;DR

This paper introduces dd-CMA, an acceleration technique for CMA-ES that uses adaptive diagonal decoding to improve performance on both separable and non-separable problems, especially in high dimensions.

## Contribution

The paper presents a novel diagonal acceleration method for CMA-ES, enhancing its speed and scalability without sacrificing performance on complex, non-separable functions.

## Key findings

- dd-CMA-ES outperforms original CMA-ES on ill-conditioned functions
- Significant speedup observed in high-dimensional optimization
- Effective even with large population sizes up to dimension squared

## Abstract

We introduce an acceleration for covariance matrix adaptation evolution strategies (CMA-ES) by means of adaptive diagonal decoding (dd-CMA). This diagonal acceleration endows the default CMA-ES with the advantages of separable CMA-ES without inheriting its drawbacks. Technically, we introduce a diagonal matrix D that expresses coordinate-wise variances of the sampling distribution in DCD form. The diagonal matrix can learn a rescaling of the problem in the coordinates within linear number of function evaluations. Diagonal decoding can also exploit separability of the problem, but, crucially, does not compromise the performance on non-separable problems. The latter is accomplished by modulating the learning rate for the diagonal matrix based on the condition number of the underlying correlation matrix. dd-CMA-ES not only combines the advantages of default and separable CMA-ES, but may achieve overadditive speedup: it improves the performance, and even the scaling, of the better of default and separable CMA-ES on classes of non-separable test functions that reflect, arguably, a landscape feature commonly observed in practice.   The paper makes two further secondary contributions: we introduce two different approaches to guarantee positive definiteness of the covariance matrix with active CMA, which is valuable in particular with large population size; we revise the default parameter setting in CMA-ES, proposing accelerated settings in particular for large dimension.   All our contributions can be viewed as independent improvements of CMA-ES, yet they are also complementary and can be seamlessly combined. In numerical experiments with dd-CMA-ES up to dimension 5120, we observe remarkable improvements over the original covariance matrix adaptation on functions with coordinate-wise ill-conditioning. The improvement is observed also for large population sizes up to about dimension squared.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.05885/full.md

## Figures

48 figures with captions in the complete paper: https://tomesphere.com/paper/1905.05885/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/1905.05885/full.md

---
Source: https://tomesphere.com/paper/1905.05885