Large Language Model-Based Benchmarking Experiment Settings for   Evolutionary Multi-Objective Optimization

Lie Meng Pang; Hisao Ishibuchi

arXiv:2502.21108·cs.NE·March 3, 2025

Large Language Model-Based Benchmarking Experiment Settings for Evolutionary Multi-Objective Optimization

Lie Meng Pang, Hisao Ishibuchi

PDF

TL;DR

This paper investigates how large language models design benchmarking experiments for evolutionary multi-objective optimization algorithms, revealing they tend to suggest classical settings like standard test problems and performance metrics.

Contribution

It is the first study to analyze the implicit assumptions made by LLMs when designing EMO benchmarking experiments, highlighting their tendency to favor traditional configurations.

Findings

01

LLMs often recommend classical benchmark problems such as ZDT, DTLZ, and WFG.

02

Standard performance metrics like HV and IGD are commonly suggested by LLMs.

03

Designed experiments by LLMs align with conventional EMO evaluation practices.

Abstract

When we manually design an evolutionary optimization algorithm, we implicitly or explicitly assume a set of target optimization problems. In the case of automated algorithm design, target optimization problems are usually explicitly shown. Recently, the use of large language models (LLMs) for the design of evolutionary multi-objective optimization (EMO) algorithms have been examined in some studies. In those studies, target multi-objective problems are not always explicitly shown. It is well known in the EMO community that the performance evaluation results of EMO algorithms depend on not only test problems but also many other factors such as performance indicators, reference point, termination condition, and population size. Thus, it is likely that the designed EMO algorithms by LLMs depends on those factors. In this paper, we try to examine the implicit assumption about the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Evolutionary Training