Exploring the True Potential: Evaluating the Black-box Optimization   Capability of Large Language Models

Beichen Huang; Xingyu Wu; Yu Zhou; Jibin Wu; Liang Feng; Ran Cheng,; Kay Chen Tan

arXiv:2404.06290·cs.NE·July 9, 2024·3 cites

Exploring the True Potential: Evaluating the Black-box Optimization Capability of Large Language Models

Beichen Huang, Xingyu Wu, Yu Zhou, Jibin Wu, Liang Feng, Ran Cheng,, Kay Chen Tan

PDF

Open Access

TL;DR

This paper systematically evaluates large language models' capabilities in black-box optimization, revealing their limitations in numerical tasks but potential in non-numerical and heuristic-based problems.

Contribution

It provides the first comprehensive analysis of LLMs in numerical and non-numerical optimization, highlighting their strengths and weaknesses.

Findings

01

LLMs perform poorly on pure numerical tasks due to domain mismatch.

02

LLMs can solve non-numerical problems and leverage heuristics from prompts.

03

This work offers new insights into LLMs' role in diverse optimization scenarios.

Abstract

Large language models (LLMs) have demonstrated exceptional performance not only in natural language processing tasks but also in a great variety of non-linguistic domains. In diverse optimization scenarios, there is also a rising trend of applying LLMs. However, whether the application of LLMs in the black-box optimization problems is genuinely beneficial remains unexplored. This paper endeavors to offer deep insights into the potential of LLMs in optimization through a comprehensive investigation, which covers both discrete and continuous optimization problems to assess the efficacy and distinctive characteristics that LLMs bring to this field. Our findings reveal both the limitations and advantages of LLMs in optimization. Specifically, on the one hand, despite the significant power consumed for running the models, LLMs exhibit subpar performance in pure numerical tasks, primarily due…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling