StrategyLLM: Large Language Models as Strategy Generators, Executors,   Optimizers, and Evaluators for Problem Solving

Chang Gao; Haiyun Jiang; Deng Cai; Shuming Shi; Wai Lam

arXiv:2311.08803·cs.CL·November 12, 2024·1 cites

StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving

Chang Gao, Haiyun Jiang, Deng Cai, Shuming Shi, Wai Lam

PDF

Open Access 1 Repo 1 Video

TL;DR

StrategyLLM introduces a multi-agent framework enabling large language models to generate, evaluate, and optimize problem-solving strategies, significantly improving performance and consistency across diverse reasoning tasks without human input.

Contribution

This work presents a novel multi-agent approach with strategy generation, evaluation, and optimization modules, enhancing generalizability and consistency in few-shot prompting for LLMs.

Findings

01

Outperforms baseline CoT-SC on 13 datasets across 4 tasks.

02

Achieves notable accuracy improvements in math, reasoning, and symbolic tasks.

03

Demonstrates applicability to various LLMs and scenarios.

Abstract

Most existing prompting methods suffer from the issues of generalizability and consistency, as they often rely on instance-specific solutions that may not be applicable to other instances and lack task-level consistency across the selected few-shot examples. To address these limitations, we propose a comprehensive framework, StrategyLLM, allowing LLMs to perform inductive reasoning, deriving general strategies from specific task instances, and deductive reasoning, applying these general strategies to particular task examples, for constructing generalizable and consistent few-shot prompts. It employs four LLM-based agents: strategy generator, executor, optimizer, and evaluator, working together to generate, evaluate, and select promising strategies for a given task. Experimental results demonstrate that StrategyLLM outperforms the competitive baseline CoT-SC that requires human-annotated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gao-xiao-bai/strategyllm
noneOfficial

Videos

StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving· slideslive

Taxonomy

TopicsTopic Modeling · Advanced Text Analysis Techniques · AI in Service Interactions