StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving
Chang Gao, Haiyun Jiang, Deng Cai, Shuming Shi, Wai Lam

TL;DR
StrategyLLM introduces a multi-agent framework enabling large language models to generate, evaluate, and optimize problem-solving strategies, significantly improving performance and consistency across diverse reasoning tasks without human input.
Contribution
This work presents a novel multi-agent approach with strategy generation, evaluation, and optimization modules, enhancing generalizability and consistency in few-shot prompting for LLMs.
Findings
Outperforms baseline CoT-SC on 13 datasets across 4 tasks.
Achieves notable accuracy improvements in math, reasoning, and symbolic tasks.
Demonstrates applicability to various LLMs and scenarios.
Abstract
Most existing prompting methods suffer from the issues of generalizability and consistency, as they often rely on instance-specific solutions that may not be applicable to other instances and lack task-level consistency across the selected few-shot examples. To address these limitations, we propose a comprehensive framework, StrategyLLM, allowing LLMs to perform inductive reasoning, deriving general strategies from specific task instances, and deductive reasoning, applying these general strategies to particular task examples, for constructing generalizable and consistent few-shot prompts. It employs four LLM-based agents: strategy generator, executor, optimizer, and evaluator, working together to generate, evaluate, and select promising strategies for a given task. Experimental results demonstrate that StrategyLLM outperforms the competitive baseline CoT-SC that requires human-annotated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · AI in Service Interactions
