Auto-Evolve: Enhancing Large Language Model's Performance via Self-Reasoning Framework
Krishna Aswani, Huilin Lu, Pranav Patankar, Priya Dhalwani, Iris Tan,, Jayant Ganeshmohan, Simon Lacasse

TL;DR
Auto-Evolve is a novel framework that enables large language models to self-create and refine reasoning modules dynamically, significantly improving their problem-solving performance across diverse tasks without relying on static prompts.
Contribution
The paper introduces Auto-Evolve, a framework allowing LLMs to generate and refine reasoning modules autonomously, surpassing state-of-the-art prompt strategies in performance.
Findings
Auto-Evolve outperforms CoT by up to 10.4%.
It achieves an average improvement of 7% across multiple models.
Iterative refinement boosts performance by 2.8% on average.
Abstract
Recent advancements in prompt engineering strategies, such as Chain-of-Thought (CoT) and Self-Discover, have demonstrated significant potential in improving the reasoning abilities of Large Language Models (LLMs). However, these state-of-the-art (SOTA) prompting strategies rely on single or fixed set of static seed reasoning modules like "think step by step" or "break down this problem" intended to simulate human approach to problem-solving. This constraint limits the flexibility of models in tackling diverse problems effectively. In this paper, we introduce Auto-Evolve, a novel framework that enables LLMs to self-create dynamic reasoning modules and downstream action plan, resulting in significant improvements over current SOTA methods. We evaluate Auto-Evolve on the challenging BigBench-Hard (BBH) dataset with Claude 2.0, Claude 3 Sonnet, Mistral Large, and GPT 4, where it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Sparse Evolutionary Training · Linear Layer · Cosine Annealing · Dropout · Byte Pair Encoding · Softmax · Multi-Head Attention · Layer Normalization
