Auto-Evolve: Enhancing Large Language Model's Performance via   Self-Reasoning Framework

Krishna Aswani; Huilin Lu; Pranav Patankar; Priya Dhalwani; Iris Tan,; Jayant Ganeshmohan; Simon Lacasse

arXiv:2410.06328·cs.CL·October 15, 2024

Auto-Evolve: Enhancing Large Language Model's Performance via Self-Reasoning Framework

Krishna Aswani, Huilin Lu, Pranav Patankar, Priya Dhalwani, Iris Tan,, Jayant Ganeshmohan, Simon Lacasse

PDF

Open Access

TL;DR

Auto-Evolve is a novel framework that enables large language models to self-create and refine reasoning modules dynamically, significantly improving their problem-solving performance across diverse tasks without relying on static prompts.

Contribution

The paper introduces Auto-Evolve, a framework allowing LLMs to generate and refine reasoning modules autonomously, surpassing state-of-the-art prompt strategies in performance.

Findings

01

Auto-Evolve outperforms CoT by up to 10.4%.

02

It achieves an average improvement of 7% across multiple models.

03

Iterative refinement boosts performance by 2.8% on average.

Abstract

Recent advancements in prompt engineering strategies, such as Chain-of-Thought (CoT) and Self-Discover, have demonstrated significant potential in improving the reasoning abilities of Large Language Models (LLMs). However, these state-of-the-art (SOTA) prompting strategies rely on single or fixed set of static seed reasoning modules like "think step by step" or "break down this problem" intended to simulate human approach to problem-solving. This constraint limits the flexibility of models in tackling diverse problems effectively. In this paper, we introduce Auto-Evolve, a novel framework that enables LLMs to self-create dynamic reasoning modules and downstream action plan, resulting in significant improvements over current SOTA methods. We evaluate Auto-Evolve on the challenging BigBench-Hard (BBH) dataset with Claude 2.0, Claude 3 Sonnet, Mistral Large, and GPT 4, where it…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Sparse Evolutionary Training · Linear Layer · Cosine Annealing · Dropout · Byte Pair Encoding · Softmax · Multi-Head Attention · Layer Normalization