Automatic Model Selection with Large Language Models for Reasoning

James Xu Zhao; Yuxi Xie; Kenji Kawaguchi; Junxian He; Michael Qizhe; Xie

arXiv:2305.14333·cs.CL·October 24, 2023·2 cites

Automatic Model Selection with Large Language Models for Reasoning

James Xu Zhao, Yuxi Xie, Kenji Kawaguchi, Junxian He, Michael Qizhe, Xie

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel method using large language models to dynamically select the most effective reasoning approach—either Chain-of-Thought or Program-Aided Language Models—for improved accuracy and efficiency across multiple datasets.

Contribution

The paper introduces a model selection technique leveraging LLMs to choose between CoT and PAL methods, achieving state-of-the-art results and reducing computational costs.

Findings

01

Significant performance improvements on eight reasoning datasets.

02

Achieved new state-of-the-art accuracies on GSM8K and SVAMP.

03

Complementary to self-consistency, further enhancing performance.

Abstract

Chain-of-Thought (CoT) and Program-Aided Language Models (PAL) represent two distinct reasoning methods, each with its own strengths. CoT employs natural language, offering flexibility and interpretability, while PAL utilizes programming language, yielding more structured and rigorous logic. We introduce a model selection method to combine the best of both worlds by employing a large language model (LLM) to dynamically select between them. Our theoretical analysis underscores the feasibility of this method, which is further corroborated by empirical results. Our proposed method demonstrates significant performance improvements across eight reasoning datasets with Codex, ChatGPT, and GPT-4. Additionally, our method is complementary to self-consistency; when integrated, it can further enhance performance while significantly reducing computation costs. Moreover, we achieve new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xuzhao0/model-selection-reasoning
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning and Algorithms · Software System Performance and Reliability

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Position-Wise Feed-Forward Layer · Dense Connections · Absolute Position Encodings · Adam · Label Smoothing · Residual Connection