Guiding Language Model Reasoning with Planning Tokens

Xinyi Wang; Lucas Caccia; Oleksiy Ostapenko; Xingdi Yuan; William Yang; Wang; Alessandro Sordoni

arXiv:2310.05707·cs.CL·August 8, 2024·2 cites

Guiding Language Model Reasoning with Planning Tokens

Xinyi Wang, Lucas Caccia, Oleksiy Ostapenko, Xingdi Yuan, William Yang, Wang, Alessandro Sordoni

PDF

Open Access

TL;DR

This paper introduces a hierarchical planning token approach to guide large language models in reasoning tasks, improving accuracy with minimal additional parameters across multiple datasets.

Contribution

It proposes a novel hierarchical generation scheme using planning tokens to structurally guide reasoning, requiring negligible additional parameters.

Findings

01

Significant accuracy improvements on math and QA datasets

02

Effective across multiple large language models

03

Minimal increase in trainable parameters

Abstract

Large language models (LLMs) have recently attracted considerable interest for their ability to perform complex reasoning tasks, such as chain-of-thought (CoT) reasoning. However, most of the existing approaches to enhance this ability rely heavily on data-driven methods, while neglecting the structural aspects of the model's reasoning capacity. To encourage a more structural generation of CoT steps, we propose a hierarchical generation scheme: we let the LM generate a planning token at the start of each reasoning step, intuitively serving as a high-level plan of the current step, and add their embeddings to the model parameters. Our approach requires a negligible increase in trainable parameters (0.001%) and can be applied through either full fine-tuning or a more parameter-efficient scheme. We demonstrate our method's effectiveness by applying it to three different LLMs, showing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Multi-Agent Systems and Negotiation