Understanding the Language Model to Solve the Symbolic Multi-Step Reasoning Problem from the Perspective of Buffer Mechanism

Zhiwei Wang; Yunji Wang; Zhongwang Zhang; Zhangchen Zhou; Hui Jin; Tianyang Hu; Jiacheng Sun; Zhenguo Li; Yaoyu Zhang; Zhi-Qin John Xu

arXiv:2405.15302·cs.AI·September 10, 2025·2 cites

Understanding the Language Model to Solve the Symbolic Multi-Step Reasoning Problem from the Perspective of Buffer Mechanism

Zhiwei Wang, Yunji Wang, Zhongwang Zhang, Zhangchen Zhou, Hui Jin, Tianyang Hu, Jiacheng Sun, Zhenguo Li, Yaoyu Zhang, Zhi-Qin John Xu

PDF

Open Access 1 Video

TL;DR

This paper investigates how Transformer-based language models perform multi-step symbolic reasoning by analyzing their internal buffer mechanisms and introduces a simple algorithm that significantly improves reasoning performance across multiple datasets.

Contribution

The study introduces the buffer mechanism concept and a novel random matrix-based algorithm that enhances reasoning ability with minimal additional parameters.

Findings

01

Significant performance improvements on 7 reasoning datasets

02

Buffer mechanism provides insights into internal reasoning processes

03

A simple 132-parameter algorithm boosts reasoning capabilities

Abstract

Large language models have consistently struggled with complex reasoning tasks, such as mathematical problem-solving. Investigating the internal reasoning mechanisms of these models can help us design better model architectures and training strategies, ultimately enhancing their reasoning capability. In this study, we constructed a symbolic multi-step reasoning task to investigate the information propagation mechanisms in Transformer models when solving the task through direct answering and Chain-of-Thought (CoT) reasoning. We introduced the concept of buffer mechanism: the model stores various information in distinct buffers and selectively extracts it through the query-key matrix. We proposed a random matrix-based algorithm to enhance the model's reasoning ability. This algorithm introduces only 132 trainable parameters, yet leads to significant performance improvements on 7…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Understanding the Language Model to Solve the Symbolic Multi-Step Reasoning Problem from the Perspective of Buffer Mechanism· underline

Taxonomy

TopicsConstraint Satisfaction and Optimization · Logic, programming, and type systems · Model-Driven Software Engineering Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Linear Warmup With Cosine Annealing · Attention Dropout · Discriminative Fine-Tuning · Weight Decay · GPT-2 · Linear Layer · Byte Pair Encoding