MeTHanol: Modularized Thinking Language Models with Intermediate Layer Thinking, Decoding and Bootstrapping Reasoning

Ningyuan Xi; Xiaoyu Wang; Yetao Wu; Teng Chen; Qingqing Gu; Yue Zhao; Jinxian Qu; Zhonglin Jiang; Yong Chen; Luo Ji

arXiv:2409.12059·cs.CL·April 30, 2026

MeTHanol: Modularized Thinking Language Models with Intermediate Layer Thinking, Decoding and Bootstrapping Reasoning

Ningyuan Xi, Xiaoyu Wang, Yetao Wu, Teng Chen, Qingqing Gu, Yue Zhao, Jinxian Qu, Zhonglin Jiang, Yong Chen, Luo Ji

PDF

1 Repo

TL;DR

MeTHanol introduces a modular approach to enhance language models' reasoning by leveraging intermediate layer decoding, dual-layer fine-tuning, and a two-pass inference mechanism, leading to improved cognitive and reflective capabilities.

Contribution

This work presents a novel modular framework with intermediate layer decoding and dual-layer fine-tuning to improve reasoning and self-reflection in large language models.

Findings

01

Intermediate layer can decode fluent language tokens.

02

Two-pass inference enhances reasoning and response quality.

03

Model demonstrates improved cognitive behaviors and adaptability.

Abstract

Current research efforts are focused on enhancing the thinking and reasoning capability of large language model (LLM) by prompting, data-driven emergence and inference-time computation. In this study, we consider stimulating language model's thinking and cognitive abilities from a modular perspective, which mimics the human brain architecture. We select a specific intermediate attention layer with newly implemented language heads. We conduct dual-layer fine-tuning by annotated (query, thought, answer) samples and show that the intermediate layer can also learn to decode fluent and reasonable language tokens. A two-pass inference mechanism is designed to generate thoughts then formal responses. The entire framework is called modularized thinking language model (MeTHanol) which can enhance LLM's cognitive behaviors as indicated by Theory of Mind (ToM) and Vignette-based experiments. Case…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://bachozean.github.io/methanol-page
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.