Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Yao Tang; Li Dong; Yaru Hao; Qingxiu Dong; Furu Wei; Jiatao Gu

arXiv:2601.08808·cs.CL·January 14, 2026

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Yao Tang, Li Dong, Yaru Hao, Qingxiu Dong, Furu Wei, Jiatao Gu

PDF

Open Access

TL;DR

Multiplex Thinking introduces a stochastic soft reasoning mechanism for large language models that maintains multiple reasoning paths simultaneously, improving performance on math reasoning tasks while keeping sequence length short.

Contribution

It proposes a novel multiplex reasoning method that combines sampling and aggregation to enhance reasoning without increasing sequence length.

Findings

01

Outperforms discrete CoT and RL baselines on math benchmarks

02

Produces shorter reasoning sequences with higher accuracy

03

Adapts between discrete and soft reasoning based on confidence

Abstract

Large language models often solve complex reasoning tasks more effectively with Chain-of-Thought (CoT), but at the cost of long, low-bandwidth token sequences. Humans, by contrast, often reason softly by maintaining a distribution over plausible next steps. Motivated by this, we propose Multiplex Thinking, a stochastic soft reasoning mechanism that, at each thinking step, samples K candidate tokens and aggregates their embeddings into a single continuous multiplex token. This preserves the vocabulary embedding prior and the sampling dynamics of standard discrete generation, while inducing a tractable probability distribution over multiplex rollouts. Consequently, multiplex trajectories can be directly optimized with on-policy reinforcement learning (RL). Importantly, Multiplex Thinking is self-adaptive: when the model is confident, the multiplex token is nearly discrete and behaves like…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques