Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

DiJia Su; Hanlin Zhu; Yingchen Xu; Jiantao Jiao; Yuandong Tian; Qinqing Zheng

arXiv:2502.03275·cs.CL·September 3, 2025

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

DiJia Su, Hanlin Zhu, Yingchen Xu, Jiantao Jiao, Yuandong Tian, Qinqing Zheng

PDF

Open Access

TL;DR

This paper introduces a hybrid token approach combining latent discrete tokens and text tokens to improve reasoning efficiency and performance in large language models, reducing input length and computational costs.

Contribution

The paper proposes a novel hybrid token representation using VQ-VAE for reasoning tasks, enabling models to abstract reasoning steps and improve performance across benchmarks.

Findings

01

Outperforms baseline methods on reasoning benchmarks

02

Reduces input length and computational resources

03

Enables effective training with mixed token types

Abstract

Large Language Models (LLMs) excel at reasoning and planning when trained on chainof-thought (CoT) data, where the step-by-step thought process is explicitly outlined by text tokens. However, this results in lengthy inputs where many words support textual coherence rather than core reasoning information, and processing these inputs consumes substantial computation resources. In this work, we propose a hybrid representation of the reasoning process, where we partially abstract away the initial reasoning steps using latent discrete tokens generated by VQ-VAE, significantly reducing the length of reasoning traces. We explore the use of latent trace abstractions in two scenarios: 1) training the model from scratch for the Keys-Finding Maze problem, 2) fine-tuning LLMs on this hybrid data with an extended vocabulary including unseen latent tokens, for both logical and mathematical reasoning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies

MethodsVQ-VAE