Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

Yifan Wang; Shiyu Li; Peiming Li; Xiaochen Yang; Yang Tang; Zheng Wei

arXiv:2601.14750·cs.CL·April 21, 2026

Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

Yifan Wang, Shiyu Li, Peiming Li, Xiaochen Yang, Yang Tang, Zheng Wei

PDF

1 Repo

TL;DR

Render-of-Thought (RoT) transforms textual reasoning chains into images to make latent reasoning explicit, improving efficiency and traceability in large language model reasoning tasks.

Contribution

RoT is the first framework to render reasoning steps as images, enabling explicit, traceable reasoning without additional pre-training overhead.

Findings

01

Achieves 3-4x token compression compared to explicit CoT

02

Provides substantial inference acceleration

03

Maintains competitive reasoning performance

Abstract

Chain-of-Thought (CoT) prompting has achieved remarkable success in unlocking the reasoning capabilities of Large Language Models (LLMs). Although CoT prompting enhances reasoning, its verbosity imposes substantial computational overhead. Recent works often focus exclusively on outcome alignment and lack supervision on the intermediate reasoning process. These deficiencies obscure the analyzability of the latent reasoning chain. To address these challenges, we introduce Render-of-Thought (RoT), the first framework to reify the reasoning chain by rendering textual steps into images, making the latent rationale explicit and traceable. Specifically, we leverage the vision encoders of existing Vision Language Models (VLMs) as semantic anchors to align the vision embeddings with the textual space. This design ensures plug-and-play implementation without incurring additional pre-training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TencentBAC/RoT
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.