TACIT: Transformation-Aware Capturing of Implicit Thought

Daniel Nobrega

arXiv:2602.07061·cs.LG·February 10, 2026

TACIT: Transformation-Aware Capturing of Implicit Thought

Daniel Nobrega

PDF

Open Access

TL;DR

TACIT introduces a diffusion-based transformer that visualizes implicit reasoning in pixel space, demonstrating rapid, holistic maze-solving with insights into neural reasoning processes.

Contribution

The paper presents a novel pixel-space diffusion transformer enabling interpretable visual reasoning and revealing holistic, insight-like solution emergence.

Findings

01

192x reduction in training loss over 100 epochs

02

22.7x improvement in L2 distance to ground truth

03

Solution emergence within 2% of the transformation process

Abstract

We present TACIT (Transformation-Aware Capturing of Implicit Thought), a diffusion-based transformer for interpretable visual reasoning. Unlike language-based reasoning systems, TACIT operates entirely in pixel space using rectified flow, enabling direct visualization of the reasoning process at each inference step. We demonstrate the approach on maze-solving, where the model learns to transform images of unsolved mazes into solutions. Key results on 1 million synthetic maze pairs include: - 192x reduction in training loss over 100 epochs - 22.7x improvement in L2 distance to ground truth - Only 10 Euler steps required (vs. 100-1000 for typical diffusion models) Quantitative analysis reveals a striking phase transition phenomenon: the solution remains invisible for 68% of the transformation (zero recall), then emerges abruptly at t=0.70 within just 2% of the process. Most…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAction Observation and Synchronization · Multimodal Machine Learning Applications · Embodied and Extended Cognition