From Sequence to Structure: Uncovering Substructure Reasoning in Transformers

Xinnan Dai; Kai Yang; Jay Revolinsky; Kai Guo; Aoran Wang; Bohang Zhang; Jiliang Tang

arXiv:2507.10435·cs.CL·October 21, 2025

From Sequence to Structure: Uncovering Substructure Reasoning in Transformers

Xinnan Dai, Kai Yang, Jay Revolinsky, Kai Guo, Aoran Wang, Bohang Zhang, Jiliang Tang

PDF

TL;DR

This paper investigates how decoder-only Transformers understand and extract graph substructures, revealing internal mechanisms and demonstrating their ability to handle complex graph data like molecular graphs.

Contribution

It introduces the Induced Substructure Filtration (ISF) framework, providing both empirical and theoretical insights into substructure reasoning in Transformers.

Findings

01

Transformers can identify substructures within graphs across layers.

02

The ISF framework captures consistent internal dynamics in LLMs.

03

Transformers successfully extract substructures from attributed graphs like molecules.

Abstract

Recent studies suggest that large language models (LLMs) possess the capability to solve graph reasoning tasks. Notably, even when graph structures are embedded within textual descriptions, LLMs can still effectively answer related questions. This raises a fundamental question: How can a decoder-only Transformer architecture understand underlying graph structures? To address this, we start with the substructure extraction task, interpreting the inner mechanisms inside the transformers and analyzing the impact of the input queries. Specifically, through both empirical results and theoretical analysis, we present Induced Substructure Filtration (ISF), a perspective that captures the substructure identification in the multi-layer transformers. We further validate the ISF process in LLMs, revealing consistent internal dynamics across layers. Building on these insights, we explore the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.