Adaptive Tokenization: On the Hop-Overpriority Problem in Tokenized Graph Learning Models
Zhibiao Wang, Yunlong Zhou, Ziwei Zhang, Mengmei Zhang, Shirui Pan, Chunming Hu, Xiao Wang

TL;DR
This paper introduces LGTL, a learnable token list for Graph Transformers that adaptively balances local and global information, addressing the hop-overpriority problem and improving performance on diverse graph types.
Contribution
The paper proposes LGTL, a novel plug-and-play module that replaces hand-designed token lists with an adaptive, learnable approach for better graph representation in TGLMs.
Findings
LGTL effectively addresses the hop-overpriority problem.
LGTL improves performance on heterophilic and homophilic graphs.
Experimental results validate LGTL's superiority across benchmarks.
Abstract
Graph Transformers, leveraging the global attention to capture long-range dependencies in graph structures, have significantly advanced graph machine learning, but face prohibitive computational complexity. Tokenized Graph Learning Models (TGLMs) address this issue by converting graphs into ordered token lists for scalable processing. Besides, TGLMs also empower Large Language Models (LLMs) to handle text-attributed graphs more effectively and thus are also employed in Graph LLMs. However, existing TGLMs rely on hand-designed token lists and their adaptability to diverse graph learning scenarios remains unexplored. In this paper, we first conduct extensive empirical and theoretical preliminary studies for hand-designed token lists. Surprisingly, we identify an unexplored hop-overpriority problem: the common pre-defined token lists overemphasize nearby nodes and overwhelm the ability of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Text and Document Classification Technologies
MethodsSoftmax · Attention Is All You Need · fast speak--How do I Speak to someone at Expedia?
