Hierarchical Compression of Text-Rich Graphs via Large Language Models
Shichang Zhang, Da Zheng, Jiani Zhang, Qi Zhu, Xiang song, Soji, Adeshina, Christos Faloutsos, George Karypis, Yizhou Sun

TL;DR
This paper presents Hierarchical Compression (HiCom), a novel method that enables large language models to effectively process and encode text-rich graphs by hierarchically compressing node texts, improving node classification performance.
Contribution
HiCom introduces a hierarchical text compression technique that aligns LLM capabilities with graph structures, addressing computational challenges and enhancing performance on text-rich graphs.
Findings
HiCom outperforms GNNs and LLM backbones in node classification accuracy.
Achieves a 3.48% average performance boost on five datasets.
More efficient than using LLMs directly for dense graph regions.
Abstract
Text-rich graphs, prevalent in data mining contexts like e-commerce and academic graphs, consist of nodes with textual features linked by various relations. Traditional graph machine learning models, such as Graph Neural Networks (GNNs), excel in encoding the graph structural information, but have limited capability in handling rich text on graph nodes. Large Language Models (LLMs), noted for their superior text understanding abilities, offer a solution for processing the text in graphs but face integration challenges due to their limitation for encoding graph structures and their computational complexities when dealing with extensive text in large neighborhoods of interconnected nodes. This paper introduces ``Hierarchical Compression'' (HiCom), a novel method to align the capabilities of LLMs with the structure of text-rich graphs. HiCom processes text in a node's neighborhood in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Algorithms and Data Compression
MethodsALIGN
