GeoGNN: Quantifying and Mitigating Semantic Drift in Text-Attributed Graphs
Liangwei Yang, Jing Ma, Jianguo Zhang, Zhiwei Liu, Jielin Qiu, Shirley Kokane, Shiyu Wang, Haolin Chen, Rithesh Murthy, Ming Zhu, Huan Wang, Weiran Yao, Caiming Xiong, Shelby Heinecke

TL;DR
This paper introduces GeoGNN, a manifold-aware graph neural network that mitigates semantic drift in text-attributed graphs by respecting the geometry of pretrained language model embeddings, leading to improved performance.
Contribution
It presents a quantitative framework for measuring semantic drift and proposes a geodesic aggregation mechanism that preserves semantic fidelity on curved manifolds.
Findings
GeoGNN reduces semantic drift in experiments.
It outperforms baseline models on benchmark datasets.
Manifold-aware aggregation improves text-graph learning.
Abstract
Graph neural networks (GNNs) on text--attributed graphs (TAGs) typically encode node texts using pretrained language models (PLMs) and propagate these embeddings through linear neighborhood aggregation. However, the representation spaces of modern PLMs are highly non--linear and geometrically structured, where textual embeddings reside on curved semantic manifolds rather than flat Euclidean spaces. Linear aggregation on such manifolds inevitably distorts geometry and causes semantic drift--a phenomenon where aggregated representations deviate from the intrinsic manifold, losing semantic fidelity and expressive power. To quantitatively investigate this problem, this work introduces a local PCA--based metric that measures the degree of semantic drift and provides the first quantitative framework to analyze how different aggregation mechanisms affect manifold structure. Building upon these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Multimodal Machine Learning Applications
