Matcha: Mitigating Graph Structure Shifts with Test-Time Adaptation
Wenxuan Bao, Zhichen Zeng, Zhining Liu, Hanghang Tong, Jingrui He

TL;DR
Matcha is a novel test-time adaptation framework for graph neural networks that effectively mitigates structure shifts, improving robustness and performance on both synthetic and real-world graph data.
Contribution
We introduce Matcha, a new method for adapting GNNs to structure shifts at test time, including a clustering loss and compatibility with existing TTA algorithms.
Findings
Matcha significantly improves GNN performance under structure shifts.
It effectively handles combined structure and attribute shifts.
Experimental results validate robustness across diverse datasets.
Abstract
Powerful as they are, graph neural networks (GNNs) are known to be vulnerable to distribution shifts. Recently, test-time adaptation (TTA) has attracted attention due to its ability to adapt a pre-trained model to a target domain, without re-accessing the source domain. However, existing TTA algorithms are primarily designed for attribute shifts in vision tasks, where samples are independent. These methods perform poorly on graph data that experience structure shifts, where node connectivity differs between source and target graphs. We attribute this performance gap to the distinct impact of node attribute shifts versus graph structure shifts: the latter significantly degrades the quality of node representations and blurs the boundaries between different node categories. To address structure shifts in graphs, we propose Matcha, an innovative framework designed for effective and…
Peer Reviews
Decision·ICLR 2025 Poster
1. The paper points out the problem of structure shifts, which is indeed overlooked in previous research on TTA, or not focused explicitly at least. 2. The proposed AdaRC framework provides a simple yet effective solution for graphs with structure shifts at test time. It is compatible with existing Test-Time Adaptation algorithms, which allows better adaption. 3. The authors conduct a detailed theoretical analysis of the impact of structure shifts on a single-layer GCN on a CSBM-generated gra
1. The theoretical analysis focuses on single-layer GCNs and CSBM-generated graphs, which represent a simplified model where structure shifts are easily identified. However, the evolving real-world graph structures are far more complex. It is unclear whether the findings on this simplified model still hold for real-world graphs, particularly in domains where the task accuracy is not high. 2. The experimental results, which show promising performance, are primarily based on synthetic datasets o
The paper analyzes the performance gap in a simple single-layer Graph Convolutional Network (GCN) using graphs generated from the Community Structure Benchmark Model (CSBM). It effectively investigates the impacts of attribute shifts and structural shifts on performance. The proposed PIC loss is both simple and effective, allowing for seamless integration with existing frameworks and methods. Its effectiveness is validated through experimental results.
It is worth noting that, as I am not deeply familiar with this specific area, my comments here may lack technical depth. I will rely on the insights from other expert reviewers and will focus primarily on their evaluations. Here, a few potential concerns include: - The performance improvements on the Twitch-E and OGB-Arxiv datasets appear marginal. Could the authors provide further analysis to clarify the reasons behind these limited gains? - The gains achieved seem to vary significantly acros
- AdaRC introduces a new approach to handling structure shifts by adapting hop-aggregation parameters, which allows the model to improve node representation quality in ways traditional TTA methods cannot. - The experimental evaluation is comprehensive, covering both synthetic and real-world datasets, and demonstrates substantial improvements in accuracy and robustness across different shifts, particularly structure shifts. - The theoretical insights into the impact of structure and attribute s
- While the PIC loss offers advantages, its application may face scalability challenges with very large graphs due to the computational overhead introduced by clustering operations. The method's efficiency under extreme graph sizes could be explored further. - AdaRC's performance relies on the optimization of hop-aggregation parameters, which may not generalize across all GNN architectures without tuning. This reliance might limit its applicability for varied and complex GNN models. - AdaRC ha
Videos
Taxonomy
TopicsGraph Theory and Algorithms · Advanced Graph Neural Networks · Software Testing and Debugging Techniques
MethodsSoftmax · Attention Is All You Need
