Graph homophily booster: Reimagining the role of discrete features in heterophilic graph learning
Ruizhong Qiu, Ting-Wei Li, Gaotang Li, Hanghang Tong

TL;DR
This paper introduces GRAPHITE, a novel graph transformation framework that explicitly increases homophily in heterophilic graphs, significantly improving GNN performance on challenging datasets by creating feature nodes for better message passing.
Contribution
The work is the first to directly transform graphs to enhance homophily, addressing the root cause of heterophily rather than just architectural modifications.
Findings
GRAPHITE significantly outperforms state-of-the-art methods on heterophilic datasets.
It achieves comparable accuracy to leading methods on homophilic graphs.
The method increases graph homophily with minimal size increase.
Abstract
Graph neural networks (GNNs) have emerged as a powerful tool for modeling graph-structured data. However, existing GNNs often struggle with heterophilic graphs, where connected nodes tend to have dissimilar features or labels. While numerous methods have been proposed to address this challenge, they primarily focus on architectural designs without directly targeting the root cause of the heterophily problem. These approaches still perform even worse than the simplest MLPs on challenging heterophilic datasets. For instance, our experiments show that 21 latest GNNs still fall behind the MLP on the Actor dataset. This critical challenge calls for an innovative approach to addressing graph heterophily beyond architectural designs. To bridge this gap, we propose and study a new and unexplored paradigm: directly increasing the graph homophily via a carefully designed graph transformation. In…
Peer Reviews
Decision·ICLR 2026 Poster
1. First work to explicitly transform graph structure to increase homophily, offering a fundamentally different approach from architectural GNN modifications. 2. Provides formal proofs showing GRAPHITE guarantees homophily improvement (Theorem 3) with only linear growth in graph size. 3. Consistently outperforms 25 baselines across 4 heterophilic datasets with improvements up to 5.35%, while maintaining competitive performance on homophilic graphs.
1. Method is specifically designed for discrete node features, limiting applicability to graphs with continuous features without discretization. 2. While theoretical complexity is linear, practical implementation with 8 GNN layers and 512 hidden dimensions may be computationally expensive for very large graphs. 3. Feature node representations (Equation 5) use simple averaging, potentially overlooking more sophisticated feature aggregation strategies. 4. No exploration of how the transformation a
1) Introduces a graph transformation method that effectively boosts homophily on heterophilic graphs. 2) Both theoretical analysis and empirical experiments support the superior performance of GRAPHITE on heterophilic graphs. 3) Demonstrates good compatibility and enhancement effects with existing GNN architectures.
1) Limited Novelty in Core Idea: While the paper introduces the GRAPHITE method to boost homophily in heterophilic graphs, the fundamental concept of adding feature nodes or shortcut connections between similar nodes is not entirely novel. Many prior works have explored similar strategies for improving graph neural network (GNN) performance, such as feature augmentation or graph transformation methods. The core idea lacks significant innovation, which might reduce its contribution to the broader
1. Paradigm shift: GRAPHITE directly enhances graph homophily through structural transformation, offering a fundamental and effective new perspective. Moreover, its design is simple and efficient: by using feature nodes as hubs, it avoids the O(|V|²) edge explosion and achieves provably improved homophily with only O(|V|) added nodes and O(|E|) added edges. 2. It elegantly decouples feature similarity from the original graph structure to enable semantics-aware message passing: by introducing f
1. Although the authors list the hyperparameter search ranges and training configurations in Section 4.1 and Appendix B.3, the paper lacks sensitivity analysis of these parameters, which weakens the robustness argument of the method. 2. The homophily definition used in the theoretical analysis (based on feature intersection) is inconsistent with the experimental metrics (feature/adjusted homophily). Although the two are intuitively positively correlated, the paper lacks rigorous derivation or e
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Complex Network Analysis Techniques
