A Scalable Inter-edge Correlation Modeling in CopulaGNN for Link Sign Prediction
Jinkyu Sung, Myunggeum Jee, Joonseok Lee

TL;DR
This paper introduces a scalable Gaussian copula-based method within CopulaGNN for link sign prediction on signed graphs, effectively modeling edge dependencies while reducing computational complexity.
Contribution
It proposes a novel scalable approach that models edge correlations via a Gramian-based correlation matrix, enabling efficient inference on large graphs.
Findings
Achieves faster convergence than baseline models
Maintains competitive prediction accuracy
Theoretically proven to have linear convergence
Abstract
Link sign prediction on a signed graph is a task to determine whether the relationship represented by an edge is positive or negative. Since the presence of negative edges violates the graph homophily assumption that adjacent nodes are similar, regular graph methods have not been applicable without auxiliary structures to handle them. We aim to directly model the latent statistical dependency among edges with the Gaussian copula and its corresponding correlation matrix, extending CopulaGNN (Ma et al., 2021). However, a naive modeling of edge-edge relations is computationally intractable even for a graph with moderate scale. To address this, we propose to 1) represent the correlation matrix as a Gramian of edge embeddings, significantly reducing the number of parameters, and 2) reformulate the conditional probability distribution to dramatically reduce the inference cost. We…
Peer Reviews
Decision·ICLR 2026 Poster
- The main problem/ task is one of the important topic in GNN. The problem formulation (in Section 2) is clear. The paper is well organized and relevant theorem has been provided to support the statements. - The main idea: using the CopulaGNN for signed graph learning has been well justified theoretically. The idea of reducing the computational cost in two ways: training phase and inference phase, is straightforward and supported well. - For the experimental results, the computational reducti
- Some of the details are missing requiring further clarifications.(see questions) - How does the low-rank multivariate Gaussian brings, in what extent, how significant ? The authors simply state that Woodbury reformulation is used to improve computational efficiency. - The proposed model uses SNEA as their backbone and show the improvements in computational efficiency. However, from practical point, considering that the best prediction performances are achieved among TrustSGCN, SLGNN, furth
- The paper has a clear and significant narrative: It considers the natural approach of edge-edge correlation modeling, which has some general relevance in graph modeling, identifies the scalability problem, and provides a solution. - The writing is clear and grammatical. The diagram of Figure 1 is helpful for understanding the core concepts. - The paper includes conceptual, theoretical, and experimental components. - Several ablations are provided to strengthen the paper's claims.
- The method is competitive with others in terms of prediction performance, but not clearly superior and arguably inferior to one other method on the chosen datasets. - The diversity of the datasets is limited, with two Bitcoin datasets and two Wikipedia datasets. - There is little discussion of how the prior methods work, including SNEA, which is used as the encoder backbone in this paper. - There is little discussion of the modeling side of the proposed approach, e.g., what do the learned emb
1. The paper proposes an innovative hypothesis that there exists a statistical dependence between edges connected by common nodes, thus extending GNNs from unsigned to signed graphs. 2. By introducing a Gramian-based correlation matrix for edge dependencies and a Woodbury matrix rewrite for computational efficiency, the paper significantly reduces memory usage and computational cost, greatly accelerates model convergence, and achieves good scalability while maintaining performance comparable to
1. The paper contains a large number of mathematical formulas, which makes it somewhat difficult to read. 2. The innovation of the framework mainly relies on the creative combination of existing tools (Gaussian Copula, Gramian construction, and the Woodbury identity), lacking some novelty. 3. Model training depends on hyperparameters η and ε, making it difficult to directly obtain optimal model performance. 4. Section 4 spends some length analyzing the model's convergence, which is not direct
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Human Pose and Action Recognition
