GRAMA: Adaptive Graph Autoregressive Moving Average Models
Moshe Eliasof, Alessio Gravina, Andrea Ceni, Claudio Gallicchio,, Davide Bacciu, Carola-Bibiane Sch\"onlieb

TL;DR
GRAMA introduces a learnable ARMA-based graph model that effectively captures long-range dependencies while maintaining permutation equivariance, outperforming existing methods on diverse datasets.
Contribution
It presents a novel adaptive ARMA framework for graph modeling that preserves permutation equivariance and enhances long-range interaction learning.
Findings
Outperforms baseline models on 14 datasets
Effectively captures long-range dependencies
Maintains permutation equivariance
Abstract
Graph State Space Models (SSMs) have recently been introduced to enhance Graph Neural Networks (GNNs) in modeling long-range interactions. Despite their success, existing methods either compromise on permutation equivariance or limit their focus to pairwise interactions rather than sequences. Building on the connection between Autoregressive Moving Average (ARMA) and SSM, in this paper, we introduce GRAMA, a Graph Adaptive method based on a learnable Autoregressive Moving Average (ARMA) framework that addresses these limitations. By transforming from static to sequential graph data, GRAMA leverages the strengths of the ARMA framework, while preserving permutation equivariance. Moreover, GRAMA incorporates a selective attention mechanism for dynamic learning of ARMA coefficients, enabling efficient and flexible long-range information propagation. We also establish theoretical connections…
Peer Reviews
Decision·Submitted to ICLR 2025
1. I agree that using more states that used in SSM can preserve more higher-order informations in long-range problem setting, which can be helpful for certain long-range multi-polynomial regression problems. Extending SSM to graph domain can be a good attemp. 2. Overall the design is easy to follow.
1. The paper is like a naive design of a new model without clear motivation. It appears to just extend SSM/ARMA to graph domain for a new model or a new paper, without clear motivation from the graph problem side. While long-range problem is an issue in graph domain, adapting an existing model to this domain without domain-specific designs and motivations is not helpful for the domain. This is just like a "combine A and B" paper. 2. While the presented framework "claims" supporting any GNN bac
1. This paper introduces a new way of combining ARMA with GNNs, addressing limitations in handling long-range interactions and permutation equivariance in graph data. 2. This paper provides a theoretical foundation that establishes connections between GRAMA and SSMs. 3. The authors have tested the proposed GRAMA on a wide range of tasks and benchmarks, and the overall paper is experimentally solid.
1. While I acknowledge that the authors conducted extensive experiments, several highly relevant baselines or benchmarks are notably absent. In Section 5.2, the graph property prediction datasets employed are not mainstream. Why not utilize more widely recognized real-world datasets such as ZINC or OGBG-MOLHIV/MOLPCBA? Additionally, the chosen heterophily GNNs in Table 4 do not represent the current state-of-the-art models; please refer to [1,2]. 2. The baselines selected by the authors across
The paper presents an innovative approach for sequential learning on graphs, specifically adapting the ARMA/SSM framework to graph data while preserving permutation equivariance. Experimental results demonstrate that GRAMA achieves strong performance, consistently outperforming its backbone model and competing effectively with state-of-the-art methods. NB: These are shortly formulated but important strengths.
The paper has three main weaknesses: (1) model presentation readability, (2) lack of references and attributions, and (3) limited discussion on potential model limitations. 1) Model Presentation Readability The model presentation is challenging to follow, and improvements in clarity are necessary for readers to grasp the structure and function of GRAMA fully. - For instance, in the caption of Figure 1, it is stated that the blocks are linear systems; however, they contain GNN layers, which ar
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Data Management and Algorithms
MethodsSoftmax · Attention Is All You Need · ARMA GNN · Focus
