Caterpillar GNN: Replacing Message Passing with Efficient Aggregation
Marek \v{C}ern\'y

TL;DR
This paper introduces Caterpillar GNNs, a new class of graph neural networks that replace traditional message passing with efficient walk-based aggregation, balancing expressivity and inductive bias for scalable and effective graph learning.
Contribution
It proposes a novel aggregation method over walk incidence matrices, providing a spectrum between message passing and simpler walk-based methods, with theoretical expressivity analysis and practical improvements.
Findings
Caterpillar GNNs match MPGNNs on benchmarks with fewer nodes in hidden layers.
The approach offers a trade-off between expressivity and inductive bias.
The method scales seamlessly between classical message passing and simpler walk-based techniques.
Abstract
Message-passing graph neural networks (MPGNNs) dominate modern graph learning. Typical efforts enhance MPGNN's expressive power by enriching the adjacency-based aggregation. In contrast, we introduce an efficient aggregation over walk incidence-based matrices that are constructed to deliberately trade off some expressivity for stronger and more structured inductive bias. Our approach allows for seamless scaling between classical message-passing and simpler methods based on walks. We rigorously characterize the expressive power at each intermediate step using homomorphism counts over a hierarchy of generalized caterpillar graphs. Based on this foundation, we propose Caterpillar GNNs, whose robust graph-level aggregation successfully tackles a benchmark specifically designed to challenge MPGNNs. Moreover, we demonstrate that, on real-world datasets, Caterpillar GNNs achieve comparable…
Peer Reviews
Decision·Submitted to ICLR 2026
Relaxing strict graph isomorphism tests to obtain more efficient computation is an interesting idea that could help the graph learning community strike a better balance between theory and practice.
* To be fair, the paper’s clarity could be much improved; I spent considerable time ensuring I understood the main ideas. I recognize that a theory-heavy paper cannot be fully popularized, but there are concrete steps that would markedly raise readability. For example: * Provide a glossary at the start of the appendix. It may be long in this case, but it is necessary. * “Caterpillar graphs” are not formally defined until the end of page 6, yet the term appears frequently before that. The sa
- The approach appears to be novel and incorporating important insights from graph theory and automata theory fields. - Deterministic and permutation-equivariant walk selection adds a principled structural constraint.
1) Poor organisation and clarity - The exposition is dense and introduces key definitions late, which makes it difficult to follow the main ideas. - The first three pages (i.e., from the beginning until Section 3) introduce the terminology and concepts used in the paper. However, a formal definition of “caterpillar”, a fundamental concept of this submission, does not appear until page 6 in Section 4. On page 1, the authors refer to caterpillars simply as a subclass of trees, without providin
- The mechanism presented is novel and well-founded, connecting random walk-based models with message-passing GNNs. - There is a rigorous theoretical analysis of the expressive power of the method using homomorphism counts. - The paper contains multiple interesting theoretical insights. I also find the use of automata theory to prove results from Sec. 3 and Sec. 4 interesting. - Different from some other random-walk-based methods, Caterpillar GNNs can preserve (permutation) equivariance and dete
The main weaknesses of this paper are scoping and empirics: - **Scoping:** The paper is primarily theoretical, proposing a novel aggregation mechanism along with the new Caterpillar GNNs and providing an extensive theoretical analysis. However, its presentation within the constraints of a short conference paper feels limited. While the main text introduces some of the key ideas and results, much of the theoretical novelty and discussion are deferred to the Supplementary Material. I recognize the
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Topic Modeling
