Caterpillar GNN: Replacing Message Passing with Efficient Aggregation

Marek \v{C}ern\'y

arXiv:2506.06784·cs.LG·September 29, 2025

Caterpillar GNN: Replacing Message Passing with Efficient Aggregation

Marek \v{C}ern\'y

PDF

Open Access 3 Reviews

TL;DR

This paper introduces Caterpillar GNNs, a new class of graph neural networks that replace traditional message passing with efficient walk-based aggregation, balancing expressivity and inductive bias for scalable and effective graph learning.

Contribution

It proposes a novel aggregation method over walk incidence matrices, providing a spectrum between message passing and simpler walk-based methods, with theoretical expressivity analysis and practical improvements.

Findings

01

Caterpillar GNNs match MPGNNs on benchmarks with fewer nodes in hidden layers.

02

The approach offers a trade-off between expressivity and inductive bias.

03

The method scales seamlessly between classical message passing and simpler walk-based techniques.

Abstract

Message-passing graph neural networks (MPGNNs) dominate modern graph learning. Typical efforts enhance MPGNN's expressive power by enriching the adjacency-based aggregation. In contrast, we introduce an efficient aggregation over walk incidence-based matrices that are constructed to deliberately trade off some expressivity for stronger and more structured inductive bias. Our approach allows for seamless scaling between classical message-passing and simpler methods based on walks. We rigorously characterize the expressive power at each intermediate step using homomorphism counts over a hierarchy of generalized caterpillar graphs. Based on this foundation, we propose Caterpillar GNNs, whose robust graph-level aggregation successfully tackles a benchmark specifically designed to challenge MPGNNs. Moreover, we demonstrate that, on real-world datasets, Caterpillar GNNs achieve comparable…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 2Confidence 4

Strengths

Relaxing strict graph isomorphism tests to obtain more efficient computation is an interesting idea that could help the graph learning community strike a better balance between theory and practice.

Weaknesses

* To be fair, the paper’s clarity could be much improved; I spent considerable time ensuring I understood the main ideas. I recognize that a theory-heavy paper cannot be fully popularized, but there are concrete steps that would markedly raise readability. For example: * Provide a glossary at the start of the appendix. It may be long in this case, but it is necessary. * “Caterpillar graphs” are not formally defined until the end of page 6, yet the term appears frequently before that. The sa

Reviewer 02Rating 2Confidence 3

Strengths

- The approach appears to be novel and incorporating important insights from graph theory and automata theory fields. - Deterministic and permutation-equivariant walk selection adds a principled structural constraint.

Weaknesses

1) Poor organisation and clarity - The exposition is dense and introduces key definitions late, which makes it difficult to follow the main ideas. - The first three pages (i.e., from the beginning until Section 3) introduce the terminology and concepts used in the paper. However, a formal definition of “caterpillar”, a fundamental concept of this submission, does not appear until page 6 in Section 4. On page 1, the authors refer to caterpillars simply as a subclass of trees, without providin

Reviewer 03Rating 6Confidence 3

Strengths

- The mechanism presented is novel and well-founded, connecting random walk-based models with message-passing GNNs. - There is a rigorous theoretical analysis of the expressive power of the method using homomorphism counts. - The paper contains multiple interesting theoretical insights. I also find the use of automata theory to prove results from Sec. 3 and Sec. 4 interesting. - Different from some other random-walk-based methods, Caterpillar GNNs can preserve (permutation) equivariance and dete

Weaknesses

The main weaknesses of this paper are scoping and empirics: - **Scoping:** The paper is primarily theoretical, proposing a novel aggregation mechanism along with the new Caterpillar GNNs and providing an extensive theoretical analysis. However, its presentation within the constraints of a short conference paper feels limited. While the main text introduces some of the key ideas and results, much of the theoretical novelty and discussion are deferred to the Supplementary Material. I recognize the

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Topic Modeling