FlexVector: A SpMM Vector Processor with Flexible VRF for GCNs on Varying-Sparsity Graphs

Bohan Li; Shengmin Li; Xinyu Shi; Enyi Yao; Francky Catthoor; Simei Yang

arXiv:2604.10113·cs.DC·April 14, 2026

FlexVector: A SpMM Vector Processor with Flexible VRF for GCNs on Varying-Sparsity Graphs

Bohan Li, Shengmin Li, Xinyu Shi, Enyi Yao, Francky Catthoor, Simei Yang

PDF

TL;DR

FlexVector is a novel vector processor architecture designed to efficiently accelerate sparse-dense matrix multiplication in GCN inference, addressing irregular workloads with flexible register files and graph-aware preprocessing.

Contribution

It introduces a row-wise dataflow and flexible VRFs, combined with graph-aware preprocessing, to improve GCN inference performance and energy efficiency.

Findings

01

Achieves 3.78x speedup over baseline

02

Reduces energy consumption by 40.5%

03

Maintains comparable area cost

Abstract

Graph Convolutional Networks (GCNs) are widely adopted for tasks involving relational or graph-structured data and can be formulated as two-stage sparse-dense matrix multiplication (SpMM) during inference. However, existing accelerators often struggle with the irregular workloads induced by power-law node degree distributions. In this work, we propose FlexVector, a vector-processor-based architecture that efficiently accelerates SpMM for GCN inference. To address irregular computation patterns, FlexVector adopts a row-wise, product-based dataflow that regularizes SpMM execution and exposes vector parallelism through full-row access to vector registers, eliminating the need for multi-banked register file designs. Building on this dataflow, it introduces software-managed, flexible vector register files (VRFs) that adapt to irregular data access patterns, without sacrificing memory access…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.