RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations

Mingshu Zhao; Yi Luo; Yong Ouyang

arXiv:2412.19628·cs.CV·July 31, 2025

RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations

Mingshu Zhao, Yi Luo, Yong Ouyang

PDF

Open Access 1 Repo 10 Models

TL;DR

RecConv introduces a recursive convolution strategy that efficiently expands the receptive field with minimal parameter increase and constant FLOPs, enabling more efficient vision transformer models.

Contribution

It proposes RecConv, a recursive decomposition method for multi-frequency representations that maintains constant FLOPs while significantly increasing the receptive field.

Findings

01

RecConv achieves a linear parameter growth with decomposition levels.

02

RecConv maintains constant FLOPs regardless of receptive field expansion.

03

RecNeXt-M3 outperforms comparable models with similar FLOPs on COCO.

Abstract

Recent advances in vision transformers (ViTs) have demonstrated the advantage of global modeling capabilities, prompting widespread integration of large-kernel convolutions for enlarging the effective receptive field (ERF). However, the quadratic scaling of parameter count and computational complexity (FLOPs) with respect to kernel size poses significant efficiency and optimization challenges. This paper introduces RecConv, a recursive decomposition strategy that efficiently constructs multi-frequency representations using small-kernel convolutions. RecConv establishes a linear relationship between parameter growth and decomposing levels which determines the effective receptive field $k \times 2^{ℓ}$ for a base kernel $k$ and $ℓ$ levels of decomposition, while maintaining constant FLOPs regardless of the ERF expansion. Specifically, RecConv achieves a parameter expansion of only…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

suous/recnext
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Neural Networks and Applications

MethodsBalanced Selection