Compact Vision Transformer by Reduction of Kernel Complexity

Yancheng Wang; Yingzhen Yang

arXiv:2507.12780·cs.CV·July 18, 2025

Compact Vision Transformer by Reduction of Kernel Complexity

Yancheng Wang, Yingzhen Yang

PDF

Open Access

TL;DR

This paper introduces KCR-Transformer, a compact vision transformer with channel selection guided by a novel theoretical generalization bound, reducing computational cost while maintaining or improving accuracy.

Contribution

The work proposes a new KCR-Transformer block with differentiable channel selection and provides a rigorous generalization bound, enabling effective channel pruning in vision transformers.

Findings

01

Reduces FLOPs of vision transformers significantly.

02

Maintains or improves prediction accuracy after pruning.

03

Achieves superior performance on various vision tasks.

Abstract

Self-attention and transformer architectures have become foundational components in modern deep learning. Recent efforts have integrated transformer blocks into compact neural architectures for computer vision, giving rise to various efficient vision transformers. In this work, we introduce Transformer with Kernel Complexity Reduction, or KCR-Transformer, a compact transformer block equipped with differentiable channel selection, guided by a novel and sharp theoretical generalization bound. KCR-Transformer performs input/output channel selection in the MLP layers of transformer blocks to reduce the computational cost. Furthermore, we provide a rigorous theoretical analysis establishing a tight generalization bound for networks equipped with KCR-Transformer blocks. Leveraging such strong theoretical results, the channel pruning by KCR-Transformer is conducted in a generalization-aware…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors · Infrared Target Detection Methodologies · Image Processing Techniques and Applications