CPUBone: Efficient Vision Backbone Design for Devices with Low Parallelization Capabilities

Moritz Nottebaum; Matteo Dunnhofer; Christian Micheloni

arXiv:2603.26425·cs.CV·March 31, 2026

CPUBone: Efficient Vision Backbone Design for Devices with Low Parallelization Capabilities

Moritz Nottebaum, Matteo Dunnhofer, Christian Micheloni

PDF

1 Repo

TL;DR

CPUBone is a new vision backbone designed specifically for CPU inference, balancing operations and hardware efficiency, achieving state-of-the-art speed-accuracy trade-offs on diverse CPU devices.

Contribution

The paper introduces CPUBone, a novel CPU-optimized vision backbone that maintains high hardware efficiency while reducing computational cost through modified convolution techniques.

Findings

01

CPUBone achieves superior speed-accuracy trade-offs on various CPU devices.

02

Modified convolutions reduce MACs while preserving hardware efficiency.

03

CPUBone transfers efficiency gains effectively to object detection and segmentation tasks.

Abstract

Recent research on vision backbone architectures has predominantly focused on optimizing efficiency for hardware platforms with high parallel processing capabilities. This category increasingly includes embedded systems such as mobile phones and embedded AI accelerator modules. In contrast, CPUs do not have the possibility to parallelize operations in the same manner, wherefore models benefit from a specific design philosophy that balances amount of operations (MACs) and hardware-efficient execution by having high MACs per second (MACpS). In pursuit of this, we investigate two modifications to standard convolutions, aimed at reducing computational cost: grouping convolutions and reducing kernel sizes. While both adaptations substantially decrease the total number of MACs required for inference, sustaining low latency necessitates preserving hardware-efficiency. Our experiments across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

altair199797/CPUBone
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.