LowFormer: Hardware Efficient Design for Convolutional Transformer Backbones
Moritz Nottebaum, Matteo Dunnhofer, Christian Micheloni

TL;DR
LowFormer is a new convolutional transformer backbone that emphasizes real-world efficiency by optimizing throughput and latency, achieving faster performance with comparable or better accuracy across various hardware platforms.
Contribution
The paper introduces LowFormer, a hardware-efficient backbone design that combines macro and micro architectural optimizations, including a slimmed-down MultiHead Self-Attention, validated across multiple hardware types.
Findings
LowFormer significantly improves throughput and latency.
It maintains or surpasses state-of-the-art accuracy.
Effective for object detection and semantic segmentation.
Abstract
Research in efficient vision backbones is evolving into models that are a mixture of convolutions and transformer blocks. A smart combination of both, architecture-wise and component-wise is mandatory to excel in the speedaccuracy trade-off. Most publications focus on maximizing accuracy and utilize MACs (multiply accumulate operations) as an efficiency metric. The latter however often do not measure accurately how fast a model actually is due to factors like memory access cost and degree of parallelism. We analyzed common modules and architectural design choices for backbones not in terms of MACs, but rather in actual throughput and latency, as the combination of the latter two is a better representation of the efficiency of models in real applications. We applied the conclusions taken from that analysis to create a recipe for increasing hardware-efficiency in macro design.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnalog and Mixed-Signal Circuit Design · Magnetic Properties and Applications · Photovoltaic System Optimization Techniques
MethodsFocus
