PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context
Maximilian Augustin, Syed Shakib Sarwar, Mostafa Elhoushi, Sai Qian Zhang, Yuecheng Li, Barbara De Salvo

TL;DR
This paper introduces PETAH, a method for efficient task adaptation in hybrid transformer models for resource-limited vision applications, combining pruning and adaptation to outperform existing techniques.
Contribution
The paper presents PETAH, a novel task adaptation approach for hybrid transformers that improves performance and efficiency in resource-constrained environments.
Findings
PETAH outperforms existing task-adaptation methods for ViTs.
PETAH models require fewer parameters and are more hardware-efficient.
Combining PETAH with pruning yields highly compact, multi-task models.
Abstract
Following their success in natural language processing (NLP), there has been a shift towards transformer models in computer vision. While transformers perform well and offer promising multi-tasking performance, due to their high compute requirements, many resource-constrained applications still rely on convolutional or hybrid models that combine the benefits of convolution and attention layers and achieve the best results in the sub 100M parameter range. Simultaneously, task adaptation techniques that allow for the use of one shared transformer backbone for multiple downstream tasks, resulting in great storage savings at negligible cost in performance, have not yet been adopted for hybrid transformers. In this work, we investigate how to achieve the best task-adaptation performance and introduce PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers. We further combine PETAH…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Parallel Computing and Optimization Techniques · Ferroelectric and Negative Capacitance Devices
MethodsSoftmax · Attention Is All You Need · Convolution · Pruning
