PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context

Maximilian Augustin; Syed Shakib Sarwar; Mostafa Elhoushi; Sai Qian Zhang; Yuecheng Li; Barbara De Salvo

arXiv:2410.17661·cs.AI·October 2, 2025

PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers in a resource-limited Context

Maximilian Augustin, Syed Shakib Sarwar, Mostafa Elhoushi, Sai Qian Zhang, Yuecheng Li, Barbara De Salvo

PDF

Open Access

TL;DR

This paper introduces PETAH, a method for efficient task adaptation in hybrid transformer models for resource-limited vision applications, combining pruning and adaptation to outperform existing techniques.

Contribution

The paper presents PETAH, a novel task adaptation approach for hybrid transformers that improves performance and efficiency in resource-constrained environments.

Findings

01

PETAH outperforms existing task-adaptation methods for ViTs.

02

PETAH models require fewer parameters and are more hardware-efficient.

03

Combining PETAH with pruning yields highly compact, multi-task models.

Abstract

Following their success in natural language processing (NLP), there has been a shift towards transformer models in computer vision. While transformers perform well and offer promising multi-tasking performance, due to their high compute requirements, many resource-constrained applications still rely on convolutional or hybrid models that combine the benefits of convolution and attention layers and achieve the best results in the sub 100M parameter range. Simultaneously, task adaptation techniques that allow for the use of one shared transformer backbone for multiple downstream tasks, resulting in great storage savings at negligible cost in performance, have not yet been adopted for hybrid transformers. In this work, we investigate how to achieve the best task-adaptation performance and introduce PETAH: Parameter Efficient Task Adaptation for Hybrid Transformers. We further combine PETAH…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Parallel Computing and Optimization Techniques · Ferroelectric and Negative Capacitance Devices

MethodsSoftmax · Attention Is All You Need · Convolution · Pruning