Attention Prompt Tuning: Parameter-efficient Adaptation of Pre-trained Models for Spatiotemporal Modeling
Wele Gedara Chaminda Bandara, Vishal M. Patel

TL;DR
Attention Prompt Tuning (APT) offers a parameter-efficient method for adapting pre-trained models to video action recognition by injecting prompts into attention mechanisms, significantly reducing computational costs while improving performance.
Contribution
We propose a novel APT method that injects prompts into attention mechanisms and introduces a reparameterization technique, enhancing efficiency and robustness for spatiotemporal video modeling.
Findings
Reduces FLOPs and latency in video action recognition tasks.
Achieves superior performance compared to existing prompt tuning methods.
Effective on multiple benchmark datasets.
Abstract
In this paper, we introduce Attention Prompt Tuning (APT) - a computationally efficient variant of prompt tuning for video-based applications such as action recognition. Prompt tuning approaches involve injecting a set of learnable prompts along with data tokens during fine-tuning while keeping the backbone frozen. This approach greatly reduces the number of learnable parameters compared to full tuning. For image-based downstream tasks, normally a couple of learnable prompts achieve results close to those of full tuning. However, videos, which contain more complex spatiotemporal information, require hundreds of tunable prompts to achieve reasonably good results. This reduces the parameter efficiency observed in images and significantly increases latency and the number of floating-point operations (FLOPs) during inference. To tackle these issues, we directly inject the prompts into the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Traffic Prediction and Management Techniques · Scientific Computing and Data Management
MethodsSparse Evolutionary Training
