Super-Prompting: Utilizing Model-Independent Contextual Data to Reduce Data Annotation Required in Visual Commonsense Tasks
Navid Rezaei, Marek Z. Reformat

TL;DR
This paper introduces prompt-based fine-tuning techniques for smaller transformer models to reduce data annotation needs in visual commonsense reasoning, achieving comparable results with less training data and saving resources.
Contribution
It presents a model-agnostic prompt-based fine-tuning approach that significantly reduces data requirements for visual commonsense tasks across language and multimodal transformers.
Findings
Achieves comparable performance with only 35-40% of training data
Reduces time and financial costs of data annotation
Applicable to various transformer models with minimal changes
Abstract
Pre-trained language models have shown excellent results in few-shot learning scenarios using in-context learning. Although it is impressive, the size of language models can be prohibitive to make them usable in on-device applications, such as sensors or smartphones. With smaller language models, task-specific data annotation is needed to fine-tune the language model for a specific purpose. However, data annotation can have a substantial financial and time burden for small research groups, startups, and even companies. In this paper, we analyze different prompt-based fine-tuning techniques to improve results on both language and multimodal causal transformer models. To evaluate our results, we use a dataset focusing on visual commonsense reasoning in time. Our results show that by simple model-agnostic prompt-based fine-tuning, comparable results can be reached by only using 35%-40% of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
