Super-Prompting: Utilizing Model-Independent Contextual Data to Reduce   Data Annotation Required in Visual Commonsense Tasks

Navid Rezaei; Marek Z. Reformat

arXiv:2204.11922·cs.CL·April 27, 2022

Super-Prompting: Utilizing Model-Independent Contextual Data to Reduce Data Annotation Required in Visual Commonsense Tasks

Navid Rezaei, Marek Z. Reformat

PDF

Open Access

TL;DR

This paper introduces prompt-based fine-tuning techniques for smaller transformer models to reduce data annotation needs in visual commonsense reasoning, achieving comparable results with less training data and saving resources.

Contribution

It presents a model-agnostic prompt-based fine-tuning approach that significantly reduces data requirements for visual commonsense tasks across language and multimodal transformers.

Findings

01

Achieves comparable performance with only 35-40% of training data

02

Reduces time and financial costs of data annotation

03

Applicable to various transformer models with minimal changes

Abstract

Pre-trained language models have shown excellent results in few-shot learning scenarios using in-context learning. Although it is impressive, the size of language models can be prohibitive to make them usable in on-device applications, such as sensors or smartphones. With smaller language models, task-specific data annotation is needed to fine-tune the language model for a specific purpose. However, data annotation can have a substantial financial and time burden for small research groups, startups, and even companies. In this paper, we analyze different prompt-based fine-tuning techniques to improve results on both language and multimodal causal transformer models. To evaluate our results, we use a dataset focusing on visual commonsense reasoning in time. Our results show that by simple model-agnostic prompt-based fine-tuning, comparable results can be reached by only using 35%-40% of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques