DynaMaR: Dynamic Prompt with Mask Token Representation
Xiaodi Sun, Sunny Rajagopalan, Priyanka Nigam, Weiyi Lu, Yi Xu,, Belinda Zeng, Trishul Chilimbi

TL;DR
DynaMaR introduces a dynamic prompt method with mask token representation that enhances fine-tuning of large language models, reducing overfitting and manual effort, leading to significant performance improvements in various tasks.
Contribution
The paper presents DynaMaR, a novel dynamic prompt approach that addresses overfitting and manual prompt design issues in prompt-based fine-tuning.
Findings
10% average improvement in few-shot settings
3.7% improvement in data-rich settings
Effective across four e-commerce applications
Abstract
Recent research has shown that large language models pretrained using unsupervised approaches can achieve significant performance improvement on many downstream tasks. Typically when adapting these language models to downstream tasks, like a classification or regression task, we employ a fine-tuning paradigm in which the sentence representation from the language model is input to a task-specific head; the model is then fine-tuned end-to-end. However, with the emergence of models like GPT-3, prompt-based fine-tuning has been proven to be a successful approach for few-shot tasks. Inspired by this work, we study discrete prompt technologies in practice. There are two issues that arise with the standard prompt approach. First, it can overfit on the prompt template. Second, it requires manual effort to formulate the downstream task as a language model problem. In this paper, we propose an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
Methods{Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Multi-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Adam · Refunds@Expedia|||How do I get a full refund from Expedia? · Residual Connection · Dropout
