Memory-assisted prompt editing to improve GPT-3 after deployment

Aman Madaan; Niket Tandon; Peter Clark; Yiming Yang

arXiv:2201.06009·cs.CL·February 21, 2023

Memory-assisted prompt editing to improve GPT-3 after deployment

Aman Madaan, Niket Tandon, Peter Clark, Yiming Yang

PDF

1 Repo 2 Videos

TL;DR

This paper introduces a memory-assisted prompt editing method that enables large language models like GPT-3 to improve their accuracy through user feedback without retraining, by leveraging a growing case memory for error correction.

Contribution

It presents a novel approach combining memory and user feedback to enhance GPT-3's performance post-deployment without costly retraining.

Findings

01

Significant accuracy improvements on lexical and ethical reasoning tasks

02

Effective correction of misunderstandings through simulated user interactions

03

Demonstrates low-cost utility enhancement for large pre-trained language models

Abstract

Large LMs such as GPT-3 are powerful, but can commit mistakes that are obvious to humans. For example, GPT-3 would mistakenly interpret "What word is similar to good?" to mean a homophone, while the user intended a synonym. Our goal is to effectively correct such errors via user interactions with the system but without retraining, which will be prohibitively costly. We pair GPT-3 with a growing memory of recorded cases where the model misunderstood the user's intents, along with user feedback for clarification. Such a memory allows our system to produce enhanced prompts for any new query based on the user feedback for error correction on similar cases in the past. On four tasks (two lexical tasks, two advanced ethical reasoning tasks), we show how a (simulated) user can interactively teach a deployed GPT-3, substantially increasing its accuracy over the queries with different kinds of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

madaan/memprompt
pytorchOfficial

Videos

Author Interview - Memory-assisted prompt editing to improve GPT-3 after deployment· youtube

Memory-assisted prompt editing to improve GPT-3 after deployment (Machine Learning Paper Explained)· youtube

Taxonomy

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Attention Dropout · Layer Normalization · Residual Connection · Adam · Dropout · Weight Decay