Robustness of Learning from Task Instructions

Jiasheng Gu; Hongyu Zhao; Hanzi Xu; Liangyu Nie; Hongyuan Mei and; Wenpeng Yin

arXiv:2212.03813·cs.CL·May 24, 2023·1 cites

Robustness of Learning from Task Instructions

Jiasheng Gu, Hongyu Zhao, Hanzi Xu, Liangyu Nie, Hongyuan Mei and, Wenpeng Yin

PDF

Open Access 1 Repo

TL;DR

This paper investigates how robust language models are when learning from task instructions that vary in form, length, and expression, aiming to improve task generalization without extensive task-specific data.

Contribution

It is the first systematic study of language model robustness to instruction variability, addressing a key challenge in instruction-based task generalization.

Findings

01

Models handle paraphrased instructions with some robustness.

02

Instruction length variability affects model performance.

03

Robustness varies depending on instruction manipulation methods.

Abstract

Traditional supervised learning mostly works on individual tasks and requires training on a large set of task-specific examples. This paradigm seriously hinders the development of task generalization since preparing a task-specific example set is costly. To build a system that can quickly and easily generalize to new tasks, task instructions have been adopted as an emerging trend of supervision recently. These instructions give the model the definition of the task and allow the model to output the appropriate answer based on the instructions and inputs. However, task instructions are often expressed in different forms, which can be interpreted from two threads: first, some instructions are short sentences and are pretrained language model (PLM) oriented, such as prompts, while other instructions are paragraphs and are human-oriented, such as those in Amazon MTurk; second, different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jiashenggu/tk-instruct
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications