Did You Read the Instructions? Rethinking the Effectiveness of Task   Definitions in Instruction Learning

Fan Yin; Jesse Vig; Philippe Laban; Shafiq Joty; Caiming Xiong,; Chien-Sheng Jason Wu

arXiv:2306.01150·cs.CL·June 5, 2023·1 cites

Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning

Fan Yin, Jesse Vig, Philippe Laban, Shafiq Joty, Caiming Xiong,, Chien-Sheng Jason Wu

PDF

Open Access 1 Repo

TL;DR

This paper investigates how task definitions influence instruction learning in large language models, revealing that minimal, key information suffices and proposing strategies to enhance model understanding and performance.

Contribution

It introduces an automatic method to compress task definitions and proposes two strategies to improve instruction comprehension in LLMs, leading to significant performance gains.

Findings

01

Model performance drops mainly when removing task output information.

02

60% of task definition tokens can be removed without performance loss.

03

Structured key information and meta-tuning improve instruction following by 4.2 Rouge-L.

Abstract

Large language models (LLMs) have shown impressive performance in following natural language instructions to solve unseen tasks. However, it remains unclear whether models truly understand task definitions and whether the human-written definitions are optimal. In this paper, we systematically study the role of task definitions in instruction learning. We first conduct an ablation analysis informed by human annotations to understand which parts of a task definition are most important, and find that model performance only drops substantially when removing contents describing the task output, in particular label information. Next, we propose an automatic algorithm to compress task definitions to a minimal supporting set of tokens, and find that 60\% of tokens can be removed while maintaining or even improving model performance. Based on these results, we propose two strategies to help…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fanyin3639/rethinking-instruction-effectiveness
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsTest