What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment
Matthew Finlayson, Kyle Richardson, Ashish Sabharwal, Peter Clark

TL;DR
This paper investigates the challenges of instruction learning in large transformer models using a synthetic environment focused on regular expression tasks, revealing difficulties with complex instructions and long context tracking.
Contribution
It introduces Hard RegSet, a new challenging dataset for instruction learning, and provides insights into the properties that make instruction learning difficult for models.
Findings
Models struggle with large regular languages and less precise instructions.
Longer context tracking increases difficulty in instruction execution.
Fine-tuned models achieve only 65.6% accuracy on Hard RegSet.
Abstract
The instruction learning paradigm -- where a model learns to perform new tasks from task descriptions alone -- has become popular in general-purpose model research. The capabilities of large transformer models as instruction learners, however, remain poorly understood. We use a controlled synthetic environment to characterize such capabilities. Specifically, we use the task of deciding whether a given string matches a regular expression (viewed as an instruction) to identify properties of tasks, instructions, and instances that make instruction learning challenging. For instance, we find that our model, a fine-tuned T5-based text2text transformer, struggles with large regular languages, suggesting that less precise instructions are challenging for models. Additionally, instruction executions that require tracking longer contexts of prior steps are also more difficult. We use our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Neural Network Applications · Multimodal Machine Learning Applications
