Investigating the Zone of Proximal Development of Language Models for In-Context Learning
Peng Cui, Mrinmaya Sachan

TL;DR
This paper introduces a novel framework based on educational psychology to analyze and improve in-context learning in large language models by measuring their Zone of Proximal Development, leading to more efficient inference and fine-tuning strategies.
Contribution
It adapts the ZPD concept and item response theory to LLMs, providing new insights and methods for optimizing in-context learning and fine-tuning processes.
Findings
Predicting ZPD improves ICL efficiency by selecting beneficial demonstrations.
A human-like curriculum based on ZPD enhances fine-tuning performance.
Framework reveals complex behaviors of in-context learning in LLMs.
Abstract
In this paper, we introduce a learning analytics framework to analyze the in-context learning (ICL) behavior of large language models (LLMs) through the lens of the Zone of Proximal Development (ZPD), an established theory in educational psychology. ZPD delineates the space between what a learner is capable of doing unsupported and what the learner cannot do even with support. We adapt this concept to ICL, measuring the ZPD of LLMs based on model performance on individual examples with and without ICL. Furthermore, we propose an item response theory (IRT) model to predict the distribution of zones for LLMs. Our findings reveal a series of intricate and multifaceted behaviors of ICL, providing new insights into understanding and leveraging this technique. Finally, we demonstrate how our framework can enhance LLM in both inference and fine-tuning scenarios: (1) By predicting a model's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Speech and dialogue systems · Natural Language Processing Techniques
