Investigating the Learning Behaviour of In-context Learning: A Comparison with Supervised Learning
Xindi Wang, Yufei Wang, Can Xu, Xiubo Geng, Bowen Zhang, Chongyang, Tao, Frank Rudzicz, Robert E. Mercer, Daxin Jiang

TL;DR
This paper compares in-context learning (ICL) and supervised learning (SL) in large language models, revealing ICL's robustness to label noise and its performance scaling with model size, advancing understanding of ICL's learning behavior.
Contribution
It provides empirical insights into how ICL differs from SL in handling label perturbations and how model size influences ICL performance, contributing to understanding ICL's learning dynamics.
Findings
Gold labels significantly impact ICL performance, especially in large models.
ICL is less sensitive to label noise and imbalance than SL.
ICL performance approaches SL as model size increases.
Abstract
Large language models (LLMs) have shown remarkable capacity for in-context learning (ICL), where learning a new task from just a few training examples is done without being explicitly pre-trained. However, despite the success of LLMs, there has been little understanding of how ICL learns the knowledge from the given prompts. In this paper, to make progress toward understanding the learning behaviour of ICL, we train the same LLMs with the same demonstration examples via ICL and supervised learning (SL), respectively, and investigate their performance under label perturbations (i.e., noisy labels and label imbalance) on a range of classification tasks. First, via extensive experiments, we find that gold labels have significant impacts on the downstream in-context performance, especially for large language models; however, imbalanced labels matter little to ICL across all model sizes.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
