In-Context Semi-Supervised Learning
Jiashuo Fan, Paul Rosu, Aaron T. Wang, Zeyu Michael Li, Lawrence Carin, and Xiang Cheng

TL;DR
This paper introduces in-context semi-supervised learning (IC-SSL), demonstrating how Transformers can leverage unlabeled data alongside few labels to improve representation learning and prediction accuracy in low-label scenarios.
Contribution
It presents the concept of IC-SSL, showing how Transformers utilize unlabeled context for robust, context-dependent representations, advancing understanding of in-context learning with limited labels.
Findings
Transformers effectively leverage unlabeled context for learning.
IC-SSL improves performance in low-label regimes.
Provides foundational insights into unlabeled data exploitation.
Abstract
There has been significant recent interest in understanding the capacity of Transformers for in-context learning (ICL), yet most theory focuses on supervised settings with explicitly labeled pairs. In practice, Transformers often perform well even when labels are sparse or absent, suggesting crucial structure within unlabeled contextual demonstrations. We introduce and study in-context semi-supervised learning (IC-SSL), where a small set of labeled examples is accompanied by many unlabeled points, and show that Transformers can leverage the unlabeled context to learn a robust, context-dependent representation. This representation enables accurate predictions and markedly improves performance in low-label regimes, offering foundational insights into how Transformers exploit unlabeled context for representation learning within the ICL framework.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Topic Modeling
