Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models
Jiaao Chen, Xiaoman Pan, Dian Yu, Kaiqiang Song, Xiaoyang Wang, Dong, Yu, Jianshu Chen

TL;DR
This paper introduces skills-in-context (SKiC), a prompt structure that significantly enhances large language models' ability to perform compositional reasoning and generalize across tasks by demonstrating foundational skills within the same prompt.
Contribution
The paper proposes SKiC, a novel in-context learning framework that improves compositional generalization and latent skill utilization in large language models.
Findings
SKiC enables near-perfect systematic generalization with minimal exemplars.
It unlocks the latent potential of pre-trained skills in LLMs.
Fine-tuning with SKiC data enhances zero-shot problem-solving capabilities.
Abstract
We investigate how to elicit compositional generalization capabilities in large language models (LLMs). Compositional generalization empowers LLMs to solve complex problems by combining foundational skills, a critical reasoning ability akin to human intelligence. However, even the most advanced LLMs currently struggle with this form of reasoning. We examine this problem within the framework of in-context learning and find that demonstrating both foundational skills and compositional examples grounded in these skills within the same prompt context is crucial. We refer to this prompt structure as skills-in-context (SKiC). With as few as two exemplars, this in-context learning structure enables LLMs to tackle more challenging problems requiring innovative skill combinations, achieving near-perfect systematic generalization across a broad range of tasks. Intriguingly, SKiC also unlocks the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
