In-Context Compositional Learning via Sparse Coding Transformer
Wei Chen, Jingxi Yu, Zichen Miao, Qiang Qiu

TL;DR
This paper introduces a sparse coding reformulation of Transformer attention to improve in-context compositional learning, enabling better generalization on tasks requiring understanding of compositional rules.
Contribution
It proposes a novel sparse coding-based attention mechanism that enhances Transformers' ability to learn and generalize compositional rules from context examples.
Findings
Improved performance on S-RAVEN and RAVEN datasets for compositional tasks.
Enhanced generalization in tasks where standard Transformers fail.
Effective representation of compositional structure through learned sparse coefficients.
Abstract
Transformer architectures have achieved remarkable success across language, vision, and multimodal tasks, and there is growing demand for them to address in-context compositional learning tasks. In these tasks, models solve the target problems by inferring compositional rules from context examples, which are composed of basic components structured by underlying rules. However, some of these tasks remain challenging for Transformers, which are not inherently designed to handle compositional tasks and offer limited structural inductive bias. In this work, inspired by the principle of sparse coding, we propose a reformulation of the attention to enhance its capability for compositional tasks. In sparse coding, data are represented as sparse combinations of dictionary atoms with coefficients that capture their compositional rules. Specifically, we reinterpret the attention block as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Advanced Neural Network Applications
