Learning to Perform Complex Tasks through Compositional Fine-Tuning of Language Models
Victor S. Bursztyn, David Demeter, Doug Downey, Larry Birnbaum

TL;DR
This paper introduces compositional fine-tuning (CFT), a method that decomposes complex tasks into components and fine-tunes smaller language models, outperforming end-to-end learning and chain of thought prompting in various domains.
Contribution
CFT is a novel approach that explicitly decomposes tasks and fine-tunes smaller models, enabling effective learning of complex tasks without relying on huge pretrained models.
Findings
CFT outperforms end-to-end learning with equal data.
CFT performs comparably to chain of thought prompting with much smaller models.
CFT is applicable to domains without pretraining data.
Abstract
How to usefully encode compositional task structure has long been a core challenge in AI. Recent work in chain of thought prompting has shown that for very large neural language models (LMs), explicitly demonstrating the inferential steps involved in a target task may improve performance over end-to-end learning that focuses on the target task alone. However, chain of thought prompting has significant limitations due to its dependency on huge pretrained LMs. In this work, we present compositional fine-tuning (CFT): an approach based on explicitly decomposing a target task into component tasks, and then fine-tuning smaller LMs on a curriculum of such component tasks. We apply CFT to recommendation tasks in two domains, world travel and local dining, as well as a previously studied inferential task (sports understanding). We show that CFT outperforms end-to-end learning even with equal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsEmirates Airlines Office in Dubai
