Learning to Perform Complex Tasks through Compositional Fine-Tuning of   Language Models

Victor S. Bursztyn; David Demeter; Doug Downey; Larry Birnbaum

arXiv:2210.12607·cs.CL·October 25, 2022·1 cites

Learning to Perform Complex Tasks through Compositional Fine-Tuning of Language Models

Victor S. Bursztyn, David Demeter, Doug Downey, Larry Birnbaum

PDF

Open Access 1 Repo

TL;DR

This paper introduces compositional fine-tuning (CFT), a method that decomposes complex tasks into components and fine-tunes smaller language models, outperforming end-to-end learning and chain of thought prompting in various domains.

Contribution

CFT is a novel approach that explicitly decomposes tasks and fine-tunes smaller models, enabling effective learning of complex tasks without relying on huge pretrained models.

Findings

01

CFT outperforms end-to-end learning with equal data.

02

CFT performs comparably to chain of thought prompting with much smaller models.

03

CFT is applicable to domains without pretraining data.

Abstract

How to usefully encode compositional task structure has long been a core challenge in AI. Recent work in chain of thought prompting has shown that for very large neural language models (LMs), explicitly demonstrating the inferential steps involved in a target task may improve performance over end-to-end learning that focuses on the target task alone. However, chain of thought prompting has significant limitations due to its dependency on huge pretrained LMs. In this work, we present compositional fine-tuning (CFT): an approach based on explicitly decomposing a target task into component tasks, and then fine-tuning smaller LMs on a curriculum of such component tasks. We apply CFT to recommendation tasks in two domains, world travel and local dining, as well as a previously studied inferential task (sports understanding). We show that CFT outperforms end-to-end learning even with equal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vbursztyn/compositional-fine-tuning
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsEmirates Airlines Office in Dubai