Divide-or-Conquer? Which Part Should You Distill Your LLM?

Zhuofeng Wu; He Bai; Aonan Zhang; Jiatao Gu; VG Vinod Vydiswaran,; Navdeep Jaitly; Yizhe Zhang

arXiv:2402.15000·cs.CL·November 20, 2024·2 cites

Divide-or-Conquer? Which Part Should You Distill Your LLM?

Zhuofeng Wu, He Bai, Aonan Zhang, Jiatao Gu, VG Vinod Vydiswaran,, Navdeep Jaitly, Yizhe Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper explores a divide-and-conquer approach to LLM reasoning, distilling problem decomposition and solving phases separately, and finds that decomposition distillation is effective for generalization and cost-efficient inference.

Contribution

It introduces a method to distill problem decomposition and solving capabilities separately, demonstrating the effectiveness of decomposition distillation for reasoning tasks.

Findings

01

Distilling problem decomposition improves generalization across tasks.

02

Distilling problem solving is challenging and reduces performance.

03

Combining distilled decomposition with large LLMs enables cost-effective reasoning.

Abstract

Recent methods have demonstrated that Large Language Models (LLMs) can solve reasoning tasks better when they are encouraged to solve subtasks of the main task first. In this paper we devise a similar strategy that breaks down reasoning tasks into a problem decomposition phase and a problem solving phase and show that the strategy is able to outperform a single stage solution. Further, we hypothesize that the decomposition should be easier to distill into a smaller model compared to the problem solving because the latter requires large amounts of domain knowledge while the former only requires learning general problem solving strategies. We propose methods to distill these two capabilities and evaluate their impact on reasoning outcomes and inference cost. We find that we can distill the problem decomposition phase and at the same time achieve good generalization across tasks, datasets,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

apple/ml-divide-or-conquer
noneOfficial

Videos

Divide-or-Conquer? Which Part Should You Distill Your LLM?· underline

Taxonomy

TopicsArtificial Intelligence in Law · Legal Education and Practice Innovations · Law, AI, and Intellectual Property