Divide-or-Conquer? Which Part Should You Distill Your LLM?
Zhuofeng Wu, He Bai, Aonan Zhang, Jiatao Gu, VG Vinod Vydiswaran,, Navdeep Jaitly, Yizhe Zhang

TL;DR
This paper explores a divide-and-conquer approach to LLM reasoning, distilling problem decomposition and solving phases separately, and finds that decomposition distillation is effective for generalization and cost-efficient inference.
Contribution
It introduces a method to distill problem decomposition and solving capabilities separately, demonstrating the effectiveness of decomposition distillation for reasoning tasks.
Findings
Distilling problem decomposition improves generalization across tasks.
Distilling problem solving is challenging and reduces performance.
Combining distilled decomposition with large LLMs enables cost-effective reasoning.
Abstract
Recent methods have demonstrated that Large Language Models (LLMs) can solve reasoning tasks better when they are encouraged to solve subtasks of the main task first. In this paper we devise a similar strategy that breaks down reasoning tasks into a problem decomposition phase and a problem solving phase and show that the strategy is able to outperform a single stage solution. Further, we hypothesize that the decomposition should be easier to distill into a smaller model compared to the problem solving because the latter requires large amounts of domain knowledge while the former only requires learning general problem solving strategies. We propose methods to distill these two capabilities and evaluate their impact on reasoning outcomes and inference cost. We find that we can distill the problem decomposition phase and at the same time achieve good generalization across tasks, datasets,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsArtificial Intelligence in Law · Legal Education and Practice Innovations · Law, AI, and Intellectual Property
