Orca 2: Teaching Small Language Models How to Reason

Arindam Mitra; Luciano Del Corro; Shweti Mahajan; Andres Codas,; Clarisse Simoes; Sahaj Agarwal; Xuxi Chen; Anastasia Razdaibiedina; Erik; Jones; Kriti Aggarwal; Hamid Palangi; Guoqing Zheng; Corby Rosset; Hamed; Khanpour; Ahmed Awadallah

arXiv:2311.11045·cs.AI·November 23, 2023·30 cites

Orca 2: Teaching Small Language Models How to Reason

Arindam Mitra, Luciano Del Corro, Shweti Mahajan, Andres Codas,, Clarisse Simoes, Sahaj Agarwal, Xuxi Chen, Anastasia Razdaibiedina, Erik, Jones, Kriti Aggarwal, Hamid Palangi, Guoqing Zheng, Corby Rosset, Hamed, Khanpour, Ahmed Awadallah

PDF

Open Access 10 Models 3 Datasets

TL;DR

Orca 2 advances small language models by teaching diverse reasoning strategies and task-specific solution selection, significantly improving their performance on complex reasoning benchmarks without relying solely on imitation learning.

Contribution

It introduces a training approach that enables small LMs to learn multiple reasoning techniques and select the most effective one per task, surpassing similar-sized models.

Findings

01

Outperforms models of similar size on complex reasoning benchmarks.

02

Achieves performance comparable to much larger models.

03

Supports research with publicly available Orca 2 weights.

Abstract

Orca 1 learns from rich signals, such as explanation traces, allowing it to outperform conventional instruction-tuned models on benchmarks like BigBench Hard and AGIEval. In Orca 2, we continue exploring how improved training signals can enhance smaller LMs' reasoning abilities. Research on training small LMs has often relied on imitation learning to replicate the output of more capable models. We contend that excessive emphasis on imitation may restrict the potential of smaller models. We seek to teach small LMs to employ different solution strategies for different tasks, potentially different from the one used by the larger model. For example, while larger models might provide a direct answer to a complex task, smaller models may not have the same capacity. In Orca 2, we teach the model various reasoning techniques (step-by-step, recall then generate, recall-reason-generate, direct…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)

MethodsSparse Evolutionary Training