Orca 2: Teaching Small Language Models How to Reason
Arindam Mitra, Luciano Del Corro, Shweti Mahajan, Andres Codas,, Clarisse Simoes, Sahaj Agarwal, Xuxi Chen, Anastasia Razdaibiedina, Erik, Jones, Kriti Aggarwal, Hamid Palangi, Guoqing Zheng, Corby Rosset, Hamed, Khanpour, Ahmed Awadallah

TL;DR
Orca 2 advances small language models by teaching diverse reasoning strategies and task-specific solution selection, significantly improving their performance on complex reasoning benchmarks without relying solely on imitation learning.
Contribution
It introduces a training approach that enables small LMs to learn multiple reasoning techniques and select the most effective one per task, surpassing similar-sized models.
Findings
Outperforms models of similar size on complex reasoning benchmarks.
Achieves performance comparable to much larger models.
Supports research with publicly available Orca 2 weights.
Abstract
Orca 1 learns from rich signals, such as explanation traces, allowing it to outperform conventional instruction-tuned models on benchmarks like BigBench Hard and AGIEval. In Orca 2, we continue exploring how improved training signals can enhance smaller LMs' reasoning abilities. Research on training small LMs has often relied on imitation learning to replicate the output of more capable models. We contend that excessive emphasis on imitation may restrict the potential of smaller models. We seek to teach small LMs to employ different solution strategies for different tasks, potentially different from the one used by the larger model. For example, while larger models might provide a direct answer to a complex task, smaller models may not have the same capacity. In Orca 2, we teach the model various reasoning techniques (step-by-step, recall then generate, recall-reason-generate, direct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗microsoft/Orca-2-7bmodel· 1.2k dl· ♡ 2231.2k dl♡ 223
- 🤗microsoft/Orca-2-13bmodel· 3.2k dl· ♡ 6663.2k dl♡ 666
- 🤗LoneStriker/Orca-2-13b-4.0bpw-h6-exl2model· 2 dl· ♡ 12 dl♡ 1
- 🤗LoneStriker/Orca-2-13b-6.0bpw-h6-exl2model· 2 dl2 dl
- 🤗LoneStriker/Orca-2-13b-8.0bpw-h8-exl2model· 3 dl· ♡ 33 dl♡ 3
- 🤗LoneStriker/Orca-2-13b-3.0bpw-h6-exl2model· 5 dl5 dl
- 🤗LoneStriker/Orca-2-13b-5.0bpw-h6-exl2model· 3 dl· ♡ 13 dl♡ 1
- 🤗LoneStriker/Orca-2-7b-4.0bpw-h6-exl2model· 5 dl5 dl
- 🤗LoneStriker/Orca-2-7b-6.0bpw-h6-exl2model· 2 dl2 dl
- 🤗LoneStriker/Orca-2-7b-8.0bpw-h8-exl2model· 2 dl2 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)
MethodsSparse Evolutionary Training
