Orca: Progressive Learning from Complex Explanation Traces of GPT-4
Subhabrata Mukherjee, Arindam Mitra, Ganesh Jawahar, Sahaj Agarwal,, Hamid Palangi, Ahmed Awadallah

TL;DR
Orca is a 13-billion parameter model trained to imitate GPT-4's reasoning process using rich explanation traces, significantly outperforming prior models on complex reasoning benchmarks and approaching ChatGPT's performance.
Contribution
This work introduces Orca, a novel model that learns from complex explanation traces of GPT-4, improving reasoning capabilities beyond existing instruction-tuned models.
Findings
Orca surpasses state-of-the-art models like Vicuna-13B on BBH and AGIEval benchmarks.
Orca achieves parity with ChatGPT on the BBH benchmark.
Orca performs competitively on standardized exams without chain-of-thought prompting.
Abstract
Recent research has focused on enhancing the capability of smaller models through imitation learning, drawing on the outputs generated by large foundation models (LFMs). A number of issues impact the quality of these models, ranging from limited imitation signals from shallow LFM outputs; small scale homogeneous training data; and most notably a lack of rigorous evaluation resulting in overestimating the small model's capability as they tend to learn to imitate the style, but not the reasoning process of LFMs. To address these challenges, we develop Orca (We are working with our legal team to publicly release a diff of the model weights in accordance with LLaMA's release policy to be published at https://aka.ms/orca-lm), a 13-billion parameter model that learns to imitate the reasoning process of LFMs. Orca learns from rich signals from GPT-4 including explanation traces; step-by-step…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗yanolja/YanoljaNEXT-EEVE-Instruct-10.8Bmodel· 3.6k dl· ♡ 1653.6k dl♡ 165
- 🤗teddylee777/EEVE-Korean-Instruct-10.8B-v1.0-ggufmodel· 811 dl· ♡ 23811 dl♡ 23
- 🤗pankajmathur/orca_alpaca_3bmodel· 16 dl· ♡ 1216 dl♡ 12
- 🤗pankajmathur/orca_dolly_3bmodel· 8 dl· ♡ 38 dl♡ 3
- 🤗pankajmathur/orca_mini_13bmodel· 107 dl· ♡ 100107 dl♡ 100
- 🤗pankajmathur/orca_mini_3bmodel· 721 dl· ♡ 165721 dl♡ 165
- 🤗pankajmathur/orca_mini_7bmodel· 87 dl· ♡ 1887 dl♡ 18
- 🤗TheBloke/orca_mini_13B-GPTQmodel· 798 dl· ♡ 44798 dl♡ 44
- 🤗TheBloke/orca_mini_13B-GGMLmodel· ♡ 56♡ 56
- 🤗TheBloke/orca_mini_7B-GPTQmodel· 20 dl· ♡ 1220 dl♡ 12
Videos
State of AI 2023: Highlights of 163 Page Report + Eureka Self-Improvement, MEG, Suno AI and GPT F· youtube
Orca: The Model Few Saw Coming· youtube
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Topic Modeling
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Label Smoothing · Adam · Absolute Position Encodings · Residual Connection · Dropout · Position-Wise Feed-Forward Layer · Layer Normalization
