Orca: Progressive Learning from Complex Explanation Traces of GPT-4

Subhabrata Mukherjee; Arindam Mitra; Ganesh Jawahar; Sahaj Agarwal,; Hamid Palangi; Ahmed Awadallah

arXiv:2306.02707·cs.CL·June 6, 2023·67 cites

Orca: Progressive Learning from Complex Explanation Traces of GPT-4

Subhabrata Mukherjee, Arindam Mitra, Ganesh Jawahar, Sahaj Agarwal,, Hamid Palangi, Ahmed Awadallah

PDF

Open Access 4 Repos 10 Models 5 Datasets 2 Videos

TL;DR

Orca is a 13-billion parameter model trained to imitate GPT-4's reasoning process using rich explanation traces, significantly outperforming prior models on complex reasoning benchmarks and approaching ChatGPT's performance.

Contribution

This work introduces Orca, a novel model that learns from complex explanation traces of GPT-4, improving reasoning capabilities beyond existing instruction-tuned models.

Findings

01

Orca surpasses state-of-the-art models like Vicuna-13B on BBH and AGIEval benchmarks.

02

Orca achieves parity with ChatGPT on the BBH benchmark.

03

Orca performs competitively on standardized exams without chain-of-thought prompting.

Abstract

Recent research has focused on enhancing the capability of smaller models through imitation learning, drawing on the outputs generated by large foundation models (LFMs). A number of issues impact the quality of these models, ranging from limited imitation signals from shallow LFM outputs; small scale homogeneous training data; and most notably a lack of rigorous evaluation resulting in overestimating the small model's capability as they tend to learn to imitate the style, but not the reasoning process of LFMs. To address these challenges, we develop Orca (We are working with our legal team to publicly release a diff of the model weights in accordance with LLaMA's release policy to be published at https://aka.ms/orca-lm), a 13-billion parameter model that learns to imitate the reasoning process of LFMs. Orca learns from rich signals from GPT-4 including explanation traces; step-by-step…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

State of AI 2023: Highlights of 163 Page Report + Eureka Self-Improvement, MEG, Suno AI and GPT F· youtube

Orca: The Model Few Saw Coming· youtube

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Topic Modeling

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Label Smoothing · Adam · Absolute Position Encodings · Residual Connection · Dropout · Position-Wise Feed-Forward Layer · Layer Normalization