Sparks of Artificial General Intelligence: Early experiments with GPT-4
S\'ebastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes, Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott, Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang

TL;DR
This paper investigates early experiments with GPT-4, highlighting its broad capabilities across multiple domains, near-human performance, and potential as an early step toward artificial general intelligence, while discussing limitations and societal implications.
Contribution
It presents an early investigation of GPT-4's capabilities, demonstrating its general intelligence traits and discussing the challenges in advancing toward true AGI.
Findings
GPT-4 exhibits human-level performance across diverse tasks.
GPT-4 surpasses previous models like ChatGPT in multiple domains.
The model shows potential as an early form of AGI.
Abstract
Artificial intelligence (AI) researchers have been developing and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. The latest model developed by OpenAI, GPT-4, was trained using an unprecedented scale of compute and data. In this paper, we report on our investigation of an early version of GPT-4, when it was still in active development by OpenAI. We contend that (this early version of) GPT-4 is part of a new cohort of LLMs (along with ChatGPT and Google's PaLM for example) that exhibit more general intelligence than previous AI models. We discuss the rising capabilities and implications of these models. We demonstrate that, beyond its mastery of language, GPT-4 can solve novel and difficult tasks that span mathematics, coding, vision, medicine, law, psychology and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
OpenAI’s GPT-4: A 70-Year Old Lesson!· youtube
OpenAI's GPT-4: A Spark Of Intelligence!· youtube
'Sparks of AGI' - Bombshell GPT-4 Paper: Fully Read w/ 15 Revelations· youtube
Debate: Sparks versus embers· youtube
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Topic Modeling · Machine Learning in Healthcare
MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Softmax · Linear Layer · Byte Pair Encoding · Layer Normalization · Residual Connection · Dense Connections
