Sparks of Artificial General Intelligence: Early experiments with GPT-4

S\'ebastien Bubeck; Varun Chandrasekaran; Ronen Eldan; Johannes; Gehrke; Eric Horvitz; Ece Kamar; Peter Lee; Yin Tat Lee; Yuanzhi Li; Scott; Lundberg; Harsha Nori; Hamid Palangi; Marco Tulio Ribeiro; Yi Zhang

arXiv:2303.12712·cs.CL·April 17, 2023·1.5k cites

Sparks of Artificial General Intelligence: Early experiments with GPT-4

S\'ebastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes, Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott, Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang

PDF

Open Access 3 Repos 4 Videos

TL;DR

This paper investigates early experiments with GPT-4, highlighting its broad capabilities across multiple domains, near-human performance, and potential as an early step toward artificial general intelligence, while discussing limitations and societal implications.

Contribution

It presents an early investigation of GPT-4's capabilities, demonstrating its general intelligence traits and discussing the challenges in advancing toward true AGI.

Findings

01

GPT-4 exhibits human-level performance across diverse tasks.

02

GPT-4 surpasses previous models like ChatGPT in multiple domains.

03

The model shows potential as an early form of AGI.

Abstract

Artificial intelligence (AI) researchers have been developing and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. The latest model developed by OpenAI, GPT-4, was trained using an unprecedented scale of compute and data. In this paper, we report on our investigation of an early version of GPT-4, when it was still in active development by OpenAI. We contend that (this early version of) GPT-4 is part of a new cohort of LLMs (along with ChatGPT and Google's PaLM for example) that exhibit more general intelligence than previous AI models. We discuss the rising capabilities and implications of these models. We demonstrate that, beyond its mastery of language, GPT-4 can solve novel and difficult tasks that span mathematics, coding, vision, medicine, law, psychology and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

OpenAI’s GPT-4: A 70-Year Old Lesson!· youtube

OpenAI's GPT-4: A Spark Of Intelligence!· youtube

'Sparks of AGI' - Bombshell GPT-4 Paper: Fully Read w/ 15 Revelations· youtube

Debate: Sparks versus embers· youtube

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Topic Modeling · Machine Learning in Healthcare

MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Softmax · Linear Layer · Byte Pair Encoding · Layer Normalization · Residual Connection · Dense Connections