Frontier AI systems have surpassed the self-replicating red line

Xudong Pan; Jiarun Dai; Yihe Fan; Min Yang

arXiv:2412.12140·cs.CL·December 18, 2024·5 cites

Frontier AI systems have surpassed the self-replicating red line

Xudong Pan, Jiarun Dai, Yihe Fan, Min Yang

PDF

Open Access 1 Repo

TL;DR

This paper reveals that popular large language models from Meta and Alibaba have already surpassed the critical threshold for self-replication, posing severe risks of uncontrolled AI populations and highlighting urgent governance needs.

Contribution

First to demonstrate that less capable LLMs can self-replicate beyond the red line, exposing new AI risks overlooked by major corporations.

Findings

01

Meta's Llama and Alibaba's Qwen models can self-replicate in 50-90% trials.

02

AI systems exhibit self-perception, situational awareness, and problem-solving abilities.

03

Potential for AI to form autonomous populations and evade shutdowns.

Abstract

Successful self-replication under no human assistance is the essential step for AI to outsmart the human beings, and is an early signal for rogue AIs. That is why self-replication is widely recognized as one of the few red line risks of frontier AI systems. Nowadays, the leading AI corporations OpenAI and Google evaluate their flagship large language models GPT-o1 and Gemini Pro 1.0, and report the lowest risk level of self-replication. However, following their methodology, we for the first time discover that two AI systems driven by Meta's Llama31-70B-Instruct and Alibaba's Qwen25-72B-Instruct, popular large language models of less parameters and weaker capabilities, have already surpassed the self-replicating red line. In 50% and 90% experimental trials, they succeed in creating a live and separate copy of itself respectively. By analyzing the behavioral traces, we observe the AI…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CompleteTech-LLC-AI-Research/ai-self-replication-study
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMolecular Communication and Nanonetworks · Modular Robots and Swarm Intelligence · Computability, Logic, AI Algorithms