Safe Artificial General Intelligence via Distributed Ledger Technology

Kristen W. Carlson

arXiv:1902.03689·cs.CY·August 20, 2024

Safe Artificial General Intelligence via Distributed Ledger Technology

Kristen W. Carlson

PDF

TL;DR

This paper proposes a comprehensive framework using distributed ledger technology to ensure safe development and alignment of artificial general intelligence with human values, addressing potential risks and vulnerabilities.

Contribution

It introduces a set of axioms and a blockchain-based architecture to enhance transparency, security, and control in AGI development for safety and alignment.

Findings

01

DLT reduces hacking risks and enhances auditability.

02

Smart contracts enable rapid, automated AI governance.

03

Decentralized components improve safety and accountability.

Abstract

Background. Expert observers and artificial intelligence (AI) progression metrics indicate AI will exceed human intelligence within a few decades. Whether general AI that exceeds human capabilities (AGI) will be the single greatest boon in history or a disaster is unknown. No proofs exist that AGI will benefit humans or that AGI will not harm or eliminate humans. Objective. I propose a set of logically distinct conceptual components that are necessary and sufficient to 1) ensure that most known AGI scenarios will not harm humanity and 2) robustly align AGI values and goals with human values. Methods. By systematically addressing each pathway category to malevolent AI we can induce the methods/axioms required to redress the category. Results and Discussion. Distributed ledger technology (DLT, blockchain) is integral to this proposal, e.g. to reduce the probability of hacking,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.