Safe Artificial General Intelligence via Distributed Ledger Technology
Kristen W. Carlson

TL;DR
This paper proposes a comprehensive framework using distributed ledger technology to ensure safe development and alignment of artificial general intelligence with human values, addressing potential risks and vulnerabilities.
Contribution
It introduces a set of axioms and a blockchain-based architecture to enhance transparency, security, and control in AGI development for safety and alignment.
Findings
DLT reduces hacking risks and enhances auditability.
Smart contracts enable rapid, automated AI governance.
Decentralized components improve safety and accountability.
Abstract
Background. Expert observers and artificial intelligence (AI) progression metrics indicate AI will exceed human intelligence within a few decades. Whether general AI that exceeds human capabilities (AGI) will be the single greatest boon in history or a disaster is unknown. No proofs exist that AGI will benefit humans or that AGI will not harm or eliminate humans. Objective. I propose a set of logically distinct conceptual components that are necessary and sufficient to 1) ensure that most known AGI scenarios will not harm humanity and 2) robustly align AGI values and goals with human values. Methods. By systematically addressing each pathway category to malevolent AI we can induce the methods/axioms required to redress the category. Results and Discussion. Distributed ledger technology (DLT, blockchain) is integral to this proposal, e.g. to reduce the probability of hacking,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
