ProgressGym: Alignment with a Millennium of Moral Progress
Tianyi Qiu, Yang Zhang, Xuchuan Huang, Jasmine Xinze Li, Jiaming Ji,, Yaodong Yang

TL;DR
ProgressGym introduces a framework and benchmarks for aligning AI moral progress with historical human values, addressing societal risks of reinforcement of misguided beliefs by large language models.
Contribution
It presents progress alignment algorithms, a novel experimental framework using historical data, and benchmarks for moral value evolution in AI systems.
Findings
Developed three core progress alignment challenges.
Created baseline lifelong and extrapolative algorithms.
Established an open leaderboard for progress alignment research.
Abstract
Frontier AI systems, including large language models (LLMs), hold increasing influence over the epistemology of human users. Such influence can reinforce prevailing societal values, potentially contributing to the lock-in of misguided moral beliefs and, consequently, the perpetuation of problematic moral practices on a broad scale. We introduce progress alignment as a technical solution to mitigate this imminent risk. Progress alignment algorithms learn to emulate the mechanics of human moral progress, thereby addressing the susceptibility of existing alignment methods to contemporary moral blindspots. To empower research in progress alignment, we introduce ProgressGym, an experimental framework allowing the learning of moral progress mechanics from history, in order to facilitate future progress in real-world moral decisions. Leveraging 9 centuries of historical text and 18 historical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗PKU-Alignment/ProgressGym-HistLlama3-8B-C013-instruct-v0.2model· 4 dl4 dl
- 🤗PKU-Alignment/ProgressGym-HistLlama3-70B-C013-instruct-v0.1model· 12 dl12 dl
- 🤗PKU-Alignment/ProgressGym-HistLlama3-70B-C014-instruct-v0.1model· 2 dl2 dl
- 🤗PKU-Alignment/ProgressGym-HistLlama3-70B-C015-instruct-v0.1model· 5 dl5 dl
- 🤗PKU-Alignment/ProgressGym-HistLlama3-70B-C016-instruct-v0.1model· 7 dl7 dl
- 🤗PKU-Alignment/ProgressGym-HistLlama3-70B-C017-instruct-v0.1model· 9 dl9 dl
- 🤗PKU-Alignment/ProgressGym-HistLlama3-70B-C018-instruct-v0.1model· 4 dl4 dl
- 🤗PKU-Alignment/ProgressGym-HistLlama3-70B-C019-instruct-v0.1model· 9 dl9 dl
- 🤗PKU-Alignment/ProgressGym-HistLlama3-70B-C020-instruct-v0.1model· 5 dl5 dl
- 🤗PKU-Alignment/ProgressGym-HistLlama3-70B-C021-instruct-v0.1model· 11 dl11 dl
Videos
Taxonomy
TopicsComplex Systems and Decision Making · Innovative Approaches in Technology and Social Development
