Decoding Alignment: A Critical Survey of LLM Development Initiatives through Value-setting and Data-centric Lens

Ilias Chalkidis

arXiv:2508.16982·cs.CL·August 26, 2025

Decoding Alignment: A Critical Survey of LLM Development Initiatives through Value-setting and Data-centric Lens

Ilias Chalkidis

PDF

TL;DR

This paper critically surveys recent LLM development initiatives, focusing on how they set values and utilize data for alignment, revealing the socio-technical challenges and practices across leading organizations.

Contribution

It provides a comprehensive audit of publicly available documentation from major LLM projects, emphasizing the value-setting and data-centric aspects of alignment practices.

Findings

01

Diverse approaches to value-setting across initiatives

02

Limited transparency in proprietary models' alignment processes

03

Highlighting socio-technical challenges in alignment practices

Abstract

AI Alignment, primarily in the form of Reinforcement Learning from Human Feedback (RLHF), has been a cornerstone of the post-training phase in developing Large Language Models (LLMs). It has also been a popular research topic across various disciplines beyond Computer Science, including Philosophy and Law, among others, highlighting the socio-technical challenges involved. Nonetheless, except for the computational techniques related to alignment, there has been limited focus on the broader picture: the scope of these processes, which primarily rely on the selected objectives (values), and the data collected and used to imprint such objectives into the models. This work aims to reveal how alignment is understood and applied in practice from a value-setting and data-centric perspective. For this purpose, we investigate and survey (`audit') publicly available documentation released by 6…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.