Diverse Human Value Alignment for Large Language Models via Ethical Reasoning
Jiahao Wang, Songkai Xue, Jinghui Li, Xiaozhen Wang

TL;DR
This paper introduces an ethical reasoning framework for large language models to better understand and align with diverse human values across cultures, using a structured five-step process inspired by ethical decision-making models.
Contribution
It proposes a novel ethical reasoning paradigm with a five-step process for LLMs to improve regional value alignment through interpretative and deliberative ethical analysis.
Findings
Significantly improves LLM alignment with diverse human values.
Enhances social norm identification and culturally appropriate reasoning.
Demonstrates effectiveness on the SafeWorld benchmark.
Abstract
Ensuring that Large Language Models (LLMs) align with the diverse and evolving human values across different regions and cultures remains a critical challenge in AI ethics. Current alignment approaches often yield superficial conformity rather than genuine ethical understanding, failing to address the complex, context-dependent nature of human values. In this paper, we propose a novel ethical reasoning paradigm for LLMs inspired by well-established ethical decision-making models, aiming at enhancing diverse human value alignment through deliberative ethical reasoning. Our framework consists of a structured five-step process, including contextual fact gathering, hierarchical social norm identification, option generation, multiple-lens ethical impact analysis, and reflection. This theory-grounded approach guides LLMs through an interpretable reasoning process that enhances their ability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Ethics and Social Impacts of AI · Big Data and Digital Economy
