MORALISE: A Structured Benchmark for Moral Alignment in Visual Language Models

Xiao Lin; Zhining Liu; Ze Yang; Gaotang Li; Ruizhong Qiu; Shuke Wang; Hui Liu; Haotian Li; Sumit Keswani; Vishwa Pardeshi; Huijun Zhao; Wei Fan; Hanghang Tong

arXiv:2505.14728·cs.CV·May 22, 2025

MORALISE: A Structured Benchmark for Moral Alignment in Visual Language Models

Xiao Lin, Zhining Liu, Ze Yang, Gaotang Li, Ruizhong Qiu, Shuke Wang, Hui Liu, Haotian Li, Sumit Keswani, Vishwa Pardeshi, Huijun Zhao, Wei Fan, Hanghang Tong

PDF

Open Access

TL;DR

MORALISE is a new benchmark dataset designed to evaluate the moral alignment of vision-language models using diverse, expert-verified real-world data across multiple moral domains, highlighting current models' limitations.

Contribution

This work introduces MORALISE, a comprehensive, expert-annotated benchmark with 2,481 image-text pairs covering 13 moral topics, for assessing moral judgment and reasoning in vision-language models.

Findings

01

Current models show significant moral limitations.

02

MORALISE challenges models with diverse moral violations.

03

Benchmark is publicly available for future research.

Abstract

Warning: This paper contains examples of harmful language and images. Reader discretion is advised. Recently, vision-language models have demonstrated increasing influence in morally sensitive domains such as autonomous driving and medical analysis, owing to their powerful multimodal reasoning capabilities. As these models are deployed in high-stakes real-world applications, it is of paramount importance to ensure that their outputs align with human moral values and remain within moral boundaries. However, existing work on moral alignment either focuses solely on textual modalities or relies heavily on AI-generated images, leading to distributional biases and reduced realism. To overcome these limitations, we introduce MORALISE, a comprehensive benchmark for evaluating the moral alignment of vision-language models (VLMs) using diverse, expert-verified real-world data. We begin by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods

MethodsALIGN