Modality-Fair Preference Optimization for Trustworthy MLLM Alignment

Songtao Jiang; Yan Zhang; Ruizhe Chen; Tianxiang Hu; Yeying Jin; Qinglin He; Yang Feng; Jian Wu; Zuozhu Liu

arXiv:2410.15334·cs.CV·June 9, 2025

Modality-Fair Preference Optimization for Trustworthy MLLM Alignment

Songtao Jiang, Yan Zhang, Ruizhe Chen, Tianxiang Hu, Yeying Jin, Qinglin He, Yang Feng, Jian Wu, Zuozhu Liu

PDF

Open Access

TL;DR

This paper introduces Modality-Fair Preference Optimization (MFPO), a novel training method that improves the trustworthiness of multimodal large language models by aligning visual and textual modalities more effectively.

Contribution

The paper proposes MFPO, a new approach with a multimodal preference dataset, an image reward loss, and an iterative training strategy to enhance MLLM trustworthiness.

Findings

01

MFPO significantly improves trustworthiness benchmarks.

02

7B models with MFPO outperform larger models in trustworthiness.

03

Enhanced alignment reduces hallucination and improves input image utilization.

Abstract

Multimodal large language models (MLLMs) have achieved remarkable success across various tasks. However, separate training of visual and textual encoders often results in a misalignment of the modality. Such misalignment may lead models to generate content that is absent from the input image, a phenomenon referred to as hallucination. These inaccuracies severely undermine the trustworthiness of MLLMs in real-world applications. Despite attempts to optimize text preferences to mitigate this issue, our initial investigation indicates that the trustworthiness of MLLMs remains inadequate. Specifically, these models tend to provide preferred answers even when the input image is heavily distorted. Analysis of visual token attention also indicates that the model focuses primarily on the surrounding context rather than the key object referenced in the question. These findings highlight a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAccess Control and Trust · Multi-Agent Systems and Negotiation