RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback
Tianyu Yu, Yuan Yao, Haoye Zhang, Taiwen He, Yifeng Han, and Ganqu Cui, Jinyi Hu, Zhiyuan Liu, Hai-Tao Zheng, Maosong Sun, and Tat-Seng Chua

TL;DR
RLHF-V improves multimodal large language models by using fine-grained human feedback to significantly reduce hallucinations, enhancing trustworthiness with high data and computational efficiency.
Contribution
The paper introduces RLHF-V, a novel method that leverages segment-level human feedback for behavior alignment, significantly reducing hallucinations in MLLMs.
Findings
Reduces hallucination rate by 34.8% with only 1.4k data samples.
Outperforms models trained on larger datasets in trustworthiness.
Achieves state-of-the-art trustworthiness among open-source MLLMs.
Abstract
Multimodal Large Language Models (MLLMs) have recently demonstrated impressive capabilities in multimodal understanding, reasoning, and interaction. However, existing MLLMs prevalently suffer from serious hallucination problems, generating text that is not factually grounded in associated images. The problem makes existing MLLMs untrustworthy and thus impractical in real-world (especially high-stakes) applications. To address the challenge, we present RLHF-V, which enhances MLLM trustworthiness via behavior alignment from fine-grained correctional human feedback. Specifically, RLHF-V collects human preference in the form of segment-level corrections on hallucinations, and performs dense direct preference optimization over the human feedback. Comprehensive experiments on five benchmarks in both automatic and human evaluation show that, RLHF-V can enable substantially more trustworthy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗openbmb/RLHF-Vmodel· 36 dl· ♡ 1836 dl♡ 18
- 🤗openbmb/RLHF-V-SFTmodel· 21 dl· ♡ 321 dl♡ 3
- 🤗openbmb/OmniLMM-12Bmodel· 127 dl· ♡ 72127 dl♡ 72
- 🤗openbmb/MiniCPM-V-2model· 78k dl· ♡ 49578k dl♡ 495
- 🤗openbmb/RLAIF-V-7Bmodel· 27 dl· ♡ 1127 dl♡ 11
- 🤗openbmb/RLAIF-V-12Bmodel· 15 dl· ♡ 715 dl♡ 7
- 🤗DeclanBracken/MiniCPM-Llama3-V-2.5-Transcriptormodel· 3 dl3 dl
- 🤗SwordElucidator/MiniCPM-Llama3-V-2_5model· 5 dl5 dl
- 🤗DeclanBracken/MiniCPM-Llama3-V-2_5-Transcriptor-V3model· 1 dl1 dl
- 🤗seanlong/MiniCPM-Llama3-V-2_5model· 2 dl2 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsBalanced Selection
