Understanding the Role of Hallucination in Reinforcement Post-Training of Multimodal Reasoning Models

Gengwei Zhang; Jie Peng; Zhen Tan; Mufan Qiu; Hossein Nourkhiz Mahjoub; Vaishnav Tadiparthi; Kwonjoon Lee; Yanyong Zhang; Tianlong Chen

arXiv:2604.03179·cs.LG·April 6, 2026

Understanding the Role of Hallucination in Reinforcement Post-Training of Multimodal Reasoning Models

Gengwei Zhang, Jie Peng, Zhen Tan, Mufan Qiu, Hossein Nourkhiz Mahjoub, Vaishnav Tadiparthi, Kwonjoon Lee, Yanyong Zhang, Tianlong Chen

PDF

TL;DR

This paper introduces the Hallucination-as-Cue Framework to analyze how reinforcement learning post-training influences multimodal reasoning models, revealing that hallucination plays a significant role in their improved performance.

Contribution

It proposes a novel analytical framework to diagnose RL training effects on multimodal models by using hallucination-inductive corruptions, challenging existing assumptions.

Findings

01

RL post-training can improve reasoning even with hallucination-inductive corruptions

02

Models trained with RL outperform standard training in some benchmarks

03

Hallucination significantly influences the reasoning capabilities of multimodal models

Abstract

The recent success of reinforcement learning (RL) in large reasoning models has inspired the growing adoption of RL for post-training Multimodal Large Language Models (MLLMs) to enhance their visual reasoning capabilities. Although many studies have reported improved performance, it remains unclear whether RL training truly enables models to learn from visual information. In this work, we propose the Hallucination-as-Cue Framework, an analytical framework designed to investigate the effects of RL-based post-training on multimodal reasoning models from the perspective of model hallucination. Specifically, we introduce hallucination-inductive, modality-specific corruptions that remove or replace essential information required to derive correct answers, thereby forcing the model to reason by hallucination. By applying these corruptions during both training and evaluation, our framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.