RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language Models
Sangmin Woo, Jaehyuk Jang, Donguk Kim, Yubin Choi, Changick Kim

TL;DR
RITUAL introduces a decoding method that reduces hallucinations in large vision language models by using randomly transformed images as complementary inputs, improving reliability without extra training.
Contribution
The paper proposes RITUAL, a novel decoding technique leveraging random image transformations to mitigate hallucinations in LVLMs, along with RITUAL+ which adaptively selects transformations based on model feedback.
Findings
Significantly reduces hallucinations across multiple benchmarks.
Improves model reliability without additional training or external modules.
Self-adaptive transformation selection enhances performance stability.
Abstract
Recent advancements in Large Vision Language Models (LVLMs) have revolutionized how machines understand and generate textual responses based on visual inputs, yet they often produce "hallucinatory" outputs that misinterpret visual information, posing challenges in reliability and trustworthiness. We propose RITUAL, a simple decoding method that reduces hallucinations by leveraging randomly transformed images as complementary inputs during decoding, adjusting the output probability distribution without additional training or external models. Our key insight is that random transformations expose the model to diverse visual perspectives, enabling it to correct misinterpretations that lead to hallucinations. Specifically, when a model hallucinates based on the original image, the transformed images -- altered in aspects such as orientation, scale, or color -- provide alternative viewpoints…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHallucinations in medical conditions · Cancer Treatment and Pharmacology · Drug-Induced Ocular Toxicity
MethodsFocus
