MACD: Model-Aware Contrastive Decoding via Counterfactual Data

Qixin Xiao

arXiv:2602.01740·cs.AI·February 10, 2026

MACD: Model-Aware Contrastive Decoding via Counterfactual Data

Qixin Xiao

PDF

Open Access

TL;DR

MACD is a novel decoding strategy for Video-LLMs that reduces hallucinations by using model-guided counterfactual data to improve evidence-grounded content generation, especially in challenging visual scenarios.

Contribution

It introduces a model-aware counterfactual data construction method integrated with contrastive decoding to mitigate hallucinations in Video-LLMs.

Findings

01

Significantly reduces hallucinations across multiple benchmarks.

02

Maintains or improves task accuracy in diverse Video-LLMs.

03

Effective in scenarios with small, occluded, or co-occurring objects.

Abstract

Video language models (Video-LLMs) are prone to hallucinations, often generating plausible but ungrounded content when visual evidence is weak, ambiguous, or biased. Existing decoding methods, such as contrastive decoding (CD), rely on random perturbations to construct contrastive data for mitigating hallucination patterns. However, such a way is hard to control the visual cues that drive hallucination or well align with model weaknesses. We propose Model-aware Counterfactual Data based Contrastive Decoding (MACD), a new inference strategy that combines model-guided counterfactual construction with decoding. Our approach uses the Video-LLM's own feedback to identify object regions most responsible for hallucination, generating targeted counterfactual inputs at the object level rather than arbitrary frame or temporal modifications. These model-aware counterfactual data is then integrated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications