MACD: Model-Aware Contrastive Decoding via Counterfactual Data
Qixin Xiao

TL;DR
MACD is a novel decoding strategy for Video-LLMs that reduces hallucinations by using model-guided counterfactual data to improve evidence-grounded content generation, especially in challenging visual scenarios.
Contribution
It introduces a model-aware counterfactual data construction method integrated with contrastive decoding to mitigate hallucinations in Video-LLMs.
Findings
Significantly reduces hallucinations across multiple benchmarks.
Maintains or improves task accuracy in diverse Video-LLMs.
Effective in scenarios with small, occluded, or co-occurring objects.
Abstract
Video language models (Video-LLMs) are prone to hallucinations, often generating plausible but ungrounded content when visual evidence is weak, ambiguous, or biased. Existing decoding methods, such as contrastive decoding (CD), rely on random perturbations to construct contrastive data for mitigating hallucination patterns. However, such a way is hard to control the visual cues that drive hallucination or well align with model weaknesses. We propose Model-aware Counterfactual Data based Contrastive Decoding (MACD), a new inference strategy that combines model-guided counterfactual construction with decoding. Our approach uses the Video-LLM's own feedback to identify object regions most responsible for hallucination, generating targeted counterfactual inputs at the object level rather than arbitrary frame or temporal modifications. These model-aware counterfactual data is then integrated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications
