DOPRA: Decoding Over-accumulation Penalization and Re-allocation in   Specific Weighting Layer

Jinfeng Wei; Xiaofeng Zhang

arXiv:2407.15130·cs.CL·July 24, 2024

DOPRA: Decoding Over-accumulation Penalization and Re-allocation in Specific Weighting Layer

Jinfeng Wei, Xiaofeng Zhang

PDF

TL;DR

DOPRA is a decoding-based method that reduces hallucinations in multi-modal large language models by applying layer-specific penalties and reallocation strategies during decoding, without needing extra training data.

Contribution

It introduces a novel decoding technique that mitigates hallucinations in MLLMs through layer-specific penalties and token reallocation, avoiding additional training or external knowledge.

Findings

01

Reduces hallucinations in MLLMs during decoding.

02

Improves alignment of generated content with actual image data.

03

Does not require extra training data or external resources.

Abstract

In this work, we introduce DOPRA, a novel approach designed to mitigate hallucinations in multi-modal large language models (MLLMs). Unlike existing solutions that typically involve costly supplementary training data or the integration of external knowledge sources, DOPRA innovatively addresses hallucinations by decoding specific weighted layer penalties and redistribution, offering an economical and effective solution without additional resources. DOPRA is grounded in unique insights into the intrinsic mechanisms controlling hallucinations within MLLMs, especially the models' tendency to over-rely on a subset of summary tokens in the self-attention matrix, neglecting critical image-related information. This phenomenon is particularly pronounced in certain strata. To counteract this over-reliance, DOPRA employs a strategy of weighted overlay penalties and redistribution in specific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsALIGN