AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
Han Wang, Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal

TL;DR
AdaCAD introduces an adaptive decoding method for large language models that dynamically balances contextual and parametric knowledge, significantly improving performance on QA and summarization tasks by accurately handling knowledge conflicts.
Contribution
The paper presents AdaCAD, a novel instance-level adaptive decoding approach that measures and adjusts for knowledge conflicts, outperforming static contrastive methods across multiple datasets and models.
Findings
Achieves 14.21% higher QA accuracy over static baselines.
Improves summarization factuality by 6.19 points (AlignScore).
Mitigates performance drops when conflicts are absent.
Abstract
Knowledge conflict arises from discrepancies between information in the context of a large language model (LLM) and the knowledge stored in its parameters. This can hurt performance when using standard decoding techniques, which tend to ignore the context. Existing test-time contrastive methods seek to address this by comparing the LLM's output distribution with and without the context and adjust the model according to the contrast between them. However, we find that these methods frequently misjudge the degree of conflict and struggle to handle instances that vary in their amount of conflict, with static methods over-adjusting when conflict is absent. We propose a fine-grained, instance-level approach called AdaCAD, which dynamically infers the weight of adjustment based on the degree of conflict, as measured by the Jensen-Shannon divergence between distributions representing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAI-based Problem Solving and Planning
