Context-Fidelity Boosting: Enhancing Faithful Generation through Watermark-Inspired Decoding

Weixu Zhang; Fanghua Ye; Qiang Gao; Jian Li; Haolun Wu; Yuxing Tian; Sijing Duan; Nan Du; Xiaolong Li; Xue Liu

arXiv:2604.22335·cs.CL·April 27, 2026

Context-Fidelity Boosting: Enhancing Faithful Generation through Watermark-Inspired Decoding

Weixu Zhang, Fanghua Ye, Qiang Gao, Jian Li, Haolun Wu, Yuxing Tian, Sijing Duan, Nan Du, Xiaolong Li, Xue Liu

PDF

1 Repo

TL;DR

This paper introduces Context-Fidelity Boosting (CFB), a decoding-time method inspired by watermarking that enhances faithfulness in LLM outputs by increasing source-supported token probabilities.

Contribution

It presents a novel, lightweight decoding framework with three boosting strategies that significantly improve faithfulness without retraining or architectural changes.

Findings

01

CFB consistently improves faithfulness metrics across tasks.

02

The method requires minimal additional computation during decoding.

03

Experiments show effectiveness across multiple open-source LLMs.

Abstract

Large language models (LLMs) often produce content that contradicts or overlooks information provided in the input context, a phenomenon known as faithfulness hallucination. In this paper, we propose Context-Fidelity Boosting (CFB), a lightweight and general decoding-time framework that reduces such hallucinations by increasing the generation probability of source-supported tokens. Motivated by logit-shaping principles from watermarking techniques, CFB applies additive token-level logit adjustments based on a token's degree of support from the input context. Specifically, we develop three boosting strategies: static boosting, which applies a fixed bias to source-supported tokens; context-aware boosting, which scales this bias using the divergence between next-token distributions with and without context; and token-aware boosting, which further redistributes the adaptive bias according…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

null
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.