The Confidence Paradox: Can LLM Know When It's Wrong

Sahil Tripathi; Md Tabrez Nafis; Imran Hussain; Jiechao Gao

arXiv:2506.23464·cs.AI·October 29, 2025

The Confidence Paradox: Can LLM Know When It's Wrong

Sahil Tripathi, Md Tabrez Nafis, Imran Hussain, Jiechao Gao

PDF

Open Access

TL;DR

The paper introduces HonestVQA, a self-supervised framework that calibrates model confidence with correctness and ethics in Document VQA, improving accuracy and reducing overconfidence across multiple datasets.

Contribution

It presents HonestVQA, a novel model-agnostic, self-supervised approach with new metrics for ethical confidence calibration in Document VQA.

Findings

01

Improves accuracy and F1 scores by up to 4.3%

02

Reduces overconfidence in model predictions

03

Achieves 78.9% accuracy and 76.1% F1-score across datasets

Abstract

Document Visual Question Answering (DocVQA) models often produce overconfident or ethically misaligned responses, especially under uncertainty. Existing models like LayoutLMv3, UDOP, and DONUT focus on accuracy but lack ethical calibration. We propose HonestVQA, a model-agnostic, self-supervised framework that aligns model confidence with correctness using weighted loss and contrastive learning. We introduce two new metrics Honesty Score (H-Score) and Ethical Confidence Index (ECI)-to evaluate ethical alignment. HonestVQA improves accuracy and F1 by up to 4.3% across SpDocVQA, InfographicsVQA, and SROIE datasets, while reducing overconfidence. It also generalizes well across domains, achieving 78.9% accuracy and 76.1% F1-score.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Advanced Graph Neural Networks