Metacognitive Sensitivity for Test-Time Dynamic Model Selection

Le Tuan Minh Trinh; Le Minh Vu Pham; Thi Minh Anh Pham; An Duc Nguyen

arXiv:2512.10451·cs.LG·December 12, 2025

Metacognitive Sensitivity for Test-Time Dynamic Model Selection

Le Tuan Minh Trinh, Le Minh Vu Pham, Thi Minh Anh Pham, An Duc Nguyen

PDF

Open Access

TL;DR

This paper introduces a new framework inspired by human metacognition to evaluate and improve the confidence calibration of deep learning models, using a novel measure called meta-d' for dynamic model selection.

Contribution

It proposes a psychologically-grounded measure of metacognitive sensitivity and applies it to test-time model selection, enhancing accuracy across multiple datasets and model types.

Findings

01

Metacognitive sensitivity correlates with model accuracy.

02

Dynamic model selection improves joint-inference accuracy.

03

The approach generalizes across CNNs and VLMs.

Abstract

A key aspect of human cognition is metacognition - the ability to assess one's own knowledge and judgment reliability. While deep learning models can express confidence in their predictions, they often suffer from poor calibration, a cognitive bias where expressed confidence does not reflect true competence. Do models truly know what they know? Drawing from human cognitive science, we propose a new framework for evaluating and leveraging AI metacognition. We introduce meta-d', a psychologically-grounded measure of metacognitive sensitivity, to characterise how reliably a model's confidence predicts its own accuracy. We then use this dynamic sensitivity score as context for a bandit-based arbiter that performs test-time model selection, learning which of several expert models to trust for a given task. Our experiments across multiple datasets and deep learning model combinations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Explainable Artificial Intelligence (XAI) · Gaussian Processes and Bayesian Inference