Measuring LLM Trust Allocation Across Conflicting Software Artifacts
Noshin Ulfat, Ahsanul Ameen Sabit, Soneya Binta Hossain

TL;DR
This paper introduces TRACE, a framework for evaluating how well LLMs assess trustworthiness across conflicting software artifacts, revealing strengths and weaknesses in their artifact-level trust reasoning.
Contribution
The paper presents TRACE, a novel structured evaluation framework for artifact-level trust assessment in LLMs, highlighting their capabilities and blind spots in software artifact evaluation.
Findings
Models detect explicit documentation bugs with 67-94% accuracy.
Detection of contradictions between Javadoc and implementation ranges from 50-91%.
Models poorly calibrate confidence and struggle with subtle implementation drift.
Abstract
LLM-based software engineering assistants fail not only by producing incorrect outputs, but also by allocating trust to the wrong artifact when code, documentation, and tests disagree. Existing evaluations focus mainly on downstream outcomes and therefore cannot reveal whether a model recognized degraded evidence, identified the unreliable source, or calibrated its trust across artifacts. We present TRACE (Trust Reasoning over Artifacts for Calibrated Evaluation), a framework that elicits structured artifact-level trust traces over Javadoc, method signatures, implementations, and test prefixes under blind perturbations. Using 22,339 valid traces from seven models on 456 curated Java method bundles, we evaluate per-artifact quality assessment, inconsistency detection, affected artifact attribution, and source prioritization. Across all models, quality penalties are largely localized to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
