Evidence Sufficiency Under Delayed Ground Truth: Proxy Monitoring for Risk Decision Systems
Oleg Solozobov

TL;DR
This paper introduces a formal model and proxy monitoring framework to assess evidence sufficiency in risk decision systems operating under delayed ground truth, addressing the degradation of evidence quality during the blind period.
Contribution
It formalizes an evidence sufficiency model with four dimensions and develops a proxy indicator framework to estimate sufficiency degradation without labels.
Findings
Proxy monitoring detects covariate and mixed drift with 100% detection rate.
Concept drift without feature change remains undetected, highlighting a fundamental limitation.
Blind period simulation shows monotone sufficiency degradation, with concept drift degrading fastest.
Abstract
Machine learning systems in fraud detection, credit scoring, and clinical risk assessment operate under delayed ground truth: outcome labels arrive days to months after the decision they evaluate. During this blind period, governance evidence degrades through mechanisms that neither drift detection methods nor governance frameworks adequately address. This paper formalizes an evidence sufficiency model with four dimensions (completeness, freshness, reliability, representativeness) and a decision-readiness gate that quantifies how label latency degrades evidence quality. The model maps three drift types to dimension-specific degradation trajectories. A complementary proxy indicator framework comprising seven measurement categories estimates sufficiency degradation without labels, with explicit coverage mapping and characterized blind spots per drift type. Evaluation on the IEEE-CIS Fraud…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
