TRUST-LAPSE: An Explainable and Actionable Mistrust Scoring Framework for Model Monitoring
Nandita Bhaskhar, Daniel L. Rubin, Christopher Lee-Messer

TL;DR
TRUST-LAPSE is a novel framework for continuous, explainable model monitoring that assesses mistrust using latent-space metrics and detects data shifts across diverse domains, outperforming existing methods.
Contribution
Introduces TRUST-LAPSE, a mistrust scoring framework utilizing latent-space metrics and sequential analysis for effective, explainable model monitoring in real-world applications.
Findings
Achieves state-of-the-art AUROCs in vision, audio, and EEG domains.
Outperforms baselines by over 10 points in mistrust detection.
Exposes limitations of popular baseline methods.
Abstract
Continuous monitoring of trained ML models to determine when their predictions should and should not be trusted is essential for their safe deployment. Such a framework ought to be high-performing, explainable, post-hoc and actionable. We propose TRUST-LAPSE, a "mistrust" scoring framework for continuous model monitoring. We assess the trustworthiness of each input sample's model prediction using a sequence of latent-space embeddings. Specifically, (a) our latent-space mistrust score estimates mistrust using distance metrics (Mahalanobis distance) and similarity metrics (cosine similarity) in the latent-space and (b) our sequential mistrust score determines deviations in correlations over the sequence of past input representations in a non-parametric, sliding-window based algorithm for actionable continuous monitoring. We evaluate TRUST-LAPSE via two downstream tasks: (1)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Data Stream Mining Techniques · Advanced Neural Network Applications
