TRUST-LAPSE: An Explainable and Actionable Mistrust Scoring Framework   for Model Monitoring

Nandita Bhaskhar; Daniel L. Rubin; Christopher Lee-Messer

arXiv:2207.11290·cs.LG·July 14, 2023

TRUST-LAPSE: An Explainable and Actionable Mistrust Scoring Framework for Model Monitoring

Nandita Bhaskhar, Daniel L. Rubin, Christopher Lee-Messer

PDF

Open Access 1 Repo

TL;DR

TRUST-LAPSE is a novel framework for continuous, explainable model monitoring that assesses mistrust using latent-space metrics and detects data shifts across diverse domains, outperforming existing methods.

Contribution

Introduces TRUST-LAPSE, a mistrust scoring framework utilizing latent-space metrics and sequential analysis for effective, explainable model monitoring in real-world applications.

Findings

01

Achieves state-of-the-art AUROCs in vision, audio, and EEG domains.

02

Outperforms baselines by over 10 points in mistrust detection.

03

Exposes limitations of popular baseline methods.

Abstract

Continuous monitoring of trained ML models to determine when their predictions should and should not be trusted is essential for their safe deployment. Such a framework ought to be high-performing, explainable, post-hoc and actionable. We propose TRUST-LAPSE, a "mistrust" scoring framework for continuous model monitoring. We assess the trustworthiness of each input sample's model prediction using a sequence of latent-space embeddings. Specifically, (a) our latent-space mistrust score estimates mistrust using distance metrics (Mahalanobis distance) and similarity metrics (cosine similarity) in the latent-space and (b) our sequential mistrust score determines deviations in correlations over the sequence of past input representations in a non-parametric, sliding-window based algorithm for actionable continuous monitoring. We evaluate TRUST-LAPSE via two downstream tasks: (1)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nanbhas/trustlapse
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEEG and Brain-Computer Interfaces · Data Stream Mining Techniques · Advanced Neural Network Applications