Conservative Prediction via Data-Driven Confidence Minimization

Caroline Choi; Fahim Tajwar; Yoonho Lee; Huaxiu Yao and; Ananya Kumar; Chelsea Finn

arXiv:2306.04974·cs.LG·June 4, 2024·1 cites

Conservative Prediction via Data-Driven Confidence Minimization

Caroline Choi, Fahim Tajwar, Yoonho Lee, Huaxiu Yao and, Ananya Kumar, Chelsea Finn

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper introduces a data-driven confidence minimization framework to improve conservative predictions in machine learning, enhancing out-of-distribution detection and selective classification by leveraging carefully curated uncertainty datasets.

Contribution

It provides a theoretical analysis guiding the choice of auxiliary datasets for confidence minimization and proposes the DCM framework for better OOD detection and selective classification.

Findings

01

DCM outperforms existing methods on multiple datasets.

02

Reduces false positive rates significantly in OOD detection.

03

Theoretical insights improve confidence minimization strategies.

Abstract

In safety-critical applications of machine learning, it is often desirable for a model to be conservative, abstaining from making predictions on unknown inputs which are not well-represented in the training data. However, detecting unknown examples is challenging, as it is impossible to anticipate all potential inputs at test time. To address this, prior work (Hendrycks et al., 2018) minimizes model confidence on an auxiliary outlier dataset carefully curated to be disjoint from the training distribution. We theoretically analyze the choice of auxiliary dataset for confidence minimization, revealing two actionable insights: (1) if the auxiliary set contains unknown examples similar to those seen at test time, confidence minimization leads to provable detection of unknown test examples, and (2) if the first condition is satisfied, it is unnecessary to filter out known examples for…

Peer Reviews

Decision·Submitted to ICLR 2024

Reviewer 01Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

- Good empirical results - The paper is well-written and easy to follow

Weaknesses

- There isn't much technical novelty on top of OE. The method is mainly about selecting a new uncertainty dataset, which is a fine direction to explore, but the approach is technically simple and possible not substantial enough for ICLR. - The proof seems fairly obvious; it seems to be saying that datasets are separable even when the training data are noisy. I may have missed some details, but surely this is already well-known and a standard result in learning theory. I'm worried that this proo

Reviewer 02Rating 6· marginally above the acceptance thresholdConfidence 2

Strengths

Overall I think the paper is well motivated and well written. The proposed method is well motivated by the theretical analysis and the empirical performance is convincing.

Weaknesses

* For the theretical part, what can we say when the following does not hold "(1) if the auxiliary set contains unknown examples similar to those seen at test time, confidence minimization leads to provable detection of unknown test examples". Specifically, what if the auxiliary set DOES NOT contain unknown examples similar to those seen at test time. It is important to know the theretical property in this case. * The experiment results are shown in CIFAR. I would be more interested in experimen

Reviewer 03Rating 5· marginally below the acceptance thresholdConfidence 4

Strengths

This paper proposes to use an auxiliary dataset combined with a penalized loss function to reduce the confidence in unseen samples. To further understand the proposed method, the authors analyze the proposed method theoretically and get two reasonable interpretations of the proposed method. To validate the efficacy of the proposed method, several datasets are selected to conduct experiments with different counterpart methods. The proposed method shows good performance on selective classification

Weaknesses

It seems that the Figure 2 is not in correct order. By observing Table1, we can find a performance drop on iid setting for relative simple datasets that can achieve classification accuracy more than 99\%. And we can also find an enhancement in the ood setting. However on relatively hard setting like FMoW, the iid performance is enhanced by the performance, but the ood and iid+ood performance does not show a significant gap compared with other methods. Take the loss into consideration, I am wo

Code & Models

Repositories

tajwarfahim/dcm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning

MethodsTest