Learning to Abstain From Uninformative Data

Yikai Zhang; Songzhu Zheng; Mina Dalirrooyfard; Pengxiang Wu; Anderson; Schneider; Anant Raj; Yuriy Nevmyvaka; Chao Chen

arXiv:2309.14240·cs.LG·September 26, 2023

Learning to Abstain From Uninformative Data

Yikai Zhang, Songzhu Zheng, Mina Dalirrooyfard, Pengxiang Wu, Anderson, Schneider, Anant Raj, Yuriy Nevmyvaka, Chao Chen

PDF

Open Access

TL;DR

This paper introduces a novel method for learning in noisy environments with high uninformative data, ensuring near-optimal decisions by distinguishing informative samples during training and testing.

Contribution

It proposes a new loss function and an iterative algorithm to jointly optimize a predictor and a selector for handling uninformative data effectively.

Findings

01

The method guarantees near-optimal decision-making under high noise conditions.

02

Empirical results show improved performance over existing approaches.

03

The approach is applicable in domains like finance and healthcare.

Abstract

Learning and decision-making in domains with naturally high noise-to-signal ratio, such as Finance or Healthcare, is often challenging, while the stakes are very high. In this paper, we study the problem of learning and acting under a general noisy generative process. In this problem, the data distribution has a significant proportion of uninformative samples with high noise in the label, while part of the data contains useful information represented by low label noise. This dichotomy is present during both training and inference, which requires the proper handling of uninformative data during both training and testing. We propose a novel approach to learning under these conditions via a loss inspired by the selective learning theory. By minimizing this loss, the model is guaranteed to make a near-optimal decision by distinguishing informative data from uninformative data and making…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Neural Networks and Applications · Machine Learning and Algorithms