A Mathematical Foundation for Robust Machine Learning based on   Bias-Variance Trade-off

Ou Wu; Weiyao Zhu; Yingjun Deng; Haixiang Zhang; Qinghu; Hou

arXiv:2106.05522·cs.LG·November 23, 2021

A Mathematical Foundation for Robust Machine Learning based on Bias-Variance Trade-off

Ou Wu, Weiyao Zhu, Yingjun Deng, Haixiang Zhang, Qinghu, Hou

PDF

Open Access

TL;DR

This paper develops a mathematical framework for robust machine learning by analyzing the impact of unequal sample contributions through bias-variance trade-off, enabling better comparison and improvement of algorithms.

Contribution

It introduces a unified theoretical foundation for robust machine learning, clarifies differences among strategies, and proposes a combined approach based on classical methods.

Findings

01

Provides definitions and properties for RML based on bias-variance trade-off

02

Analyzes and compares classical learning algorithms within the new framework

03

Proposes a unified method combining two classical strategies

Abstract

A common assumption in machine learning is that samples are independently and identically distributed (i.i.d). However, the contributions of different samples are not identical in training. Some samples are difficult to learn and some samples are noisy. The unequal contributions of samples has a considerable effect on training performances. Studies focusing on unequal sample contributions (e.g., easy, hard, noisy) in learning usually refer to these contributions as robust machine learning (RML). Weighing and regularization are two common techniques in RML. Numerous learning algorithms have been proposed but the strategies for dealing with easy/hard/noisy samples differ or even contradict with different learning algorithms. For example, some strategies take the hard samples first, whereas some strategies take easy first. Conducting a clear comparison for existing RML algorithms in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Face and Expression Recognition · Imbalanced Data Classification Techniques