Boosting in the presence of outliers: adaptive classification with   non-convex loss functions

Alexander Hanbo Li; Jelena Bradic

arXiv:1510.01064·stat.ML·August 25, 2017

Boosting in the presence of outliers: adaptive classification with non-convex loss functions

Alexander Hanbo Li, Jelena Bradic

PDF

TL;DR

This paper introduces a new family of non-convex loss functions and a boosting framework called Arch Boost, designed to improve classification robustness against outliers, with theoretical analysis and extensive numerical validation.

Contribution

It proposes the $oldsymbol{ ext{ extgamma}}$-robust losses and the Arch Boost framework, enhancing robustness and adaptability in boosting algorithms dealing with contaminated data.

Findings

01

The new algorithms demonstrate increased robustness to outliers.

02

Theoretical analysis confirms robustness properties and local curvature-based guarantees.

03

Numerical results show advantages over existing boosting methods with outliers.

Abstract

This paper examines the role and efficiency of the non-convex loss functions for binary classification problems. In particular, we investigate how to design a simple and effective boosting algorithm that is robust to the outliers in the data. The analysis of the role of a particular non-convex loss for prediction accuracy varies depending on the diminishing tail properties of the gradient of the loss -- the ability of the loss to efficiently adapt to the outlying data, the local convex properties of the loss and the proportion of the contaminated data. In order to use these properties efficiently, we propose a new family of non-convex losses named $γ$ -robust losses. Moreover, we present a new boosting framework, {\it Arch Boost}, designed for augmenting the existing work such that its corresponding classification algorithm is significantly more adaptable to the unknown data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.