DeepGUM: Learning Deep Robust Regression with a Gaussian-Uniform Mixture   Model

St\'ephane Lathuili\`ere; Pablo Mesejo; Xavier Alameda-Pineda; Radu; Horaud

arXiv:1808.09211·cs.CV·August 29, 2018·5 cites

DeepGUM: Learning Deep Robust Regression with a Gaussian-Uniform Mixture Model

St\'ephane Lathuili\`ere, Pablo Mesejo, Xavier Alameda-Pineda, Radu, Horaud

PDF

Open Access

TL;DR

DeepGUM introduces a robust deep regression method using a Gaussian-uniform mixture model, effectively handling outliers during training without manual thresholding, and demonstrating improved reliability across multiple tasks.

Contribution

We propose DeepGUM, a novel deep regression approach that automatically detects and mitigates outliers using EM, enhancing robustness without manual outlier proportion setting.

Findings

01

Outperforms standard methods in noisy conditions

02

Effective across facial, fashion, age, and pose estimation tasks

03

Automatically adapts to evolving outlier distributions

Abstract

In this paper, we address the problem of how to robustly train a ConvNet for regression, or deep robust regression. Traditionally, deep regression employs the L2 loss function, known to be sensitive to outliers, i.e. samples that either lie at an abnormal distance away from the majority of the training samples, or that correspond to wrongly annotated targets. This means that, during back-propagation, outliers may bias the training process due to the high magnitude of their gradient. In this paper, we propose DeepGUM: a deep regression model that is robust to outliers thanks to the use of a Gaussian-uniform mixture model. We derive an optimization algorithm that alternates between the unsupervised detection of outliers using expectation-maximization, and the supervised training with cleaned samples using stochastic gradient descent. DeepGUM is able to adapt to a continuously evolving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning · Advanced Statistical Methods and Models