DeepGUM: Learning Deep Robust Regression with a Gaussian-Uniform Mixture Model
St\'ephane Lathuili\`ere, Pablo Mesejo, Xavier Alameda-Pineda, Radu, Horaud

TL;DR
DeepGUM introduces a robust deep regression method using a Gaussian-uniform mixture model, effectively handling outliers during training without manual thresholding, and demonstrating improved reliability across multiple tasks.
Contribution
We propose DeepGUM, a novel deep regression approach that automatically detects and mitigates outliers using EM, enhancing robustness without manual outlier proportion setting.
Findings
Outperforms standard methods in noisy conditions
Effective across facial, fashion, age, and pose estimation tasks
Automatically adapts to evolving outlier distributions
Abstract
In this paper, we address the problem of how to robustly train a ConvNet for regression, or deep robust regression. Traditionally, deep regression employs the L2 loss function, known to be sensitive to outliers, i.e. samples that either lie at an abnormal distance away from the majority of the training samples, or that correspond to wrongly annotated targets. This means that, during back-propagation, outliers may bias the training process due to the high magnitude of their gradient. In this paper, we propose DeepGUM: a deep regression model that is robust to outliers thanks to the use of a Gaussian-uniform mixture model. We derive an optimization algorithm that alternates between the unsupervised detection of outliers using expectation-maximization, and the supervised training with cleaned samples using stochastic gradient descent. DeepGUM is able to adapt to a continuously evolving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning · Advanced Statistical Methods and Models
