Outlier-robust neural network training: variation regularization meets trimmed loss to prevent functional breakdown

Akifumi Okuno; Shotaro Yagishita

arXiv:2308.02293·stat.ME·May 14, 2025

Outlier-robust neural network training: variation regularization meets trimmed loss to prevent functional breakdown

Akifumi Okuno, Shotaro Yagishita

PDF

Open Access 2 Repos

TL;DR

This paper proposes a novel training method for neural networks that combines a transformed trimmed loss with higher-order variation regularization to enhance robustness against outliers, backed by theoretical guarantees.

Contribution

It introduces a new outlier-robust training framework for neural networks that controls model capacity and maintains high robustness, extending classical robust statistics to nonlinear models.

Findings

01

The method retains a high functional breakdown point.

02

The approach effectively suppresses overfitting to outliers.

03

The stochastic optimization algorithm converges theoretically.

Abstract

In this study, we tackle the challenge of outlier-robust predictive modeling using highly expressive neural networks. Our approach integrates two key components: (1) a transformed trimmed loss (TTL), a computationally efficient variant of the classical trimmed loss, and (2) higher-order variation regularization (HOVR), which imposes smoothness constraints on the prediction function. While traditional robust statistics typically assume low-complexity models such as linear and kernel models, applying TTL alone to modern neural networks may fail to ensure robustness, as their high expressive power allows them to fit both inliers and outliers, even when a robust loss is used. To address this, we revisit the traditional notion of breakdown point and adapt it to the nonlinear function setting, introducing a regularization scheme via HOVR that controls the model's capacity and suppresses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Computational Physics and Python Applications · Stochastic Gradient Optimization Techniques