Provably robust learning of regression neural networks using $\beta$-divergences
Abhik Ghosh, Suryasis Jana

TL;DR
This paper introduces rRNet, a robust regression neural network framework based on $eta$-divergence, providing theoretical robustness guarantees and demonstrating practical advantages over existing methods.
Contribution
It develops a broad, theoretically grounded robust learning method for regression NNs using $eta$-divergence, with convergence and robustness guarantees.
Findings
rRNet achieves bounded influence functions for suitable $eta$
It attains a 50 ext% asymptotic breakdown point
Simulation and real-data experiments show improved robustness
Abstract
Regression neural networks (NNs) are most commonly trained by minimizing the mean squared prediction error, which is highly sensitive to outliers and data contamination. Existing robust training methods for regression NNs are often limited in scope and rely primarily on empirical validation, with only a few offering partial theoretical guarantees. In this paper, we propose a new robust learning framework for regression NNs based on the -divergence (also known as the density power divergence) which we call `rRNet'. It applies to a broad class of regression NNs, including models with non-smooth activation functions and error densities, and recovers the classical maximum likelihood learning as a special case. The rRNet is implemented via an alternating optimization scheme, for which we establish convergence guarantees to stationary points under mild, verifiable conditions. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Advanced Statistical Methods and Models
