Adaptive Regularization via Residual Smoothing in Deep Learning   Optimization

Junghee Cho; Junseok Kwon; Byung-Woo Hong

arXiv:1907.09750·cs.LG·September 2, 2019·1 cites

Adaptive Regularization via Residual Smoothing in Deep Learning Optimization

Junghee Cho, Junseok Kwon, Byung-Woo Hong

PDF

Open Access

TL;DR

This paper introduces an adaptive regularization method for deep learning that uses residual-based smoothing to improve generalization, outperforming standard optimization algorithms in image classification tasks.

Contribution

The paper proposes a novel residual smoothing-based regularization algorithm that adaptively adjusts regularity using a heat equation driven diffusion process in deep learning optimization.

Findings

01

Outperforms common optimization algorithms in generalization

02

Effective in image classification benchmarks

03

Demonstrates improved model robustness

Abstract

We present an adaptive regularization algorithm that can be effectively applied to the optimization problem in deep learning framework. Our regularization algorithm aims to take into account the fitness of data to the current state of model in the determination of regularity to achieve better generalization. The degree of regularization at each element in the target space of the neural network architecture is determined based on the residual at each optimization iteration in an adaptive way. Our adaptive regularization algorithm is designed to apply a diffusion process driven by the heat equation with spatially varying diffusivity depending on the probability density function following a certain distribution of residual. Our data-driven regularity is imposed by adaptively smoothing a simplified objective function in which the explicit regularization term is omitted in an alternating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Gaussian Processes and Bayesian Inference