Adaptive Regularization via Residual Smoothing in Deep Learning Optimization
Junghee Cho, Junseok Kwon, Byung-Woo Hong

TL;DR
This paper introduces an adaptive regularization method for deep learning that uses residual-based smoothing to improve generalization, outperforming standard optimization algorithms in image classification tasks.
Contribution
The paper proposes a novel residual smoothing-based regularization algorithm that adaptively adjusts regularity using a heat equation driven diffusion process in deep learning optimization.
Findings
Outperforms common optimization algorithms in generalization
Effective in image classification benchmarks
Demonstrates improved model robustness
Abstract
We present an adaptive regularization algorithm that can be effectively applied to the optimization problem in deep learning framework. Our regularization algorithm aims to take into account the fitness of data to the current state of model in the determination of regularity to achieve better generalization. The degree of regularization at each element in the target space of the neural network architecture is determined based on the residual at each optimization iteration in an adaptive way. Our adaptive regularization algorithm is designed to apply a diffusion process driven by the heat equation with spatially varying diffusivity depending on the probability density function following a certain distribution of residual. Our data-driven regularity is imposed by adaptively smoothing a simplified objective function in which the explicit regularization term is omitted in an alternating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Gaussian Processes and Bayesian Inference
