Exploiting Adam-like Optimization Algorithms to Improve the Performance of Convolutional Neural Networks
Loris Nanni, Gianluca Maguolo, Alessandra Lumini

TL;DR
This paper compares Adam-like optimization algorithms for training CNNs, demonstrating that certain variants and ensemble methods can achieve high accuracy on medical image classification tasks.
Contribution
It introduces a comparison of Adam-based optimizers and ensemble strategies, improving CNN performance and reproducibility in medical imaging.
Findings
Adam variants outperform traditional SGD in accuracy.
Ensemble of ResNet50 models enhances performance.
Proposed methods achieve state-of-the-art accuracy.
Abstract
Stochastic gradient descent (SGD) is the main approach for training deep networks: it moves towards the optimum of the cost function by iteratively updating the parameters of a model in the direction of the gradient of the loss evaluated on a minibatch. Several variants of SGD have been proposed to make adaptive step sizes for each parameter (adaptive gradient) and take into account the previous updates (momentum). Among several alternative of SGD the most popular are AdaGrad, AdaDelta, RMSProp and Adam which scale coordinates of the gradient by square roots of some form of averaging of the squared coordinates in the past gradients and automatically adjust the learning rate on a parameter basis. In this work, we compare Adam based variants based on the difference between the present and the past gradients, the step size is adjusted for each parameter. We run several tests benchmarking…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · AI in cancer detection
MethodsRMSProp · Stochastic Gradient Descent · Adam · AdaDelta · AdaGrad
