Deep Learning Training Procedure Augmentations

Cristian Simionescu

arXiv:2211.14395·cs.CV·November 29, 2022

Deep Learning Training Procedure Augmentations

Cristian Simionescu

PDF

Open Access

TL;DR

This paper introduces novel deep learning training techniques that enhance performance, convergence speed, and robustness, while providing new insights into the training process and optimization landscape.

Contribution

It presents two new training methods—Perfect Ordering Approximation and Cascading Sum Augmentation—that improve training efficiency and model robustness across tasks.

Findings

01

Improved training time with Perfect Ordering Approximation.

02

Enhanced prediction performance and robustness with Cascading Sum Augmentation.

03

Insights into convergence speed and optimization landscape smoothness.

Abstract

Recent advances in Deep Learning have greatly improved performance on various tasks such as object detection, image segmentation, sentiment analysis. The focus of most research directions up until very recently has been on beating state-of-the-art results. This has materialized in the utilization of bigger and bigger models and techniques which help the training procedure to extract more predictive power out of a given dataset. While this has lead to great results, many of which with real-world applications, other relevant aspects of deep learning have remained neglected and unknown. In this work, we will present several novel deep learning training techniques which, while capable of offering significant performance gains they also reveal several interesting analysis results regarding convergence speed, optimization landscape smoothness, and adversarial robustness. The methods presented…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · COVID-19 diagnosis using AI · Adversarial Robustness in Machine Learning

MethodsMixup