Nth Absolute Root Mean Error

Siddhartha Dhar Choudhury; Shashank Pandey

arXiv:1810.00421·cs.LG·December 16, 2019

Nth Absolute Root Mean Error

Siddhartha Dhar Choudhury, Shashank Pandey

PDF

TL;DR

The paper introduces NARME, a novel loss function that significantly accelerates neural network training for regression tasks, reducing epochs needed and maintaining high accuracy, especially beneficial for large datasets.

Contribution

It proposes NARME, a new loss function that improves training speed and efficiency for neural network regression models compared to existing loss functions.

Findings

01

NARME reduces training epochs to about one-tenth of traditional loss functions.

02

NARME achieves high accuracy with less training time.

03

Experiments demonstrate faster convergence using NARME.

Abstract

Neural network training process takes long time when the size of training data is huge, without the large set of training values the neural network is unable to learn features. This dilemma between time and size of data is often solved using fast GPUs, but we present a better solution for a subset of those problems. To reduce the time for training a regression model using neural network we introduce a loss function called Nth Absolute Root Mean Error (NARME). It helps to train regression models much faster compared to other existing loss functions. Experiments show that in most use cases NARME reduces the required number of epochs to almost one-tenth of that required by other commonly used loss functions, and also achieves great accuracy in the small amount of time in which it was trained.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.