On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector   Regression

Jun Qi; Jun Du; Sabato Marco Siniscalchi; Xiaoli Ma; Chin-Hui Lee

arXiv:2008.07281·eess.AS·August 18, 2020

On Mean Absolute Error for Deep Neural Network Based Vector-to-Vector Regression

Jun Qi, Jun Du, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee

PDF

TL;DR

This paper analyzes the properties of mean absolute error (MAE) as a loss function for deep neural network vector-to-vector regression, demonstrating its advantages over mean squared error (MSE) through theoretical bounds and speech enhancement experiments.

Contribution

It presents new theoretical bounds for MAE in DNN regression and shows its suitability over MSE, supported by speech enhancement results.

Findings

01

MAE has a Lipschitz continuity property useful for performance bounds

02

MAE can be interpreted as modeling Laplacian errors

03

Speech enhancement experiments favor MAE over MSE

Abstract

In this paper, we exploit the properties of mean absolute error (MAE) as a loss function for the deep neural network (DNN) based vector-to-vector regression. The goal of this work is two-fold: (i) presenting performance bounds of MAE, and (ii) demonstrating new properties of MAE that make it more appropriate than mean squared error (MSE) as a loss function for DNN based vector-to-vector regression. First, we show that a generalized upper-bound for DNN-based vector- to-vector regression can be ensured by leveraging the known Lipschitz continuity property of MAE. Next, we derive a new generalized upper bound in the presence of additive noise. Finally, in contrast to conventional MSE commonly adopted to approximate Gaussian errors for regression, we show that MAE can be interpreted as an error modeled by Laplacian distribution. Speech enhancement experiments are conducted to corroborate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.