Probabilistic performance estimators for computational chemistry   methods: the empirical cumulative distribution function of absolute errors

Pascal Pernot; and Andreas Savin

arXiv:1801.03305·physics.chem-ph·March 19, 2018

Probabilistic performance estimators for computational chemistry methods: the empirical cumulative distribution function of absolute errors

Pascal Pernot, and Andreas Savin

PDF

1 Repo

TL;DR

This paper introduces the use of empirical cumulative distribution functions of absolute errors for better benchmarking in computational chemistry, providing more informative error probabilities than traditional statistics.

Contribution

It proposes new error statistics based on the empirical CDF of unsigned errors, improving benchmarking and ranking of computational methods.

Findings

01

Empirical CDF-based error probabilities are more informative.

02

Traditional error statistics do not reliably predict prediction error amplitudes.

03

Standard errors of benchmarking statistics depend on dataset size.

Abstract

Benchmarking studies in computational chemistry use reference datasets to assess the accuracy of a method through error statistics. The commonly used error statistics, such as the mean signed and mean unsigned errors, do not inform end-users on the expected amplitude of prediction errors attached to these methods. We show that, the distributions of model errors being neither normal nor zero-centered, these error statistics cannot be used to infer prediction error probabilities. To overcome this limitation, we advocate for the use of more informative statistics, based on the empirical cumulative distribution function of unsigned errors, namely (1) the probability for a new calculation to have an absolute error below a chosen threshold, and (2) the maximal amplitude of errors one can expect with a chosen high confidence level. Those statistics are also shown to be well suited for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ppernot/ECDFT
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.