Metrics for Benchmarking and Uncertainty Quantification: Quality,   Applicability, and a Path to Best Practices for Machine Learning in Chemistry

Gaurav Vishwakarma; Aditya Sonpal; Johannes Hachmann

arXiv:2010.00110·physics.chem-ph·January 26, 2021

Metrics for Benchmarking and Uncertainty Quantification: Quality, Applicability, and a Path to Best Practices for Machine Learning in Chemistry

Gaurav Vishwakarma, Aditya Sonpal, Johannes Hachmann

PDF

TL;DR

This review emphasizes the importance of proper metrics for benchmarking and quantifying uncertainty in machine learning models for chemistry, highlighting their role in model validation, comparison, and establishing best practices.

Contribution

It discusses the current state and challenges of statistical metrics and uncertainty quantification in chemical machine learning, proposing a pathway toward standardized best practices.

Findings

01

Metrics are often overlooked in chemical ML validation

02

Uncertainty quantification enhances model reliability and applicability

03

Guidelines for best practices are proposed

Abstract

This review aims to draw attention to two issues of concern when we set out to make machine learning work in the chemical and materials domain, i.e., statistical loss function metrics for the validation and benchmarking of data-derived models, and the uncertainty quantification of predictions made by them. They are often overlooked or underappreciated topics as chemists typically only have limited training in statistics. Aside from helping to assess the quality, reliability, and applicability of a given model, these metrics are also key to comparing the performance of different models and thus for developing guidelines and best practices for the successful application of machine learning in chemistry.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.