Model predictivity assessment: incremental test-set selection and   accuracy evaluation

Elias Fekhari (EDF R&D PRISME); Bertrand Iooss (EDF R&D PRISME; IMT,; GdR MASCOT-NUM); Joseph Mur\'e; Luc Pronzato (I3S; GdR MASCOT-NUM),; Maria-Jo\~ao Rendas

arXiv:2207.03724·math.ST·July 11, 2022·SIS

Model predictivity assessment: incremental test-set selection and accuracy evaluation

Elias Fekhari (EDF R&D PRISME), Bertrand Iooss (EDF R&D PRISME, IMT,, GdR MASCOT-NUM), Joseph Mur\'e, Luc Pronzato (I3S, GdR MASCOT-NUM),, Maria-Jo\~ao Rendas

PDF

TL;DR

This paper introduces a new method for assessing model predictivity by optimally selecting test points and weighting errors, improving accuracy over traditional methods, and demonstrated on an industrial electricity prediction case.

Contribution

It proposes a novel predictivity criterion combined with incremental test set selection methods, including support points and kernel herding, for more accurate model evaluation.

Findings

01

Weighted incremental test selection improves prediction error estimates.

02

Kernel herding and support points outperform traditional test set methods.

03

Method reduces reliance on costly cross-validation techniques.

Abstract

Unbiased assessment of the predictivity of models learnt by supervised machine-learning methods requires knowledge of the learned function over a reserved test set (not used by the learning algorithm). The quality of the assessment depends, naturally, on the properties of the test set and on the error statistic used to estimate the prediction error. In this work we tackle both issues, proposing a new predictivity criterion that carefully weights the individual observed errors to obtain a global error estimate, and using incremental experimental design methods to "optimally" select the test points on which the criterion is computed. Several incremental constructions are studied, including greedy-packing (coffee-house design), support points and kernel herding techniques. Our results show that the incremental and weighted versions of the latter two, based on Maximum Mean Discrepancy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.