What can we Learn by Predicting Accuracy?

Olivier Risser-Maroix; Benjamin Chamand

arXiv:2208.01358·cs.LG·August 24, 2022

What can we Learn by Predicting Accuracy?

Olivier Risser-Maroix, Benjamin Chamand

PDF

Open Access 1 Video

TL;DR

This paper uses a data-driven, experimental approach with symbolic regression to discover an accurate, explainable formula predicting classifier accuracy across many datasets, offering new insights into loss function design.

Contribution

It introduces a novel, experimental method to derive an accurate, interpretable formula for predicting accuracy, challenging traditional intuition-based loss function development.

Findings

01

Discovered a formula with 0.96 Pearson correlation to accuracy

02

The formula explains previous insights on loss design

03

Achieved high correlation across 260 datasets

Abstract

This paper seeks to answer the following question: \textit{"What can we learn by predicting accuracy?"}. Indeed, classification is one of the most popular tasks in machine learning, and many loss functions have been developed to maximize this non-differentiable objective function. Unlike past work on loss function design, which was guided mainly by intuition and theory before being validated by experimentation, here we propose to approach this problem in the opposite way: we seek to extract knowledge by experimentation. This data-driven approach is similar to that used in physics to discover general laws from data. We used a symbolic regression method to automatically find a mathematical expression highly correlated with a linear classifier's accuracy. The formula discovered on more than 260 datasets of embeddings has a Pearson's correlation of 0.96 and a $r^{2}$ of 0.93. More…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

What can we Learn by Predicting Accuracy?· youtube

Taxonomy

TopicsEvolutionary Algorithms and Applications · Neural Networks and Applications · Machine Learning and Data Classification