Causal Inference Tools for a Better Evaluation of Machine Learning
Micha\"el Soumm

TL;DR
This paper introduces a framework applying econometric statistical methods like OLS, ANOVA, and logistic regression to enhance the evaluation, interpretability, and fairness analysis of machine learning models.
Contribution
It bridges econometrics and machine learning by providing a comprehensive guide on applying statistical tools for deeper model analysis and evaluation.
Findings
Statistical methods reveal subtle model patterns.
Enhanced understanding of model behavior and fairness.
Guidelines for implementation and interpretation.
Abstract
We present a comprehensive framework for applying rigorous statistical techniques from econometrics to analyze and improve machine learning systems. We introduce key statistical methods such as Ordinary Least Squares (OLS) regression, Analysis of Variance (ANOVA), and logistic regression, explaining their theoretical foundations and practical applications in machine learning evaluation. The document serves as a guide for researchers and practitioners, detailing how these techniques can provide deeper insights into model behavior, performance, and fairness. We cover the mathematical principles behind each method, discuss their assumptions and limitations, and provide step-by-step instructions for their implementation. The paper also addresses how to interpret results, emphasizing the importance of statistical significance and effect size. Through illustrative examples, we demonstrate how…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification
