Is your model predicting the past?

Moritz Hardt; Michael P. Kim

arXiv:2206.11673·cs.LG·March 12, 2024·1 cites

Is your model predicting the past?

Moritz Hardt, Michael P. Kim

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a framework with statistical tests called backward baselines to distinguish whether machine learning models predict future outcomes or simply recite past patterns, supported by theory and empirical evaluation.

Contribution

It proposes a novel set of statistical tests for auditing models to determine if they predict future or just reflect past data, with practical guidance and empirical validation.

Findings

01

Backward baselines effectively distinguish prediction types.

02

Theoretical guidance aids interpretation of baseline results.

03

Empirical tests on survey data demonstrate practical utility.

Abstract

When does a machine learning model predict the future of individuals and when does it recite patterns that predate the individuals? In this work, we propose a distinction between these two pathways of prediction, supported by theoretical, empirical, and normative arguments. At the center of our proposal is a family of simple and efficient statistical tests, called backward baselines, that demonstrate if, and to what extent, a model recounts the past. Our statistical theory provides guidance for interpreting backward baselines, establishing equivalences between different baselines and familiar statistical concepts. Concretely, we derive a meaningful backward baseline for auditing a prediction system as a black box, given only background variables and the system's predictions. Empirically, we evaluate the framework on different prediction tasks derived from longitudinal panel surveys,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

socialfoundations/backward_baselines
noneOfficial

Videos

Is Your Model Predicting the Past?· youtube

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI · Health, Environment, Cognitive Aging