The Strong, Weak and Benign Goodhart's law. An independence-free and paradigm-agnostic formalisation

Adrien Majka; El-Mahdi El-Mhamdi

arXiv:2505.23445·stat.ML·September 5, 2025

The Strong, Weak and Benign Goodhart's law. An independence-free and paradigm-agnostic formalisation

Adrien Majka, El-Mahdi El-Mhamdi

PDF

Open Access

TL;DR

This paper provides a formal analysis of Goodhart's law, examining how dependence between proxy metrics and goals affects optimization outcomes, especially under different tail distribution assumptions, and introduces a paradigm-agnostic framework.

Contribution

It relaxes previous independence assumptions and offers a formal, general framework to understand Goodhart's law across various learning paradigms and dependence scenarios.

Findings

01

Dependence does not alter Goodhart's effect with light-tailed goal and discrepancy.

02

Heavy-tailed discrepancy can cause over-optimization inversely proportional to tail heaviness.

03

The framework is paradigm-agnostic and applicable to diverse settings.

Abstract

Goodhart's law is a famous adage in policy-making that states that ``When a measure becomes a target, it ceases to be a good measure''. As machine learning models and the optimisation capacity to train them grow, growing empirical evidence reinforced the belief in the validity of this law without however being formalised. Recently, a few attempts were made to formalise Goodhart's law, either by categorising variants of it, or by looking at how optimising a proxy metric affects the optimisation of an intended goal. In this work, we alleviate the simplifying independence assumption, made in previous works, and the assumption on the learning paradigm made in most of them, to study the effect of the coupling between the proxy metric and the intended goal on Goodhart's law. Our results show that in the case of light tailed goal and light tailed discrepancy, dependence does not change the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Computational and Text Analysis Methods