Why are there many equally good models? An Anatomy of the Rashomon Effect
Harsh Parikh

TL;DR
This paper investigates the Rashomon effect in machine learning, explaining why multiple models can perform equally well and categorizing the causes into statistical, structural, and procedural sources, with implications for various applications.
Contribution
It provides a unified framework for understanding the origins of model multiplicity, integrating insights from multiple disciplines and discussing its implications.
Findings
Statistical sources decrease with more data.
Structural sources persist asymptotically and require different data or assumptions.
Procedural sources depend on optimization choices and constraints.
Abstract
The Rashomon effect -- the existence of multiple, distinct models that achieve nearly equivalent predictive performance -- has emerged as a fundamental phenomenon in modern machine learning and statistics. In this paper, we explore the causes underlying the Rashomon effect, organizing them into three categories: statistical sources arising from finite samples and noise in the data-generating process; structural sources arising from non-convexity of optimization objectives and unobserved variables that create fundamental non-identifiability; and procedural sources arising from limitations of optimization algorithms and deliberate restrictions to suboptimal model classes. We synthesize insights from machine learning, statistics, and optimization literature to provide a unified framework for understanding why the multiplicity of good models arises. A key distinction emerges: statistical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI · Machine Learning and Data Classification
