The Double-Edged Nature of the Rashomon Set for Trustworthy Machine Learning
Ethan Hsu, Harry Chen, Chudi Zhong, Lesia Semenova

TL;DR
This paper explores how the multiplicity of near-optimal models in the Rashomon set impacts trustworthiness, revealing a trade-off between robustness and privacy in machine learning.
Contribution
It provides a theoretical and empirical analysis of the Rashomon set's dual role, highlighting its effects on robustness and privacy in ML models.
Findings
Diverse models enable reactive robustness against attacks.
Sparse interpretable models preserve privacy but are fragile.
Diversity in the Rashomon set increases information leakage.
Abstract
Real-world machine learning (ML) pipelines rarely produce a single model; instead, they produce a Rashomon set of many near-optimal ones. We show that this multiplicity reshapes key aspects of trustworthiness. At the individual-model level, sparse interpretable models tend to preserve privacy but are fragile to adversarial attacks. In contrast, the diversity within a large Rashomon set enables reactive robustness: even when an attack breaks one model, others often remain accurate. Rashomon sets are also stable under small distribution shifts. However, this same diversity increases information leakage, as disclosing more near-optimal models provides an attacker with progressively richer views of the training data. Through theoretical analysis and empirical studies of sparse decision trees and linear models, we characterize this robustness-privacy trade-off and highlight the dual role of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Explainable Artificial Intelligence (XAI)
