DIVERSE: Disagreement-Inducing Vector Evolution for Rashomon Set Exploration
Gilles Eerlings, Brent Zoomers, Jori Liesenborgs, Gustavo Rovelo Ruiz, Kris Luyten

TL;DR
DIVERSE is a framework that efficiently explores the Rashomon set of neural networks by generating diverse high-performing models without retraining, using feature modulation and evolutionary strategies.
Contribution
It introduces a novel method combining FiLM layers and CMA-ES to systematically explore model diversity within the Rashomon set without retraining.
Findings
DIVERSE uncovers multiple functionally distinct models with high accuracy.
The method achieves diversity comparable to retraining-based approaches.
It reduces computational costs while exploring model multiplicity.
Abstract
We propose DIVERSE, a framework for systematically exploring the Rashomon set of deep neural networks, the collection of models that match a reference model's accuracy while differing in their predictive behavior. DIVERSE augments a pretrained model with Feature-wise Linear Modulation (FiLM) layers and uses Covariance Matrix Adaptation Evolution Strategy (CMA-ES) to search a latent modulation space, generating diverse model variants without retraining or gradient access. Across MNIST, PneumoniaMNIST, and CIFAR-10, DIVERSE uncovers multiple high-performing yet functionally distinct models. Our experiments show that DIVERSE offers a competitive and efficient exploration of the Rashomon set, making it feasible to construct diverse sets that maintain robustness and performance while supporting well-balanced model multiplicity. While retraining remains the baseline to generate Rashomon sets,…
Peer Reviews
Decision·ICLR 2026 Poster
- Innovative approximation of Quality-Diversity evolution - Well written paper
- Limited Ablation study on evolution side.
1. The paper is well-written with a clear objective, all notions are introduced clearly. 2. The proposed approach has the potential to be a fundamental tool in many domains of machine learning. 3. The proposed approach is sound and well grounded in the literature. 4. The ablation study is quite furnished and the experimental protocol sound
1. The paper lacks qualitative or illustrative experiments to helps the reader gain intuitions on the significance of the results. 2. The paper lacks experiments with downstream tasks (uncertainty quantification, ensembling, ...) to better asses if generated models with DIVERSE are actually effective for practical tasks. 3. The paper lacks quantification about how smaller the explored set of models induced by FiLM is to the true $\epsilon$-Rashomon set.
This paper proposes a relatively simple, inexpensive and elegant solution to the Rashomon Set exploration problem. S1: The proposed method is relatively simple, yet effective. This makes it very relevant to solve the Rashomon Set exploration problem. S2: The evaluation of the algorithm is good, with metrics covering both class predictions and output probabilities. Furthermore, multiple hyperparameters are explored and DIVERSE is compared to existing baselines. S3: The paper is generally clear
I think that the paper is overall quite solid. However, I believe its main weak points are related to its impact and motivation. W1: Based on the Introduction and the Conclusion of the paper, I do not understand why Rashomon Set exploration is an important problem to solve. I would like the authors to better motivate why their work is impactful for research in Machine Learning. W2: Furthermore, I think the results insufficiently show that DIVERSE is better than dropout-based Rashomon explorati
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
