TL;DR
This paper introduces a personalized model synthesis approach that uses a neural network to estimate user-specific interpretability, enabling tailored, more user-preferred models in high-stakes AI applications.
Contribution
It presents a bi-objective evolutionary algorithm that incorporates user feedback via active learning to personalize interpretability estimation for model synthesis.
Findings
The approach learns diverse interpretability estimations for different users.
Users prefer models generated with personalized interpretability over generic indices.
The method effectively adapts to individual interpretability preferences.
Abstract
High-stakes applications require AI-generated models to be interpretable. Current algorithms for the synthesis of potentially interpretable models rely on objectives or regularization terms that represent interpretability only coarsely (e.g., model size) and are not designed for a specific user. Yet, interpretability is intrinsically subjective. In this paper, we propose an approach for the synthesis of models that are tailored to the user by enabling the user to steer the model synthesis process according to her or his preferences. We use a bi-objective evolutionary algorithm to synthesize models with trade-offs between accuracy and a user-specific notion of interpretability. The latter is estimated by a neural network that is trained concurrently to the evolution using the feedback of the user, which is collected using uncertainty-based active learning. To maximize usability, the user…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
