Contextual Explanation Networks
Maruan Al-Shedivat, Avinava Dubey, Eric P. Xing

TL;DR
Contextual Explanation Networks (CEN) are a novel architecture that generate instance-specific explanations through intermediate probabilistic models, improving interpretability and performance in classification and survival analysis tasks.
Contribution
CEN introduces a unified model that predicts and explains simultaneously by generating intermediate graphical models, enhancing interpretability and robustness over post-hoc explanation methods.
Findings
CEN achieves competitive accuracy on image, text, and survival tasks.
CEN provides consistent, instance-specific explanations without additional computational cost.
CEN helps detect misleading explanations from post-hoc methods.
Abstract
Modern learning algorithms excel at producing accurate but complex models of the data. However, deploying such models in the real-world requires extra care: we must ensure their reliability, robustness, and absence of undesired biases. This motivates the development of models that are equally accurate but can be also easily inspected and assessed beyond their predictive performance. To this end, we introduce contextual explanation networks (CEN)---a class of architectures that learn to predict by generating and utilizing intermediate, simplified probabilistic models. Specifically, CENs generate parameters for intermediate graphical models which are further used for prediction and play the role of explanations. Contrary to the existing post-hoc model-explanation tools, CENs learn to predict and to explain simultaneously. Our approach offers two major advantages: (i) for each prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Topic Modeling
