On the definition and importance of interpretability in scientific machine learning
Conor Rowan, Alireza Doostan

TL;DR
This paper discusses the concept of interpretability in scientific machine learning, emphasizing understanding mechanisms over sparsity, and proposes a clear, operational definition tailored for physical sciences.
Contribution
It clarifies the definition of interpretability in SciML, critiques existing notions, and introduces an operational, mechanism-focused interpretation to guide future research.
Findings
Sparsity is often conflated with interpretability in SciML.
Existing interpretability methods from outside sciences are inadequate for SciML.
A mechanism-based interpretability definition can better guide scientific discovery.
Abstract
Though neural networks trained on large datasets have been successfully used to describe and predict many physical phenomena, there is a sense among scientists that, unlike traditional scientific models comprising simple mathematical expressions, their findings cannot be integrated into the body of scientific knowledge. Critics of machine learning's inability to produce human-understandable relationships have converged on the concept of "interpretability" as its point of departure from more traditional forms of science. As the growing interest in interpretability has shown, researchers in the physical sciences seek not just predictive models, but also to uncover the fundamental principles that govern a system of interest. However, clarity around a definition of interpretability and the precise role that it plays in science is lacking in the literature. In this work, we argue that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
