A Theoretical Survey on Foundation Models
Shi Fu, Yuzhu Chen, Yingjie Wang, Dacheng Tao

TL;DR
This paper reviews interpretable methods rooted in machine learning theory that aim to understand the inner workings of foundation models more accurately and resource-efficiently than traditional explainability techniques.
Contribution
It provides a comprehensive survey of interpretable methods for foundation models, emphasizing theoretical foundations and practical insights into their mechanisms and behaviors.
Findings
Identifies limitations of post-hoc explainability methods.
Highlights the importance of resource-light, accurate interpretability techniques.
Suggests future research directions for understanding foundation models.
Abstract
Understanding the inner mechanisms of black-box foundation models (FMs) is essential yet challenging in artificial intelligence and its applications. Over the last decade, the long-running focus has been on their explainability, leading to the development of post-hoc explainable methods to rationalize the specific decisions already made by black-box FMs. However, these explainable methods have certain limitations in terms of faithfulness and resource requirement. Consequently, a new class of interpretable methods should be considered to unveil the underlying mechanisms of FMs in an accurate, comprehensive, heuristic, and resource-light way. This survey aims to review those interpretable methods that comply with the aforementioned principles and have been successfully applied to FMs. These methods are deeply rooted in machine learning theory, covering the analysis of generalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Philosophy and History of Science · Credit Risk and Financial Regulations
MethodsFocus
