A Comprehensive Survey on the Risks and Limitations of Concept-based Models
Sanchit Sinha, Aidong Zhang

TL;DR
This survey reviews the risks, limitations, and recent advances of concept-based models, highlighting challenges like concept leakage and robustness issues, and discusses future research directions for their reliable deployment.
Contribution
It provides a comprehensive overview of the challenges and mitigation strategies for concept-based models, emphasizing their limitations and future research avenues.
Findings
Concept leakage and entangled representations are common issues.
Limited robustness to adversarial perturbations affects reliability.
Recent advances aim to improve model interpretability and robustness.
Abstract
Concept-based Models are a class of inherently explainable networks that improve upon standard Deep Neural Networks by providing a rationale behind their predictions using human-understandable `concepts'. With these models being highly successful in critical applications like medical diagnosis and financial risk prediction, there is a natural push toward their wider adoption in sensitive domains to instill greater trust among diverse stakeholders. However, recent research has uncovered significant limitations in the structure of such networks, their training procedure, underlying assumptions, and their susceptibility to adversarial vulnerabilities. In particular, issues such as concept leakage, entangled representations, and limited robustness to perturbations pose challenges to their reliability and generalization. Additionally, the effectiveness of human interventions in these models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Healthcare
MethodsFocus
