In Transformer We Trust? A Perspective on Transformer Architecture Failure Modes
Trishit Mondal, Ameya D. Jagtap

TL;DR
This paper critically assesses the trustworthiness of transformer architectures across various high-stakes domains, highlighting vulnerabilities, risks, and open challenges to ensure their reliable deployment.
Contribution
It provides a comprehensive review of transformer reliability issues, including interpretability, robustness, fairness, and privacy, across multiple scientific and engineering fields.
Findings
Identifies structural vulnerabilities in transformer models.
Highlights domain-specific risks in safety-critical applications.
Outlines open research challenges for trustworthy deployment.
Abstract
Transformer architectures have revolutionized machine learning across a wide range of domains, from natural language processing to scientific computing. However, their growing deployment in high-stakes applications, such as computer vision, natural language processing, healthcare, autonomous systems, and critical areas of scientific computing including climate modeling, materials discovery, drug discovery, nuclear science, and robotics, necessitates a deeper and more rigorous understanding of their trustworthiness. In this work, we critically examine the foundational question: \textitHow trustworthy are transformer models?} We evaluate their reliability through a comprehensive review of interpretability, explainability, robustness against adversarial attacks, fairness, and privacy. We systematically examine the trustworthiness of transformer-based models in safety-critical applications…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing · Explainable Artificial Intelligence (XAI)
