In Transformer We Trust? A Perspective on Transformer Architecture Failure Modes

Trishit Mondal; Ameya D. Jagtap

arXiv:2602.14318·cs.LG·February 17, 2026

In Transformer We Trust? A Perspective on Transformer Architecture Failure Modes

Trishit Mondal, Ameya D. Jagtap

PDF

Open Access

TL;DR

This paper critically assesses the trustworthiness of transformer architectures across various high-stakes domains, highlighting vulnerabilities, risks, and open challenges to ensure their reliable deployment.

Contribution

It provides a comprehensive review of transformer reliability issues, including interpretability, robustness, fairness, and privacy, across multiple scientific and engineering fields.

Findings

01

Identifies structural vulnerabilities in transformer models.

02

Highlights domain-specific risks in safety-critical applications.

03

Outlines open research challenges for trustworthy deployment.

Abstract

Transformer architectures have revolutionized machine learning across a wide range of domains, from natural language processing to scientific computing. However, their growing deployment in high-stakes applications, such as computer vision, natural language processing, healthcare, autonomous systems, and critical areas of scientific computing including climate modeling, materials discovery, drug discovery, nuclear science, and robotics, necessitates a deeper and more rigorous understanding of their trustworthiness. In this work, we critically examine the foundational question: \textitHow trustworthy are transformer models?} We evaluate their reliability through a comprehensive review of interpretability, explainability, robustness against adversarial attacks, fairness, and privacy. We systematically examine the trustworthiness of transformer-based models in safety-critical applications…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing · Explainable Artificial Intelligence (XAI)