Robustness in Large Language Models: A Survey of Mitigation Strategies and Evaluation Metrics

Pankaj Kumar; Subhankar Mishra

arXiv:2505.18658·cs.CL·November 7, 2025

Robustness in Large Language Models: A Survey of Mitigation Strategies and Evaluation Metrics

Pankaj Kumar, Subhankar Mishra

PDF

TL;DR

This survey reviews the challenges of robustness in Large Language Models, analyzing sources of vulnerabilities, mitigation strategies, evaluation metrics, and future research directions to improve reliability in real-world applications.

Contribution

It provides a comprehensive overview of robustness issues in LLMs, categorizes sources of non-robustness, and discusses current mitigation and evaluation strategies.

Findings

01

Identification of key sources of non-robustness in LLMs

02

Analysis of state-of-the-art mitigation techniques

03

Discussion of gaps in current evaluation metrics

Abstract

Large Language Models (LLMs) have emerged as a promising cornerstone for the development of natural language processing (NLP) and artificial intelligence (AI). However, ensuring the robustness of LLMs remains a critical challenge. To address these challenges and advance the field, this survey provides a comprehensive overview of current studies in this area. First, we systematically examine the nature of robustness in LLMs, including its conceptual foundations, the importance of consistent performance across diverse inputs, and the implications of failure modes in real-world applications. Next, we analyze the sources of non-robustness, categorizing intrinsic model limitations, data-driven vulnerabilities, and external adversarial factors that compromise reliability. Following this, we review state-of-the-art mitigation strategies, and then we discuss widely adopted benchmarks, emerging…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.