Robustness of Prompting: Enhancing Robustness of Large Language Models Against Prompting Attacks

Lin Mu; Guowei Chu; Li Ni; Lei Sang; Zhize Wu; Peiquan Jin; Yiwen Zhang

arXiv:2506.03627·cs.CL·June 5, 2025

Robustness of Prompting: Enhancing Robustness of Large Language Models Against Prompting Attacks

Lin Mu, Guowei Chu, Li Ni, Lei Sang, Zhize Wu, Peiquan Jin, Yiwen Zhang

PDF

Open Access

TL;DR

This paper introduces RoP, a novel prompting strategy that enhances the robustness of large language models against input perturbations by using error correction and guidance stages, significantly improving performance on various reasoning tasks.

Contribution

The paper proposes RoP, a new prompting method with error correction and guidance stages, explicitly designed to improve LLM robustness against adversarial input perturbations.

Findings

01

RoP significantly improves robustness against adversarial perturbations.

02

RoP maintains high accuracy with minimal degradation on clean inputs.

03

Experiments show effectiveness across arithmetic, commonsense, and logical reasoning tasks.

Abstract

Large Language Models (LLMs) have demonstrated remarkable performance across various tasks by effectively utilizing a prompting strategy. However, they are highly sensitive to input perturbations, such as typographical errors or slight character order errors, which can substantially degrade their performance. Despite advances in prompting techniques, developing a prompting strategy that explicitly mitigates the negative impact of such perturbations remains an open challenge. To bridge this gap, we propose Robustness of Prompting (RoP), a novel prompting strategy specifically designed to enhance the robustness of LLMs. RoP consists of two stages: Error Correction and Guidance. In the Error Correction stage, RoP applies diverse perturbation methods to generate adversarial examples, which are then used to construct prompts that automatically correct input errors. In the Guidance stage, RoP…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Explainable Artificial Intelligence (XAI)