Vulnerability of LLMs to Vertically Aligned Text Manipulations

Zhecheng Li; Yiwei Wang; Bryan Hooi; Yujun Cai; Zhen Xiong; Nanyun Peng; Kai-wei Chang

arXiv:2410.20016·cs.CL·September 29, 2025

Vulnerability of LLMs to Vertically Aligned Text Manipulations

Zhecheng Li, Yiwei Wang, Bryan Hooi, Yujun Cai, Zhen Xiong, Nanyun Peng, Kai-wei Chang

PDF

Open Access

TL;DR

This paper investigates how vertically aligned text manipulations significantly impair the performance of large language models in classification tasks, revealing vulnerabilities related to tokenization and attention mechanisms.

Contribution

It provides a comprehensive analysis of the impact of vertical text input on various LLMs and explores underlying causes, offering insights into model vulnerabilities and potential mitigation strategies.

Findings

01

Vertical text significantly reduces LLM accuracy in classification.

02

Chain-of-Thought reasoning does not mitigate vertical input issues.

03

Tokenization and attention mechanisms are key causes of vulnerability.

Abstract

Vertical text input is commonly encountered in various real-world applications, such as mathematical computations and word-based Sudoku puzzles. While current large language models (LLMs) have excelled in natural language tasks, they remain vulnerable to variations in text formatting. Recent research demonstrates that modifying input formats, such as vertically aligning words for encoder-based models, can substantially lower accuracy in text classification tasks. While easily understood by humans, these inputs can significantly mislead models, posing a potential risk of bypassing detection in real-world scenarios involving harmful or sensitive information. With the expanding application of LLMs, a crucial question arises: Do decoder-based LLMs exhibit similar vulnerabilities to vertically formatted text input? In this paper, we investigate the impact of vertical text input on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSecurity and Verification in Computing · Digital Rights Management and Security · Advanced Malware Detection Techniques

MethodsSoftmax · Attention Is All You Need