ChartAttack: Testing the Vulnerability of LLMs to Malicious Prompting in Chart Generation

Jesus-German Ortiz-Barajas; Jonathan Tonglet; Vivek Gupta; Iryna Gurevych

arXiv:2601.12983·cs.CL·March 26, 2026

ChartAttack: Testing the Vulnerability of LLMs to Malicious Prompting in Chart Generation

Jesus-German Ortiz-Barajas, Jonathan Tonglet, Vivek Gupta, Iryna Gurevych

PDF

Open Access

TL;DR

This paper introduces ChartAttack, a framework to evaluate and improve the robustness of multimodal large language models against maliciously manipulated chart prompts that can mislead data interpretation.

Contribution

We propose ChartAttack and AttackViz, novel tools for testing and enhancing MLLMs' resistance to misleading chart generation and interpretation.

Findings

01

ChartAttack reduces MLLM QA accuracy by 17.2 points in-domain.

02

AttackViz effectively identifies misleading chart prompts.

03

Fine-tuning with AttackViz improves model robustness.

Abstract

Multimodal large language models (MLLMs) are increasingly used to automate chart generation from data tables, enabling efficient data analysis and reporting but also introducing new misuse risks. In this work, we introduce ChartAttack, a novel framework for evaluating how MLLMs can be misused to generate misleading charts at scale. ChartAttack injects misleaders into chart designs, aiming to induce incorrect interpretations of the underlying data. Furthermore, we create AttackViz, a chart question-answering (QA) dataset where each (chart specification, QA) pair is labeled with effective misleaders and their induced incorrect answers. ChartAttack significantly degrades QA performance, reducing MLLM accuracy by 17.2 points in-domain and 11.9 cross-domain. Preliminary human results (limited sample size) indicate a 20.2-point accuracy drop. Finally, we demonstrate that AttackViz can be used…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Natural Language Processing Techniques