Chasing Shadows: Pitfalls in LLM Security Research

Jonathan Evertz; Niklas Risse; Nicolai Neuer; Andreas M\"uller; Philipp Normann; Gaetano Sapia; Srishti Gupta; David Pape; Soumya Shaw; Devansh Srivastav; Christian Wressnegger; Erwin Quiring; Thorsten Eisenhofer; Daniel Arp; Lea Sch\"onherr

arXiv:2512.09549·cs.CR·December 16, 2025

Chasing Shadows: Pitfalls in LLM Security Research

Jonathan Evertz, Niklas Risse, Nicolai Neuer, Andreas M\"uller, Philipp Normann, Gaetano Sapia, Srishti Gupta, David Pape, Soumya Shaw, Devansh Srivastav, Christian Wressnegger, Erwin Quiring, Thorsten Eisenhofer, Daniel Arp, Lea Sch\"onherr

PDF

Open Access

TL;DR

This paper identifies nine common pitfalls in LLM security research that threaten validity, assesses their prevalence in recent papers, and provides guidelines to improve research rigor and reproducibility.

Contribution

It introduces a comprehensive list of pitfalls specific to LLM security research and evaluates their impact across recent publications, offering practical recommendations.

Findings

01

Every paper contained at least one pitfall

02

Most pitfalls are unrecognized in current research

03

Empirical case studies show pitfalls can mislead evaluation

Abstract

Large language models (LLMs) are increasingly prevalent in security research. Their unique characteristics, however, introduce challenges that undermine established paradigms of reproducibility, rigor, and evaluation. Prior work has identified common pitfalls in traditional machine learning research, but these studies predate the advent of LLMs. In this paper, we identify nine common pitfalls that have become (more) relevant with the emergence of LLMs and that can compromise the validity of research involving them. These pitfalls span the entire computation process, from data collection, pre-training, and fine-tuning to prompting and evaluation. We assess the prevalence of these pitfalls across all 72 peer-reviewed papers published at leading Security and Software Engineering venues between 2023 and 2024. We find that every paper contains at least one pitfall, and each pitfall appears…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation and Cyber Security · Advanced Malware Detection Techniques · Adversarial Robustness in Machine Learning