Benchmarking Large Language Models for Cryptanalysis and Side-Channel Vulnerabilities

Utsav Maskey; Chencheng Zhu; Usman Naseem

arXiv:2505.24621·cs.CL·September 18, 2025

Benchmarking Large Language Models for Cryptanalysis and Side-Channel Vulnerabilities

Utsav Maskey, Chencheng Zhu, Usman Naseem

PDF

Open Access 1 Datasets

TL;DR

This paper evaluates the ability of large language models to perform cryptanalysis and identify vulnerabilities in cryptographic systems, revealing their strengths and limitations in security-related tasks.

Contribution

It introduces a new benchmark dataset and systematically assesses LLMs' cryptanalytic capabilities using zero-shot and few-shot prompts.

Findings

01

LLMs can decrypt certain ciphertexts with moderate success

02

Side-channel vulnerabilities can be exploited by LLMs

03

LLMs show limitations in generalizing across diverse cryptographic contexts

Abstract

Recent advancements in large language models (LLMs) have transformed natural language understanding and generation, leading to extensive benchmarking across diverse tasks. However, cryptanalysis - a critical area for data security and its connection to LLMs' generalization abilities - remains underexplored in LLM evaluations. To address this gap, we evaluate the cryptanalytic potential of state-of-the-art LLMs on ciphertexts produced by a range of cryptographic algorithms. We introduce a benchmark dataset of diverse plaintexts, spanning multiple domains, lengths, writing styles, and topics, paired with their encrypted versions. Using zero-shot and few-shot settings along with chain-of-thought prompting, we assess LLMs' decryption success rate and discuss their comprehension abilities. Our findings reveal key insights into LLMs' strengths and limitations in side-channel scenarios and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Sakonii/EncryptionDataset
dataset· 11 dl
11 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCryptographic Implementations and Security · Advanced Malware Detection Techniques