Classifying World War II Era Ciphers with Machine Learning

Brooke Dalton; Mark Stamp

arXiv:2307.00501·cs.LG·August 31, 2023

Classifying World War II Era Ciphers with Machine Learning

Brooke Dalton, Mark Stamp

PDF

Open Access

TL;DR

This study evaluates machine learning and deep learning methods for classifying WWII era ciphers solely from ciphertext, achieving over 97% accuracy with 1000 characters, and compares their effectiveness across different scenarios.

Contribution

It systematically compares classic and deep learning models for cipher classification, revealing that traditional models perform comparably to deep learning in this context.

Findings

01

Classic models perform as well as deep learning models.

02

Over 97% accuracy with 1000 characters in the most realistic scenario.

03

More similar ciphers are slightly harder to distinguish.

Abstract

We determine the accuracy with which machine learning and deep learning techniques can classify selected World War II era ciphers when only ciphertext is available. The specific ciphers considered are Enigma, M-209, Sigaba, Purple, and Typex. We experiment with three classic machine learning models, namely, Support Vector Machines (SVM), $k$ -Nearest Neighbors ( $k$ -NN), and Random Forest (RF). We also experiment with four deep learning neural network-based models: Multi-Layer Perceptrons (MLP), Long Short-Term Memory (LSTM), Extreme Learning Machines (ELM), and Convolutional Neural Networks (CNN). Each model is trained on features consisting of histograms, digrams, and raw ciphertext letter sequences. Furthermore, the classification problem is considered under four distinct scenarios: Fixed plaintext with fixed keys, random plaintext with fixed keys, fixed plaintext with random keys, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligence, Security, War Strategy · Handwritten Text Recognition Techniques