Classifying World War II Era Ciphers with Machine Learning
Brooke Dalton, Mark Stamp

TL;DR
This study evaluates machine learning and deep learning methods for classifying WWII era ciphers solely from ciphertext, achieving over 97% accuracy with 1000 characters, and compares their effectiveness across different scenarios.
Contribution
It systematically compares classic and deep learning models for cipher classification, revealing that traditional models perform comparably to deep learning in this context.
Findings
Classic models perform as well as deep learning models.
Over 97% accuracy with 1000 characters in the most realistic scenario.
More similar ciphers are slightly harder to distinguish.
Abstract
We determine the accuracy with which machine learning and deep learning techniques can classify selected World War II era ciphers when only ciphertext is available. The specific ciphers considered are Enigma, M-209, Sigaba, Purple, and Typex. We experiment with three classic machine learning models, namely, Support Vector Machines (SVM), -Nearest Neighbors (-NN), and Random Forest (RF). We also experiment with four deep learning neural network-based models: Multi-Layer Perceptrons (MLP), Long Short-Term Memory (LSTM), Extreme Learning Machines (ELM), and Convolutional Neural Networks (CNN). Each model is trained on features consisting of histograms, digrams, and raw ciphertext letter sequences. Furthermore, the classification problem is considered under four distinct scenarios: Fixed plaintext with fixed keys, random plaintext with fixed keys, fixed plaintext with random keys, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligence, Security, War Strategy · Handwritten Text Recognition Techniques
