Jigsaw Cryptanalysis of Audio Scrambling Systems
Hamzeh Ghasemzadeh, Mehdi Tajik Khass, Hamed Mehrara

TL;DR
This paper presents a cipher-text only attack on speech permutation-only ciphers, exploiting speech redundancies and advanced techniques to successfully recover audio signals with high intelligibility, surpassing previous methods.
Contribution
It introduces a novel cipher-text only attack on speech ciphers using a combination of signal processing and puzzle-solving techniques, demonstrating effective decryption without plaintext access.
Findings
Achieves 87.8% objective intelligibility
Achieves 92.9% subjective intelligibility
Outperforms previous methods by over 50% in objective scores
Abstract
Recently it was shown that permutation-only multimedia ciphers can completely be broken in a chosen-plaintext scenario. Apparently, chosen-plaintext scenario models a very resourceful adversary and does not hold in many practical situations. To show that these ciphers are totally broken, we propose a cipher-text only attack on these ciphers. To that end, we investigate speech permutation-only ciphers and show that inherent redundancies of speech signal can pave the path for a successful cipher-text only attack. For this task different concepts and techniques are merged together. First, Short Time Fourier Transform (STFT) is employed to extract regularities of audio signal in both time and frequency. Then, it is shown that cipher-texts can be considered as a set of scrambled puzzles. Then different techniques such as estimation, image processing, branch and bound, and graph theory are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Digital Media Forensic Detection · Handwritten Text Recognition Techniques
