Comparison of Entropy Calculation Methods for Ransomware Encrypted File Identification
Simon R Davies, Richard Macfarlane, William J. Buchanan

TL;DR
This paper compares 53 entropy calculation methods to improve ransomware encrypted file detection, aiming to identify the most effective techniques for distinguishing encrypted data from other file types.
Contribution
It provides a comprehensive evaluation of various entropy tests, highlighting the most accurate methods for ransomware detection based on entropy analysis.
Findings
Shannon entropy is commonly used but not always optimal.
Certain tests outperform others in differentiating encrypted files.
The study identifies the most reliable entropy methods for ransomware detection.
Abstract
Ransomware is a malicious class of software that utilises encryption to implement an attack on system availability. The target's data remains encrypted and is held captive by the attacker until a ransom demand is met. A common approach used by many crypto-ransomware detection techniques is to monitor file system activity and attempt to identify encrypted files being written to disk, often using a file's entropy as an indicator of encryption. However, often in the description of these techniques, little or no discussion is made as to why a particular entropy calculation technique is selected or any justification given as to why one technique is selected over the alternatives. The Shannon method of entropy calculation is the most commonly-used technique when it comes to file encryption identification in crypto-ransomware detection techniques. Overall, correctly encrypted data should be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
