Using Convolutional Neural Networks to Detect Compression Algorithms

Shubham Bharadwaj

arXiv:2111.09034·cs.CV·January 13, 2022

Using Convolutional Neural Networks to Detect Compression Algorithms

Shubham Bharadwaj

PDF

Open Access

TL;DR

This paper demonstrates how convolutional neural networks can effectively identify different compression algorithms used on file fragments, aiding digital forensic analysis.

Contribution

It introduces a CNN-based approach to classify compression algorithms, addressing a gap in digital forensics literature.

Findings

01

Accurately identified compress, lzip, and bzip2 algorithms

02

Used a dataset of files compressed with various algorithms

03

Showed CNN's effectiveness in compression algorithm detection

Abstract

Machine learning is penetrating various domains virtually, thereby proliferating excellent results. It has also found an outlet in digital forensics, wherein it is becoming the prime driver of computational efficiency. A prominent feature that exhibits the effectiveness of ML algorithms is feature extraction that can be instrumental in the applications for digital forensics. Convolutional Neural Networks are further used to identify parts of the file. To this end, we observed that the literature does not include sufficient information about the identification of the algorithms used to compress file fragments. With this research, we attempt to address this gap as compression algorithms are beneficial in generating higher entropy comparatively as they make the data more compact. We used a base dataset, compressed every file with various algorithms, and designed a model based on that. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital and Cyber Forensics · Digital Media Forensic Detection · Advanced Malware Detection Techniques