File fragment recognition based on content and statistical features

Marzieh Masoumi; Ahmad Keshavarz; Reza Fotohi

arXiv:2102.12699·cs.CR·February 26, 2021

File fragment recognition based on content and statistical features

Marzieh Masoumi, Ahmad Keshavarz, Reza Fotohi

PDF

TL;DR

This paper proposes a method for recognizing file fragments by extracting content and statistical features, reducing feature sets, and classifying with multiple algorithms to improve accuracy in cybercrime investigations.

Contribution

It introduces a new approach combining feature reduction and multiple classifiers for improved file fragment recognition accuracy.

Findings

01

Achieved higher accuracy than previous methods

02

Effectively distinguished 6 file types

03

Reduced feature set improved classification speed

Abstract

Nowadays, the speed up development and use of digital devices such as smartphones have put people at risk of internet crimes. The evidence of present crimes in a computer file can be easily unreachable by changing the prefix of a file or other algorithms. In more complex cases, either file divided into different parts or the parts of a file that has information about the file type are deleted, where the file fragment recognition issue is discussed. The known files are divided into different fragments, and different classification algorithms are used to solve the problems of file fragment recognition. The issue of identifying the type of file fragment due to its importance in cybercrime issues as well as antivirus has been highly emphasized and has been addressed in many articles. Increasing the accuracy in this field on the types of widely used files due to the sensitivity of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.