File fragment recognition based on content and statistical features
Marzieh Masoumi, Ahmad Keshavarz, Reza Fotohi

TL;DR
This paper proposes a method for recognizing file fragments by extracting content and statistical features, reducing feature sets, and classifying with multiple algorithms to improve accuracy in cybercrime investigations.
Contribution
It introduces a new approach combining feature reduction and multiple classifiers for improved file fragment recognition accuracy.
Findings
Achieved higher accuracy than previous methods
Effectively distinguished 6 file types
Reduced feature set improved classification speed
Abstract
Nowadays, the speed up development and use of digital devices such as smartphones have put people at risk of internet crimes. The evidence of present crimes in a computer file can be easily unreachable by changing the prefix of a file or other algorithms. In more complex cases, either file divided into different parts or the parts of a file that has information about the file type are deleted, where the file fragment recognition issue is discussed. The known files are divided into different fragments, and different classification algorithms are used to solve the problems of file fragment recognition. The issue of identifying the type of file fragment due to its importance in cybercrime issues as well as antivirus has been highly emphasized and has been addressed in many articles. Increasing the accuracy in this field on the types of widely used files due to the sensitivity of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
