ChatGPT for Digital Forensic Investigation: The Good, The Bad, and The Unknown
Mark Scanlon, Frank Breitinger, Christopher Hargreaves, Jan-Niclas, Hilgert, John Sheppard

TL;DR
This paper evaluates ChatGPT's capabilities and limitations in digital forensics, highlighting its potential as a supportive tool while cautioning about risks and current unsuitability for certain applications.
Contribution
It provides a comprehensive assessment of GPT-4's performance in digital forensic tasks, identifying strengths, risks, and practical considerations for its use in the field.
Findings
ChatGPT shows potential in artefact understanding and evidence searching.
Risks include inaccuracies and privacy concerns when uploading evidence.
Certain forensic tasks are currently unsuitable for ChatGPT use.
Abstract
The disruptive application of ChatGPT (GPT-3.5, GPT-4) to a variety of domains has become a topic of much discussion in the scientific community and society at large. Large Language Models (LLMs), e.g., BERT, Bard, Generative Pre-trained Transformers (GPTs), LLaMA, etc., have the ability to take instructions, or prompts, from users and generate answers and solutions based on very large volumes of text-based training data. This paper assesses the impact and potential impact of ChatGPT on the field of digital forensics, specifically looking at its latest pre-trained LLM, GPT-4. A series of experiments are conducted to assess its capability across several digital forensic use cases including artefact understanding, evidence searching, code generation, anomaly detection, incident response, and education. Across these topics, its strengths and risks are outlined and a number of general…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital and Cyber Forensics · Artificial Intelligence in Healthcare and Education · Advanced Malware Detection Techniques
Methodstravel james · Attention Is All You Need · Absolute Position Encodings · Label Smoothing · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Transformer · Linear Layer · Multi-Head Attention · Softmax
