ChatGPT for Digital Forensic Investigation: The Good, The Bad, and The   Unknown

Mark Scanlon; Frank Breitinger; Christopher Hargreaves; Jan-Niclas; Hilgert; John Sheppard

arXiv:2307.10195·cs.CR·July 21, 2023·1 cites

ChatGPT for Digital Forensic Investigation: The Good, The Bad, and The Unknown

Mark Scanlon, Frank Breitinger, Christopher Hargreaves, Jan-Niclas, Hilgert, John Sheppard

PDF

Open Access 1 Repo

TL;DR

This paper evaluates ChatGPT's capabilities and limitations in digital forensics, highlighting its potential as a supportive tool while cautioning about risks and current unsuitability for certain applications.

Contribution

It provides a comprehensive assessment of GPT-4's performance in digital forensic tasks, identifying strengths, risks, and practical considerations for its use in the field.

Findings

01

ChatGPT shows potential in artefact understanding and evidence searching.

02

Risks include inaccuracies and privacy concerns when uploading evidence.

03

Certain forensic tasks are currently unsuitable for ChatGPT use.

Abstract

The disruptive application of ChatGPT (GPT-3.5, GPT-4) to a variety of domains has become a topic of much discussion in the scientific community and society at large. Large Language Models (LLMs), e.g., BERT, Bard, Generative Pre-trained Transformers (GPTs), LLaMA, etc., have the ability to take instructions, or prompts, from users and generate answers and solutions based on very large volumes of text-based training data. This paper assesses the impact and potential impact of ChatGPT on the field of digital forensics, specifically looking at its latest pre-trained LLM, GPT-4. A series of experiments are conducted to assess its capability across several digital forensic use cases including artefact understanding, evidence searching, code generation, anomaly detection, incident response, and education. Across these topics, its strengths and risks are outlined and a number of general…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

markscanlonucd/chatgpt-for-digital-forensics
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital and Cyber Forensics · Artificial Intelligence in Healthcare and Education · Advanced Malware Detection Techniques

Methodstravel james · Attention Is All You Need · Absolute Position Encodings · Label Smoothing · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Transformer · Linear Layer · Multi-Head Attention · Softmax