Evaluating the Reliability of Digital Forensic Evidence Discovered by Large Language Model: A Case Study

Jeel Piyushkumar Khatiwala; Daniel Kwaku Ntiamoah Addai; and Weifeng Xu

arXiv:2602.20202·cs.CR·February 25, 2026

Evaluating the Reliability of Digital Forensic Evidence Discovered by Large Language Model: A Case Study

Jeel Piyushkumar Khatiwala, Daniel Kwaku Ntiamoah Addai, and Weifeng Xu

PDF

Open Access

TL;DR

This paper presents a structured framework utilizing large language models and a digital forensic knowledge graph to automate artifact extraction, validate evidence, and improve the reliability of AI-identified digital forensic evidence.

Contribution

The paper introduces a novel, scalable framework combining LLM analysis and a forensic knowledge graph to enhance the credibility and traceability of AI-derived digital evidence.

Findings

01

Achieved over 95% accuracy in artifact extraction.

02

Ensured strong chain-of-custody support.

03

Maintained robust contextual consistency in forensic relationships.

Abstract

The growing reliance on AI-identified digital evidence raises significant concerns about its reliability, particularly as large language models (LLMs) are increasingly integrated into forensic investigations. This paper proposes a structured framework that automates forensic artifact extraction, refines data through LLM-driven analysis, and validates results using a Digital Forensic Knowledge Graph (DFKG). Evaluated on a 13 GB forensic image dataset containing 61 applications, 2,864 databases, and 5,870 tables, the framework ensures artifact traceability and evidentiary consistency through deterministic Unique Identifiers (UIDs) and forensic cross-referencing. We propose this methodology to address challenges in ensuring the credibility and forensic integrity of AI-identified evidence, reducing classification errors, and advancing scalable, auditable methodologies. A comprehensive case…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital and Cyber Forensics · Digital Media Forensic Detection · Forensic and Genetic Research