AutoDFBench 1.0: A Benchmarking Framework for Digital Forensic Tool Testing and Generated Code Evaluation
Akila Wickramasekara, Tharusha Mihiranga, Aruna Withanage, Buddhima Weerasinghe, Frank Breitinger, John Sheppard, Mark Scanlon

TL;DR
AutoDFBench 1.0 is a comprehensive, automated benchmarking framework for digital forensic tools and code, supporting diverse forensic tasks and AI-generated code, enabling standardized, reproducible evaluations.
Contribution
It introduces the first unified, extensible benchmarking framework that automates validation and comparison of digital forensic tools and scripts across multiple forensic tasks.
Findings
Supports evaluation of conventional and AI-generated forensic code
Includes extensive ground truth data with 63 test cases and nearly 11,000 scenarios
Provides structured, standardized metrics for fair comparison
Abstract
The National Institute of Standards and Technology (NIST) Computer Forensic Tool Testing (CFTT) programme has become the de facto standard for providing digital forensic tool testing and validation. However to date, no comprehensive framework exists to automate benchmarking across the diverse forensic tasks included in the programme. This gap results in inconsistent validation, challenges in comparing tools, and limited validation reproducibility. This paper introduces AutoDFBench 1.0, a modular benchmarking framework that supports the evaluation of both conventional DF tools and scripts, as well as AI-generated code and agentic approaches. The framework integrates five areas defined by the CFTT programme: string search, deleted file recovery, file carving, Windows registry recovery, and SQLite data recovery. AutoDFBench 1.0 includes ground truth data comprising of 63 test cases and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital and Cyber Forensics · Forensic Fingerprint Detection Methods · Forensic and Genetic Research
