Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessments

Regan Bolton; Mohammadreza Sheikhfathollahi; Simon Parkinson; Vanessa Vulovic; Gary Bamford; Dan Basher; Howard Parkinson

arXiv:2505.01307·cs.SE·May 12, 2026

Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessments

Regan Bolton, Mohammadreza Sheikhfathollahi, Simon Parkinson, Vanessa Vulovic, Gary Bamford, Dan Basher, Howard Parkinson

PDF

TL;DR

DRAFT is a novel fine-tuning approach for large language models that enhances safety-critical software assessments by integrating dual document retrieval and a semi-automated dataset generation process.

Contribution

It introduces a new fine-tuning framework with dual-retrieval architecture and a semi-automated dataset creation method for improved compliance assessment.

Findings

01

GPT-4o-mini with DRAFT shows a 7% accuracy improvement.

02

Qualitative enhancements in evidence handling and reasoning.

03

Effective in complex regulatory compliance scenarios.

Abstract

Safety critical software assessment requires robust assessment against complex regulatory frameworks, a process traditionally limited by manual evaluation. This paper presents Document Retrieval-Augmented Fine-Tuning (DRAFT), a novel approach that enhances the capabilities of a large language model (LLM) for safety-critical compliance assessment. DRAFT builds upon existing Retrieval-Augmented Generation (RAG) techniques by introducing a novel fine-tuning framework that accommodates our dual-retrieval architecture, which simultaneously accesses both software documentation and applicable reference standards. To fine-tune DRAFT, we develop a semi-automated dataset generation methodology that incorporates variable numbers of relevant documents with meaningful distractors, closely mirroring real-world assessment scenarios. Experiments with GPT-4o-mini demonstrate a 7% improvement in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.