Security Document Classification with a Fine-Tuned Local Large Language Model: Benchmark Data and an Open-Source System

Ivan Dobrovolskyi

arXiv:2605.20368·cs.CR·May 21, 2026

Security Document Classification with a Fine-Tuned Local Large Language Model: Benchmark Data and an Open-Source System

Ivan Dobrovolskyi

PDF

TL;DR

This paper introduces TorchSight, an open-source local system using a fine-tuned Qwen 3.5 27B model for security document classification, achieving high accuracy and outperforming commercial models while maintaining local data control.

Contribution

The study presents a new open-source system with a fine-tuned large language model trained on extensive data for security document classification, demonstrating superior accuracy over commercial models.

Findings

01

Model achieved 95.0% category accuracy on main benchmark

02

Outperformed commercial models with 75.4-79.9% accuracy

03

External validation showed 93.8% accuracy, indicating good generalization

Abstract

Organizations that scan documents for sensitive information face a practical problem. Cloud services require data to be sent to external infrastructure, while rule-based tools often miss threats that depend on context. This study presents TorchSight, an open-source local system for security document classification built around a fine-tuned Qwen 3.5 27B model. The model was trained on 78,358 samples from 13 permissively licensed sources and GPT-4 synthetic data covering seven security categories and 51 subcategories. In the main evaluation on 1,000 documents, the model reached 95.0% category-level accuracy (95% confidence interval: 93.5-96.2). The tested commercial models scored 75.4-79.9% under the same prompting protocol. On a separate external set of 500 held-out samples, the model reached 93.8% accuracy, which suggests that performance extends beyond the main benchmark, although the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.