Construction and evaluation of classifiers for forensic document   analysis

Christopher P. Saunders; Linda J. Davis; Andrea C. Lamas; John J.; Miller; Donald T. Gantz

arXiv:1004.0678·stat.AP·March 14, 2015

Construction and evaluation of classifiers for forensic document analysis

Christopher P. Saunders, Linda J. Davis, Andrea C. Lamas, John J., Miller, Donald T. Gantz

PDF

TL;DR

This paper develops and evaluates classifiers for forensic document analysis using a statistical approach, introducing a Bayesian cross-validation method to assess classifier performance with limited samples per writer.

Contribution

It presents a novel Bayesian cross-validation technique for evaluating classifiers in forensic document analysis with small sample sizes.

Findings

01

Classifiers achieved near perfect accuracy in predicting writers.

02

The Bayesian cross-validation method effectively assesses classifier performance.

03

The approach is suitable for datasets with many writers and few samples per writer.

Abstract

In this study we illustrate a statistical approach to questioned document examination. Specifically, we consider the construction of three classifiers that predict the writer of a sample document based on categorical data. To evaluate these classifiers, we use a data set with a large number of writers and a small number of writing samples per writer. Since the resulting classifiers were found to have near perfect accuracy using leave-one-out cross-validation, we propose a novel Bayesian-based cross-validation method for evaluating the classifiers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.