# Global forensic geolocation with deep neural networks

**Authors:** Neal S. Grantham, Brian J. Reich, Eric B. Laber, Krishna Pacifici,, Robert R. Dunn, Noah Fierer, Matthew Gebert, Julia S. Allwood, Seth A. Faith

arXiv: 1905.11765 · 2019-05-29

## TL;DR

This paper introduces a deep neural network ensemble algorithm for forensic geolocation using microbial DNA data, achieving high accuracy in continental and global dust sample analyses.

## Contribution

It presents a novel ensemble deep learning approach for microbial geolocation, leveraging Voronoi partitions and high-throughput sequencing data.

## Key findings

- Over 50% of errors under 100 km in continental data
- Nearly 90% accuracy in country-level classification globally
- Effective for forensic applications with microbial DNA

## Abstract

An important problem in forensic analyses is identifying the provenance of materials at a crime scene, such as biological material on a piece of clothing. This procedure, known as geolocation, is conventionally guided by expert knowledge of the biological evidence and therefore tends to be application-specific, labor-intensive, and subjective. Purely data-driven methods have yet to be fully realized due in part to the lack of a sufficiently rich data source. However, high-throughput sequencing technologies are able to identify tens of thousands of microbial taxa using DNA recovered from a single swab collected from nearly any object or surface. We present a new algorithm for geolocation that aggregates over an ensemble of deep neural network classifiers trained on randomly-generated Voronoi partitions of a spatial domain. We apply the algorithm to fungi present in each of 1300 dust samples collected across the continental United States and then to a global dataset of dust samples from 28 countries. Our algorithm makes remarkably good point predictions with more than half of the geolocation errors under 100 kilometers for the continental analysis and nearly 90% classification accuracy of a sample's country of origin for the global analysis. We suggest that the effectiveness of this model sets the stage for a new, quantitative approach to forensic geolocation.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.11765/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/1905.11765/full.md

## References

27 references — full list in the complete paper: https://tomesphere.com/paper/1905.11765/full.md

---
Source: https://tomesphere.com/paper/1905.11765