deepNoC: A deep learning system to assign the number of contributors to a short tandem repeat DNA profile
Duncan Taylor, Melissa A. Humphries

TL;DR
deepNoC is a deep learning system that accurately estimates the number of contributors to a DNA profile using simulated data, with high performance and explainability, aiding forensic DNA analysis.
Contribution
This work introduces a novel pipeline for generating large labeled training data via simulation and applies deep neural networks to improve contributor estimation accuracy.
Findings
Achieves 89% accuracy in estimating 1-10 contributors.
Requires only a few hundred real profiles for fine-tuning.
Provides explainability features for user interpretation.
Abstract
A common task in forensic biology is to interpret and evaluate short tandem repeat DNA profiles. The first step in these interpretations is to assign a number of contributors to the profiles, a task that is most often performed manually by a scientist using their knowledge of DNA profile behaviour. Studies using constructed DNA profiles have shown that as DNA profiles become more complex, and the number of DNA-donating individuals increases, the ability for scientists to assign the target number. There have been a number of machine learning algorithms developed that seek to assign the number of contributors to a DNA profile, however due to practical limitations in being able to generate DNA profiles in a laboratory, the algorithms have been based on summaries of the available information. In this work we develop an analysis pipeline that simulates the electrophoretic signal of an STR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
