Calculating LRs for presence of body fluids from mRNA assay data in mixtures
R.J.F. Ypma, P.A. Maaskant-van Wijk, R.D. Gill, M. Sjerps, M. van den, Berge

TL;DR
This paper develops a probabilistic likelihood ratio system for identifying multiple body fluids in forensic samples using mRNA data, incorporating in silico mixtures and machine learning classifiers to improve accuracy and robustness.
Contribution
It introduces a novel LR framework capable of analyzing mixed body fluid samples with probabilistic classification and calibration, enhancing forensic mRNA analysis.
Findings
Logistic regression provides robust and interpretable results.
The LR system performs well on simulated and real mixture samples.
The approach improves probabilistic identification of body fluids in forensic evidence.
Abstract
Messenger RNA (mRNA) profiling can identify body fluids present in a stain, yielding information on what activities could have taken place at a crime scene. To account for uncertainty in such identifications, recent work has focused on devising statistical models to allow for probabilistic statements on the presence of body fluids. A major hurdle for practical adoption is that evidentiary stains are likely to contain more than one body fluid and current models are ill-suited to analyse such mixtures. Here, we construct a likelihood ratio (LR) system that can handle mixtures, considering the hypotheses H1: the sample contains at least one of the body fluids of interest (and possibly other body fluids); H2: the sample contains none of the body fluids of interest (but possibly other body fluids). Thus, the LR-system outputs an LR-value for any combination of mRNA profile and set of body…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
