Incorporating Naive Bayes Classification to Address Subpopulation Structure in Familial DNA Search
Akaraphon Jantaraphum, Chanagarn Laoiam, Budsaba Rerkamnuaychoke, Jittima Shotivaranon, Monchai Kooakachai

TL;DR
This paper introduces LRCLASS, a new likelihood ratio method that uses Naive Bayes classification to better account for population substructure in familial DNA searches, improving detection power.
Contribution
The study presents LRCLASS, integrating Naive Bayes classification into LR-based familial DNA analysis to address subpopulation effects, a novel approach in forensic genetics.
Findings
LRCLASS outperforms existing methods in detecting full-sibling relationships.
Naive Bayes and multinomial logistic regression classifiers show comparable performance.
The approach enhances the robustness of relationship inference under population substructure.
Abstract
Familial DNA search evaluates the genetic relatedness of two individuals by comparing the likelihood of their observed DNA profiles under two competing hypotheses-the null hypothesis that the individuals are unrelated and the alternative hypothesis that they are related-most commonly through the likelihood ratio (LR). Standard LR-based approaches typically assume a uniform genetic background; however, this assumption is rarely valid due to population substructure, where allele frequencies vary among subpopulations and can bias relationship inference. Existing modifications-such as LR calculations based on average allele frequencies (LRLAF) and strategies using maximum, minimum, or average likelihood ratios (LRMAX, LRMIN, LRAVG)-help mitigate these challenges but remain limited in their ability to fully address subpopulation differences. This study introduces a new LR-based statistic,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Forensic and Genetic Research · Genomic variations and chromosomal abnormalities
