# Robust imputation-based method for eye, hair, and skin colour prediction from low-coverage ancient DNA

**Authors:** Zoltán Maróti, Emil Nyerki, Tibor Török, Gergely István Varga, Tibor Kalmár

PMC · DOI: 10.1038/s41598-026-38372-3 · 2026-02-05

## TL;DR

This paper introduces a new method to predict eye, hair, and skin color from low-quality ancient DNA using imputation techniques.

## Contribution

The novel contribution is a robust imputation-based framework called aHISPlex for accurate phenotype prediction from degraded ancient DNA.

## Key findings

- Phenotype predictions from ancient DNA can be accurate even at low genome coverages (0.1–0.5×).
- The aHISPlex framework includes tools and reference datasets for efficient analysis of degraded DNA samples.
- Factors like genome coverage and postmortem damage significantly affect imputation efficiency.

## Abstract

The prediction of externally visible traits (eye, hair, and skin colours) from DNA can provide valuable information for contemporary and ancient human populations. The validated HIrisPlex-S method is the primary tool in forensics for phenotyping modern samples. The HIrisPlex-S multiplex PCR assay can handle trace DNA from modern samples, but the analysis of degraded, low-coverage ancient DNA (aDNA) presents additional challenges. Genotype imputation has recently proven successful in effectively filling in missing information in aDNA sequences. To assess the feasibility of this approach, we evaluated how key factors, such as genome coverage, minor allele frequency, extent of postmortem damage, and the population origin of the test individual, influence the efficiency of imputing HIrisPlex-S markers and predicting phenotypes. We used high-coverage sequence data from modern individuals and ancient remains for the evaluation. Our results demonstrate that even with genome coverages as low as 0.1–0.5×, the proposed workflow can predict phenotypes from degraded ancient (or forensic) whole genome sequence (WGS) data with good accuracy. To aid the archaeogenetics community, we have developed aHISPlex. This user-friendly, easily deployable imputation-based framework includes the new bioinformatics tools and the pre-made reference data sets required for the whole analysis.

The online version contains supplementary material available at 10.1038/s41598-026-38372-3.

## Full-text entities

- **Genes:** UNG (uracil DNA glycosylase) [NCBI Gene 7374] {aka DGU, HIGM4, HIGM5, UDG, UNG1, UNG15}, MC1R (melanocortin 1 receptor) [NCBI Gene 4157] {aka CMM5, MSH-R, SHEP2}
- **Diseases:** pigmentation (MESH:D010859), PMD (MESH:D011180)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** G > A, rs1110400, A > G, rs312262906, rs1805006, rs885479, rs1805008, rs12913832, C > T, rs2228479, rs201326893, g.89919344dup, rs11547464, rs1805009

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12923854/full.md

---
Source: https://tomesphere.com/paper/PMC12923854