Identification of a novel HLA-G allele, HLA-G*01:66, using PolyseqOne and oxford nanopore sequencing
Ran Li, Liyan Sun, Siqi Cai

TL;DR
Researchers discovered a new HLA-G allele, HLA-G*01:66, using advanced sequencing techniques.
Contribution
The paper introduces the novel HLA-G*01:66 allele identified through PolyseqOne and Oxford Nanopore sequencing.
Findings
HLA-G*01:66 has a missense nucleotide substitution at position 508 in exon 3.
The new allele was identified using PolyseqOne and Oxford Nanopore sequencing technologies.
Abstract
HLA-G*01:66 differs from HLA-G*01:01:22:01 by one missense nucleotide substitution at position 508 in exon 3.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1- —Sanming Project of Medicine in Shenzen Municipality10.13039/501100012151
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReproductive System and Pregnancy · Reproductive Biology and Fertility · Reproductive Physiology in Livestock
HLA-G, a non-classical class I molecule of the human MHC Ib family, exhibits limited protein polymorphism but marked diversity at the DNA level. As of the IPD-IMGT/HLA Database Release 3.62 (October 2025), a total of 194 alleles encoding 61 proteins have been documented (Robinson et al., 2024; Barker et al., 2023). Here, we report the identification of a novel allele, HLA-G01:66, which differs from HLA-G01:01:22:01 by a single nucleotide change in exon 3. The novel allele was identified during a population genetic study designed to characterize HLA diversity in healthy Chinese volunteers. HLA typing was performed as part of this broad screening effort; the two carriers were not selected based on any clinical or familial criteria.
Genomic DNA was extracted from peripheral blood using the iPure DNA HS kit (IGE Biotechnology, Guangzhou, China). HLA typing across 12 loci (HLA-A, -B, -C, -DRB1, -DRB3/4/5, -DQA1, -DQB1, -DPA1, -DPB1, etc.) was performed via long-range multiplex PCR using ApexHF HS DNA Polymerase (Accurate Biology, Changsha, China). Routine typing was conducted on the PolyseqOne nanopore platform (Polyseq Inc., Beijing, China), and novel alleles were confirmed using Oxford Nanopore Technologies (ONT, Oxford, United Kingdom). PolyseqOne libraries were prepared using PY-DTB101/102 and PY-BLP101 kits (Polyseq Inc., Beijing, China), sequenced on PY-NFC001 flow cells for 12 h, and base-called with Kant v1.0.1 in high-accuracy mode (420 bp/s). Fastq files from both platforms were error-corrected with NanoFix-AI. HLA genotyping was then performed with NanoHLA- Resolve Assign v1.0.5 (DAFEI Biotech, Guangzhou, China) against the IPD-IMGT/HLA Database. The error-corrected consensus sequence for HLA-G*01:66 from the two platforms was identical across the full 7.1 kb, with no nucleotide discrepancies.
The novel allele was submitted to GenBank (Accession No. PX126642) and the IPD-IMGT/HLA Database (HWS10101356). It was officially named HLA-G01:66 by the WHO Nomenclature Committee in September 2025, following current naming guidelines (Marsh et al., 2010). HLA-G01:66 is a novel allele differing from HLA-G*01:01:22:01 by a single nucleotide change at position 508 in exon 3 (A>G). This mutation converts codon 146 from AAG to GAG, thereby substituting the encoded lysine with glutamic acid (K146E) (Figure 1). The potential functional impact of this substitution remains to be determined.
Of the 163 healthy individuals genotyped (corresponding to 326 alleles), two were found to be heterozygous for the novel HLA-G01:66 allele. The other HLA-G alleles in these carriers were HLA-G01:01:04 and HLA-G01:01:01. Despite shared similarities at other HLA loci, the two individuals were confirmed to be unrelated. Their detailed HLA genotypes are presented in Table 1. The screening yielded an observed allele frequency of 0.61% for the HLA-G01:66 allele. We note that the true population frequency may differ, as our sample size offers limited precision for this estimate.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Barker D. J. Maccari G. Georgiou X. Cooper M. A. Flicek P. Robinson J. (2023). The IPD-IMGT/HLA database. Nucleic Acids Research 51 (D 1), D 1053–D 1060. 10.1093/nar/gkac 1011 36350643 PMC 9825470 · doi ↗ · pubmed ↗
- 2Marsh S. G. Albert E. D. Bodmer W. F. Bontrop R. E. Dupont B. Erlich H. A. (2010). Nomenclature for factors of the HLA system, 2010. Tissue Antigens 75 (4), 291–455. 10.1111/j.1399-0039.2010.01466.x 20356336 PMC 2848993 · doi ↗ · pubmed ↗
- 3Robinson J. Barker D. J. Marsh S. G. E. (2024). 25 years of the IPD-IMGT/HLA database. HLA 103 (6), e 15549. 10.1111/tan.15549 38936817 · doi ↗ · pubmed ↗
