# Complex genetic effects linked to plasma protein abundance in the UK Biobank

**Authors:** Arnor I. Sigurdsson, Justus F. Gräf, Zhiyu Yang, Kirstine Ravn, Jonas Meisner, Roman Thielemann, Henry Webel, Roelof A. J. Smit, Lili Niu, Matthias Mann, Zhiyu Yang, Zhiyu Yang, Andrea Ganna, Bjarni Vilhjalmsson, Benjamin M. Neale, Jens-Christian Holm, Andrea Ganna, Torben Hansen, Ruth J. F. Loos, Simon Rasmussen

PMC · DOI: 10.1038/s41467-025-67235-0 · 2025-12-14

## TL;DR

This paper introduces a deep learning method to uncover complex genetic effects on plasma protein levels using data from the UK Biobank.

## Contribution

The study presents EIR-auto-GP, a novel deep learning approach for identifying non-linear and complex genetic effects on plasma proteins.

## Key findings

- 123 proteins were found to correlate with non-linear covariates in the UK Biobank cohort.
- 15 proteins showed genetic dominance and epistasis effects.
- A novel interaction between ABO and FUT3 loci was identified, along with dominance effects on CD209 and CLEC4M.

## Abstract

Understanding genetic associations of proteins is important for studying the molecular effect of genetic variation. A key component of this is to understand the role of complex genetic effects such as dominance and epistasis that are associated with plasma proteins. Therefore, we develop EIR-auto-GP, a deep learning-based approach, to identify complex effects that are associated with protein quantitative trait loci (pQTLs). Applying this method to the UK Biobank proteomics cohort of 48,594 individuals, we identify 123 proteins that are correlated with non-linear covariates and 15 with genetic dominance and epistasis. We uncover a novel interaction between the ABO and FUT3 loci and demonstrate dominance effects of the ABO locus on plasma levels of pathogen recognition receptors CD209 and CLEC4M. Furthermore, we replicate these findings and the methodology across Olink and mass spectrometry-based cohorts. Our approach presents a systematic, large-scale attempt to identify complex effects of plasma protein levels.

The authors present a deep learning approach to uncover complex genetic effects on circulating protein levels. They reveal new interactions and dominance patterns using UK Biobank proteomics data.

## Linked entities

- **Genes:** ABO (ABO, alpha 1-3-N-acetylgalactosaminyltransferase and alpha 1-3-galactosyltransferase) [NCBI Gene 28], FUT3 (fucosyltransferase 3 (Lewis blood group)) [NCBI Gene 2525], CD209 (CD209 molecule) [NCBI Gene 30835], CLEC4M (C-type lectin domain family 4 member M) [NCBI Gene 10332]
- **Proteins:** CD209 (CD209 molecule), CLEC4M (C-type lectin domain family 4 member M)

## Full-text entities

- **Genes:** CD209 (CD209 molecule) [NCBI Gene 30835] {aka CDSIGN, CLEC4L, DC-SIGN, DC-SIGN1, hDC-SIGN}, CLEC4M (C-type lectin domain family 4 member M) [NCBI Gene 10332] {aka CD209L, CD209L1, CD299, DC-SIGN2, DC-SIGNR, DCSIGNR}, ABO (ABO, alpha 1-3-N-acetylgalactosaminyltransferase and alpha 1-3-galactosyltransferase) [NCBI Gene 28] {aka A3GALNT, A3GALT1, GTA, GTB, NAGAT}, FUT3 (fucosyltransferase 3 (Lewis blood group)) [NCBI Gene 2525] {aka CD174, FT3B, FucT-III, LE, Les}

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12804929/full.md

---
Source: https://tomesphere.com/paper/PMC12804929