# Digenic Analysis Finds Highly Interactive Genetic Variants Underlying Polygenic Traits

**Authors:** Gao Wang, Jurg Ott

PMC · DOI: 10.18103/mra.v11i10.4604 · 2024-06-14

## TL;DR

This paper introduces a new method to find interacting genetic variants in complex diseases like AMD and Parkinson's by analyzing genotype pairs.

## Contribution

The novel approach uses a statistical framework to identify highly connected genetic variants in polygenic traits through digenic analysis.

## Key findings

- AMD genetic variants are more interconnected than those in Parkinson's Disease.
- The method identified 12 and 8 significant variants for AMD and PD, respectively.
- Some identified variants match results from other machine learning methods.

## Abstract

We briefly review our recently published approach to mining digenic genotype patterns, which consist of two genotypes each originating in a different DNA variant. We do this for a genetic case-control study by evaluating all possible pairs of genotypes, distributing the workload over numerous CPUs (threads) in a high-performance computing environment and apply our methods to two known datasets, age-related macular degeneration (AMD) and Parkinson Disease (PD). Based on a list of (e.g., 100,000) genotype pairs with largest genotype pair frequency differences between cases and controls, we determine the number Nu of unique variants occurring in this list. For each unique variant, we find the number of genotype pairs it participates in, which identifies a set of variants “connected” with the given unique variant. Among the total of variants “connected” with all unique variants, only a subset of variants is unique. The ratio of all connected variants divided by that subset of variants is a measure for the overall density or connectedness of variants interacting with each other. We find that variants for the AMD data are much more interconnected than those for PD, at least based on the 100,000 genotype pairs with largest chi-square we investigated. Further, for each of the Nu unique variants, we use the number of variants connected with it as a test statistic, weighted by the inverse of the rank at which the unique variant first occurred in the original list of genotype patterns. This weighing scheme ties the number of connections to the genetics of the trait and allows us to obtain, for each of the Nu unique variants, an empirical significance level by permuting ranks. We find 12 and 8 significant, highly connected variants for AMD and PD, respectively, some of which have previously been identified by other machine learning methods, thus providing credence to our approach. Among the 100,000 genotype pairs investigated for each of AMD and PD, significant variants showed connections with up to 7,093 and 3,777 other variants, respectively. Our approach has been implemented in a freely available piece of software, the Digenic Network Test. Thus, our statistical genetics method can provide important information on the genetic architecture of polygenic traits.

## Linked entities

- **Diseases:** age-related macular degeneration (MONDO:0005150), Parkinson Disease (MONDO:0005180)

## Full-text entities

- **Diseases:** PD (MESH:D010300), AMD (MESH:D008268)

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC11177775/full.md

---
Source: https://tomesphere.com/paper/PMC11177775