# Predicting type 2 diabetes via machine learning integration of multiple omics from human pancreatic islets

**Authors:** Tina Rönn, Alexander Perfilyev, Nikolay Oskolkov, Charlotte Ling

PMC · DOI: 10.1038/s41598-024-64846-3 · Scientific Reports · 2024-06-25

## TL;DR

This study uses machine learning to combine multiple types of biological data from pancreatic islets to predict type 2 diabetes with high accuracy.

## Contribution

A novel machine learning approach integrating multiOmics data to predict T2D and identify new biomarkers.

## Key findings

- The multiOmics integration achieved 91% accuracy in predicting T2D with an AUC of 0.96.
- Biomarkers like SACS, TXNIP, OPRD1, and ANO1 were identified as key contributors to T2D mechanisms.
- The approach revealed novel links between DNA methylation, gene expression, and SNPs in T2D.

## Abstract

Type 2 diabetes (T2D) is the fastest growing non-infectious disease worldwide. Impaired insulin secretion from pancreatic beta-cells is a hallmark of T2D, but the mechanisms behind this defect are insufficiently characterized. Integrating multiple layers of biomedical information, such as different Omics, may allow more accurate understanding of complex diseases such as T2D. Our aim was to explore and use Machine Learning to integrate multiple sources of biological/molecular information (multiOmics), in our case RNA-sequening, DNA methylation, SNP and phenotypic data from islet donors with T2D and non-diabetic controls. We exploited Machine Learning to perform multiOmics integration of DNA methylation, expression, SNPs, and phenotypes from pancreatic islets of 110 individuals, with ~ 30% being T2D cases. DNA methylation was analyzed using Infinium MethylationEPIC array, expression was analyzed using RNA-sequencing, and SNPs were analyzed using HumanOmniExpress arrays. Supervised linear multiOmics integration via DIABLO based on Partial Least Squares (PLS) achieved an accuracy of 91 ± 15% of T2D prediction with an area under the curve of 0.96 ± 0.08 on the test dataset after cross-validation. Biomarkers identified by this multiOmics integration, including SACS and TXNIP DNA methylation, OPRD1 and RHOT1 expression and a SNP annotated to ANO1, provide novel insights into the interplay between different biological mechanisms contributing to T2D. This Machine Learning approach of multiOmics cross-sectional data from human pancreatic islets achieved a promising accuracy of T2D prediction, which may potentially find broad applications in clinical diagnostics. In addition, it delivered novel candidate biomarkers for T2D and links between them across the different Omics.

## Linked entities

- **Genes:** SACS (sacsin molecular chaperone) [NCBI Gene 26278], TXNIP (thioredoxin interacting protein) [NCBI Gene 10628], OPRD1 (opioid receptor delta 1) [NCBI Gene 4985], RHOT1 (ras homolog family member T1) [NCBI Gene 55288], ANO1 (anoctamin 1) [NCBI Gene 55107]
- **Diseases:** Type 2 diabetes (MONDO:0005148), T2D (MONDO:0005148)

## Full-text entities

- **Genes:** RHOT1 (ras homolog family member T1) [NCBI Gene 55288] {aka ARHT1, MIRO-1, MIRO1}, OPRD1 (opioid receptor delta 1) [NCBI Gene 4985] {aka DOP, DOR, DOR1, OPRD}, TXNIP (thioredoxin interacting protein) [NCBI Gene 10628] {aka ARRDC6, EST01027, HHCPA78, THIF, VDUP1}, INS (insulin) [NCBI Gene 3630] {aka IDDM, IDDM1, IDDM2, ILPR, IRDN, MODY10}, SACS (sacsin molecular chaperone) [NCBI Gene 26278] {aka ARSACS, DNAJC29, PPP1R138, SPAX6}, ANO1 (anoctamin 1) [NCBI Gene 55107] {aka DOG1, INDMS, MYMY7, ORAOV2, TAOS2, TMEM16A}
- **Diseases:** T2D (MESH:D003924), diabetic (MESH:D003920)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11199577/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11199577/full.md

## References

59 references — full list in the complete paper: https://tomesphere.com/paper/PMC11199577/full.md

---
Source: https://tomesphere.com/paper/PMC11199577