Common interacting genetic variation shapes susceptibility to type 1 diabetes in a Colombian Caribbean community: In search of shared genetic markers
Gloria Garavito-De Egea, Alex Domínguez-Vargas, Jorge I. Vélez, Gustavo Aroca, Luis Fang, Elkin Navarro-Quiroz, Zilac Espitaleta, Kenny Del Toro-Camargo, Leticia Martínez-Ariza, Tatiana González-Vargas, Susana García, Mauricio Arcos-Burgos, Eduardo Egea

Abstract
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDiabetes and associated disorders · Pancreatic function and diabetes · Genetic Associations and Epidemiology
Genome-wide association studies (GWASs) have identified hundreds of loci across the human genome conferring susceptibility to autoimmune diseases (AIDs), some of which are shared between more than two diseases. However, this univariate approach has limitations in detecting complex genotype-phenotype correlations. In this work, we carried out whole-exome sequencing of Colombian Caribbean patients with type 1 diabetes (T1D), lupus nephritis (LN), and juvenile idiopathic arthritis (JIA), to evaluate functional exomic variation, i.e., single nucleotide polymorphisms (SNPs), and to outline common and rare variations underpinning the susceptibility to these autoimmune diseases. Single and multi-locus linear mixed-effects models fit the data to identify T1D-associated genomic variants and the most likely genetic architecture underpinning AID risk. Variations associated with T1D susceptibility pointed to genes related to glycoprotein oligosaccharide biosynthesis, phospholipid binding, pancreatic adenocarcinoma, systolic blood pressure, and fasting insulin metabolism, among others that highlight MGAT5 (PFDR = 1.64 × 10^−22^), RUNX1 (PFDR = 1.8 × 10^−12^), PSD3 (PFDR = 8.1 × 10^−12^), and HLA-DBP2 (PFDR = 2.18 × 10^−9^). Our study outlines oligogenic common variation underpinning the susceptibility to develop T1D. These genetic polymorphisms are also shared by patients with other AIDs such as LN and JIA, indicating that the shared genetic architecture (defined by pleiotropy and epistasis) shapes the genetic susceptibility of these disorders in this multiethnic population.
AIDs are estimated to impact 5%–8% of people living in America, representing a public health impact and a significant epidemiological burden. GWASs suggest that a set of shared genetic risk factors is underlying the etiology of AIDs.1 It is noteworthy that several AIDs show a clear family grouping, such as inflammatory bowel disease, while others, such as T1D, autoimmune thyroiditis, and celiac disease, can manifest as comorbid diseases among them.2 Indeed, the gene-sharing concept between AIDs is not clear to date, nor is it ruled out that this is due to ‘pleiotropic’ factors that predispose to multiple AIDs through shared mechanisms or if numerous independent risk factors are overlooked or add up to give these endotypes.2^,^3
Genetic evidence has stated that around 44% of SNPs found in GWASs on AIDs are shared by two or more of the following diseases: celiac disease, Crohn's disease, psoriasis, multiple sclerosis (MS), rheumatoid arthritis, T1D, and systemic lupus erythematosus. Moreover, T1D, LN, and JIA have previously been considered part of an AID cluster with genetic and clinical interactions.1 Several epidemiological studies, along with our findings, support overlapping genetic factors, for example, shared heritability in autoimmune diseases.3^,^4 In previous studies, our group demonstrated that transmission of AID susceptibility is fit either for i) the effects of major Mendelian loci or ii) oligogenic cooperating loci, or for iii) the inclusive models of hundreds of loci interacting with environmental effects.5
In this study, we report the analysis of whole-exome sequencing of 75 patients with autoimmune phenotypes (T1D, LN, and JIA, n = 25 from each group) ascertained from Barranquilla, the capital city of the Atlántico state in the Colombian Caribbean coast. We studied the association of common polymorphic variants (minor allele frequency ≥0.01) with T1D using single- and multi-locus linear mixed-effect models (SLMEM and MLMEM, respectively) with up to 10 steps in the backward/forward optimization algorithm.
The demographic characteristics of individuals included in this study are summarized in Table S1. The mean age for study participants was 20 ± 9.6 years. There were 55 (73%, 55/75) women and 20 men (27%, 20/75). Symptom lengths went from two to up to over 20 years. LN patients described a disease debut at 13.9 ± 2.6 years of age on average, T1D patients at 6.7 ± 4 years of age, and JIA patients reported a diagnosis of the disease at 6.5 ± 3.9 years of age.
DNA from patients was subject to whole exome capture, amplification, and sequencing. We identified a total of 2,779,380 SNPs. This number was reduced to 87,638 common variants with potential functional effects at the end of the filtering process. These variants were harbored in chromosomal regions reported to confer susceptibility to T1D and manually compiled and curated by our group. Manhattan plots depicting genome significances reached after the maximization of three MLMEMs (additive, dominant, and recessive) are presented in Figure 1.Figure 1. Genome-wide association analysis of patients (n = 75). The Manhattan plots of the association results are shown when (a) additive, (b) dominant, and (c) recessive multi-locus linear-mixed effects are used. The plots show the top markers detected with FDR-corrected *p-*values below 0.05 presenting –log_10_ (P) versus the SNPs coordinates throughout the genome. FDR, false discovery rate.Fig. 1
We contrasted several MLMEMs, including all 87,638 common variants in genes previously reported to confer susceptibility to T1D. An MLMEM with four, nine, and six steps in the forward/backward selection algorithm were selected under the additive (Table S2a and Fig. 1a), dominant (Table S2b and Fig. 1b), and recessive (Table S2c and Fig. 1c) forms of transmission, respectively. The advantage of these models is the inclusion of both fixed (genotype markers and covariates of any type) and random effects (family or population structure), the latter of which accounts for potential inbreeding by including a kinship matrix (the identity-by-descent matrix). In our case, the identity-by-descent matrix was estimated between all pairs of individuals using all markers located within genes of interest. A SLMEM assumes that all loci have a negligible effect on the trait (simulating a multifactorial model), while an MLMEM assumes that several interacting loci have a significant effect (non-linear epistatic effects).
Four variants confer susceptibly to T1D under the additive inheritance model (Table S2a). Of these, variants mapping to the MGAT5 ( = 0.747, odds ratio/OR = 2.11, PFDR = 3.18 × 10^−10^), POM121C ( = 0.419, OR = 1.520, PFDR = 1.75 × 10^−3^), and UHRF1BP1 ( = 0.684, OR = 1.982, PFDR = 4.28 × 10^−3^) genes confer susceptibility to T1D, while the variant mapping to RUNX1 ( = 0.747, OR = 0.586, PFDR = 3.19 × 10^−4^) is protective (Table S2a).
We identified nine variants associated with T1D under the dominant inheritance model (Table S2b). Of these, variants harbored in the MGAT5 ( = 1.336, OR = 3.8, PFDR = 1.64 × 10^−22^), PSD3 ( = 0.947, OR = 2.578, PFDR = 1.8 × 10^−12^), HLA-DBP2 ( = 0.607, OR = 1.835, PFDR = 2.18 × 10^−9^), CD86 ( = 0.381, OR = 1.464, PFDR = 3.97 × 10^−3^), KSR2 ( = 0.418, OR = 1.519, PFDR = 1.17 × 10^−2^), and EDEM3 ( = 0.162, OR = 1.176, PFDR = 2.33 × 10^−2^) genes confer susceptibility to T1D. In contrast, variants within the RUNX1 ( = −0.968, OR = 0.379, PFDR = 1.8 × 10^−12^), GRB2 ( = −0.808, OR = 0.446, PFDR = 4.88 × 10^−11^), and GABRA2 ( = −0.223, OR = 0.8, PFDR = 3.49 × 10^−5^) genes are protective.
In the recessive transmission MLMEM model, we identified six variants associated with T1D (Table S2c and Fig. 1c). Among these, variants harbored in the LINC00578 ( = −1.833, OR = 0.159, PFDR = 2.43 × 10^−11^) and PTPN2 ( = −0.645, OR = 0.525, PFDR = 1.87 × 10^−2^) genes are protective, and those within the LINC1250 ( = 1.628, OR = 5.094, PFDR = 3.49 × 10^−5^), GABRA2 ( = 0.89, OR = 2.435, PFDR = 3.49 × 10^−5^), RBFOX1 ( = 0.825, OR = 2.282, PFDR = 3.49 × 10^−5^), and SCUBE3 ( = 0.724, OR = 2.062, PFDR = 3.49 × 10^−5^) genes confer susceptibility to T1D.
In summary, our study outlines oligogenic common variation underpinning the susceptibility to develop T1D. These genetic polymorphisms are also shared by patients suffering from other diseases such as LN and JIA, indicating that the shared genetic architecture defined by pleiotropy and epistasis shapes the genetic susceptibility of these disorders in this multiethnic population. Given the predicted functional nature of these genetic variants, it is very likely that in this understudied multiethnic population, genes harboring these mutations are major contributors to AID immunopathology and provide new insights into the autoimmune tautology in this group of diseases.
Ethics declaration
The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Ethics Committee of Universidad del Norte and Universidad Simón Bolívar, Barranquilla, Colombia (Approval # 00032, 13 October 2011). Informed consent was obtained from all subjects involved in the study.
Author contributions
Conceptualization: G.G.-D.E., J.I.V., M.A.-B., E.E., and L.F. Methodology: G.G.-D.E., M.A-B., and J.I.V. Validation: J.I.V. and M.A.-B. Formal analysis: G.G.-D.E., J.I.V., M.A.-B., A.D.-V., and E.E. Data curation: G.G.-D.E., J.I.V., M.A.-B., S.G., K.T.-C., E.E., A.D.-V., T.G-V., and L.F. Visualization: J.I.V., K.T.-C., M.A.-B. and L.M.-A. Resources: G.G.-D., E.E., G.A., T.G-V., and Z.E. Writing—original draft preparation: G.G.-D.E, J.I.V., M.A.-B., E.E., and A.D.-V. Writing—review & editing: G.G.-D.E, J.I.V, A.D.-V, E.N.-Q., G.A, Z. E, S.G and E.E. Project administration: G.G.-D.E, J.I.V, and E.E. All authors have read and agreed to the published version of the manuscript.
Conflict of interests
The authors declare no conflict of interests.
Funding
This study was financed by Minciencias title proyect “Variantes genéticas y mutaciones genómicas identificadas por secuenciación de última generación (NGS) en familias colombianas: en busca de marcadores comunes de tautología autoinmune – No. 121577758377” and was partially supported from 10.13039/501100004245Universidad del Norte, Barranquilla, Colombia. The sponsor of the study had no role in study design, data collection, data analysis, data interpretation, or writing of the paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Lettre G.Rioux J.D.Autoimmune diseases: insights from genome-wide association studies Hum Mol Genet 17R 22008 R 116R 1211885219910.1093/hmg/ddn 246PMC 2782355 · doi ↗ · pubmed ↗
- 2Zhernakova A.van Diemen C.C.Wijmenga C.Detecting shared pathogenesis from the shared genetics of immune-related diseases Nat Rev Genet 101200943551909283510.1038/nrg 2489 · doi ↗ · pubmed ↗
- 3Cotsapas C.Voight B.F.Rossin E.Pervasive sharing of genetic effects in autoimmune disease P Lo S Genet 782011 e 100225410.1371/journal.pgen.1002254 PMC 315413721852963 · doi ↗ · pubmed ↗
- 4Caliskan M.Brown C.D.Maranville J.C.A catalog of GWAS fine-mapping efforts in autoimmune disease Am J Hum Genet 108420215495633379844310.1016/j.ajhg.2021.03.009PMC 8059376 · doi ↗ · pubmed ↗
- 5Vélez J.I.Lopera F.Silva C.T.Familial Alzheimer's disease and recessive modifiers Mol Neurobiol 5722020103510433166470210.1007/s 12035-019-01798-0PMC 7031188 · doi ↗ · pubmed ↗
